In the pursue to make an impactful app that uses Gemma 3n model, recently published by Google, I have been looking for available implementations for using Gemma 3n on local consumer devices, such as your regular iPhone and Android phones, maybe macOS and even Windows-based laptops.
Gemma 3n is a multi-modal model that can process text, images, and audio. See overview https://ai.google.dev/gemma/docs/gemma-3n
This is a very beginning of hackathon, so the open-source implementations available today are limited to those that google pushed to the public just days before the release. The vest survey so far was published on Hugging Face, see https://huggingface.co/blog/gemma3n
It says "Gemma 3n fully available in the open-source ecosystem", but definition of "fully" is blurry as the multi-modal context is very complicated topic and the designs of some ecosystem toolings are not even ready to accept such "multi" context to begin with. Le