Loading video...

Video Failed to Load

Go Home

📲 Gemma 3n delivers advanced on-device AI with optimized performance and multimodal understanding. Take a closer look at the early preview text capabilities available in Google AI Studio, GenAI SDK, and text/image capabilities via MediaPipe.

16,628 views • 1 year ago •via X (Twitter)

9 Comments

Google AI Developers's profile picture
Google AI Developers1 year ago

Check out the blog for more details ↓

ksminnovation's profile picture
ksminnovation1 year ago

AI is transforming healthcare! A KSM-led study shows AI can detect Celiac disease 4 years earlier @TalPatalon @MedPredict

Albert@PepinoCapital's profile picture
Albert@PepinoCapital1 year ago

Related but can you add support for Google login (since it's a Google app?) so I'm logged in already when I try to download a model from @huggingface? Thank you

Sam Hocking's profile picture
Sam Hocking1 year ago

No video/live video available yet. Seems really good!

Tsukuyomi's profile picture
Tsukuyomi1 year ago

on-device AI that gets smarter? sounds like a plot twist waiting to happen. let’s see how well it handles the chaos of reality.

Nicolas's profile picture
Nicolas1 year ago

it does not seem to work with liteRT for web

KeySS Inc's profile picture
KeySS Inc1 year ago

Sounds cool! Can't wait to see how this AI performs better than my coffee maker on a Monday morning.

Vishal's profile picture
Vishal1 year ago

On-device AI with multimodal skills is a smart move.

hamhamtar0xOwO's profile picture
hamhamtar0xOwO1 year ago

@MirraTerminal good update

Related Videos

We've officially released and open-sourced HunyuanImage 2.1, our latest text-to-image model. The new model delivers on our commitment to balancing performance and quality. With native 2K image generation, HunyuanImage 2.1 is an advanced open-source text-to-image model.🎨 ✨ New in 2.1: 🔹Advanced Semantics: Supports ultra-long and complex prompts of up to 1000 tokens, and precisely controls the generation of multiple subjects in a single image. 🔹Precise Chinese and English Text Rendering with seamless image–text integration: The model naturally integrates text into images, making it suitable for a wide range of applications such as product covers, illustrations, and poster design to meet the needs of various fields. 🔹Rich Styles and High Aesthetic: Capable of generating images in various styles—including photorealistic portraits, comics, and vinyl figures—it delivers outstanding visual appeal and artistic quality. 🔹High-Quality Generation: Efficiently produces ultra-high-definition (2K) images in the same time other models take to generate a 1K image. HunyuanImage 2.1 uses two text encoders: a multimodal large language model (MLLM) to improve the model's image and text alignment capabilities, and a multi-language character-aware encoder to improve text rendering capabilities. The model is a single- and double-stream diffusion transformer with 17B parameters. We've also open-sourced the weights of the the accelerated version with meanflow which reduces inference steps from 100 to just 8, and PromptEnhancer, the first industrial-grade rewriting model that enhances your prompts for more nuanced and expressive image generation. Now, creators turn complex ideas—like posters with slogans or multi-panel comics—into visuals faster than ever. We’re just getting started. Stay tuned for our native multimodal image generation model coming soon. 🌐Website: 🔗Github: 🤗Hugging Face: ✨Hugging Face Demo:

Tencent Hy

89,257 views • 8 months ago