Loading video...

Video Failed to Load

Go Home

We are implementing Google's Gemini 1.5 Pro for character description and image generation (with Imagen)! Soon on you'll be able to create character images from text prompts and then bring them to life with Viggle. Stay tuned!

43,989 views • 1 year ago •via X (Twitter)

10 Comments

madpencil_'s profile picture
madpencil_1 year ago

Awesome upgrade 👌

ViggleAI's profile picture
ViggleAI1 year ago

Will let you know when it’s available!

Brent Lynch's profile picture
Brent Lynch1 year ago

You sprung for fancy pants Gemini 1.5 PRO so we can can create people with it! Good going! Having to make kitchen sinks and other non human characters dance would have been fun too but appreciate the extra mile! :) @RyanMorrisonJer @TheoMediaAI

Heather Cooper's profile picture
Heather Cooper1 year ago

This is great! I need this👏

ViggleAI's profile picture
ViggleAI1 year ago

Yeeeess coming soon!!

Tom Blake's profile picture
Tom Blake1 year ago

This is pretty cool! 🔥

Andy Holland's profile picture
Andy Holland1 year ago

Super cool! Can't wait to use this and start creating my own characters. The possibilities are endless!!

Sri's profile picture
Sri1 year ago

Great innovation 🔥

Jay Jay 3D 🤖's profile picture
Jay Jay 3D 🤖1 year ago

Looking forward to this Viggle :D

ViggleAI's profile picture
ViggleAI1 year ago

Let’s goooo🔥

Related Videos

We've officially released and open-sourced HunyuanImage 2.1, our latest text-to-image model. The new model delivers on our commitment to balancing performance and quality. With native 2K image generation, HunyuanImage 2.1 is an advanced open-source text-to-image model.🎨 ✨ New in 2.1: 🔹Advanced Semantics: Supports ultra-long and complex prompts of up to 1000 tokens, and precisely controls the generation of multiple subjects in a single image. 🔹Precise Chinese and English Text Rendering with seamless image–text integration: The model naturally integrates text into images, making it suitable for a wide range of applications such as product covers, illustrations, and poster design to meet the needs of various fields. 🔹Rich Styles and High Aesthetic: Capable of generating images in various styles—including photorealistic portraits, comics, and vinyl figures—it delivers outstanding visual appeal and artistic quality. 🔹High-Quality Generation: Efficiently produces ultra-high-definition (2K) images in the same time other models take to generate a 1K image. HunyuanImage 2.1 uses two text encoders: a multimodal large language model (MLLM) to improve the model's image and text alignment capabilities, and a multi-language character-aware encoder to improve text rendering capabilities. The model is a single- and double-stream diffusion transformer with 17B parameters. We've also open-sourced the weights of the the accelerated version with meanflow which reduces inference steps from 100 to just 8, and PromptEnhancer, the first industrial-grade rewriting model that enhances your prompts for more nuanced and expressive image generation. Now, creators turn complex ideas—like posters with slogans or multi-panel comics—into visuals faster than ever. We’re just getting started. Stay tuned for our native multimodal image generation model coming soon. 🌐Website: 🔗Github: 🤗Hugging Face: ✨Hugging Face Demo:

Tencent Hy

89,257 views • 9 months ago