正在加载视频...

视频加载失败

We are implementing Google's Gemini 1.5 Pro for character description and image generation (with Imagen)! Soon on you'll be able to create character images from text prompts and then bring them to life with Viggle. Stay tuned!

43,989 次观看 • 1 年前 •via X (Twitter)

10 条评论

madpencil_ 的头像
madpencil_1 年前

Awesome upgrade 👌

ViggleAI 的头像
ViggleAI1 年前

Will let you know when it’s available!

Brent Lynch 的头像
Brent Lynch1 年前

You sprung for fancy pants Gemini 1.5 PRO so we can can create people with it! Good going! Having to make kitchen sinks and other non human characters dance would have been fun too but appreciate the extra mile! :) @RyanMorrisonJer @TheoMediaAI

Heather Cooper 的头像
Heather Cooper1 年前

This is great! I need this👏

ViggleAI 的头像
ViggleAI1 年前

Yeeeess coming soon!!

Tom Blake 的头像
Tom Blake1 年前

This is pretty cool! 🔥

Andy Holland 的头像
Andy Holland1 年前

Super cool! Can't wait to use this and start creating my own characters. The possibilities are endless!!

Sri 的头像
Sri1 年前

Great innovation 🔥

Jay Jay 3D 🤖 的头像
Jay Jay 3D 🤖1 年前

Looking forward to this Viggle :D

ViggleAI 的头像
ViggleAI1 年前

Let’s goooo🔥

相关视频

We've officially released and open-sourced HunyuanImage 2.1, our latest text-to-image model. The new model delivers on our commitment to balancing performance and quality. With native 2K image generation, HunyuanImage 2.1 is an advanced open-source text-to-image model.🎨 ✨ New in 2.1: 🔹Advanced Semantics: Supports ultra-long and complex prompts of up to 1000 tokens, and precisely controls the generation of multiple subjects in a single image. 🔹Precise Chinese and English Text Rendering with seamless image–text integration: The model naturally integrates text into images, making it suitable for a wide range of applications such as product covers, illustrations, and poster design to meet the needs of various fields. 🔹Rich Styles and High Aesthetic: Capable of generating images in various styles—including photorealistic portraits, comics, and vinyl figures—it delivers outstanding visual appeal and artistic quality. 🔹High-Quality Generation: Efficiently produces ultra-high-definition (2K) images in the same time other models take to generate a 1K image. HunyuanImage 2.1 uses two text encoders: a multimodal large language model (MLLM) to improve the model's image and text alignment capabilities, and a multi-language character-aware encoder to improve text rendering capabilities. The model is a single- and double-stream diffusion transformer with 17B parameters. We've also open-sourced the weights of the the accelerated version with meanflow which reduces inference steps from 100 to just 8, and PromptEnhancer, the first industrial-grade rewriting model that enhances your prompts for more nuanced and expressive image generation. Now, creators turn complex ideas—like posters with slogans or multi-panel comics—into visuals faster than ever. We’re just getting started. Stay tuned for our native multimodal image generation model coming soon. 🌐Website: 🔗Github: 🤗Hugging Face: ✨Hugging Face Demo:

Tencent Hy

89,257 次观看 • 9 个月前