Загрузка видео...

Не удалось загрузить видео

На главную

Whether you need expressive storytelling, clean narration, or multilingual characters, the Qwen3‑TTS Family makes it feel effortless🎧 With peak performance and powerful control capabilities, the model is able to meet global application demands and adapt voice based on instructions and text semantics, while significantly improving robustness to input text noise.

120,908 просмотров • 4 месяцев назад •via X (Twitter)

Комментарии: 0

Нет доступных комментариев

Здесь появятся комментарии из оригинального поста

Похожие видео

We’re excited to announce the release and open-source of HunyuanImage 3.0 — the largest and most powerful open-source text-to-image model to date, with over 80 billion total parameters, of which 13 billion are activated per token during inference.The effect is completely comparable to the industry’s flagship closed-source model.🚀🚀🚀 HunyuanImage 3.0 originates from our internally developed native multimodal large language model, with fine-tuning and post-training focused on text-to-image generation. This unique foundation gives the model a powerful set of capabilities: ✅Reason with world knowledge ✅Understand complex, thousand-word prompts ✅Generate precise text within images Different from traditional DiT architecture image generation models, HunyuanImage 3.0’s MoE architecture uses a Transfusion-based approach to deeply couple Diffusion and LLM training for a single, powerful system. Built on Hunyuan-A13B, HunyuanImage 3.0 was trained on a massive dataset: 5 billion image-text pairs, video frames, interleaved image-text data, and 6 trillion tokens of text corpora. This hybrid training across multimodal generation, understanding, and LLM capabilities allows the model to seamlessly integrate multiple tasks. Whether you're an illustrator, designer, or creator, this is built to slash your workflow from hours to minutes. HunyuanImage 3.0 can generate intricate text, detailed comics, expressive emojis, and lively, engaging illustrations for educational content. The current release focuses solely on text-to-image generation and future updates will include image-to-image, image editing, multi-turn interaction, and more. 👉🏻Try it now: 🔗GitHub: 🤗Hugging Face:

Tencent Hy

412,616 просмотров • 9 месяцев назад

Typing just became… Typeless. Meet Typeless 1.0.2 for Mac — a tool that transforms your voice into clear, accurate writing across your Mac. Speak naturally and let Typeless handle the typing, corrections, and structure for you. With Typeless, you can dictate, translate, or ask for quick edits in any language or accent. It converts your speech into polished text up to 10× faster than traditional typing, while automatically fixing mistakes along the way. --- 1️⃣ Dictation Typeless acts as a powerful voice keyboard that works across all applications on your Mac. When you speak, it understands your intent, organizes your ideas, and converts your natural speech into well-structured writing. Whether you're drafting emails, notes, documents, or messages, Typeless helps you turn spoken thoughts into clean text instantly. Controls - Press Fn to start or stop dictation - Hold Fn for quick, short dictation --- 2️⃣ Translation Typeless makes writing in other languages effortless. You can speak in your native language and have Typeless translate your words instantly into the language you want. This allows you to communicate, write, and respond in foreign languages smoothly and naturally. Controls - Press Fn + Space to start translation - Press Fn to stop translation --- 3️⃣ Ask Anything Typeless Typeless also lets you interact with your text using voice commands. You can select any text and simply say how you want it changed. Typeless can edit, rewrite, answer questions, or perform quick actions based on your request, making editing and improving text much faster. Controls - Press Fn + Space to start Ask Anything - Press Fn to stop Ask Anything --- With Typeless, your voice becomes the fastest and easiest way to write, edit, and communicate on your Mac. Your voice is now your keyboard. Get Typeless → Available now on Mac, Windows, iOS, and Android. #Typeless

Kuria Chronicles

43,623 просмотров • 3 месяцев назад

We've officially released and open-sourced HunyuanImage 2.1, our latest text-to-image model. The new model delivers on our commitment to balancing performance and quality. With native 2K image generation, HunyuanImage 2.1 is an advanced open-source text-to-image model.🎨 ✨ New in 2.1: 🔹Advanced Semantics: Supports ultra-long and complex prompts of up to 1000 tokens, and precisely controls the generation of multiple subjects in a single image. 🔹Precise Chinese and English Text Rendering with seamless image–text integration: The model naturally integrates text into images, making it suitable for a wide range of applications such as product covers, illustrations, and poster design to meet the needs of various fields. 🔹Rich Styles and High Aesthetic: Capable of generating images in various styles—including photorealistic portraits, comics, and vinyl figures—it delivers outstanding visual appeal and artistic quality. 🔹High-Quality Generation: Efficiently produces ultra-high-definition (2K) images in the same time other models take to generate a 1K image. HunyuanImage 2.1 uses two text encoders: a multimodal large language model (MLLM) to improve the model's image and text alignment capabilities, and a multi-language character-aware encoder to improve text rendering capabilities. The model is a single- and double-stream diffusion transformer with 17B parameters. We've also open-sourced the weights of the the accelerated version with meanflow which reduces inference steps from 100 to just 8, and PromptEnhancer, the first industrial-grade rewriting model that enhances your prompts for more nuanced and expressive image generation. Now, creators turn complex ideas—like posters with slogans or multi-panel comics—into visuals faster than ever. We’re just getting started. Stay tuned for our native multimodal image generation model coming soon. 🌐Website: 🔗Github: 🤗Hugging Face: ✨Hugging Face Demo:

Tencent Hy

89,257 просмотров • 9 месяцев назад