Video wird geladen...

Video konnte nicht geladen werden

Beim Laden dieses Videos ist ein Problem aufgetreten. Dies könnte an einem vorübergehenden Netzwerkproblem liegen oder das Video ist möglicherweise nicht verfügbar.

New text and image to video generation AI model Open-Sora-Plan-v1.3.0

AK

505,877 subscribers

51,838 Aufrufe • vor 1 Jahr •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

6 Kommentare

Profilbild von AK

AKvor 1 Jahr

model:

Profilbild von Cesar Silva

Cesar Silvavor 1 Jahr

How can i use it?

Profilbild von Aditya Singh

Aditya Singhvor 1 Jahr

How many H100s do we need?

Profilbild von Agbomekhe Iwonii

Agbomekhe Iwoniivor 1 Jahr

Waoh

Profilbild von Romain Abdel-Aal

Romain Abdel-Aalvor 1 Jahr

awesome :)

Profilbild von Jonah

Jonahvor 1 Jahr

@TomLikesRobots So Sora is only on hugging face?

Ähnliche Videos

New Kling, Runway, Luma competitor? Open text and image video generation model Pyramidal Flow Matching for Efficient Video Generative Modeling

New Kling, Runway, Luma competitor? Open text and image video generation model Pyramidal Flow Matching for Efficient Video Generative Modeling

AK

90,965 Aufrufe • vor 1 Jahr

The first truly open-source audio-video model. LTX-2 is a DiT-based foundation model with all core video generation capabilities in one unified model. Designed to run locally on consumer GPUs. - text-to-video - image-to-video - and video-to-video modes 100% open-source.

The first truly open-source audio-video model. LTX-2 is a DiT-based foundation model with all core video generation capabilities in one unified model. Designed to run locally on consumer GPUs. - text-to-video - image-to-video - and video-to-video modes 100% open-source.

Akshay 🚀

66,042 Aufrufe • vor 5 Monaten

This is wild. Luma AI just dropped Dream Machine that generates AI video from text and image. Unlike Sora, it's open to public today. The quality is insane. 1. Kaku Drop 架空飴

This is wild. Luma AI just dropped Dream Machine that generates AI video from text and image. Unlike Sora, it's open to public today. The quality is insane. 1. Kaku Drop 架空飴

Min Choi

757,416 Aufrufe • vor 2 Jahren

Bytedance drops an open-source Gemini Omni!!! Bernini is a new AI video generation + editing framework. > Edit videos with text prompts > Image/video references > Code available

Bytedance drops an open-source Gemini Omni!!! Bernini is a new AI video generation + editing framework. > Edit videos with text prompts > Image/video references > Code available

⚡AI Search⚡

43,567 Aufrufe • vor 1 Monat

Exciting News from Open-Sora! 🚀 They've just made the ENTIRE suite of their video-generation model open source! Dive into the world of cutting-edge AI with access to model weights, comprehensive training source code, and detailed architecture insights. Start building your dream video-generation model today! Check it out 👉

Exciting News from Open-Sora! 🚀 They've just made the ENTIRE suite of their video-generation model open source! Dive into the world of cutting-edge AI with access to model weights, comprehensive training source code, and detailed architecture insights. Start building your dream video-generation model today! Check it out 👉

Yang You

245,742 Aufrufe • vor 2 Jahren

TLDR: Meet ✨Lumiere✨ our new text-to-video model from Google AI! Lumiere is designed to create entire clips in just one go! Seamlessly opening up possibilities for many applications: Image-to-video 🖼️ Stylized generation 🖌️ Video editing 🪩 and beyond. See 🧵👇

TLDR: Meet ✨Lumiere✨ our new text-to-video model from Google AI! Lumiere is designed to create entire clips in just one go! Seamlessly opening up possibilities for many applications: Image-to-video 🖼️ Stylized generation 🖌️ Video editing 🪩 and beyond. See 🧵👇

Hila Chefer

252,598 Aufrufe • vor 2 Jahren

The winner of Lovable's weekend competition: Kolbo ai - A powerful tool to help make all sorts of social media content with AI Features of the winning app: - Supabase for backend - Project-based organization system - OpenAI for text & image generation - Anthropic for text generation - Google Gemini for text generation - Midjourney for image generation - for image generation - Text-to-speech - Speech-to-text - Stripe for payments - mu for music generation Built by Zohar Vanunu 👇

The winner of Lovable's weekend competition: Kolbo ai - A powerful tool to help make all sorts of social media content with AI Features of the winning app: - Supabase for backend - Project-based organization system - OpenAI for text & image generation - Anthropic for text generation - Google Gemini for text generation - Midjourney for image generation - for image generation - Text-to-speech - Speech-to-text - Stripe for payments - mu for music generation Built by Zohar Vanunu 👇

Lovable

35,841 Aufrufe • vor 1 Jahr

Compute is the backbone of the AI-driven future. OptimAI Compute Engine would enable scalable, high-performance workloads across image and video models, supporting: • Text-to-image generation • Image editing and inpainting • 2K / 4K / 8K super-resolution • Brand-aligned visual synthesis • OCR and image intelligence • Video generation and motion synthesis • Frame interpolation and enhancement • Multimodal model inference Distributed compute built for faster inference, shorter training cycles, and production-grade AI execution.

OptimAI Network

18,784 Aufrufe • vor 4 Monaten

Grok Imagine Video is now dominating the Artificial Analysis Arena #1 in both Text→Video and Image→Video Outranking Veo 3.1, Sora 2 and Kling 2.6 Pro in every way - Best overall quality - Fastest generation times - Lowest cost-to-performance ratio - Native audio-video support xAI is leading the next wave of AI video generation

Grok Imagine Video is now dominating the Artificial Analysis Arena #1 in both Text→Video and Image→Video Outranking Veo 3.1, Sora 2 and Kling 2.6 Pro in every way - Best overall quality - Fastest generation times - Lowest cost-to-performance ratio - Native audio-video support xAI is leading the next wave of AI video generation

X Freeze

16,972 Aufrufe • vor 5 Monaten

Seedream v4.5 is HERE! ByteDance’s new-generation image creation model that combines image generation + image editing in one unified architecture. Cleaner text, sharper details, smarter edits, all in a single model. Pair it with ImagineArt’s 71% OFF Cyber Monday deal + give access to 40 creators on your plan. Follow+ RT + like + reply with “ImagineArt Seedream” to get 250 credits in DMs.

Seedream v4.5 is HERE! ByteDance’s new-generation image creation model that combines image generation + image editing in one unified architecture. Cleaner text, sharper details, smarter edits, all in a single model. Pair it with ImagineArt’s 71% OFF Cyber Monday deal + give access to 40 creators on your plan. Follow+ RT + like + reply with “ImagineArt Seedream” to get 250 credits in DMs.

ImagineArt

534,583 Aufrufe • vor 7 Monaten

We've officially released and open-sourced HunyuanImage 2.1, our latest text-to-image model. The new model delivers on our commitment to balancing performance and quality. With native 2K image generation, HunyuanImage 2.1 is an advanced open-source text-to-image model.🎨 ✨ New in 2.1: 🔹Advanced Semantics: Supports ultra-long and complex prompts of up to 1000 tokens, and precisely controls the generation of multiple subjects in a single image. 🔹Precise Chinese and English Text Rendering with seamless image–text integration: The model naturally integrates text into images, making it suitable for a wide range of applications such as product covers, illustrations, and poster design to meet the needs of various fields. 🔹Rich Styles and High Aesthetic: Capable of generating images in various styles—including photorealistic portraits, comics, and vinyl figures—it delivers outstanding visual appeal and artistic quality. 🔹High-Quality Generation: Efficiently produces ultra-high-definition (2K) images in the same time other models take to generate a 1K image. HunyuanImage 2.1 uses two text encoders: a multimodal large language model (MLLM) to improve the model's image and text alignment capabilities, and a multi-language character-aware encoder to improve text rendering capabilities. The model is a single- and double-stream diffusion transformer with 17B parameters. We've also open-sourced the weights of the the accelerated version with meanflow which reduces inference steps from 100 to just 8, and PromptEnhancer, the first industrial-grade rewriting model that enhances your prompts for more nuanced and expressive image generation. Now, creators turn complex ideas—like posters with slogans or multi-panel comics—into visuals faster than ever. We’re just getting started. Stay tuned for our native multimodal image generation model coming soon. 🌐Website: 🔗Github: 🤗Hugging Face: ✨Hugging Face Demo:

We've officially released and open-sourced HunyuanImage 2.1, our latest text-to-image model. The new model delivers on our commitment to balancing performance and quality. With native 2K image generation, HunyuanImage 2.1 is an advanced open-source text-to-image model.🎨 ✨ New in 2.1: 🔹Advanced Semantics: Supports ultra-long and complex prompts of up to 1000 tokens, and precisely controls the generation of multiple subjects in a single image. 🔹Precise Chinese and English Text Rendering with seamless image–text integration: The model naturally integrates text into images, making it suitable for a wide range of applications such as product covers, illustrations, and poster design to meet the needs of various fields. 🔹Rich Styles and High Aesthetic: Capable of generating images in various styles—including photorealistic portraits, comics, and vinyl figures—it delivers outstanding visual appeal and artistic quality. 🔹High-Quality Generation: Efficiently produces ultra-high-definition (2K) images in the same time other models take to generate a 1K image. HunyuanImage 2.1 uses two text encoders: a multimodal large language model (MLLM) to improve the model's image and text alignment capabilities, and a multi-language character-aware encoder to improve text rendering capabilities. The model is a single- and double-stream diffusion transformer with 17B parameters. We've also open-sourced the weights of the the accelerated version with meanflow which reduces inference steps from 100 to just 8, and PromptEnhancer, the first industrial-grade rewriting model that enhances your prompts for more nuanced and expressive image generation. Now, creators turn complex ideas—like posters with slogans or multi-panel comics—into visuals faster than ever. We’re just getting started. Stay tuned for our native multimodal image generation model coming soon. 🌐Website: 🔗Github: 🤗Hugging Face: ✨Hugging Face Demo:

Tencent Hy

89,257 Aufrufe • vor 9 Monaten

This video was generated using OpenAI’s powerful new Sora text-to-video model. Prompt: a man beats up a kid for being rude and disrespectful

This video was generated using OpenAI’s powerful new Sora text-to-video model. Prompt: a man beats up a kid for being rude and disrespectful

george glooney

875,361 Aufrufe • vor 2 Jahren

Struggling with slow inference of diffusion and flow models? Check out the video below—I’ve been using our new FastGen library to achieve 7-28x acceleration for text-2-image and {text,image,video}-2-video generation without sacrificing visual fidelity!

Struggling with slow inference of diffusion and flow models? Check out the video below—I’ve been using our new FastGen library to achieve 7-28x acceleration for text-2-image and {text,image,video}-2-video generation without sacrificing visual fidelity!

Julius Berner

13,643 Aufrufe • vor 4 Monaten

🚀 Q2 Image Model is Live! ✨Text to image, Reference to image, and Image editing supported ✨Ultra-fast generation (as fast as 5s), 4K quality, super consistency ✨One-stop workflow: turn reference images into subjects and reuse them for video creation ✨Unlimited image generation for members until Dec 31 New users: use code VIDUQ2RTI for bonus credits 🎁 #ViduQ2RTI #ViduQ2 #Viduai #vidu

🚀 Q2 Image Model is Live! ✨Text to image, Reference to image, and Image editing supported ✨Ultra-fast generation (as fast as 5s), 4K quality, super consistency ✨One-stop workflow: turn reference images into subjects and reuse them for video creation ✨Unlimited image generation for members until Dec 31 New users: use code VIDUQ2RTI for bonus credits 🎁 #ViduQ2RTI #ViduQ2 #Viduai #vidu

Vidu AI

24,747 Aufrufe • vor 7 Monaten

Big news from OpenAI! They just dropped Sora, their new video generation model. And it looks wild! Here’s what you need to know:

Big news from OpenAI! They just dropped Sora, their new video generation model. And it looks wild! Here’s what you need to know:

Julian Goldie SEO

17,969 Aufrufe • vor 1 Jahr

~cowboyz~ made with sora text to video experimental gen ai

~cowboyz~ made with sora text to video experimental gen ai

Dave Clark

12,041 Aufrufe • vor 1 Jahr

🎇 Video Model Comparison: Image2Video Video Model Comparison: Image2Video Same input image + text prompt on each model: • Pika 2.0 • PixVerse 3.5 • Runway Gen-3 • Kling AI 1.6 • Luma Dream Machine • Hailuo MiniMax I used the same Midjourney image and text prompt in each model and chose my favorite results.

🎇 Video Model Comparison: Image2Video Video Model Comparison: Image2Video Same input image + text prompt on each model: • Pika 2.0 • PixVerse 3.5 • Runway Gen-3 • Kling AI 1.6 • Luma Dream Machine • Hailuo MiniMax I used the same Midjourney image and text prompt in each model and chose my favorite results.

Heather Cooper

36,110 Aufrufe • vor 1 Jahr