Google DeepMind's banner
Google DeepMind's profile picture

Google DeepMind

@GoogleDeepMind1,446,014 subscribers

The engine room of @Google. Building AI safely and responsibly to solve the world’s most complex problems. Join us: https://t.co/jUHQA27iBL

Shorts

Image generation with Gemini just got a bananas upgrade and is the new state-of-the-art image generation and editing model. 🤯 From photorealistic masterpieces to mind-bending fantasy worlds, you can now natively produce, edit and refine visuals with new levels of reasoning, control and creativity. A quick dive into Gemini 2.5 Flash’s capabilities 🧵

Image generation with Gemini just got a bananas upgrade and is the new state-of-the-art image generation and editing model. 🤯 From photorealistic masterpieces to mind-bending fantasy worlds, you can now natively produce, edit and refine visuals with new levels of reasoning, control and creativity. A quick dive into Gemini 2.5 Flash’s capabilities 🧵

1,479,728 views

Google Flow 🤝 Gemini Omni Create more cinematic stories with our latest model, which brings batch editing, improved character consistency and more. Here’s what else is new for Flow by Google → #GoogleIO

Google Flow 🤝 Gemini Omni Create more cinematic stories with our latest model, which brings batch editing, improved character consistency and more. Here’s what else is new for Flow by Google → #GoogleIO

54,779 views

We just dropped Lyria 3: our latest generative music model. 🔊 It can turn photos and text into dynamic tracks - complete with vocals and lyrics. 🧵

We just dropped Lyria 3: our latest generative music model. 🔊 It can turn photos and text into dynamic tracks - complete with vocals and lyrics. 🧵

339,638 views

You can even reimagine the action in a video you took by asking Gemini Omni. Transform your world instantly - change the environment, add new objects, or create something completely unexpected.

You can even reimagine the action in a video you took by asking Gemini Omni. Transform your world instantly - change the environment, add new objects, or create something completely unexpected.

44,286 views

We built an AI model to simulate how a fruit fly walks, flies and behaves – in partnership with HHMI | Janelia. 🪰 Our computerized insect replicates realistic motion, and can even use its eyes to control its actions. Here’s how we developed it – and what it means for science. 🧵

We built an AI model to simulate how a fruit fly walks, flies and behaves – in partnership with HHMI | Janelia. 🪰 Our computerized insect replicates realistic motion, and can even use its eyes to control its actions. Here’s how we developed it – and what it means for science. 🧵

1,056,633 views

Powered by Gemini 3, the YouTube Playables Builder web app can help creators develop fun, bite-sized games with text, video or image prompts. 🕹️ Find out more →

Powered by Gemini 3, the YouTube Playables Builder web app can help creators develop fun, bite-sized games with text, video or image prompts. 🕹️ Find out more →

397,501 views

Announcing AlphaFold 3: our state-of-the-art AI model for predicting the structure and interactions of all life’s molecules. 🧬 Here’s how we built it with Isomorphic Labs and what it means for biology. 🧵

Announcing AlphaFold 3: our state-of-the-art AI model for predicting the structure and interactions of all life’s molecules. 🧬 Here’s how we built it with Isomorphic Labs and what it means for biology. 🧵

1,466,950 views

Our new state-of-the-art AI model Aeneas transforms how historians connect the past. 📜 Ancient inscriptions often lack context – it's like solving a puzzle with 90% of the pieces lost to time. It helps researchers interpret and situate inscriptions in their past context. 🧵

Our new state-of-the-art AI model Aeneas transforms how historians connect the past. 📜 Ancient inscriptions often lack context – it's like solving a puzzle with 90% of the pieces lost to time. It helps researchers interpret and situate inscriptions in their past context. 🧵

555,050 views

For decades, your mouse only tracked where you were pointing. AI helps it understand what you're pointing at. 💭 This means a photo of a scribbled note could turn into an interactive to-do list, or a paused video frame can become a restaurant booking link.

For decades, your mouse only tracked where you were pointing. AI helps it understand what you're pointing at. 💭 This means a photo of a scribbled note could turn into an interactive to-do list, or a paused video frame can become a restaurant booking link.

40,755 views

SIMA 2 🤝 Genie 3 We tested SIMA 2’s abilities in simulated 3D worlds created by our world model Genie 3. It demonstrated unprecedented adaptability by navigating its surroundings and took meaningful steps toward goals.

SIMA 2 🤝 Genie 3 We tested SIMA 2’s abilities in simulated 3D worlds created by our world model Genie 3. It demonstrated unprecedented adaptability by navigating its surroundings and took meaningful steps toward goals.

270,821 views

Veo is getting new precision editing capabilities that let you easily add or remove elements from a scene - all while preserving the integrity of your original video. 🎥

Veo is getting new precision editing capabilities that let you easily add or remove elements from a scene - all while preserving the integrity of your original video. 🎥

266,666 views

Veo 2 is more than just a video generation tool in @GoogleCloud’s #VertexAI. 🎥 Here’s a rundown of its features. 🧵

Veo 2 is more than just a video generation tool in @GoogleCloud’s #VertexAI. 🎥 Here’s a rundown of its features. 🧵

335,145 views

We're sharing progress on our video-to-audio (V2A) generative technology. 🎥 It can add sound to silent clips that match the acoustics of the scene, accompany on-screen action, and more. Here are 4 examples - turn your sound on. 🧵🔊

We're sharing progress on our video-to-audio (V2A) generative technology. 🎥 It can add sound to silent clips that match the acoustics of the scene, accompany on-screen action, and more. Here are 4 examples - turn your sound on. 🧵🔊

529,159 views

Extract – a system built by the UK government, using our Gemini foundational model – will help council planners make faster decisions. 🚀 Using multimodal reasoning, it turns complex planning documents – even handwritten notes and blurry maps – into digital data in just 40s. Find out more. ↓

Extract – a system built by the UK government, using our Gemini foundational model – will help council planners make faster decisions. 🚀 Using multimodal reasoning, it turns complex planning documents – even handwritten notes and blurry maps – into digital data in just 40s. Find out more. ↓

267,317 views

So our 45-person team developed entirely new AI capabilities, enabling them to: 🎨 Fine-tune custom Veo and Imagen models on their paintings and artwork 📹 Provide a desired look through rough animations, which the models transformed into stylized videos 🎭 Edit specific regions without regenerating entire shots from scratch

So our 45-person team developed entirely new AI capabilities, enabling them to: 🎨 Fine-tune custom Veo and Imagen models on their paintings and artwork 📹 Provide a desired look through rough animations, which the models transformed into stylized videos 🎭 Edit specific regions without regenerating entire shots from scratch

92,872 views

Introducing 2️⃣ new AI systems for robotics: 🤖 ALOHA Unleashed to perform two-armed manipulation tasks 🦾 DemoStart to control a multi-fingered robotic hand They learned to tackle a range of actions requiring dexterity. Here's how. 🧵

Introducing 2️⃣ new AI systems for robotics: 🤖 ALOHA Unleashed to perform two-armed manipulation tasks 🦾 DemoStart to control a multi-fingered robotic hand They learned to tackle a range of actions requiring dexterity. Here's how. 🧵

297,041 views

The Gemini 2.0 era is here. And we’re excited for you to start building with it. A quick rewind of what we just released ⏪ Gemini 2.0 Flash ⚡ comes with low latency and better performance. 🔵 You can now access an experimental version in G3mini on the web, while Gemini Advanced users can try Deep Research, a new AI research assistant. 🔵 Developers can begin building through the Gemini API in Google AI Studio and Vertex AI 2.0 is also enabling new research prototypes of AI agents, including: 🔵 Project Astra, which explores future capabilities of a universal AI assistant 🔵 Project Mariner, which shows what’s possible for human-agent interaction, starting with your browser 🔵 Jules, an experimental AI-powered coding agent Finally, we’re exploring how 2.0 can be used in agents across domains — from navigating the virtual world of video games to applying its spatial reasoning capabilities to robotics. 🤖

The Gemini 2.0 era is here. And we’re excited for you to start building with it. A quick rewind of what we just released ⏪ Gemini 2.0 Flash ⚡ comes with low latency and better performance. 🔵 You can now access an experimental version in G3mini on the web, while Gemini Advanced users can try Deep Research, a new AI research assistant. 🔵 Developers can begin building through the Gemini API in Google AI Studio and Vertex AI 2.0 is also enabling new research prototypes of AI agents, including: 🔵 Project Astra, which explores future capabilities of a universal AI assistant 🔵 Project Mariner, which shows what’s possible for human-agent interaction, starting with your browser 🔵 Jules, an experimental AI-powered coding agent Finally, we’re exploring how 2.0 can be used in agents across domains — from navigating the virtual world of video games to applying its spatial reasoning capabilities to robotics. 🤖

231,798 views

Since launching Veo 2, we’ve built new capabilities and addressed a few pain points to help filmmakers and creatives. 📽️✨ Here’s a quick rundown. 🧵

Since launching Veo 2, we’ve built new capabilities and addressed a few pain points to help filmmakers and creatives. 📽️✨ Here’s a quick rundown. 🧵

170,001 views

Introducing MedGemma, our most capable open model for multimodal medical text and image comprehension. 🩻 MedGemma is available now as part of Health AI Developer Foundations →

Introducing MedGemma, our most capable open model for multimodal medical text and image comprehension. 🩻 MedGemma is available now as part of Health AI Developer Foundations →

144,253 views

How could robotics soon help us in our daily lives? 🤖 Today, we’re announcing a suite of research advances that enable robots to make decisions faster as well as better understand and navigate their environments. Here's a snapshot of the work. 🧵

How could robotics soon help us in our daily lives? 🤖 Today, we’re announcing a suite of research advances that enable robots to make decisions faster as well as better understand and navigate their environments. Here's a snapshot of the work. 🧵

281,687 views

Videos