Google DeepMind

@GoogleDeepMind • 1,458,567 subscribers

The engine room of @Google. Building AI safely and responsibly to solve the world’s most complex problems. Join us: https://t.co/jUHQA27iBL

Shorts

1,482,072 views

1,056,893 views

340,209 views

1,467,405 views

397,710 views

555,191 views

271,462 views

266,797 views

57,823 views

529,287 views

55,368 views

335,202 views

267,767 views

45,845 views

93,938 views

297,118 views

40,755 views

231,798 views

170,034 views

144,406 views

Videos

sweetdream.ai

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Private Show

Join now for exclusive access

Free preview available • Premium content

5,501,158 views • 1 month ago

14,572,867 views • 4 months ago

13,467,402 views • 5 months ago

201,539 views • 8 days ago

1,631,465 views • 2 months ago

1,670,621 views • 2 months ago

3,193,829 views • 5 months ago

2,217,491 views • 5 months ago

3,734,106 views • 11 months ago

153,019 views • 18 days ago

1,920,761 views • 8 months ago

839,487 views • 3 months ago

1,693,329 views • 8 months ago

1,462,399 views • 8 months ago

553,394 views • 3 months ago

308,764 views • 1 month ago

469,510 views • 3 months ago

111,914 views • 23 days ago

110,615 views • 24 days ago

199,303 views • 1 month ago

Live Cam

Google DeepMind

Shorts

We built an AI model to simulate how a fruit fly walks, flies and behaves – in partnership with HHMI | Janelia. 🪰 Our computerized insect replicates realistic motion, and can even use its eyes to control its actions. Here’s how we developed it – and what it means for science. 🧵

We just dropped Lyria 3: our latest generative music model. 🔊 It can turn photos and text into dynamic tracks - complete with vocals and lyrics. 🧵

Announcing AlphaFold 3: our state-of-the-art AI model for predicting the structure and interactions of all life’s molecules. 🧬 Here’s how we built it with Isomorphic Labs and what it means for biology. 🧵

Powered by Gemini 3, the YouTube Playables Builder web app can help creators develop fun, bite-sized games with text, video or image prompts. 🕹️ Find out more →

Our new state-of-the-art AI model Aeneas transforms how historians connect the past. 📜 Ancient inscriptions often lack context – it's like solving a puzzle with 90% of the pieces lost to time. It helps researchers interpret and situate inscriptions in their past context. 🧵

SIMA 2 🤝 Genie 3 We tested SIMA 2’s abilities in simulated 3D worlds created by our world model Genie 3. It demonstrated unprecedented adaptability by navigating its surroundings and took meaningful steps toward goals.

Veo is getting new precision editing capabilities that let you easily add or remove elements from a scene - all while preserving the integrity of your original video. 🎥

Google Flow 🤝 Gemini Omni Create more cinematic stories with our latest model, which brings batch editing, improved character consistency and more. Here’s what else is new for Google Flow → #GoogleIO

We're sharing progress on our video-to-audio (V2A) generative technology. 🎥 It can add sound to silent clips that match the acoustics of the scene, accompany on-screen action, and more. Here are 4 examples - turn your sound on. 🧵🔊

Omni brings together an improved understanding of physics with Gemini's knowledge of history, biology, and culture, bridging the gap from photorealism to meaningful storytelling. Actions have consequences, environments respond to events, and narratives evolve logically.

Veo 2 is more than just a video generation tool in @GoogleCloud’s #VertexAI. 🎥 Here’s a rundown of its features. 🧵

You can even reimagine the action in a video you took by asking Gemini Omni. Transform your world instantly - change the environment, add new objects, or create something completely unexpected.

Introducing 2️⃣ new AI systems for robotics: 🤖 ALOHA Unleashed to perform two-armed manipulation tasks 🦾 DemoStart to control a multi-fingered robotic hand They learned to tackle a range of actions requiring dexterity. Here's how. 🧵

For decades, your mouse only tracked where you were pointing. AI helps it understand what you're pointing at. 💭 This means a photo of a scribbled note could turn into an interactive to-do list, or a paused video frame can become a restaurant booking link.

Since launching Veo 2, we’ve built new capabilities and addressed a few pain points to help filmmakers and creatives. 📽️✨ Here’s a quick rundown. 🧵

Introducing MedGemma, our most capable open model for multimodal medical text and image comprehension. 🩻 MedGemma is available now as part of Health AI Developer Foundations →

Videos

Watch Anya Live

SynthID, our imperceptible watermark for AI-generated content, is expanding to more partners. We’re also adding new ways to find out if content was generated using AI - just ask in the Google Gemini or in Google Search.

We used Gemini 3.1 Pro to build a realistic city planner app. 🏙️ Watch how the model tackles complex terrain, maps out infrastructure, and simulates traffic to generate a high-quality visualization.

Step inside Project Genie: our experimental research prototype that lets you create, edit, and explore virtual worlds. 🌎

We’re dropping Gemini Omni: our first step towards a model that can create anything from anything - starting with video. It combines Gemini’s intelligence with our generative media systems - representing a leap forward in world understanding, multimodality, and editing 🧵

We’re reimagining a 50-year-old interface - the mouse pointer - with AI. 🖱️ These experimental demos show how people can intuitively direct Gemini on their screens using motion, speech, and natural shorthand to get things done 🧵

We’ve upgraded our specialized reasoning mode Gemini 3 Deep Think to help solve modern science, research, and engineering challenges – pushing the frontier of intelligence. 🧠 Watch how the Wang Lab at Duke University is using it to design new semiconductor materials. 🧵

What if you could not only watch a generated video, but explore it too? 🌐 Genie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt. From photorealistic landscapes to fantasy realms, the possibilities are endless. 🧵

Nano Banana 2 Lite delivers text-to-image outputs in just 4 seconds. It’s designed for quicker ideation and workflows where speed and cost are the primary roadblocks.

SIMA 2 is our most capable AI agent for virtual 3D worlds. 👾🌐 Powered by Gemini, it goes beyond following basic instructions to think, understand, and take actions in interactive environments – meaning you can talk to it through text, voice, or even images. Here’s how 🧵

Watch how fast Gemini 3.1 Flash-Lite can generate websites. ⚡ This browser creates each page in real-time as you click, search, and navigate. Give it a try →

This is Gemini 3: our most intelligent model that helps you learn, build and plan anything. It comes with state-of-the-art reasoning capabilities, world-leading multimodal understanding, and enables new agentic coding experiences. 🧵

We just dropped Nano Banana Pro, built on Gemini 3. 🍌 With state-of-the-art text rendering, vast world knowledge and studio-quality creative controls, Gemini 3 Pro Image can create and edit more complex visuals, infographics and more. Here’s what’s under the hood. 🧵

We’re rolling out an upgrade designed to help robots reason about the physical world. 🤖 Gemini Robotics-ER 1.6 has significantly better visual and spatial understanding in order to plan and complete more useful tasks. Here’s why this is important 🧵

Project Genie 🤝 @GoogleMaps Street View You can now take real U.S. places and transform them into new, interactive worlds. 🌍

Gemini 3.1 Flash TTS is our most controllable text-to-speech model yet. With new Audio Tags, you can easily direct vocal style, delivery, and pace through text commands. 🧵

Gemini 3.5 Flash now supports native computer use. This built-in tool lets developers build custom agents that can see and take action across browser, mobile, and desktop interfaces. Find out more →

We believe AI can be a dedicated research partner to help discover the next breakthrough. Enter Co-Scientist: our latest Gemini-based multi-agent system that can generate, debate and evolve novel hypotheses for complex scientific problems 🧵