Google AI's banner
Google AI's profile picture

Google AI

@GoogleAI2,416,729 subscribers

Making AI helpful for everyone. Show thinking ↓

Shorts

By now, you've probably heard about Gemini Omni, our new model designed to create anything from any input, starting with video. But... what's the big deal? Let’s break it down 🧵👇

By now, you've probably heard about Gemini Omni, our new model designed to create anything from any input, starting with video. But... what's the big deal? Let’s break it down 🧵👇

221,793 просмотров

Today, we launched a brand-new intelligent Search box. Here's what that means: An upgrade to the Search experience with our most advanced Gemini 3.5 models, bringing with them our latest agentic capabilities You can ask across modalities (text, images, files, and videos) and Search can reason across them all We're combining AI Overviews and AI Mode into one, seamless AI Search experience. So you can ask follow-up questions, build context, and received even more tailored and personalized responses This new AI Search experience is live today across desktop and mobile, worldwide.

Today, we launched a brand-new intelligent Search box. Here's what that means: An upgrade to the Search experience with our most advanced Gemini 3.5 models, bringing with them our latest agentic capabilities You can ask across modalities (text, images, files, and videos) and Search can reason across them all We're combining AI Overviews and AI Mode into one, seamless AI Search experience. So you can ask follow-up questions, build context, and received even more tailored and personalized responses This new AI Search experience is live today across desktop and mobile, worldwide.

43,111 просмотров

Beyond generating high-fidelity visuals, we wanted to test the limits of what Nano Banana Pro can do. We worked with design partners Porto Rocha to build out a hypothetical brand called YOYOYO to see how the model would handle the task. Here’s what we found: 🎨Brand consistency: Across logos, colors, and typography, the model maintained a strict, cohesive brand identity (even for wildly diverse concepts) 🛍️Environmental realism: We asked to see the products in storefront and studio mockups. It nailed accurate lighting, shadows, and physical proportions - even when upscaled for massive retail displays 🪀Spatial accuracy: We tested spatial volumes for physical packaging. The generated proportions were so precise that we were able to 3D-print the functional yo-yo How have you been pushing the limits of Nano Banana Pro? Let us know in the replies below!

Beyond generating high-fidelity visuals, we wanted to test the limits of what Nano Banana Pro can do. We worked with design partners Porto Rocha to build out a hypothetical brand called YOYOYO to see how the model would handle the task. Here’s what we found: 🎨Brand consistency: Across logos, colors, and typography, the model maintained a strict, cohesive brand identity (even for wildly diverse concepts) 🛍️Environmental realism: We asked to see the products in storefront and studio mockups. It nailed accurate lighting, shadows, and physical proportions - even when upscaled for massive retail displays 🪀Spatial accuracy: We tested spatial volumes for physical packaging. The generated proportions were so precise that we were able to 3D-print the functional yo-yo How have you been pushing the limits of Nano Banana Pro? Let us know in the replies below!

121,848 просмотров

Announcing Personal Intelligence, a more personalized Google Gemini designed just for you. How it works: — Customized: With your permission, it reasons across your Gmail, YouTube, Google Photos, and Search apps to share hyper-relevant and context-aware responses — Secure: If enabled, you control which Google apps to connect to. This setting is off by default — Useful: From travel plans based on your Google Photos to gym recommendations based on goals you’ve shared with Gemini, you get help tailored to your world Personal Intelligence in beta is rolling out to Google AI Pro and AI Ultra subscribers in the U.S., with expansions to the free tier, more countries, and AI Mode in Search to come. Take a look at the Gemini app's personalized assistance in the clip below, then let us know what you would use it for!

Announcing Personal Intelligence, a more personalized Google Gemini designed just for you. How it works: — Customized: With your permission, it reasons across your Gmail, YouTube, Google Photos, and Search apps to share hyper-relevant and context-aware responses — Secure: If enabled, you control which Google apps to connect to. This setting is off by default — Useful: From travel plans based on your Google Photos to gym recommendations based on goals you’ve shared with Gemini, you get help tailored to your world Personal Intelligence in beta is rolling out to Google AI Pro and AI Ultra subscribers in the U.S., with expansions to the free tier, more countries, and AI Mode in Search to come. Take a look at the Gemini app's personalized assistance in the clip below, then let us know what you would use it for!

320,011 просмотров

Meet Gemma 3n, a model that runs on as little as 2GB of RAM 🤯 It shares the same architecture as Gemini Nano, and is engineered for incredible performance. We added audio understanding, so now it’s multimodal, fast and lean, and runs on-device (no cloud connection required!)

Meet Gemma 3n, a model that runs on as little as 2GB of RAM 🤯 It shares the same architecture as Gemini Nano, and is engineered for incredible performance. We added audio understanding, so now it’s multimodal, fast and lean, and runs on-device (no cloud connection required!)

228,502 просмотров

Graph clustering merges similar items into groups to better understand relationships in data. Today, read about our recent works, including key techniques that enabled us to scale a high-quality algorithm that can cluster trillion-edge graphs. Read more →

Graph clustering merges similar items into groups to better understand relationships in data. Today, read about our recent works, including key techniques that enabled us to scale a high-quality algorithm that can cluster trillion-edge graphs. Read more →

265,386 просмотров

Introducing SEEDS, our newest generative AI technology that advances medium-range weather forecasting. We can now generate ensemble forecasts more efficiently, helping us better predict rare and extreme weather events. 🌩️ #WeatherForecasting Learn more at

Introducing SEEDS, our newest generative AI technology that advances medium-range weather forecasting. We can now generate ensemble forecasts more efficiently, helping us better predict rare and extreme weather events. 🌩️ #WeatherForecasting Learn more at

205,705 просмотров

Quantum computers offer many promising applications dependent on greatly improved performance. Read how we’ve combined quantum error correction w/ our latest superconducting processor, Willow, exponentially reducing error rates w/ increasing qubit scale →

Quantum computers offer many promising applications dependent on greatly improved performance. Read how we’ve combined quantum error correction w/ our latest superconducting processor, Willow, exponentially reducing error rates w/ increasing qubit scale →

146,751 просмотров

Today we introduced AlphaGenome, a new tool that can more comprehensively predict the impact of single variants or mutations in DNA 🧬 How, you ask? 🤔 tldr; Our AlphaGenome model takes a long DNA sequence as input, processes that data, and predicts thousands of molecular properties by characterizing its regulatory activity. For the full read ➡️

Today we introduced AlphaGenome, a new tool that can more comprehensively predict the impact of single variants or mutations in DNA 🧬 How, you ask? 🤔 tldr; Our AlphaGenome model takes a long DNA sequence as input, processes that data, and predicts thousands of molecular properties by characterizing its regulatory activity. For the full read ➡️

83,048 просмотров

What’s MedGemma? 🤔 It’s our collection of open, multimodal medical models that are designed to help developers build AI tools for healthcare, such as analyzing radiology images or summarizing notes for physicians. We built this demo using MedGemma to help showcase the possibilities of the model. What other use cases can you foresee for this technology?

What’s MedGemma? 🤔 It’s our collection of open, multimodal medical models that are designed to help developers build AI tools for healthcare, such as analyzing radiology images or summarizing notes for physicians. We built this demo using MedGemma to help showcase the possibilities of the model. What other use cases can you foresee for this technology?

65,083 просмотров

That dog in your photo? He's got something to say. 🐶 Turn your images into eight-second video clips with sound effects and speech in the Google Gemini and Flow from Google Labs. This feature uses Veo 3 to generate motion that reflects real world physics and includes a new experimental audio capability so you can really bring your images to life. Try it at and

That dog in your photo? He's got something to say. 🐶 Turn your images into eight-second video clips with sound effects and speech in the Google Gemini and Flow from Google Labs. This feature uses Veo 3 to generate motion that reflects real world physics and includes a new experimental audio capability so you can really bring your images to life. Try it at and

55,236 просмотров

We trained our Large Sensor Model (LSM) on over 40 million hours of de-identified multimodal sensor data from 165K users to demonstrate how it could improve performance in wearable tasks like exercise and activity recognition. Here’s what we found →

We trained our Large Sensor Model (LSM) on over 40 million hours of de-identified multimodal sensor data from 165K users to demonstrate how it could improve performance in wearable tasks like exercise and activity recognition. Here’s what we found →

63,580 просмотров

Chrome 🧵4/5 We’ve also built a deeper integration between Gemini in Chrome and your favorite Google apps, like Calendar, YouTube and Maps, so you can schedule meetings, see location details and more without leaving the page you’re on.

Chrome 🧵4/5 We’ve also built a deeper integration between Gemini in Chrome and your favorite Google apps, like Calendar, YouTube and Maps, so you can schedule meetings, see location details and more without leaving the page you’re on.

38,231 просмотров

Here are some really tactical ways to optimize the First and last frame capability: — Make sure your prompt includes precise camera motion descriptions for smooth and creative transitions between the first and last frames. — Or, you can simply include the word “transform” in your prompt for a smooth transition — Starting with a closeup in your first frame and then zooming out to a wide-shot in the final frame is an effective way to create a dramatic reveal.

Here are some really tactical ways to optimize the First and last frame capability: — Make sure your prompt includes precise camera motion descriptions for smooth and creative transitions between the first and last frames. — Or, you can simply include the word “transform” in your prompt for a smooth transition — Starting with a closeup in your first frame and then zooming out to a wide-shot in the final frame is an effective way to create a dramatic reveal.

28,595 просмотров

It’s no secret the human brain is a complex structure. Even so, #AI has emerged as a powerful tool to map out its complicated pathways. Discover the advancements our Connectomics team & Harvard University University researchers are making to understand the brain →

It’s no secret the human brain is a complex structure. Even so, #AI has emerged as a powerful tool to map out its complicated pathways. Discover the advancements our Connectomics team & Harvard University University researchers are making to understand the brain →

62,661 просмотров

Alright, now that we know *what* an agent is, how does it actually work? When you ask for help on a task, the agent plans a series of steps and executes them directly in the application on your behalf, using the tools it has access to. Say you are booking a local service or trying to organize your inbox (which typically takes multiple steps): the AI model first plans how to achieve the task using its existing knowledge and then interacts with your inbox to execute the task. The agent will continue until it is confident the task has been successfully completed.

Alright, now that we know *what* an agent is, how does it actually work? When you ask for help on a task, the agent plans a series of steps and executes them directly in the application on your behalf, using the tools it has access to. Say you are booking a local service or trying to organize your inbox (which typically takes multiple steps): the AI model first plans how to achieve the task using its existing knowledge and then interacts with your inbox to execute the task. The agent will continue until it is confident the task has been successfully completed.

22,487 просмотров

We’re proud to highlight Google Research’s contributions to improving Clear Calling, the background noise reduction feature on Pixel, which can now handle full-band audio & is powered by an audio-to-audio ML model that was optimized to run at low latency on Google Tensor.

We’re proud to highlight Google Research’s contributions to improving Clear Calling, the background noise reduction feature on Pixel, which can now handle full-band audio & is powered by an audio-to-audio ML model that was optimized to run at low latency on Google Tensor.

69,420 просмотров

We translated the endurance of competitive hot dog eating into a game by prompting Gemini to “Create a HTML, CSS, Javascript hot dog eating contest game. Game mechanics is user needs to click super fast to eat each hotdog. Add a glass of water to help digest and allowing for faster eating when a user eats too many hotdogs. Timer is 1 minute.“ The Google Gemini App built the game mechanics in a single prompt, so the rest of our vibe coding focused on refining UI design with Gemini. Play here:

We translated the endurance of competitive hot dog eating into a game by prompting Gemini to “Create a HTML, CSS, Javascript hot dog eating contest game. Game mechanics is user needs to click super fast to eat each hotdog. Add a glass of water to help digest and allowing for faster eating when a user eats too many hotdogs. Timer is 1 minute.“ The Google Gemini App built the game mechanics in a single prompt, so the rest of our vibe coding focused on refining UI design with Gemini. Play here:

31,465 просмотров

Population dynamics can provide insights into domains ranging from health to environmental science. Here we introduce a geospatial foundation model (plus embeddings and code recipes) that could be employed for a variety of downstream tasks. →

Population dynamics can provide insights into domains ranging from health to environmental science. Here we introduce a geospatial foundation model (plus embeddings and code recipes) that could be employed for a variety of downstream tasks. →

41,712 просмотров

With updated coding and reasoning capabilities, it’s especially strong at building interactive web apps. What used to take 50+ prompts a year ago with 1.5 Flash (left) now takes <10 with far better quality and much more complex UI (right).

With updated coding and reasoning capabilities, it’s especially strong at building interactive web apps. What used to take 50+ prompts a year ago with 1.5 Flash (left) now takes <10 with far better quality and much more complex UI (right).

31,800 просмотров

Videos