Philipp Schmid's banner
Philipp Schmid's profile picture

Philipp Schmid

@_philschmid80,578 subscribers

Agents & Gemini API, MTS @GoogleDeepMind | prev: Tech Lead at @huggingface, AWS ML Hero 🤗 Sharing my own views and AI News 🧑🏻‍💻 https://t.co/7IosdlO6RA

Shorts

MANUS AI: HYPE VS. REALITY 🔍 Yichao 'Peak' Ji (co-founder of ) confirmed rumors: ✅ Built on Anthropic Claude Sonnet, not their own foundation model ✅Has access to 29 tools and uses Browser Use open-source for browser control ✅User communicates with executor agent and not planner or other agents. ✅Each user gets isolated sandbox environment ✅Outperforms OpenAI Deep Research on GAIA benchmark Building AI products doesn't require training your own foundation models. We're probably just scratching the surface of what existing models can do with the right tooling and integration!

MANUS AI: HYPE VS. REALITY 🔍 Yichao 'Peak' Ji (co-founder of ) confirmed rumors: ✅ Built on Anthropic Claude Sonnet, not their own foundation model ✅Has access to 29 tools and uses Browser Use open-source for browser control ✅User communicates with executor agent and not planner or other agents. ✅Each user gets isolated sandbox environment ✅Outperforms OpenAI Deep Research on GAIA benchmark Building AI products doesn't require training your own foundation models. We're probably just scratching the surface of what existing models can do with the right tooling and integration!

202,842 Aufrufe

Holy Shit Gemini 2.5 Pro Exp 0-shot @levelsio flight simulator: “In pure three.js, without downloading any assets or textures, create a flight simulator game where i can fly an airplane. Make sure it runs in the browser.”

Holy Shit Gemini 2.5 Pro Exp 0-shot @levelsio flight simulator: “In pure three.js, without downloading any assets or textures, create a flight simulator game where i can fly an airplane. Make sure it runs in the browser.”

165,632 Aufrufe

You haven’t tried Google AI Studio yet?👀 We made it simpler! When you come to AIS for the first time, you will have a Default Gemini Project & API Key waiting for you! This should reduce time to first prompt, and help you start building faster! Give it a try!

You haven’t tried Google AI Studio yet?👀 We made it simpler! When you come to AIS for the first time, you will have a Default Gemini Project & API Key waiting for you! This should reduce time to first prompt, and help you start building faster! Give it a try!

72,605 Aufrufe

Gemini 3.1 Flash-Lite can generate and imagine websites on the fly while you browse. Each click leads to a newly generated site. See how it envisions "facebook in 2004" 🌐🔦 Link below to test. ⬇️

Gemini 3.1 Flash-Lite can generate and imagine websites on the fly while you browse. Each click leads to a newly generated site. See how it envisions "facebook in 2004" 🌐🔦 Link below to test. ⬇️

18,872 Aufrufe

Gemini Diffusion ~1000 tokens per second!⚡Text Diffusion doing bouncing balls. ⚽️

Gemini Diffusion ~1000 tokens per second!⚡Text Diffusion doing bouncing balls. ⚽️

56,674 Aufrufe

No plans in slowing down! 🤝

No plans in slowing down! 🤝

10,864 Aufrufe

Character Consistency with Google Veo 3 now in Gemini API! 🤯 Use Images as starting frame to keep character consistency! Here is an python script on how to make consistent viral videos, like you see on TikTok or Youtube shorts: 1. Based on an idea, it generates a series of scene prompts using Gemini 2.5. 2. Generates a Image based on the first scene using Imagen 3 3. For each scene prompt Veo 3 (fast) generates a video clip. 4. Uses Gemini 2.0 image editing to make sure the starting images fits the scenes 5. Combine the individual video clips into a single final video using MoviePy Veo 3 starts at $0.75 / second and Veo 3 at $0.40 / second with audio. ! 📹 🔉 Prompt: “A realistic energy drink commercial for athletes.”

Character Consistency with Google Veo 3 now in Gemini API! 🤯 Use Images as starting frame to keep character consistency! Here is an python script on how to make consistent viral videos, like you see on TikTok or Youtube shorts: 1. Based on an idea, it generates a series of scene prompts using Gemini 2.5. 2. Generates a Image based on the first scene using Imagen 3 3. For each scene prompt Veo 3 (fast) generates a video clip. 4. Uses Gemini 2.0 image editing to make sure the starting images fits the scenes 5. Combine the individual video clips into a single final video using MoviePy Veo 3 starts at $0.75 / second and Veo 3 at $0.40 / second with audio. ! 📹 🔉 Prompt: “A realistic energy drink commercial for athletes.”

15,863 Aufrufe

Google Gemini 2.5 Pro Exp: “Write a p5.js script that simulates 25 particles in a vacuum space of a cylindrical container, bouncing within its boundaries. Use different colors for each ball and ensure they leave a trail showing their movement. Add a slow rotation of the container to give better view of what's going on in the scene. Make sure to create proper collision detection and physic rules to ensure particles remain in the container. Add an external spherical container. Add a slow zoom in and zoom out effect to the whole scene.” AK

Google Gemini 2.5 Pro Exp: “Write a p5.js script that simulates 25 particles in a vacuum space of a cylindrical container, bouncing within its boundaries. Use different colors for each ball and ensure they leave a trail showing their movement. Add a slow rotation of the container to give better view of what's going on in the scene. Make sure to create proper collision detection and physic rules to ensure particles remain in the container. Add an external spherical container. Add a slow zoom in and zoom out effect to the whole scene.” AK

13,619 Aufrufe

Videos