正在加载视频...

视频加载失败

The most underutilized AI API is Gemini 2.5 pro with video input. This app is for our first ever intern, Kevin, who makes content. So I built an app that analyzes his videos before he posts. And it doesn’t just analyze the script. It literally “watches” the video in...

66,707 次观看 • 1 年前 •via X (Twitter)

10 条评论

Daniel Mulec 的头像
Daniel Mulec1 年前

@vibecodeapp How on earth did I miss that Gemini 2.5 Pro allows video input :O Also in the Gemini app or only via API?

Riley Brown 的头像
Riley Brown1 年前

@vibecodeapp It works on Google AI Studio which I use instead of Gemini app. @OfficialLoganK is video input in the Gemini app?

Stephen 的头像
Stephen1 年前

@vibecodeapp video in is really cool explored using it for interviews, finding signs of lying via body language, but it’s not so great and hard to do few shot with video vision in models is generally slept on imo by both labs and public

nicholas ⛱ 的头像
nicholas ⛱1 年前

@vibecodeapp my father wants the ai to help diagnose difference between his golf swing on the driving range vs the course. can i get beta access to vibe code this app for him? video analysis comparing swings in each location is the answer.

Riley Brown 的头像
Riley Brown1 年前

@vibecodeapp Great app idea

Kevin X 的头像
Kevin X1 年前

@vibecodeapp ok this is 🔥but where did you get the audio from 😭

Riley Brown 的头像
Riley Brown1 年前

@vibecodeapp I have other skills bro

Joshua Johnson 的头像
Joshua Johnson1 年前

@vibecodeapp Building some crazy stuff with this atm. 💯

Tom Osman 🐦‍⬛ 的头像
Tom Osman 🐦‍⬛1 年前

@vibecodeapp agree with you. pretty sure this is because the docs are hard to work with too. need a way simpler way to build multimodal / live realtime apps with Flash etc

Sw3paz 的头像
Sw3paz1 年前

@vibecodeapp Classy

相关视频