
Trudy Painter
@trudypainter • 3,307 subscribers
Currently @MakeGizmos.... previously @Google Creative Lab + @MIT
Shorts
Videos

I’ve been exploring Gemini 2.0’s new native audio output capability, which is available for early testers. I’m a developer at Google Creative Lab, and wanted to share one of my favorite experiments so far called ✨ VoiceCursor (🔊 sound on for video) Unlike traditional TTS, native audio lets you prompt the model with expressive styles, ie “Say this like a disgruntled pirate…” So I made ✨VoiceCursor… it lets you rapidly try different prompts. Just type, highlight your phrase, then hear it spoken in different ways! My code is open-sourced here: Here’s a thread 🧵
Trudy Painter67,567 просмотров • 1 год назад
Больше нет контента для загрузки