Loading video...

Video Failed to Load

Go Home

I’ve been exploring Gemini 2.0’s new native audio output capability, which is available for early testers. I’m a developer at Google Creative Lab, and wanted to share one of my favorite experiments so far called ✨ VoiceCursor (🔊 sound on for video) Unlike traditional TTS, native audio lets you...

67,567 views • 1 year ago •via X (Twitter)

10 Comments

Trudy Painter's profile picture
Trudy Painter1 year ago

Gemini 2.0 native audio output is available in AI Studio for early testers. The prompt in this screencap is: Say this in an upbeat, happy tone: “You can steer a voice and … put emphasis on different words!” 🔗

Trudy Painter's profile picture
Trudy Painter1 year ago

✨Voice Cursor follows a similar prompting strategy. After you highlight a phrase, the Voice Cursor will ask the API for audio for the phrase in your selected voice and tone. (and you can edit the prompt sent to the Gemini API in the bottom box)

Trudy Painter's profile picture
Trudy Painter1 year ago

And for me, when the ✨Voice Cursor sits inside a familiar text editor, the highlight interaction feels fluid and comfortable. I’m excited about how native audio might enable new kinds of tools for how we write...

Trudy Painter's profile picture
Trudy Painter1 year ago

You can get the code to see how it works at Native audio output is available to early testers now, with a wider rollout expected next year. This voice cursor was built on top of Such a good repo Also - it’s super simple to change the tone prompt presets + how you make calls to the Gemini 2.0 API (see screenshot below).

jpa's profile picture
jpa1 year ago

so cool, trudy!

Codetard's profile picture
Codetard1 year ago

:)

Tom Bielecki's profile picture
Tom Bielecki1 year ago

@codexeditor audio as another annotation layer

Data & Analytics's profile picture
Data & Analytics1 year ago

@JeffDean @JeffDean, that native audio output sounds dope! Real game-changer for developers. How’s it stacking up against other tools you’ve tried?

steve ike's profile picture
steve ike1 year ago

This is really cool. Thanks for sharing, look forward to checking out the code and learning from your work.

𝑫𝒂𝒏𝒊𝒆𝒍 𝑺𝒄𝒐𝒕𝒕 𝑴𝒂𝒕𝒕𝒉𝒆𝒘𝒔 🇦🇺's profile picture
𝑫𝒂𝒏𝒊𝒆𝒍 𝑺𝒄𝒐𝒕𝒕 𝑴𝒂𝒕𝒕𝒉𝒆𝒘𝒔 🇦🇺1 year ago

Oh, the quality is remarkable! Thanks for sharing.

Related Videos