Loading video...

Video Failed to Load

Go Home

Introducing WhisperKit

98,394 views • 2 years ago •via X (Twitter)

10 Comments

batuhan the fal guy's profile picture
batuhan the fal guy2 years ago

looks 🔥

murat 🍥's profile picture
murat 🍥2 years ago

how is this different from whisper.cpp's metal/coreml implementation?

Miguel Piedrafita ✨'s profile picture
Miguel Piedrafita ✨2 years ago

oooo this is really cool!

Alexandre Berriche's profile picture
Alexandre Berriche2 years ago

Super cool !

BLENDER SUSHI 🫶 X - 24/7 Blenderian's profile picture
BLENDER SUSHI 🫶 X - 24/7 Blenderian2 years ago

I tried it so far I like the realtime streaming and translation. I test it while YouTube running on TV and translation works 50-60% some times it's buffering. Can it tend later audio from say running audio app like YouTube or Twitter video?

Tomas Maixner | Neuralbyte's profile picture
Tomas Maixner | Neuralbyte2 years ago

I have problems without loading

Anton Panasenko's profile picture
Anton Panasenko2 years ago

Any plan for diarization?

argmax's profile picture
argmax2 years ago

Yes

Mark Lord's profile picture
Mark Lord1 year ago

Hi! 👋 Quick qu; is my understanding right that WhisperKit processes live audio in chunks and stitches the transcript together? Or does it keep a run-on KV cache of encoded audio which grows as more audio is streamed in + encoded?

Tony Raha's profile picture
Tony Raha2 years ago

@osanseviero This is so cool! Can you please add an action in iOS shortcuts app so we can integrate it with other iOS native apps?

Related Videos

Introducing.
0:25

Sensitive content

Introducing.

Useless Utility

70,007 views • 2 years ago