正在加载视频...

视频加载失败

Introducing WhisperKit

98,394 次观看 • 2 年前 •via X (Twitter)

10 条评论

batuhan the fal guy 的头像
batuhan the fal guy2 年前

looks 🔥

murat 🍥 的头像
murat 🍥2 年前

how is this different from whisper.cpp's metal/coreml implementation?

Miguel Piedrafita ✨ 的头像
Miguel Piedrafita ✨2 年前

oooo this is really cool!

Alexandre Berriche 的头像
Alexandre Berriche2 年前

Super cool !

BLENDER SUSHI 🫶 X - 24/7 Blenderian 的头像
BLENDER SUSHI 🫶 X - 24/7 Blenderian2 年前

I tried it so far I like the realtime streaming and translation. I test it while YouTube running on TV and translation works 50-60% some times it's buffering. Can it tend later audio from say running audio app like YouTube or Twitter video?

Tomas Maixner | Neuralbyte 的头像
Tomas Maixner | Neuralbyte2 年前

I have problems without loading

Anton Panasenko 的头像
Anton Panasenko2 年前

Any plan for diarization?

argmax 的头像
argmax2 年前

Yes

Mark Lord 的头像
Mark Lord1 年前

Hi! 👋 Quick qu; is my understanding right that WhisperKit processes live audio in chunks and stitches the transcript together? Or does it keep a run-on KV cache of encoded audio which grows as more audio is streamed in + encoded?

Tony Raha 的头像
Tony Raha2 年前

@osanseviero This is so cool! Can you please add an action in iOS shortcuts app so we can integrate it with other iOS native apps?

相关视频

Introducing.
0:25

Sensitive content

Introducing.

Useless Utility

70,007 次观看 • 2 年前