正在加载视频...

视频加载失败

Announcing Kontext Realtime, an open-source web app for editing images with voice commands.

46,022 次观看 • 1 年前 •via X (Twitter)

12 条评论

Zeke Sikelianos 的头像
Zeke Sikelianos1 年前

It's powered by OpenAI's Realtime API over WebRTC for voice commands. The image generation and editing uses Flux Schnell and Flux Kontext running on Replicate. You can run it locally or deploy it to Cloudflare. Here's the repo:

Zeke Sikelianos 的头像
Zeke Sikelianos1 年前

Built last weekend at @replicate's Kontext hackathon with @bfl_ml 🖤.

Zeke Sikelianos 的头像
Zeke Sikelianos1 年前

I put it on YouTube too:

Mobile Scanner 的头像
Mobile Scanner1 年前

Scan any documents, convert images into text, PDF files, etc. 👍

Heather Cooper 的头像
Heather Cooper1 年前

Very cool app, Zeke! Thanks for the demo and code walkthrough

Zeke Sikelianos 的头像
Zeke Sikelianos1 年前

Thanks! Let me know if you build anything cool on top of it! Also, pull requests welcome :) Planning to add more tools to it very soon.

prompter 的头像
prompter1 年前

Been cooking up a voice intuitive canvas for a few months. And will launch soon. Love this

clem 🤗 的头像
clem 🤗1 年前

super cool, you should add it to

Zeke Sikelianos 的头像
Zeke Sikelianos1 年前

Thanks! I'm a spaces newbie. What would make the most sense for that... a blank Docker template? It's a Cloudflare Workers app with a static frontend and a few serverless backend functions.

Ryan Daigle 的头像
Ryan Daigle1 年前

you're the king of demos, my old friend🤗

Bharat 的头像
Bharat1 年前

Looks great! How are the kontext edits near realtime?!

Zeke Sikelianos 的头像
Zeke Sikelianos1 年前

Camera tricks to keep the video snappy. Actual generation times are around 4 to 5 seconds for Kontext Pro. See examples here:

相关视频