Loading video...

Video Failed to Load

Go Home

Announcing Kontext Realtime, an open-source web app for editing images with voice commands.

46,022 views • 1 year ago •via X (Twitter)

12 Comments

Zeke Sikelianos's profile picture
Zeke Sikelianos1 year ago

It's powered by OpenAI's Realtime API over WebRTC for voice commands. The image generation and editing uses Flux Schnell and Flux Kontext running on Replicate. You can run it locally or deploy it to Cloudflare. Here's the repo:

Zeke Sikelianos's profile picture
Zeke Sikelianos1 year ago

Built last weekend at @replicate's Kontext hackathon with @bfl_ml 🖤.

Zeke Sikelianos's profile picture
Zeke Sikelianos1 year ago

I put it on YouTube too:

Mobile Scanner's profile picture
Mobile Scanner1 year ago

Scan any documents, convert images into text, PDF files, etc. 👍

Heather Cooper's profile picture
Heather Cooper1 year ago

Very cool app, Zeke! Thanks for the demo and code walkthrough

Zeke Sikelianos's profile picture
Zeke Sikelianos1 year ago

Thanks! Let me know if you build anything cool on top of it! Also, pull requests welcome :) Planning to add more tools to it very soon.

prompter's profile picture
prompter1 year ago

Been cooking up a voice intuitive canvas for a few months. And will launch soon. Love this

clem 🤗's profile picture
clem 🤗1 year ago

super cool, you should add it to

Zeke Sikelianos's profile picture
Zeke Sikelianos1 year ago

Thanks! I'm a spaces newbie. What would make the most sense for that... a blank Docker template? It's a Cloudflare Workers app with a static frontend and a few serverless backend functions.

Ryan Daigle's profile picture
Ryan Daigle1 year ago

you're the king of demos, my old friend🤗

Bharat's profile picture
Bharat1 year ago

Looks great! How are the kontext edits near realtime?!

Zeke Sikelianos's profile picture
Zeke Sikelianos1 year ago

Camera tricks to keep the video snappy. Actual generation times are around 4 to 5 seconds for Kontext Pro. See examples here:

Related Videos