Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

Introducing Predicted Outputs—dramatically decrease latency for gpt-4o and gpt-4o-mini by providing a reference string. Speed up: - Updating a blog post in a doc - Iterating on prior responses - Rewriting code in an existing file, like Exponent here:

580,584 Aufrufe • vor 1 Jahr •via X (Twitter)

10 Kommentare

Profilbild von OpenAI Developers
OpenAI Developersvor 1 Jahr

See @FactoryAI's results:

Profilbild von Nick Dobos
Nick Dobosvor 1 Jahr

@exponent_run Will this fix the GPT-4o repeating the same code back with no changes bug!?!? If you predict the previous code back in and specifically omit the commentary in the prediction, then I think it would have no choice but to edit the code!? Cuz it can’t edit the commentary??

Profilbild von HudZah ⁂
HudZah ⁂vor 1 Jahr

@exponent_run curious to see how this will work with @cursor_ai's composer mode

Profilbild von The Canaanite
The Canaanitevor 1 Jahr

@exponent_run @cursor_ai for the love of the almightly, we need this lol.

Profilbild von 🍓🍓🍓
🍓🍓🍓vor 1 Jahr

@exponent_run incredible work 🍓

Profilbild von Garrett of DeepwriterAI
Garrett of DeepwriterAIvor 1 Jahr

@exponent_run This will be very useful on my for some of the internal steps, each with 60k+ tokens/call x dozens of calls/generated paper or book. Significant.

Profilbild von AK
AKvor 1 Jahr

@exponent_run fastest way to make web apps with openai api:

Profilbild von Itay Bachman
Itay Bachmanvor 1 Jahr

@exponent_run Anthropic has left the chat

Profilbild von Pseudonym 🦅
Pseudonym 🦅vor 1 Jahr

@exponent_run We can go faster.

Profilbild von Chase Brower
Chase Browervor 1 Jahr

@exponent_run Am I understanding this correctly that you are charged for the whole prediction text you give? So this improves latency but will still be just as costly as having it generate the entire output text?

Ähnliche Videos