正在加载视频...

视频加载失败

Introducing Predicted Outputs—dramatically decrease latency for gpt-4o and gpt-4o-mini by providing a reference string. Speed up: - Updating a blog post in a doc - Iterating on prior responses - Rewriting code in an existing file, like Exponent here:

580,584 次观看 • 1 年前 •via X (Twitter)

10 条评论

OpenAI Developers 的头像
OpenAI Developers1 年前

See @FactoryAI's results:

Nick Dobos 的头像
Nick Dobos1 年前

@exponent_run Will this fix the GPT-4o repeating the same code back with no changes bug!?!? If you predict the previous code back in and specifically omit the commentary in the prediction, then I think it would have no choice but to edit the code!? Cuz it can’t edit the commentary??

HudZah ⁂ 的头像
HudZah ⁂1 年前

@exponent_run curious to see how this will work with @cursor_ai's composer mode

The Canaanite 的头像
The Canaanite1 年前

@exponent_run @cursor_ai for the love of the almightly, we need this lol.

🍓🍓🍓 的头像
🍓🍓🍓1 年前

@exponent_run incredible work 🍓

Garrett of DeepwriterAI 的头像
Garrett of DeepwriterAI1 年前

@exponent_run This will be very useful on my for some of the internal steps, each with 60k+ tokens/call x dozens of calls/generated paper or book. Significant.

AK 的头像
AK1 年前

@exponent_run fastest way to make web apps with openai api:

Itay Bachman 的头像
Itay Bachman1 年前

@exponent_run Anthropic has left the chat

Pseudonym 🦅 的头像
Pseudonym 🦅1 年前

@exponent_run We can go faster.

Chase Brower 的头像
Chase Brower1 年前

@exponent_run Am I understanding this correctly that you are charged for the whole prediction text you give? So this improves latency but will still be just as costly as having it generate the entire output text?

相关视频