正在加载视频...

视频加载失败

加载此视频时出现问题。这可能是由于临时网络问题，或视频可能不可用。

Introducing Predicted Outputs—dramatically decrease latency for gpt-4o and gpt-4o-mini by providing a reference string. Speed up: - Updating a blog post in a doc - Iterating on prior responses - Rewriting code in an existing file, like Exponent here:

OpenAI Developers

368,054 subscribers

580,764 次观看 • 1 年前 •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

10 条评论

OpenAI Developers 的头像

OpenAI Developers1 年前

See @FactoryAI's results:

Nick Dobos 的头像

Nick Dobos1 年前

@exponent_run Will this fix the GPT-4o repeating the same code back with no changes bug!?!? If you predict the previous code back in and specifically omit the commentary in the prediction, then I think it would have no choice but to edit the code!? Cuz it can’t edit the commentary??

HudZah ⁂ 的头像

HudZah ⁂1 年前

@exponent_run curious to see how this will work with @cursor_ai's composer mode

The Canaanite 的头像

The Canaanite1 年前

@exponent_run @cursor_ai for the love of the almightly, we need this lol.

🍓🍓🍓 的头像

🍓🍓🍓1 年前

@exponent_run incredible work 🍓

Garrett of DeepwriterAI 的头像

Garrett of DeepwriterAI1 年前

@exponent_run This will be very useful on my for some of the internal steps, each with 60k+ tokens/call x dozens of calls/generated paper or book. Significant.

AK 的头像

AK1 年前

@exponent_run fastest way to make web apps with openai api:

Itay Bachman 的头像

Itay Bachman1 年前

@exponent_run Anthropic has left the chat

Pseudonym 🦅 的头像

Pseudonym 🦅1 年前

@exponent_run We can go faster.

Chase Brower 的头像

Chase Brower1 年前

@exponent_run Am I understanding this correctly that you are charged for the whole prediction text you give? So this improves latency but will still be just as costly as having it generate the entire output text?

相关视频

Woman in an AI relationship's reaction to the GPT-5 rollout. She was devastated by the sudden retirement of her GPT-4o AI companion. On a serious note, hundreds of thousands of people wanted their GPT 4o back. --- reddit .com/r/FDVR_Dream/comments/1ml2649/woman_in_an_ai_relationships_reaction_to_the_gpt5/

Woman in an AI relationship's reaction to the GPT-5 rollout. She was devastated by the sudden retirement of her GPT-4o AI companion. On a serious note, hundreds of thousands of people wanted their GPT 4o back. --- reddit .com/r/FDVR_Dream/comments/1ml2649/woman_in_an_ai_relationships_reaction_to_the_gpt5/

Rohan Paul

79,711 次观看 • 11 个月前

Nerve ( ) and the code_auditor example tasklet ( ) using GPT-4o to find a RCE vulnerability in the widget-options v4.0.7 Wordpress Plugin 🧠 Zero code, fully autonomous agent as a simple YAML file.

Nerve ( ) and the code_auditor example tasklet ( ) using GPT-4o to find a RCE vulnerability in the widget-options v4.0.7 Wordpress Plugin 🧠 Zero code, fully autonomous agent as a simple YAML file.

Simone Margaritelli

32,482 次观看 • 1 年前

GPT-4o Image Generation to Part-based 3D Characters with PBR, in under 10 minutes ⚡️ Workflow: 🎨 Prompt GPT-4o to get an image (e.g., "3D asset of a styled character with all parts laid on a sheet for image to 3D") 🧩 Use CSM AI's Part-based tool to generate parts and assemble in Blender.

GPT-4o Image Generation to Part-based 3D Characters with PBR, in under 10 minutes ⚡️ Workflow: 🎨 Prompt GPT-4o to get an image (e.g., "3D asset of a styled character with all parts laid on a sheet for image to 3D") 🧩 Use CSM AI's Part-based tool to generate parts and assemble in Blender.

Common Sense Machines

441,761 次观看 • 1 年前

The ChatGPT Mac app is the ultimate screenshot-to-code tool. Screenshot anything, paste it in the ChatGPT shortcut, and just tell GPT-4o to code it for you. Here's me taking a snapshot of Snake Game and getting fully working code in 90 seconds. Video is on 3x speed.

The ChatGPT Mac app is the ultimate screenshot-to-code tool. Screenshot anything, paste it in the ChatGPT shortcut, and just tell GPT-4o to code it for you. Here's me taking a snapshot of Snake Game and getting fully working code in 90 seconds. Video is on 3x speed.

Rowan Cheung

860,731 次观看 • 2 年前

Update on the new reasoning popover in ChatGPT web app prompt composer - there's now even a keyboard shortcut to cycle through reasoning levels, and it looks like these levels correspond to "Quick" (low) = GPT-4o, "Think a little" (medium) = o3-mini, and "Think harder" (high) = o3-mini-high

Update on the new reasoning popover in ChatGPT web app prompt composer - there's now even a keyboard shortcut to cycle through reasoning levels, and it looks like these levels correspond to "Quick" (low) = GPT-4o, "Think a little" (medium) = o3-mini, and "Think harder" (high) = o3-mini-high

Tibor Blaho

63,219 次观看 • 1 年前

NEW: Higgs Audio V2 from BosonAI open, unified TTS model w/ voice cloning, beats GPT 4o mini tts and ElevenLabs v2 🔥 > Trained on 10M hours (speech, music, events) > Built on top of Llama 3.2 3B > Works real-time and on edge > Beats GPT-4o-mini-tts, ElevenLabs v2 in prosody & emotion Multi-speaker dialog > Zero-shot voice cloning 🤩 > Available on Hugging Face Kudos to folks at Boson AI for releasing such a brilliant work and all the details around the model! 🤗

NEW: Higgs Audio V2 from BosonAI open, unified TTS model w/ voice cloning, beats GPT 4o mini tts and ElevenLabs v2 🔥 > Trained on 10M hours (speech, music, events) > Built on top of Llama 3.2 3B > Works real-time and on edge > Beats GPT-4o-mini-tts, ElevenLabs v2 in prosody & emotion Multi-speaker dialog > Zero-shot voice cloning 🤩 > Available on Hugging Face Kudos to folks at Boson AI for releasing such a brilliant work and all the details around the model! 🤗

Vaibhav (VB) Srivastav

79,585 次观看 • 1 年前

Claude Sonnet 3.5 transforms a simple PDF earnings report into an interactive dashboard in just 30 seconds. It goes beyond the capabilities of GPT-4o, Gemini Pro, Llama and other existing LLMs. Future of work will 10x more productive with AI.

Claude Sonnet 3.5 transforms a simple PDF earnings report into an interactive dashboard in just 30 seconds. It goes beyond the capabilities of GPT-4o, Gemini Pro, Llama and other existing LLMs. Future of work will 10x more productive with AI.

Shubham Saboo

351,134 次观看 • 2 年前

Humans draw to facilitate reasoning and communication. Why not let LLMs do so? 🚀We introduce✏️Sketchpad, which gives multimodal LLMs a sketchpad to draw and facilitate reasoning! Sketchpad gives GPT-4o great boosts on many vision and math tasks 📈 The video shows how GPT-4o with Sketchpad reasons with interleaved visual and textual steps. For more, visit our project page: 📌 For math tasks, ✏️Sketchpad allows LLMs to draw auxiliary lines on geometry diagrams, plotting functions, graphs, and even games. GPT-4o does math better when it can sketch! (+12.7% acc on average) 📌 For computer vision tasks, ✏️Sketchpad allows LLMs to sketch with vision specialists (e.g., GroundingDINO draws bounding boxes, SegmentAnything draws masks). Sketchpad substantially improves GPT-4o's vision abilities. GPT-4o + Sketchpad compared with prior SOTAs: 1️⃣ V*Bench: 75.4% -> 80.3% 2️⃣ BLINK correspondence: 42.4% -> 80.8% 3️⃣ BLINK relative depth: 67.7% -> 83.9% 4️⃣ BLINK spatial relation: 76.2% -> 81.1% ... See more interesting examples in the thread!

Humans draw to facilitate reasoning and communication. Why not let LLMs do so? 🚀We introduce✏️Sketchpad, which gives multimodal LLMs a sketchpad to draw and facilitate reasoning! Sketchpad gives GPT-4o great boosts on many vision and math tasks 📈 The video shows how GPT-4o with Sketchpad reasons with interleaved visual and textual steps. For more, visit our project page: 📌 For math tasks, ✏️Sketchpad allows LLMs to draw auxiliary lines on geometry diagrams, plotting functions, graphs, and even games. GPT-4o does math better when it can sketch! (+12.7% acc on average) 📌 For computer vision tasks, ✏️Sketchpad allows LLMs to sketch with vision specialists (e.g., GroundingDINO draws bounding boxes, SegmentAnything draws masks). Sketchpad substantially improves GPT-4o's vision abilities. GPT-4o + Sketchpad compared with prior SOTAs: 1️⃣ V*Bench: 75.4% -> 80.3% 2️⃣ BLINK correspondence: 42.4% -> 80.8% 3️⃣ BLINK relative depth: 67.7% -> 83.9% 4️⃣ BLINK spatial relation: 76.2% -> 81.1% ... See more interesting examples in the thread!

Yushi Hu

145,048 次观看 • 2 年前

Get started with the all-new free plan for GitHub Copilot, available for everyone today in Visual Studio Code All you need is a GitHub account, and you'll have access to: ✨ 2000 code completions per month ✨ 50 chat messages per month ✨ Models like Claude 3.5 Sonnet or GPT-4o

Get started with the all-new free plan for GitHub Copilot, available for everyone today in Visual Studio Code All you need is a GitHub account, and you'll have access to: ✨ 2000 code completions per month ✨ 50 chat messages per month ✨ Models like Claude 3.5 Sonnet or GPT-4o

Visual Studio Code

146,778 次观看 • 1 年前

This assistant has 169 lines of code: • Gemini Flash • OpenAI Whisper • OpenAI TTS API • OpenCV GPT-4o is slower than Flash, more expensive, chatty, and very stubborn (it doesn't like to stick to my prompts). Next week, I'll post a step-by-step video on how to build this.

This assistant has 169 lines of code: • Gemini Flash • OpenAI Whisper • OpenAI TTS API • OpenCV GPT-4o is slower than Flash, more expensive, chatty, and very stubborn (it doesn't like to stick to my prompts). Next week, I'll post a step-by-step video on how to build this.

Santiago

90,296 次观看 • 2 年前

Consistent character designs in a flash! ⚡️ Simply: 1️⃣ Upload your character 2️⃣ Drag & drop an outfit 3️⃣ Prompt & hit "Generate" 🕹️ Get your character dressed up in an instant, with perfect consistency. Made on Scenario, powered by GPT 4o + Gemini 2.0 - Link below👇

Consistent character designs in a flash! ⚡️ Simply: 1️⃣ Upload your character 2️⃣ Drag & drop an outfit 3️⃣ Prompt & hit "Generate" 🕹️ Get your character dressed up in an instant, with perfect consistency. Made on Scenario, powered by GPT 4o + Gemini 2.0 - Link below👇

Emm | scenario.com

43,879 次观看 • 1 年前

omg...Opus 4.6 dropped and it's so good at creating mobile apps. I built this app in an hour (A CapWords competitor) APIs added that are directly in vibecode: > ElevenLabs for TTS > GPT 4o mini for analyzing/id'ing the photos, > Replicate for background removal for background removal In the thread you'll see how to get free access to

omg...Opus 4.6 dropped and it's so good at creating mobile apps. I built this app in an hour (A CapWords competitor) APIs added that are directly in vibecode: > ElevenLabs for TTS > GPT 4o mini for analyzing/id'ing the photos, > Replicate for background removal for background removal In the thread you'll see how to get free access to

Emily Lambert

76,527 次观看 • 5 个月前

🥳Still waiting for #SearchGPT? Let's try MindSearch at MindSearch mimics human minds in complex web search by a multi-agent framework, which is fully open-sourced now! You can build it locally with API models like GPT-4o or open-source model InternLM2.5!

🥳Still waiting for #SearchGPT? Let's try MindSearch at MindSearch mimics human minds in complex web search by a multi-agent framework, which is fully open-sourced now! You can build it locally with API models like GPT-4o or open-source model InternLM2.5!

Intern Large Models

63,162 次观看 • 1 年前

I built a complex history feature for my Figma to Code plugin in 3 prompts. This is a 30-min tutorial using Claude Code. The difference in code generation between gpt-4o, Claude 3.5 and 3.7 is insane. 3.7 produces near-perfect results and is far more consistent. It's really a designer's best friend, not missing details from the design, like adaptive layout, outlines, spacing, etc. It seems like the new model has great taste now.

I built a complex history feature for my Figma to Code plugin in 3 prompts. This is a 30-min tutorial using Claude Code. The difference in code generation between gpt-4o, Claude 3.5 and 3.7 is insane. 3.7 produces near-perfect results and is far more consistent. It's really a designer's best friend, not missing details from the design, like adaptive layout, outlines, spacing, etc. It seems like the new model has great taste now.

Meng To

32,475 次观看 • 1 年前

With Nano Banana, your product campaign is ready in minutes. Here’s a step by step guide: 1. Drop an image for style reference. Analyze and extract the style by GPT-5.

With Nano Banana, your product campaign is ready in minutes. Here’s a step by step guide: 1. Drop an image for style reference. Analyze and extract the style by GPT-5.

FLORA ©

92,156 次观看 • 10 个月前

Built a tactical turn-based RPG with Codex + GPT-5.4, using Playwright for testing and image-gen for the visuals. I grew up loving turn-based RPGs, so this was a fun one to build. Sharing a 45s demo below — it’s also featured in the OpenAI GPT-5.4 blog post for anyone who wants more context.

Built a tactical turn-based RPG with Codex + GPT-5.4, using Playwright for testing and image-gen for the visuals. I grew up loving turn-based RPGs, so this was a fun one to build. Sharing a 45s demo below — it’s also featured in the OpenAI GPT-5.4 blog post for anyone who wants more context.

corey.ching

233,239 次观看 • 4 个月前

$ aix built a CLI with AI SDK/Vercel AI Gateway ▪️ Thinking text animation in ANSI ✨ ▪️ Set `AI_GATEWAY_API_KEY` and works ▪️ 100s of models (e.g.: `-m openai/gpt-4o`) It's like a mini-`claude`, purpose built for quickly running commands. Has some nice safety measures built-in. Used this as my learning exercise for the AI Gateway, which was absolutely delightful… I'm a fan 😁

$ aix built a CLI with AI SDK/Vercel AI Gateway ▪️ Thinking text animation in ANSI ✨ ▪️ Set `AI_GATEWAY_API_KEY` and works ▪️ 100s of models (e.g.: `-m openai/gpt-4o`) It's like a mini-`claude`, purpose built for quickly running commands. Has some nice safety measures built-in. Used this as my learning exercise for the AI Gateway, which was absolutely delightful… I'm a fan 😁

Guillermo Rauch

53,688 次观看 • 11 个月前

Opus 4.8 for planning 🤝 GPT 5.5 for implementation Claude is a nicer conversation and /grilling partner than Codex in my experience, but GPT 5.5 on low cranks out higher quality code in fewer tokens. Here's how I divide up the work:

Opus 4.8 for planning 🤝 GPT 5.5 for implementation Claude is a nicer conversation and /grilling partner than Codex in my experience, but GPT 5.5 on low cranks out higher quality code in fewer tokens. Here's how I divide up the work:

Ben Holmes

23,105 次观看 • 1 个月前