Loading video...

Video Failed to Load

There was a problem loading this video. This could be due to a temporary network issue or the video might be unavailable.

MIPROv2, our new state-of-the-art optimizer for LM programs, is live in DSPy Stanford NLP Group! It's even faster, cheaper, and more accurate than MIPRO. MIPROv2 proposes instructions, bootstraps demonstrations, and optimizes combinations. Let’s dive into a visual 🧵of how it works!

Michael Ryan

2,396 subscribers

156,809 views • 2 years ago •via X (Twitter)

Science & Technology Education

Anya Rossi• Live Now

Private livecam show

10 Comments

Michael Ryan2 years ago

First, MIPROv2 tries to understand your task. It reads your DSPy code, analyzes your dataset, and runs your program a few times to produce example traces. These will inform the proposal LM to write better and more grounded instructions!

Michael Ryan2 years ago

Next, MIPROv2 generates demonstrations of your program. It runs your program several times and keeps traces whose output is scored highly. These traces form the basis for optimizing few-shot demonstrations. Besides what your metric requires, no labels necessary!

Michael Ryan2 years ago

Given the instructions & demonstrations it proposed, the MIPROs build a Bayesian surrogate model to sample combinations and assign a belief over their utility. To make MIPROv2 faster & cheaper than v1, the updates happen on small *mini-batches* of your data.

Michael Ryan2 years ago

MIPROv2 is already live in DSPy 2.4.10+! Check out this notebook, where we show how to optimize a program for ScoNe, an NLI benchmark with just over 1000 LM calls — an order of magnitude fewer than MIPROv1!

Michael Ryan2 years ago

If you want more details, read our paper release thread or join the community at Joint work with an amazing team: @kristahopsalong @JoshPurtell @DavidKarlBroman @matei_zaharia @ChrisGPotts @lateinteraction

Omar Khattab2 years ago

Amazing release, @michaelryan207. These are some of the best visuals I saw in a very long time!

Mike Taylor2 years ago

@stanfordnlp Can't wait to try this

Nirant2 years ago

@stanfordnlp What tooling did you use to make these visuals? The edit and story is quite high quality

Michael Ryan2 years ago

@stanfordnlp Thanks Nirant! The visuals were all produced in Keynote

Karthik Kalyanaraman2 years ago

@stanfordnlp Awesome work as always! Looking forward to trying this out.

Related Videos

New short course: DSPy: Build and Optimize Agentic Apps DSPy is a powerful open-source framework for automatically tuning prompts for GenAI applications. In this course, you'll learn to use DSPy, together with MLflow. This is built in partnership with Databricks and taught by Chen Qian, co-lead of the DSPy framework. Many AI builders spend hours hand-tuning prompts. When given a set of evals, DSPy automates this process. It’s especially useful for optimizing prompts, including few-shot prompts, in complex agentic AI workflows. Further, if you switch an application to a newer LLM, performance can degrade if your prompts were optimized to the previous model. DSPy automatically optimizes the entire system for the new LLM as well, using just a few evaluation examples. This course teaches DSPy works, and best practices for using it. You’ll write programs using DSPy’s signature-based programming model, debug them with MLflow tracing -- to gain visibility into how different parts of a pipeline, as well as how the overall system, are performing -- and automatically improve their accuracy with DSPy Optimizer. Please sign up here:

New short course: DSPy: Build and Optimize Agentic Apps DSPy is a powerful open-source framework for automatically tuning prompts for GenAI applications. In this course, you'll learn to use DSPy, together with MLflow. This is built in partnership with Databricks and taught by Chen Qian, co-lead of the DSPy framework. Many AI builders spend hours hand-tuning prompts. When given a set of evals, DSPy automates this process. It’s especially useful for optimizing prompts, including few-shot prompts, in complex agentic AI workflows. Further, if you switch an application to a newer LLM, performance can degrade if your prompts were optimized to the previous model. DSPy automatically optimizes the entire system for the new LLM as well, using just a few evaluation examples. This course teaches DSPy works, and best practices for using it. You’ll write programs using DSPy’s signature-based programming model, debug them with MLflow tracing -- to gain visibility into how different parts of a pipeline, as well as how the overall system, are performing -- and automatically improve their accuracy with DSPy Optimizer. Please sign up here:

Andrew Ng

181,457 views • 1 year ago

🧵 Unleashing the Power of $LRDS in BLOCKLORDS In the dynamic world of gaming, $LRDS is more than a token—it's the heartbeat of the BLOCKLORDS ecosystem. Let’s dive into how $LRDS is transforming gaming and why it’s set to become your ultimate BLOCKLORDS asset. 👇🌟

🧵 Unleashing the Power of $LRDS in BLOCKLORDS In the dynamic world of gaming, $LRDS is more than a token—it's the heartbeat of the BLOCKLORDS ecosystem. Let’s dive into how $LRDS is transforming gaming and why it’s set to become your ultimate BLOCKLORDS asset. 👇🌟

BLOCKLORDS

227,785 views • 1 year ago

4. Stanford CS 224N An introduction to natural language processing (NLP) and how it works.

4. Stanford CS 224N An introduction to natural language processing (NLP) and how it works.

Rowan Cheung

67,710 views • 3 years ago

NEW: Europe’s biggest military IPO is combusting. Our months-long dive into the Czechoslovak Group — and the question of just how much ammunition it really produces.

NEW: Europe’s biggest military IPO is combusting. Our months-long dive into the Czechoslovak Group — and the question of just how much ammunition it really produces.

Hunterbrook

646,077 views • 1 month ago

#b3d tip: Since Blender 4.0 snapping has gotten a lot faster and more accurate, thanks to the new Base Point feature. In this video we look at how it works, and how it can help us.

#b3d tip: Since Blender 4.0 snapping has gotten a lot faster and more accurate, thanks to the new Base Point feature. In this video we look at how it works, and how it can help us.

Jan van den Hemel

254,427 views • 2 years ago

Taking a deep dive into our brand new state of the art performance center here at CrossCountry Mortgage Campus

Cleveland Browns

59,311 views • 1 year ago

Claude Tag is the next evolution of agents. It's a proactive, multiplayer agent with memory and identity, built on top of Claude Code. Learn more about how Claude Tag works and best practices for using it in this deep dive.

Claude Tag is the next evolution of agents. It's a proactive, multiplayer agent with memory and identity, built on top of Claude Code. Learn more about how Claude Tag works and best practices for using it in this deep dive.

ClaudeDevs

276,570 views • 17 hours ago

I get LOTS of questions about deploying DSPy programs, so isaac 🧩 and I built dspy-cli: a tool that serves DSPy programs as HTTP APIs with Docker config, OpenAPI specs, MCP support, and more. Here's a quick intro:

I get LOTS of questions about deploying DSPy programs, so isaac 🧩 and I built dspy-cli: a tool that serves DSPy programs as HTTP APIs with Docker config, OpenAPI specs, MCP support, and more. Here's a quick intro:

Drew Breunig

18,676 views • 7 months ago

Catch hundreds of fish, discover rare mutations, and explore islands full of secrets in Catch It! 🎣 Built in Fortnite by Epic MegaGrant recipient CATCH IT! (FISHING), this progression-driven fishing adventure features persistent inventory, gear upgrades, live events, and more. Let’s dive into how it was built 🧵:

Catch hundreds of fish, discover rare mutations, and explore islands full of secrets in Catch It! 🎣 Built in Fortnite by Epic MegaGrant recipient CATCH IT! (FISHING), this progression-driven fishing adventure features persistent inventory, gear upgrades, live events, and more. Let’s dive into how it was built 🧵:

Fortnite Developers

26,316 views • 3 months ago

Our new site is live! It's not only a visual refresh, but a new approach to how we explain session backends and how they fit into your application.

Our new site is live! It's not only a visual refresh, but a new approach to how we explain session backends and how they fit into your application.

Jamsocket

11,980 views • 2 years ago

🚨 @Opera just feels like a brand new browser with the R3 update. The revamp is real! → Faster AI and tighter integration → Fully customizable Tabs → Proper multi-split for multitasking → A ton of UX details that add up Let’s dive in 🧵↓

Charly Wargnier

18,370 views • 4 months ago

The Angstrom v1 Whitepaper is live! Karthik Srinivasan, Ludwig, and ciamac moallemi provide a technical deep dive into the many intertwined mechanisms that collectively form Angstrom. Let’s dive in 🧵

The Angstrom v1 Whitepaper is live! Karthik Srinivasan, Ludwig, and ciamac moallemi provide a technical deep dive into the many intertwined mechanisms that collectively form Angstrom. Let’s dive in 🧵

Angstrom

31,902 views • 10 months ago

LET’S DITCH THOSE FLOPPY DISKS ❌ As a part of our air traffic control modernization, we are getting RID of old floppy disks and implementing STATE OF THE ART technology to help keep our skies moving SAFER and FASTER than ever before ✈️

LET’S DITCH THOSE FLOPPY DISKS ❌ As a part of our air traffic control modernization, we are getting RID of old floppy disks and implementing STATE OF THE ART technology to help keep our skies moving SAFER and FASTER than ever before ✈️

Secretary Sean Duffy

40,088 views • 1 month ago

Not one, but two huge announcements today: Our User Mainnet is officially live AND the Lisk Airdrop is finally here! 🎉 Want to learn more? Let’s dive in 🧵

Not one, but two huge announcements today: Our User Mainnet is officially live AND the Lisk Airdrop is finally here! 🎉 Want to learn more? Let’s dive in 🧵

Lisk

461,703 views • 1 year ago

WHAM defines the new state of the art in 3D human pose estimation from video. By a large margin. It’s fast, accurate, and it computes human pose in world coordinates. It’s also the first video-based method to be more accurate than single-image methods. 1/8

WHAM defines the new state of the art in 3D human pose estimation from video. By a large margin. It’s fast, accurate, and it computes human pose in world coordinates. It’s also the first video-based method to be more accurate than single-image methods. 1/8

Michael Black

118,425 views • 2 years ago

The Future of Bitcoin is Here! BitcoinOS transforms Bitcoin into a superchain, making it faster, cheaper, and more efficient. We are unlocking the true potential of Bitcoin and driving mainstream adoption. Join us in revolutionizing Bitcoin and shaping the future of finance.

The Future of Bitcoin is Here! BitcoinOS transforms Bitcoin into a superchain, making it faster, cheaper, and more efficient. We are unlocking the true potential of Bitcoin and driving mainstream adoption. Join us in revolutionizing Bitcoin and shaping the future of finance.

BOS

49,452 views • 1 year ago

Introducing our new frontier video model, Runway Gen-4.5. Previously known as Whisper Thunder (aka) David. Gen-4.5 is state-of-the-art and sets a new standard for video generation motion quality, prompt adherence and visual fidelity. Learn more below.

Introducing our new frontier video model, Runway Gen-4.5. Previously known as Whisper Thunder (aka) David. Gen-4.5 is state-of-the-art and sets a new standard for video generation motion quality, prompt adherence and visual fidelity. Learn more below.

Runway

851,864 views • 6 months ago

Twenty years ago, NOAA satellites helped monitor and track one of the deadliest hurricanes in U.S. history—#HurricaneKatrina—as it devastated New Orleans and the Mississippi coast. Today, NOAA's latest generation of satellites provide faster and more accurate forecasts and warnings than ever before. Learn more in our latest #EarthFromOrbit video:

Twenty years ago, NOAA satellites helped monitor and track one of the deadliest hurricanes in U.S. history—#HurricaneKatrina—as it devastated New Orleans and the Mississippi coast. Today, NOAA's latest generation of satellites provide faster and more accurate forecasts and warnings than ever before. Learn more in our latest #EarthFromOrbit video:

NOAA Satellites

12,689 views • 10 months ago

"THE RISE OF ORDINALS AND NFTS ON THE MEDIUM OF BITCOIN" is now pubished in Bitcoin Magazine! The Ordinals protocol enables many things on Bitcoin. My article explores how it unlocks substantial value and catalyzes a wave of innovation. I dive into the new properties of NFTs on Bitcoin and the future of this exciting new ecosystem for art and innovation 🔥

"THE RISE OF ORDINALS AND NFTS ON THE MEDIUM OF BITCOIN" is now pubished in Bitcoin Magazine! The Ordinals protocol enables many things on Bitcoin. My article explores how it unlocks substantial value and catalyzes a wave of innovation. I dive into the new properties of NFTs on Bitcoin and the future of this exciting new ecosystem for art and innovation 🔥

danny huuep

215,397 views • 2 years ago