正在加载视频...

视频加载失败

加载此视频时出现问题。这可能是由于临时网络问题，或视频可能不可用。

A conversation on the optimal reward for coding agents, infinite context models, and real-time RL

Cursor

389,074 subscribers

317,814 次观看 • 1 年前 •via X (Twitter)

健康养生科学技术教育

Anya Rossi• Live Now

Private livecam show

10 条评论

Han 的头像

Han1 年前

Found YouTube link

Moescape AI 的头像

Moescape AI1 年前

Sign up & effortlessly find YOUR perfect character to chat with on Moescape AI: #MoescapeTavern #aichatbot

kipply 的头像

kipply1 年前

stan SEASNELL

martin.p 的头像

martin.p1 年前

how about direct user feedback below the agent answer? simple 1-5 rating, maybe hidable in settings. I'd gladly give feedback, especially on the answer of the initial request

Luke Igel 的头像

Luke Igel1 年前

Omg (Snell 2024) and Jacob ??

Yacine Mahdid 的头像

Yacine Mahdid1 年前

we need more women in ai

morgan — 的头像

morgan —1 年前

youtube?

Pedro Ramos 的头像

Pedro Ramos1 年前

@mntruell Wrote about Revenue Sharing as RL for Agents:

Kalash 的头像

Kalash1 年前

this sounds like a wild debate

Edoardo Contente 的头像

Edoardo Contente1 年前

Looks more homely than @OpenAI

相关视频

AI coding agents hit a wall when codebases get massive. Even with 2M token context windows, a 10M line codebase needs 100M tokens. The real bottleneck isn't just ingesting code - it's getting models to actually pay attention to all that context effectively.

AI coding agents hit a wall when codebases get massive. Even with 2M token context windows, a 10M line codebase needs 100M tokens. The real bottleneck isn't just ingesting code - it's getting models to actually pay attention to all that context effectively.

Garry Tan

976,161 次观看 • 1 年前

Nia by @NozomioAI is an MCP that gives coding agents 10x more developer context. It indexes external repos and docs, so any coding agent always has the right context. Congrats on the launch, Arlan!

Nia by @NozomioAI is an MCP that gives coding agents 10x more developer context. It indexes external repos and docs, so any coding agent always has the right context. Congrats on the launch, Arlan!

Y Combinator

33,333 次观看 • 10 个月前

The Design OS for your coding agents $ npx skills add superdesigndev/superdesign-skill - Design w/ full codebase context - Infinite canvas exploration - Built in prompt library Here is how it works 🧵👇

The Design OS for your coding agents $ npx skills add superdesigndev/superdesign-skill - Design w/ full codebase context - Infinite canvas exploration - Built in prompt library Here is how it works 🧵👇

Jason Zhou

67,679 次观看 • 5 个月前

A conversation with Claude Code's creator Boris Cherny on the future of agentic coding, the evolution of coding models, and how he uses Claude Code to build Claude Code.

A conversation with Claude Code's creator Boris Cherny on the future of agentic coding, the evolution of coding models, and how he uses Claude Code to build Claude Code.

Alex Albert

81,455 次观看 • 9 个月前

inside Smallest AI office, Sudarshan Kamath on what's the missing piece for voice agents!! why Lightning TTS, ultra-low latency voice models needs HydraDB the knowledge and context layer for AI Agents!!

inside Smallest AI office, Sudarshan Kamath on what's the missing piece for voice agents!! why Lightning TTS, ultra-low latency voice models needs HydraDB the knowledge and context layer for AI Agents!!

Harnoor Singh

32,486 次观看 • 1 个月前

2.5 models are thinking models, capable of reasoning through thoughts before responding. The result is enhanced performance and improved accuracy. This means Gemini 2.5 can handle more complex problems in coding, science and math, and support more context-aware agents.

2.5 models are thinking models, capable of reasoning through thoughts before responding. The result is enhanced performance and improved accuracy. This means Gemini 2.5 can handle more complex problems in coding, science and math, and support more context-aware agents.

Google

31,023 次观看 • 1 年前

The Context Company (The Context Company) is redefining observability for AI agents. Catch silent failures like bad tool calls, infinite loops, and hallucinations in just 10 lines of code. Congrats on the launch, Rohil Agarwal & arman!

The Context Company (The Context Company) is redefining observability for AI agents. Catch silent failures like bad tool calls, infinite loops, and hallucinations in just 10 lines of code. Congrats on the launch, Rohil Agarwal & arman!

Y Combinator

49,351 次观看 • 7 个月前

The data analytics bottleneck is real. Questions stack up, context gets lost, and by the time you get an answer, you forgot why you even asked. How to turn coding agents into 24/7 data analysts👇

The data analytics bottleneck is real. Questions stack up, context gets lost, and by the time you get an answer, you forgot why you even asked. How to turn coding agents into 24/7 data analysts👇

Cognition

251,117 次观看 • 9 个月前

The Copilot CLI now supports builtin subagents for parallelizing search and coding tasks within separate context windows. Uniquely, the outer agent can invoke different models. Ask Opus, Codex, Gemini and a dozen other models for answers in parallel.

The Copilot CLI now supports builtin subagents for parallelizing search and coding tasks within separate context windows. Uniquely, the outer agent can invoke different models. Ask Opus, Codex, Gemini and a dozen other models for answers in parallel.

Evan Boyle

10,602 次观看 • 5 个月前

A deep conversation with Nikolay Savinov, the Gemini long context pre-training co-lead… We go from the basics to what is needed to scale to infinite context to long context best practices for devs:

A deep conversation with Nikolay Savinov, the Gemini long context pre-training co-lead… We go from the basics to what is needed to scale to infinite context to long context best practices for devs:

Logan Kilpatrick

252,322 次观看 • 1 年前

Hoping your coding agents could understand you and adapt to your preferences? Meet TOM-SWE, our new framework for coding agents that don’t just write code, but model the user's mind persistently (ranging from general preferences to small details) arxiv: ❓Motivation: Most coding agents today can plan, edit, run, and test code. But they still fail at a key part of real-world development, understanding the user! Underspecified, shifting, or context-dependent instructions can easily break them. You must have those moments when coding agents were running for 10 minutes and ended up producing things largely misaligned. (1/)

Hoping your coding agents could understand you and adapt to your preferences? Meet TOM-SWE, our new framework for coding agents that don’t just write code, but model the user's mind persistently (ranging from general preferences to small details) arxiv: ❓Motivation: Most coding agents today can plan, edit, run, and test code. But they still fail at a key part of real-world development, understanding the user! Underspecified, shifting, or context-dependent instructions can easily break them. You must have those moments when coding agents were running for 10 minutes and ended up producing things largely misaligned. (1/)

Xuhui Zhou

39,728 次观看 • 7 个月前

∞ Introducing World #1 Infinite Agent - infinite steps, infinite context, infinite output on cloud. See for yourself:

∞ Introducing World #1 Infinite Agent - infinite steps, infinite context, infinite output on cloud. See for yourself:

Flowith

446,943 次观看 • 1 年前

Introducing Repo2RLEnv Turn any repository into runnable, verifiable coding environments built from real PRs and commits for coding-agent evaluation or RL training > uv pip install repo2rlenv

Introducing Repo2RLEnv Turn any repository into runnable, verifiable coding environments built from real PRs and commits for coding-agent evaluation or RL training > uv pip install repo2rlenv

Adithya S K

66,475 次观看 • 25 天前

Behavioral Foundation Models (BFMs) trained with RL are secretly more powerful than we think. BFM’s directly output a policy believed to be near-optimal given any reward function. Our new work shows that they can actually do much better:

Behavioral Foundation Models (BFMs) trained with RL are secretly more powerful than we think. BFM’s directly output a policy believed to be near-optimal given any reward function. Our new work shows that they can actually do much better:

Harshit Sikchi

44,182 次观看 • 1 年前

For a long time, software was limited by how fast people could write code, and how good that code was. As models have improved, that constraint has largely disappeared. Now the bottleneck is access: what surfaces can your agents actually reach? Those interaction layers sit on top of coding agents, the kernel that turns prompts into real-world impact. With Notion's custom agents, we pushed this further. They adapt to your work style as you collaborate with them, using the same deterministic logic that powers coding agents. Simon Last sarah guo

For a long time, software was limited by how fast people could write code, and how good that code was. As models have improved, that constraint has largely disappeared. Now the bottleneck is access: what surfaces can your agents actually reach? Those interaction layers sit on top of coding agents, the kernel that turns prompts into real-world impact. With Notion's custom agents, we pushed this further. They adapt to your work style as you collaborate with them, using the same deterministic logic that powers coding agents. Simon Last sarah guo

Notion Developers

25,065 次观看 • 2 个月前

📹 Context Engineering & Coding Agents with Cursor From OpenAI DevDay

📹 Context Engineering & Coding Agents with Cursor From OpenAI DevDay

Lee Robinson

159,074 次观看 • 8 个月前

shadow is an open-source background coding agent with a real-time interface and codebase context system here's a breakdown of how it works ⬇️

shadow is an open-source background coding agent with a real-time interface and codebase context system here's a breakdown of how it works ⬇️

ishaan dey

21,582 次观看 • 10 个月前

AI coding without systems thinking is just tech debt on speedrun. Delty (@delty_ai) is an AI Staff Engineer for your team. With deep expertise, it designs software systems, evaluates tradeoffs and makes AI coding agents smarter with your engineering context. Congrats on the launch, Lalit Kundu and Catherine!

AI coding without systems thinking is just tech debt on speedrun. Delty (@delty_ai) is an AI Staff Engineer for your team. With deep expertise, it designs software systems, evaluates tradeoffs and makes AI coding agents smarter with your engineering context. Congrats on the launch, Lalit Kundu and Catherine!

Y Combinator

73,297 次观看 • 1 年前

Introducing Agentkit: A production-ready, model-agnostic framework for building AI agents with infinite onchain and web2 functionality, powered by @coinbaseDev and Based Agents on Base were just the beginning. its time to change the way we interact onchain. 🧵

Introducing Agentkit: A production-ready, model-agnostic framework for building AI agents with infinite onchain and web2 functionality, powered by @coinbaseDev and Based Agents on Base were just the beginning. its time to change the way we interact onchain. 🧵

lincoln.base.eth

409,263 次观看 • 1 年前