正在加载视频...
视频加载失败
A conversation on the optimal reward for coding agents, infinite context models, and real-time RL
10 条评论

Han1 年前
Found YouTube link

Moescape AI1 年前
Sign up & effortlessly find YOUR perfect character to chat with on Moescape AI: #MoescapeTavern #aichatbot

kipply1 年前
stan SEASNELL

martin.p1 年前
how about direct user feedback below the agent answer? simple 1-5 rating, maybe hidable in settings. I'd gladly give feedback, especially on the answer of the initial request

Luke Igel1 年前
Omg (Snell 2024) and Jacob ??

Yacine Mahdid1 年前
we need more women in ai

morgan —1 年前
youtube?

Pedro Ramos1 年前
@mntruell Wrote about Revenue Sharing as RL for Agents:

Kalash1 年前
this sounds like a wild debate

Edoardo Contente1 年前
Looks more homely than @OpenAI



