正在加载视频...

视频加载失败

Watch Gemini 2.5 Pro implement a landmark Google DeepMind research paper. 🕹️ It codes the reinforcement learning algorithm, visualizes the training live and even debugs errors. ↓

369,902 次观看 • 1 年前 •via X (Twitter)

11 条评论

elvis 的头像
elvis1 年前

Really appreciate you all included the errors and how to potentially fix them. I am working on a similar workflow and figuring out how to provide error more efficiently as feedback to agents.

Rainmaker 的头像
Rainmaker2 年前

💡 Learn how Reinforcement Learning can boost your trading performance! In this free Substack article I share full code of a trading algorithm based on Reinforcement Learning that beats other Machine Learning models as well as simply buying and holding the stock.

chipko 的头像
chipko1 年前

We'll you just saved me time! is faster to type than

AI Pro Workflow 的头像
AI Pro Workflow1 年前

The future of AI x coding is here. Gemini 2.5 Pro not only writes reinforcement learning algorithms — it runs them, visualizes training, and debugs in real-time. 🧠🕹️ Watching this feels like a glimpse into the next generation of engineering workflows.

Asset investing made simple 的头像
Asset investing made simple1 年前

You have a better product than Open ai. Its time to focus on usage

Munish 的头像
Munish1 年前

Can you guys improve the UI? Most of the time it just renders badly when it comes to changes, syntax, conversation names and also why isn't Rmd and md code blocks doesn't have syntax highlighting.

Apollo 的头像
Apollo1 年前

Nice RL feedback loop. Video did a good job showing how this works

Arpit Sharma 的头像
Arpit Sharma1 年前

The ability to debug errors in real-time is mind-blowing.

Arunachalam B 的头像
Arunachalam B1 年前

Absolutely mind blowing. I think @Google should build an IDE like Cursor.

Mzml Rafiq 的头像
Mzml Rafiq1 年前

code(44).html have you tried 44 times

Ghost of Bear Jew 的头像
Ghost of Bear Jew1 年前

that's pretty incredible but is it repeatable and scalable

相关视频