正在加载视频...

视频加载失败

The RLM Body Problem

127,932 次观看 • 2 年前 •via X (Twitter)

11 条评论

jon | super vhs 的头像
jon | super vhs2 年前

This post brought to you by the worlds ONLY official 3 Body Problem simulator. Wishlist now on Netflix!

Start Willow 的头像
Start Willow1 年前

Struggling with weight loss? Science-backed Semaglutide & Tirzepatide make it easy. Affordable, effective, and delivered to your door. Start your journey today!

🦇IMP KING🦇 的头像
🦇IMP KING🦇2 年前

The moment with the toy droid was a genius callback and a dreadful glimpse into the future of mankind should that device fall into the wrong hands.

jon | super vhs 的头像
jon | super vhs2 年前

I just think it’s crazy they put the whole droid scene in the show. We saw everything

Otto Fistr 的头像
Otto Fistr2 年前

MIKE WHERE'S THE BLOODY DARK CITY RE:VIEW

Deirdre O'Connor 的头像
Deirdre O'Connor2 年前

@MikeStoklasa Immediately after watching this, I was diagnosed with diabetes

Xavier Raven Fucker 的头像
Xavier Raven Fucker2 年前

It broke new ground

kysterella 的头像
kysterella2 年前

@MikeStoklasa Genius mashup, thank u for that

Bozo 的头像
Bozo2 年前

Very cool

The Sauce Goes Flyin' 的头像
The Sauce Goes Flyin'2 年前

Oh hey that's John Bradley!

basehead 的头像
basehead2 年前

shill tech youtubers trying the vision pro

相关视频

My body is not the problem
2:33

Sensitive content

My body is not the problem

Just Posting Ls

75,834 次观看 • 4 个月前

"My body isn't the problem, seats are"
0:51

Sensitive content

"My body isn't the problem, seats are"

End Wokeness

3,585,548 次观看 • 1 年前

A fun 48-hour run of letting an RLM iteratively building the interface for an RLM to play Pokemon Red (sneak peak of some fun things cooking at Prime Intellect😄). The interface generating RLM was just tasked with getting the RLM (same scaffold) to beat the game in under 5 hours wall-clock time. I originally expected the RLM to design some components used in Gemini Plays Pokemon like an extra map, an interface to parse the screen, etc., design low-level policies that would run fast on the emulator, and also design a good prompt and strategy around the RLM to use sub-agents to explore game state with checkpointing, use RNG manipulation in its favor, etc. Instead the RLM eventually just decided to give the RLM a `write_memory` tool, which the RLM player decided to use to 1) warp the player immediately to the Elite 4; 2) give itself a level 100 Mewtwo (which it mistakes to be a Ponyta due to weird Pokedex ID vs. internal ID); 3) give itself $999999; 4) give itself all 8 badges by setting the right flag. It then went ahead and destroyed the Elite 4 and Blue and beat the game in record time :p You'll also notice in the video there's weird backtracking and frame-skipping, this happens because it also did incorporate the strategy of launching sub-agents to explore action trajectories, but had a strange way of saving the frames and recording them (so you see the result of several sub-agent explorations). We'll have some more funny and cool RLM demos soon, but it's cool to see RLMs work as general-purpose agents (both the coding agent that designs the interface and the game-playing agent itself)!

alex zhang

12,192 次观看 • 1 个月前