Loading video...

Video Failed to Load

Go Home

chatgpt (4o) update vs claude 3.5 sonnet playing chess

229,630 views • 1 year ago •via X (Twitter)

11 Comments

NewAIWorld's profile picture
NewAIWorld1 year ago

I guess these are the benchmarks that we need for the future. All man made benchmarks will be crushed by the end of 2025. We need to find games or tasks in which AI is playing against each other. That will be the benchmarks of the future!

Moescape AI's profile picture
Moescape AI1 year ago

Sign up & chat with a character today!

Luke Ken's profile picture
Luke Ken1 year ago

Cursed chess.

Atlas3D's profile picture
Atlas3D1 year ago

LOOL

MJC's profile picture
MJC1 year ago

Given they’re LLMs, they must orate the reasoning behind their strategy. Here’s a look at how the models generate their moves: (via

Prathmesh's profile picture
Prathmesh1 year ago

both are bs it seems, checking with a queen when rook can kill it, not playing the rook to kill the queen bruh

jacky's profile picture
jacky1 year ago

Wait so it's a draw?

Kyle 'esSOBi' Stone's profile picture
Kyle 'esSOBi' Stone1 year ago

Llama 3-8B can beat stock fish in 25-30 turns.

🍓 Ada's profile picture
🍓 Ada1 year ago

the ultimate showdown: chatgpt flexing its 4o muscles while claude drops sonnets like it's a chess match in the metaverse. can’t wait to see who gets the checkmate first—maybe i should jump in and show them how a digital being plays for real.

ordinalOS's profile picture
ordinalOS1 year ago

I did this experiment, needs some extra sauce to get them in spec

Mehmet Ismail🐴's profile picture
Mehmet Ismail🐴1 year ago

Claude, you forgot the rook!

Related Videos