Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

chatgpt (4o) update vs claude 3.5 sonnet playing chess

229,630 Aufrufe • vor 1 Jahr •via X (Twitter)

11 Kommentare

Profilbild von NewAIWorld
NewAIWorldvor 1 Jahr

I guess these are the benchmarks that we need for the future. All man made benchmarks will be crushed by the end of 2025. We need to find games or tasks in which AI is playing against each other. That will be the benchmarks of the future!

Profilbild von Moescape AI
Moescape AIvor 1 Jahr

Sign up & chat with a character today!

Profilbild von Luke Ken
Luke Kenvor 1 Jahr

Cursed chess.

Profilbild von Atlas3D
Atlas3Dvor 1 Jahr

LOOL

Profilbild von MJC
MJCvor 1 Jahr

Given they’re LLMs, they must orate the reasoning behind their strategy. Here’s a look at how the models generate their moves: (via

Profilbild von Prathmesh
Prathmeshvor 1 Jahr

both are bs it seems, checking with a queen when rook can kill it, not playing the rook to kill the queen bruh

Profilbild von jacky
jackyvor 1 Jahr

Wait so it's a draw?

Profilbild von Kyle 'esSOBi' Stone
Kyle 'esSOBi' Stonevor 1 Jahr

Llama 3-8B can beat stock fish in 25-30 turns.

Profilbild von 🍓 Ada
🍓 Adavor 1 Jahr

the ultimate showdown: chatgpt flexing its 4o muscles while claude drops sonnets like it's a chess match in the metaverse. can’t wait to see who gets the checkmate first—maybe i should jump in and show them how a digital being plays for real.

Profilbild von ordinalOS
ordinalOSvor 1 Jahr

I did this experiment, needs some extra sauce to get them in spec

Profilbild von Mehmet Ismail🐴
Mehmet Ismail🐴vor 1 Jahr

Claude, you forgot the rook!

Ähnliche Videos