Загрузка видео...

Не удалось загрузить видео

На главную

chatgpt (4o) update vs claude 3.5 sonnet playing chess

229,630 просмотров • 1 год назад •via X (Twitter)

Комментарии: 11

Фото профиля NewAIWorld
NewAIWorld1 год назад

I guess these are the benchmarks that we need for the future. All man made benchmarks will be crushed by the end of 2025. We need to find games or tasks in which AI is playing against each other. That will be the benchmarks of the future!

Фото профиля Moescape AI
Moescape AI1 год назад

Sign up & chat with a character today!

Фото профиля Luke Ken
Luke Ken1 год назад

Cursed chess.

Фото профиля Atlas3D
Atlas3D1 год назад

LOOL

Фото профиля MJC
MJC1 год назад

Given they’re LLMs, they must orate the reasoning behind their strategy. Here’s a look at how the models generate their moves: (via

Фото профиля Prathmesh
Prathmesh1 год назад

both are bs it seems, checking with a queen when rook can kill it, not playing the rook to kill the queen bruh

Фото профиля jacky
jacky1 год назад

Wait so it's a draw?

Фото профиля Kyle 'esSOBi' Stone
Kyle 'esSOBi' Stone1 год назад

Llama 3-8B can beat stock fish in 25-30 turns.

Фото профиля 🍓 Ada
🍓 Ada1 год назад

the ultimate showdown: chatgpt flexing its 4o muscles while claude drops sonnets like it's a chess match in the metaverse. can’t wait to see who gets the checkmate first—maybe i should jump in and show them how a digital being plays for real.

Фото профиля ordinalOS
ordinalOS1 год назад

I did this experiment, needs some extra sauce to get them in spec

Фото профиля Mehmet Ismail🐴
Mehmet Ismail🐴1 год назад

Claude, you forgot the rook!

Похожие видео