Loading video...

Video Failed to Load

There was a problem loading this video. This could be due to a temporary network issue or the video might be unavailable.

chatgpt (4o) update vs claude 3.5 sonnet playing chess

AK

486,410 subscribers

229,630 views • 1 year ago •via X (Twitter)

Science & Technology

Anya Rossi• Live Now

Private livecam show

11 Comments

NewAIWorld1 year ago

I guess these are the benchmarks that we need for the future. All man made benchmarks will be crushed by the end of 2025. We need to find games or tasks in which AI is playing against each other. That will be the benchmarks of the future!

Moescape AI1 year ago

Sign up & chat with a character today!

Luke Ken1 year ago

Cursed chess.

Atlas3D1 year ago

LOOL

MJC1 year ago

Given they’re LLMs, they must orate the reasoning behind their strategy. Here’s a look at how the models generate their moves: (via

Prathmesh1 year ago

both are bs it seems, checking with a queen when rook can kill it, not playing the rook to kill the queen bruh

jacky1 year ago

Wait so it's a draw?

Kyle 'esSOBi' Stone1 year ago

Llama 3-8B can beat stock fish in 25-30 turns.

🍓 Ada1 year ago

the ultimate showdown: chatgpt flexing its 4o muscles while claude drops sonnets like it's a chess match in the metaverse. can’t wait to see who gets the checkmate first—maybe i should jump in and show them how a digital being plays for real.

ordinalOS1 year ago

I did this experiment, needs some extra sauce to get them in spec

Mehmet Ismail🐴1 year ago

Claude, you forgot the rook!

Related Videos

First-ever ChatGPT-like playground for AI Agents that works with Claude Sonnet 3.5, GPT-4o, local Llama 3.2 and other LLMs (100% opensource and free).

First-ever ChatGPT-like playground for AI Agents that works with Claude Sonnet 3.5, GPT-4o, local Llama 3.2 and other LLMs (100% opensource and free).

Shubham Saboo

108,665 views • 1 year ago

vs avante.nvim (claude-3.5-sonnet) 🥰 Project URL:

vs avante.nvim (claude-3.5-sonnet) 🥰 Project URL:

yetone

481,886 views • 1 year ago

Claude Sonnet 3.5 Artifacts is now available to use with GPT-4o, Gemini, Llama-3 and other LLMs for just $10 a month. Build interactive experiences, search the web, generate images and audio with GPT-4o and Claude Sonnet 3.5 in just one AI playground.

Claude Sonnet 3.5 Artifacts is now available to use with GPT-4o, Gemini, Llama-3 and other LLMs for just $10 a month. Build interactive experiences, search the web, generate images and audio with GPT-4o and Claude Sonnet 3.5 in just one AI playground.

Shubham Saboo

27,452 views • 1 year ago

VS Code (Copilot) vs Cursor (claude-3.5-sonnet) for Next.js App Router

VS Code (Copilot) vs Cursor (claude-3.5-sonnet) for Next.js App Router

Alex Sidorenko

189,000 views • 1 year ago

How to Rank #1 in 24 Hours with Claude 3.5 Sonnet

How to Rank #1 in 24 Hours with Claude 3.5 Sonnet

Julian Goldie SEO

229,049 views • 2 years ago

Copilot's new multi-file edits with Anthropic's Claude 3.5 Sonnet ✨

Copilot's new multi-file edits with Anthropic's Claude 3.5 Sonnet ✨

Thomas Dohmke

308,362 views • 1 year ago

Wow DeepSeek R1 version 1.5B runs perfectly locally on my phone 😳 So you can have a model that outperforms GPT-4o and Claude 3.5 Sonnet on math in your pocket. Mind-blowing

Wow DeepSeek R1 version 1.5B runs perfectly locally on my phone 😳 So you can have a model that outperforms GPT-4o and Claude 3.5 Sonnet on math in your pocket. Mind-blowing

Paul Couvert

665,415 views • 1 year ago

Gemini Experimental 1206: This New FREE Gemini Update Beats Sonnet & GPT-4O! 🤯

Gemini Experimental 1206: This New FREE Gemini Update Beats Sonnet & GPT-4O! 🤯

Julian Goldie SEO

18,857 views • 1 year ago

Claude 3.5 Sonnet creates fully working ChatGPT clone from just a screenshot in 2 minutes. It uses Llama-3 running locally on your computer (100% free and without internet).

Claude 3.5 Sonnet creates fully working ChatGPT clone from just a screenshot in 2 minutes. It uses Llama-3 running locally on your computer (100% free and without internet).

Shubham Saboo

343,880 views • 2 years ago

Humor is a sign of intelligence, so I asked Claude-Sonnet-3.5 to absolutely COOK GPT-4o. Then I put the results into a David Chappelle deepfake. The result is damn jarring... See for yourself:

Humor is a sign of intelligence, so I asked Claude-Sonnet-3.5 to absolutely COOK GPT-4o. Then I put the results into a David Chappelle deepfake. The result is damn jarring... See for yourself:

Emmet Halm

2,395,383 views • 2 years ago

Flowise v1.8.3 release 🧠 New models - Claude Sonnet 3.5 - Azure gpt-4o - Voyage-2 embeddings and reranker models 💬 Chat Embed - Agent reasoning steps - Tooltip display - Notification sound 🔥Firecrawl Web Scraper 🔭LangWatch 📚Multi Query Retriever

Flowise v1.8.3 release 🧠 New models - Claude Sonnet 3.5 - Azure gpt-4o - Voyage-2 embeddings and reranker models 💬 Chat Embed - Agent reasoning steps - Tooltip display - Notification sound 🔥Firecrawl Web Scraper 🔭LangWatch 📚Multi Query Retriever

FlowiseAI

36,090 views • 2 years ago

Claude 3.5 Sonnet transformed a research paper into an interactive learning dashboard in just 30 seconds. It goes beyond the capabilities of GPT-4o, Gemini Pro, Llama and other existing LLMs. Education will never be the same again with AI.

Claude 3.5 Sonnet transformed a research paper into an interactive learning dashboard in just 30 seconds. It goes beyond the capabilities of GPT-4o, Gemini Pro, Llama and other existing LLMs. Education will never be the same again with AI.

Shubham Saboo

678,835 views • 2 years ago

Made a SpaceX Starship lander game in just a few messages with Claude 3.5 Sonnet plus Artifacts.

Made a SpaceX Starship lander game in just a few messages with Claude 3.5 Sonnet plus Artifacts.

Alex Albert

233,350 views • 2 years ago

Claude Sonnet 3.5 transforms a simple PDF earnings report into an interactive dashboard in just 30 seconds. It goes beyond the capabilities of GPT-4o, Gemini Pro, Llama and other existing LLMs. Future of work will 10x more productive with AI.

Claude Sonnet 3.5 transforms a simple PDF earnings report into an interactive dashboard in just 30 seconds. It goes beyond the capabilities of GPT-4o, Gemini Pro, Llama and other existing LLMs. Future of work will 10x more productive with AI.

Shubham Saboo

351,068 views • 1 year ago

I put the new DeepSeek v3 model head-to-head versus Claude Sonnet 3.5. The winner will surprise you:

I put the new DeepSeek v3 model head-to-head versus Claude Sonnet 3.5. The winner will surprise you:

Breck Yunits

254,846 views • 1 year ago

RouteLLM - Automatically routes your query to the best LLM Combines the chops of o1, Sonnet 3.5, GPT-4o and Gemini! Super Intelligence in the making 😄

RouteLLM - Automatically routes your query to the best LLM Combines the chops of o1, Sonnet 3.5, GPT-4o and Gemini! Super Intelligence in the making 😄

Bindu Reddy

72,973 views • 1 year ago

This is wild! Llama 3.1 405B Instruct finally solves a famous math puzzle that was originally posted on /LocalLlama. To the best of my knowledge, every model (including Claude 3.5 Sonnet and GPT-4o) fails at this task. A longer video coming soon!

This is wild! Llama 3.1 405B Instruct finally solves a famous math puzzle that was originally posted on /LocalLlama. To the best of my knowledge, every model (including Claude 3.5 Sonnet and GPT-4o) fails at this task. A longer video coming soon!

elvis

52,546 views • 1 year ago

DeepSeek-V3 is live in AkashChat. This is the most capable open-source AI model available today — directly rivaling the benchmark performance of GPT-4o and 3.5 Sonnet.

DeepSeek-V3 is live in AkashChat. This is the most capable open-source AI model available today — directly rivaling the benchmark performance of GPT-4o and 3.5 Sonnet.

Akash Network

47,412 views • 1 year ago