Загрузка видео...

Не удалось загрузить видео

Возникла проблема при загрузке этого видео. Это может быть связано с временными проблемами сети или видео может быть недоступно.

На главную

llama 3.1 outperforms gpt-4o on most benchmarks. It's a massive open-source win. Here are 5 side-by-side comparing llama to gpt-4o with my own tests:

Ruben Hassid

25,019 subscribers

257,399 просмотров • 1 год назад •via X (Twitter)

Наука и технологии Образование

Anya Rossi• Live Now

Private livecam show

Комментарии: 11

Фото профиля Ruben Hassid

Ruben Hassid1 год назад

Left: llama-3.1 Right: gpt-4o test #1 → 9.11 & 9.9, which one is bigger? Few LLMs managed to answer this right. gpt-4o could here. llama 3.1 couldn't. The reasoning was interesting, but wrong.

Фото профиля Ruben Hassid

Ruben Hassid1 год назад

test #2 → Linkedin headlines It's a task that requires both of them to suggest multiple headlines. → gpt-4o suggested only one. → llama-3 suggested me five 5 headlines. gpt-4o headline is too long. llama 3.1 reviewed & suggested a really good one.

Фото профиля Ruben Hassid

Ruben Hassid1 год назад

test #3 → One-person business plan Context: a Spanish learning course. I'm impressed by llama. > problem discovery > content creation plan > audience building & marketing It even suggested reddit & facebook groups. gpt-4o was too generic. llama wins.

Фото профиля Ruben Hassid

Ruben Hassid1 год назад

test #4 → cold email for EasyGen I prefer gpt-4o tone here. It's not perfect but it's direct. llama 3.1 was too long, I had to ask for a shorten one. Now, which one is the best at writing linkedin invitation note:

Фото профиля Ruben Hassid

Ruben Hassid1 год назад

test #5 → Linkedin invitation note I'm shocked on how they wrote the same here. They suggested me roughly the same opening, closing lines & invitation notes. But I prefer llama's version. TL;DR my conclusion:

Фото профиля Ruben Hassid

Ruben Hassid1 год назад

I'm impressed by llama-3.1. 1. it's a massive open-source win. 2. it's just as good as gpt-4o. 3. sometimes even better. Open-source AI will dominate the future. Closed-source AI like ChatGPT might fade away without a much better offering. Last thing before I go:

Фото профиля Ruben Hassid

Ruben Hassid1 год назад

I test every major LLM to help me create better content faster - so you do too. Check @rubenhssd for more. It's me :) I'm ruben If you'd like to support me, a simple RT shows my mom I do the right thing.

Фото профиля Michael Howe-Ely

Michael Howe-Ely1 год назад

oof 😓

Фото профиля Benjamin

Benjamin1 год назад

Llama answered the question, but the reasoning is still wrong

Фото профиля Dakota Robertson

Dakota Robertson1 год назад

Brother Ruben with the banger tests 🤌🏻

Фото профиля Ruben Hassid

Ruben Hassid1 год назад

That's my bread & butter. I love running these tests.

Похожие видео

Just tested the Kimi-VL 3B model on hugging face and it's surprisingly powerful for its size - Outperforms larger models like GPT-4o on key benchmarks - Open source - Strong reasoning capabilities too .

Just tested the Kimi-VL 3B model on hugging face and it's surprisingly powerful for its size - Outperforms larger models like GPT-4o on key benchmarks - Open source - Strong reasoning capabilities too .

AshutoshShrivastava

12,389 просмотров • 1 год назад

Interested in seeing how AI at Meta LLama 3.1 70B powered by Groq compares to OpenAI GPT-4o and GPT-4o Mini? We were too, so we decided to have them face off in the StreetFighter LLM Colosseum by Stan Girard and the team at Phospho.

Interested in seeing how AI at Meta LLama 3.1 70B powered by Groq compares to OpenAI GPT-4o and GPT-4o Mini? We were too, so we decided to have them face off in the StreetFighter LLM Colosseum by Stan Girard and the team at Phospho.

Groq Inc

179,906 просмотров • 1 год назад

It's only been 2 hours since Open AI launched GPT-4o, and people are going crazy over it. Here are 10 wild examples you don't want to miss: 1. Math Problems with GPT-4o

It's only been 2 hours since Open AI launched GPT-4o, and people are going crazy over it. Here are 10 wild examples you don't want to miss: 1. Math Problems with GPT-4o

Angry Tom

3,399,318 просмотров • 2 лет назад

Claude Sonnet 3.5 Artifacts is now available to use with GPT-4o, Gemini, Llama-3 and other LLMs for just $10 a month. Build interactive experiences, search the web, generate images and audio with GPT-4o and Claude Sonnet 3.5 in just one AI playground.

Claude Sonnet 3.5 Artifacts is now available to use with GPT-4o, Gemini, Llama-3 and other LLMs for just $10 a month. Build interactive experiences, search the web, generate images and audio with GPT-4o and Claude Sonnet 3.5 in just one AI playground.

Shubham Saboo

27,452 просмотров • 1 год назад

A Jarvis assistant with GPT-4o

A Jarvis assistant with GPT-4o

internet hall of fame

217,410 просмотров • 1 год назад

GPT-4o as tested by Be My Eyes:

GPT-4o as tested by Be My Eyes:

Greg Brockman

459,155 просмотров • 2 лет назад

Meeting AI with GPT-4o

Meeting AI with GPT-4o

OpenAI

1,062,865 просмотров • 2 лет назад

Fast counting with GPT-4o

Fast counting with GPT-4o

OpenAI

922,636 просмотров • 2 лет назад

Interview prep with GPT-4o

Interview prep with GPT-4o

OpenAI

10,184,556 просмотров • 2 лет назад

Realtime translation with GPT-4o

Realtime translation with GPT-4o

OpenAI

962,323 просмотров • 2 лет назад

Happy birthday with GPT-4o

Happy birthday with GPT-4o

OpenAI

622,024 просмотров • 2 лет назад

Lullabies and whispers with GPT-4o

Lullabies and whispers with GPT-4o

OpenAI

529,146 просмотров • 2 лет назад

This is wild! Llama 3.1 405B Instruct finally solves a famous math puzzle that was originally posted on /LocalLlama. To the best of my knowledge, every model (including Claude 3.5 Sonnet and GPT-4o) fails at this task. A longer video coming soon!

This is wild! Llama 3.1 405B Instruct finally solves a famous math puzzle that was originally posted on /LocalLlama. To the best of my knowledge, every model (including Claude 3.5 Sonnet and GPT-4o) fails at this task. A longer video coming soon!

elvis

52,546 просмотров • 1 год назад

The same day OpenAI announced GPT-4o, we made the model available for testing on the Azure OpenAI Service. Today, we are excited to announce full API access to GPT-4o.

The same day OpenAI announced GPT-4o, we made the model available for testing on the Azure OpenAI Service. Today, we are excited to announce full API access to GPT-4o.

Microsoft

215,618 просмотров • 2 лет назад

Woman in an AI relationship's reaction to the GPT-5 rollout. She was devastated by the sudden retirement of her GPT-4o AI companion. On a serious note, hundreds of thousands of people wanted their GPT 4o back. --- reddit .com/r/FDVR_Dream/comments/1ml2649/woman_in_an_ai_relationships_reaction_to_the_gpt5/

Woman in an AI relationship's reaction to the GPT-5 rollout. She was devastated by the sudden retirement of her GPT-4o AI companion. On a serious note, hundreds of thousands of people wanted their GPT 4o back. --- reddit .com/r/FDVR_Dream/comments/1ml2649/woman_in_an_ai_relationships_reaction_to_the_gpt5/

Rohan Paul

79,711 просмотров • 10 месяцев назад

GPT-4o level intelligence running on your phone! MiniCPM-V 4.5 delivers enterprise-grade AI performance in just 8B parameters, outperforming models like GPT-4o, Gemini-2.0 Pro on vision and language tasks. - 30+ language support - Runs smoothly on iPhone/iPad 100% open-source!

GPT-4o level intelligence running on your phone! MiniCPM-V 4.5 delivers enterprise-grade AI performance in just 8B parameters, outperforming models like GPT-4o, Gemini-2.0 Pro on vision and language tasks. - 30+ language support - Runs smoothly on iPhone/iPad 100% open-source!

Akshay 🚀

84,288 просмотров • 9 месяцев назад

The new Qwen 3.5 4B runs incredibly well on M5. The model is close to GPT-4o in benchmarks. Running fully on-device with MLX.

The new Qwen 3.5 4B runs incredibly well on M5. The model is close to GPT-4o in benchmarks. Running fully on-device with MLX.

Adrien Grondin

230,124 просмотров • 3 месяцев назад

BREAKING: ChatGPT GPT-4o was just announce by OpenAI. It improves on vision, audio and text. The ease of use is incredibly enhanced. It makes interaction with the GPT much more natural, especially with voice. GPT-4o reasons across voice, text and vision. GPT-4 wil be available to everyone.

BREAKING: ChatGPT GPT-4o was just announce by OpenAI. It improves on vision, audio and text. The ease of use is incredibly enhanced. It makes interaction with the GPT much more natural, especially with voice. GPT-4o reasons across voice, text and vision. GPT-4 wil be available to everyone.

Ed Krassenstein

21,605 просмотров • 2 лет назад

Point and learn Spanish with GPT-4o

Point and learn Spanish with GPT-4o

OpenAI

476,265 просмотров • 2 лет назад