Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

Llama 3 was *still* learning when Meta stopped training it. They only stopped because they decided they needed the GPUs to start testing for Llama 4. AI scaling laws are insane.

414,180 görüntüleme • 2 yıl önce •via X (Twitter)

11 Yorum

Mckay Wrigley profil fotoğrafı
Mckay Wrigley2 yıl önce

“The models just want to learn.” - Ilya

PowerBeatsVR profil fotoğrafı
PowerBeatsVR1 yıl önce

Box, dodge, and squat your way through PowerBeatsVR - Now 40% OFF on Meta Quest for a limited time 🔥

Justin Halford profil fotoğrafı
Justin Halford2 yıl önce

Even crazier is that they only trained with 16k H100s despite having ordered 600k. So only 2.6% of the compute from that order was used to create a GPT-4 level model. Wild times ahead.

Mckay Wrigley profil fotoğrafı
Mckay Wrigley2 yıl önce

I’m only halfway through this so far, but Zuck even says he thinks energy will be a bottleneck before compute. Wild!

Magic Carpet 🇺🇸 profil fotoğrafı
Magic Carpet 🇺🇸2 yıl önce

Almost all LLMs are undertrained relative to size

miru profil fotoğrafı
miru2 yıl önce

@HlibIvanov source of the clip

ASI - Tech Gone Wild 🤖❤️‍🔥⚔️ profil fotoğrafı
ASI - Tech Gone Wild 🤖❤️‍🔥⚔️2 yıl önce

Very Very promising for FSD and Teslas Fleet advantage

λy profil fotoğrafı
λy2 yıl önce

llama 3 still *is* learning there’s a checkpoint on the biggest 400B model today, but they’re still training it

Luci Pars profil fotoğrafı
Luci Pars2 yıl önce

AGI 💖 Mark 🤣

Josh Whiton profil fotoğrafı
Josh Whiton2 yıl önce

“There are these meta reasoning questions…”

EMAD profil fotoğrafı
EMAD2 yıl önce

So maybe scale actually is all you need…?

Benzer Videolar