Загрузка видео...

Не удалось загрузить видео

На главную

LLM Visualization This is actually pretty amazing! It helps to visualize the core components of LLMs like nano-gpt and GPT-3.

39,431 просмотров • 1 год назад •via X (Twitter)

Комментарии: 9

Фото профиля AI Advocate Arif
AI Advocate Arif1 год назад

I'd love to see more visualizations like this make complex LLM concepts more accessible

Фото профиля Mehrdad Yazdani
Mehrdad Yazdani1 год назад

Does anyone actually find visualizations useful? I don’t mean to hate, but besides educational purposes is there any other use? Code/equations are so much easier for me to understand.

Фото профиля elvis
elvis1 год назад

Fair comment. I think, and as you said, they are pretty useful for educational purposes. I don't know that this particular one can lead to any new research insights as is. I really liked the other Transformer Explainer visualization as it does show more things like the transformation of the data, probs, and so on.

Фото профиля 通往AGI之路
通往AGI之路1 год назад

Mind blown 🤯

Фото профиля elbouz
elbouz1 год назад

It's insane how easier this makes it to understand complex architecture. Also great for efficient refreshers.

Фото профиля GPT.Biz
GPT.Biz1 год назад

This looks like a great resource to understand LLMs better, definitely worth checking out!

Фото профиля Clancy
Clancy1 год назад

That is awesome

Фото профиля AIxBlock
AIxBlock1 год назад

Nice breakdown and visualization! Tks so much for sharing 👏

Фото профиля Rajesh David
Rajesh David1 год назад

Truly amazing stuff. You know what could be cooler ? Going step-by-step and assembling them part-by-part with some details about why that is needed ? Does it already do that ?

Похожие видео

GPT-5.6 vs GPT-5.5 on my custom spaceship prompt. I gave both models the exact same custom prompt. This is also the same prompt I previously gave to Fable 5. For context, GPT-5.6 Pro worked for 87 minutes, while GPT-5.5 Extra High worked for 34 minutes and 42 seconds. As I’ve said before, based on great authority GPT-5.6 will be an incremental/soldi improvement over GPT-5.5, not a “Fable killer.” My rough expectation has been that it would trade blows with Fable 5 on some benchmarks, maybe win around half depending on the category, but not clearly surpass it overall. And again fable five will have bigger model smell, but this was expected. After testing this coding output, that view feels pretty accurate. GPT-5.6 is clearly better than GPT-5.5 in several visual areas. The lighting, shading, chairs, object details, and exterior of the spaceship looked noticeably stronger. The scene was also easier to test. I do want to give GPT-5.5 credit though. It built out the rooms much much better and the planets looked better than GPT-5.6’s. It was also interesting that both GPT-5.5 and GPT-5.6 produced better-looking planets than Fable 5 in this specific test. The downside with GPT-5.5 was stability. The game was much glitchier and harder to test compared to GPT-5.6. But when it comes to the core of the demo, which is the spaceship itself, Fable 5 still beat both models pretty comfortably. GPT-5.6 is impressive, but from this test, it looks exactly like what I expected which was a meaningful incremental improvement over GPT-5.5, at least for indie game demos, but not something that replaces Fable 5. In collaboration with Chetaslua

Chris

222,849 просмотров • 13 дней назад