Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

LLM Visualization This is actually pretty amazing! It helps to visualize the core components of LLMs like nano-gpt and GPT-3.

39,431 Aufrufe • vor 1 Jahr •via X (Twitter)

9 Kommentare

Profilbild von AI Advocate Arif
AI Advocate Arifvor 1 Jahr

I'd love to see more visualizations like this make complex LLM concepts more accessible

Profilbild von Mehrdad Yazdani
Mehrdad Yazdanivor 1 Jahr

Does anyone actually find visualizations useful? I don’t mean to hate, but besides educational purposes is there any other use? Code/equations are so much easier for me to understand.

Profilbild von elvis
elvisvor 1 Jahr

Fair comment. I think, and as you said, they are pretty useful for educational purposes. I don't know that this particular one can lead to any new research insights as is. I really liked the other Transformer Explainer visualization as it does show more things like the transformation of the data, probs, and so on.

Profilbild von 通往AGI之路
通往AGI之路vor 1 Jahr

Mind blown 🤯

Profilbild von elbouz
elbouzvor 1 Jahr

It's insane how easier this makes it to understand complex architecture. Also great for efficient refreshers.

Profilbild von GPT.Biz
GPT.Bizvor 1 Jahr

This looks like a great resource to understand LLMs better, definitely worth checking out!

Profilbild von Clancy
Clancyvor 1 Jahr

That is awesome

Profilbild von AIxBlock
AIxBlockvor 1 Jahr

Nice breakdown and visualization! Tks so much for sharing 👏

Profilbild von Rajesh David
Rajesh Davidvor 1 Jahr

Truly amazing stuff. You know what could be cooler ? Going step-by-step and assembling them part-by-part with some details about why that is needed ? Does it already do that ?

Ähnliche Videos

GPT-5.6 vs GPT-5.5 on my custom spaceship prompt. I gave both models the exact same custom prompt. This is also the same prompt I previously gave to Fable 5. For context, GPT-5.6 Pro worked for 87 minutes, while GPT-5.5 Extra High worked for 34 minutes and 42 seconds. As I’ve said before, based on great authority GPT-5.6 will be an incremental/soldi improvement over GPT-5.5, not a “Fable killer.” My rough expectation has been that it would trade blows with Fable 5 on some benchmarks, maybe win around half depending on the category, but not clearly surpass it overall. And again fable five will have bigger model smell, but this was expected. After testing this coding output, that view feels pretty accurate. GPT-5.6 is clearly better than GPT-5.5 in several visual areas. The lighting, shading, chairs, object details, and exterior of the spaceship looked noticeably stronger. The scene was also easier to test. I do want to give GPT-5.5 credit though. It built out the rooms much much better and the planets looked better than GPT-5.6’s. It was also interesting that both GPT-5.5 and GPT-5.6 produced better-looking planets than Fable 5 in this specific test. The downside with GPT-5.5 was stability. The game was much glitchier and harder to test compared to GPT-5.6. But when it comes to the core of the demo, which is the spaceship itself, Fable 5 still beat both models pretty comfortably. GPT-5.6 is impressive, but from this test, it looks exactly like what I expected which was a meaningful incremental improvement over GPT-5.5, at least for indie game demos, but not something that replaces Fable 5. In collaboration with Chetaslua

Chris

195,739 Aufrufe • vor 9 Tagen