Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

llm.c by Hand✍️ C programming + matrix multiplication by hand This combination is perhaps as low as we can get to explain how the Transformer works. Special thanks to Andrej Karpathy for encouraging early feedback and tetsuo //: 👾 for helping me understand the pragma magic. I hope this...

302,625 Aufrufe • vor 2 Jahren •via X (Twitter)

11 Kommentare

Profilbild von Yuchen Jin
Yuchen Jinvor 2 Jahren

@karpathy @7etsuo Awesome! Looking forward to an llm.c Attention episode

Profilbild von Tom Yeh
Tom Yehvor 2 Jahren

@karpathy @7etsuo This is a great suggestion! Thanks!

Profilbild von Antonio Linares
Antonio Linaresvor 2 Jahren

@karpathy @7etsuo Is there a way to load a single layer of a LLM (in example Llama3-70B) and process it layer by layer so it fits on a normal PC memory ? Could you explain how to do it ?

Profilbild von Tom Yeh
Tom Yehvor 2 Jahren

@karpathy @7etsuo It is possible. But it would be quite slow. To generate each token, the model has to evaluate every one of the hundreds of layers. Say loading each layer takes 1 second. It would take 100 seconds to load 100 layers for one token. Then you multiple that by 1000 tokens. Very slow.

Profilbild von rakesh
rakeshvor 2 Jahren

@karpathy @7etsuo Goated acc

Profilbild von ed
edvor 2 Jahren

@karpathy @7etsuo Amazing 🔥

Profilbild von Cuhmunity Notes
Cuhmunity Notesvor 2 Jahren

@karpathy @7etsuo 🔥

Profilbild von Byron Hsu
Byron Hsuvor 2 Jahren

@karpathy @7etsuo This is awesome

Profilbild von Filippo Broggini
Filippo Brogginivor 2 Jahren

@karpathy @7etsuo Is this also available elsewhere online?

Profilbild von George D Gregory
George D Gregoryvor 2 Jahren

@karpathy @7etsuo Can these be made into the notebooks!? That would be awesome.

Profilbild von Tom Yeh
Tom Yehvor 2 Jahren

@karpathy @7etsuo I am waiting for a notebook ninja to step up to help.😄

Ähnliche Videos