Загрузка видео...

Не удалось загрузить видео

На главную

llm.c by Hand✍️ C programming + matrix multiplication by hand This combination is perhaps as low as we can get to explain how the Transformer works. Special thanks to Andrej Karpathy for encouraging early feedback and tetsuo //: 👾 for helping me understand the pragma magic. I hope this...

302,657 просмотров • 2 лет назад •via X (Twitter)

Комментарии: 11

Фото профиля Yuchen Jin
Yuchen Jin2 лет назад

@karpathy @7etsuo Awesome! Looking forward to an llm.c Attention episode

Фото профиля Tom Yeh
Tom Yeh2 лет назад

@karpathy @7etsuo This is a great suggestion! Thanks!

Фото профиля Antonio Linares
Antonio Linares2 лет назад

@karpathy @7etsuo Is there a way to load a single layer of a LLM (in example Llama3-70B) and process it layer by layer so it fits on a normal PC memory ? Could you explain how to do it ?

Фото профиля Tom Yeh
Tom Yeh2 лет назад

@karpathy @7etsuo It is possible. But it would be quite slow. To generate each token, the model has to evaluate every one of the hundreds of layers. Say loading each layer takes 1 second. It would take 100 seconds to load 100 layers for one token. Then you multiple that by 1000 tokens. Very slow.

Фото профиля rakesh
rakesh2 лет назад

@karpathy @7etsuo Goated acc

Фото профиля ed
ed2 лет назад

@karpathy @7etsuo Amazing 🔥

Фото профиля Cuhmunity Notes
Cuhmunity Notes2 лет назад

@karpathy @7etsuo 🔥

Фото профиля Byron Hsu
Byron Hsu2 лет назад

@karpathy @7etsuo This is awesome

Фото профиля Filippo Broggini
Filippo Broggini2 лет назад

@karpathy @7etsuo Is this also available elsewhere online?

Фото профиля George D Gregory
George D Gregory2 лет назад

@karpathy @7etsuo Can these be made into the notebooks!? That would be awesome.

Фото профиля Tom Yeh
Tom Yeh2 лет назад

@karpathy @7etsuo I am waiting for a notebook ninja to step up to help.😄

Похожие видео