
Andrej Karpathy
@karpathy • 2,833,581 subscribers
I like training large deep neural nets.
Shorts
Videos

I'm playing around with generative AI tools and stitching them together into visual stories. Here I took the first few sentences of Pride and Prejudice and made it into a video. The gen stack used for this one: - Anthropic Claude took the first chapter, generated the scenes and the individual prompts to to the image generator. - Ideogram took the prompts and generate the images - Luma took the images and animated them - for narration - VEED | AI Video Creation to stitch it together (Many of these choices are just what I happened to use for this one while exploring a bunch of things). Anyway honestly it was pretty messy and there is a ton of copy pasting between all of the tools, and even this little video with 3 scenes took me about an hour. There is a huge storytelling opportunity here for whoever can make this convenient. Who is building the first 100% AI-native movie maker?
Andrej Karpathy608,316 просмотров • 1 год назад

We're vibing this nice Sunday morning. Added more functionality. Using the approx 3500kcal ~= 1lb of fat, we now show a really cool animated ring that fills up to 3500 in either +/- direction, and completing the circle adds it on the bottom. So e.g. 3 green circles = 3lb lighter, in theory :). 3 conversations were used: Refactor the AppStorage to be better / cleaner and shuffle elements around a bit Clamp the display to always be in range [-3500, 3500], which is 1lb of fat, and show lb of fat as circles on bottom Making the calorie counter have a nice ring that fills up
Andrej Karpathy356,066 просмотров • 1 год назад

Amazing text to music generations from Suno , could easily see these taking over leaderboards. Personal favorite: this song I fished out of their Discord a few months ago, "Return to Monkey", which has been stuck in my head since :D [00:57] I wanna return to monkey, I wanna be wild and free, I wanna return to monkey, modern life is not for me. No more emails, no more bills, no more endless strife, Just the sound of the river, the hearbeat of life 😂
Andrej Karpathy553,524 просмотров • 2 лет назад

August 1, 2024: The Music Video Fun hack just stitching up gen AI tools :), in this case to create a music video for today. - copy paste the entire WSJ front page into Claude - ask it to generate multiple scenes and give visual descriptions for them - copy paste scene descriptions into image generator (Ideogram here) - copy paste generated images into Runway Gen 3 Alpha to make each image into a 10-second video - ask Claude to generate lyrics that depict that day - copy paste lyrics into Suno to generate music - stitch things up in iMovie :D :D :D
Andrej Karpathy416,183 просмотров • 1 год назад

I gave a talk at GPU MODE workshop last week on llm.c - the origin story of llm.c - being naked in the world without PyTorch and having to re-invent Array, Autograd, Device, Dtype, Compile, Distributed - how to port a PyTorch layer to 1) explicit PyTorch - and then to 2) write the backward pass - 3) port forward & backward pass to C - 4) string all the layers together - achieving one file of C with no dependencies that compiles and runs ~instantly, where all memory is pre-planned and allocated a single time, fully deterministic, portable code that can run on a potato or a von Neumann probe - how most of llm.c was built at 1am-7am in a water villa porch in Maldives and why this is the recommended way to develop software - convert all of it to run in CUDA on GPU in fp32 - port matmul to cuBLAS - port attention to cuDNN flash-attention - introduce bfloat16 mixed precision - introduce many more optimizations and features like kernel fusions, Packed128, stochastic rounding, full determinism - add multi-GPU training, NCCL, sharded optimizer - add multi-node with MPI or file system or socket - reproduce GPT-2 (1.6B) on one 8XH100 node in 24 hours for $672 in llm.c, achieving (at the time) 29% less memory, 19% faster training that PyTorch nightly, and much faster compile & run - how open source development attracts Avengers from the internet - port to training Llama 3 imminent (branch exists) - many other notable forks - last thought: how software abstractions like Python/PyTorch and everything else really exist only because humans are finite in knowledge, IQ and attention, and how with increasing AI capability LLMs may export custom binaries like llm.c for any application directly, tearing apart and refactoring all abstractions as needed. More links in reply
Andrej Karpathy335,718 просмотров • 1 год назад

Yay, llama2.c can now load and inference the Meta released models! :) E.g. here inferencing the smallest 7B model at ~3 tokens/s on 96 OMP threads on a cloud Linux box. Still just CPU, fp32, one single .c file of 500 lines: expecting ~300 tok/s tomorrow :)
Andrej Karpathy409,212 просмотров • 2 лет назад
Больше нет контента для загрузки