Loading video...

Video Failed to Load

Go Home

timelapse #21 (12 hrs): - leetcode practice for xAI coding test - updated mnist-cuda (find repo in pinned) by adding a new CUDA training script w/ an extra hidden layer and a feature to show generalization - notes on LLM training datasets, architectures, training config, etc

216,069 views • 1 year ago •via X (Twitter)

10 Comments

calibur 🜲's profile picture
calibur 🜲1 year ago

The grind is so real. Congrats on your training

Maximilian Schilling's profile picture
Maximilian Schilling1 year ago

75k views is 5h is nuts! Awesome! You are the Timelapse grand master 😂

Elliot Arledge's profile picture
Elliot Arledge1 year ago

you inspire me Max

λux's profile picture
λux1 year ago

xAI LFG 🔥

Frieda Huang's profile picture
Frieda Huang1 year ago

Nice. How much leetcode prep did you do?

sλrthak's profile picture
sλrthak1 year ago

xAI coding test? enlighten me

Raen AI's profile picture
Raen AI1 year ago

Locked in

Elliot Arledge's profile picture
Elliot Arledge1 year ago

yessir

Nothing is something's profile picture
Nothing is something1 year ago

We're back with timelapses.... LFG!!!!

Lucas Lain's profile picture
Lucas Lain1 year ago

is there a way where I can peak a classic interview for these positions?

Related Videos

Introducing The AI CUDA Engineer: An agentic AI system that automates the production of highly optimized CUDA kernels. The AI CUDA Engineer can produce highly optimized CUDA kernels, reaching 10-100x speedup over common machine learning operations in PyTorch. Our system is also able to produce highly optimized CUDA kernels that are much faster than existing CUDA kernels commonly used in production. We believe that fundamentally, AI systems can and should be as resource-efficient as the human brain, and that the best path to achieve this efficiency is to use AI to make AI more efficient! We are excited to publish our paper, The AI CUDA Engineer: Agentic CUDA Kernel Discovery, Optimization and Composition. We also release a dataset of over 17,000 verified CUDA kernels produced by The AI CUDA Engineer. Paper: Kernel Archive Webpage: HuggingFace Dataset: The AI CUDA Engineer utilizes evolutionary LLM-driven code optimization to autonomously improve the runtime of machine learning operations. Our system is not only able to convert PyTorch code into CUDA kernels, but through the use of evolution, it can also optimize the runtime performance of CUDA kernels, fuse multiple operations, and even discover novel solutions for writing efficient CUDA operations by learning from past innovations! We believe The AI CUDA Engineer opens a new era of AI-driven acceleration of AI and automated inference time optimization. We (Robert Lange, Aaditya Prasad 🇺🇸, Suuun, Maxence Faldor, Yujin Tang, hardmaru) are excited to continue Sakana AI's mission of leveraging AI to improve AI.

Sakana AI

1,149,314 views • 1 year ago