Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

timelapse #21 (12 hrs): - leetcode practice for xAI coding test - updated mnist-cuda (find repo in pinned) by adding a new CUDA training script w/ an extra hidden layer and a feature to show generalization - notes on LLM training datasets, architectures, training config, etc

216,069 Aufrufe • vor 1 Jahr •via X (Twitter)

10 Kommentare

Profilbild von calibur 🜲
calibur 🜲vor 1 Jahr

The grind is so real. Congrats on your training

Profilbild von Maximilian Schilling
Maximilian Schillingvor 1 Jahr

75k views is 5h is nuts! Awesome! You are the Timelapse grand master 😂

Profilbild von Elliot Arledge
Elliot Arledgevor 1 Jahr

you inspire me Max

Profilbild von λux
λuxvor 1 Jahr

xAI LFG 🔥

Profilbild von Frieda Huang
Frieda Huangvor 1 Jahr

Nice. How much leetcode prep did you do?

Profilbild von sλrthak
sλrthakvor 1 Jahr

xAI coding test? enlighten me

Profilbild von Raen AI
Raen AIvor 1 Jahr

Locked in

Profilbild von Elliot Arledge
Elliot Arledgevor 1 Jahr

yessir

Profilbild von Nothing is something
Nothing is somethingvor 1 Jahr

We're back with timelapses.... LFG!!!!

Profilbild von Lucas Lain
Lucas Lainvor 1 Jahr

is there a way where I can peak a classic interview for these positions?

Ähnliche Videos

Introducing The AI CUDA Engineer: An agentic AI system that automates the production of highly optimized CUDA kernels. The AI CUDA Engineer can produce highly optimized CUDA kernels, reaching 10-100x speedup over common machine learning operations in PyTorch. Our system is also able to produce highly optimized CUDA kernels that are much faster than existing CUDA kernels commonly used in production. We believe that fundamentally, AI systems can and should be as resource-efficient as the human brain, and that the best path to achieve this efficiency is to use AI to make AI more efficient! We are excited to publish our paper, The AI CUDA Engineer: Agentic CUDA Kernel Discovery, Optimization and Composition. We also release a dataset of over 17,000 verified CUDA kernels produced by The AI CUDA Engineer. Paper: Kernel Archive Webpage: HuggingFace Dataset: The AI CUDA Engineer utilizes evolutionary LLM-driven code optimization to autonomously improve the runtime of machine learning operations. Our system is not only able to convert PyTorch code into CUDA kernels, but through the use of evolution, it can also optimize the runtime performance of CUDA kernels, fuse multiple operations, and even discover novel solutions for writing efficient CUDA operations by learning from past innovations! We believe The AI CUDA Engineer opens a new era of AI-driven acceleration of AI and automated inference time optimization. We (Robert Lange, Aaditya Prasad 🇺🇸, Suuun, Maxence Faldor, Yujin Tang, hardmaru) are excited to continue Sakana AI's mission of leveraging AI to improve AI.

Sakana AI

1,149,314 Aufrufe • vor 1 Jahr