
Robert Lange
@RobertTLange • 9,810 subscribers
Staff Research Scientist & Founding Member @SakanaAILabs 🎏 The AI Scientist, ShinkaEvolve, gymnax, evosax Prev: SR & Intern @GoogleDeepMind
Shorts
Videos

Doc-to-LoRA: What if you could online distill documents into your LLM weights without training? 🚀 Stoked to share our new work on instant LLM adaptation using meta-learned hypernetworks 📷📝 Building on our previous Text-to-LoRA work, we doc-condition a hypernetwork to output LoRA adapters, improving the base LLM's effective context window. The hypernetwork is meta-trained on 1000s of summarization tasks and shows remarkable compression capabilities at low latency 📈 🧑🔬 Work led by Rujikorn (Tan) Charakorn with Edoardo Cetin & Shin Useka at Sakana AI 📷
Robert Lange37,339 görüntüleme • 3 ay önce

🎉 Stoked to share The AI CUDA Engineer 👷 - our end-to-end approach for automating the design and optimization of CUDA Kernels using agentic systems. Blog 📰: Paper 📜: WebUI 📈: Dataset 💽: Awesome team work done with Aaditya Prasad 🇺🇸, Suuun, Maxence Faldor, Yujin Tang, hardmaru 🤗
Robert Lange42,174 görüntüleme • 1 yıl önce
Daha fazla içerik yok.