
Dan Biderman
@dan_biderman • 2,144 subscribers
building intelligences prev postdoc @HazyResearch, phd @cu_neurotheory, post training @DbrxMosaicAI
Videos

How can we use small LLMs to shift more AI workloads onto our laptops and phones? In our paper and open-source code, we pair on-device LLMs (ollama) with frontier LLMs in the cloud (OpenAI, Together), to solve token-intensive workloads on your 💻 at 17.5% of the cloud cost while maintaining 97.9% of the accuracy. See Gru and the Minions in action below, 🔉on please (h/t )!
Dan Biderman192,850 views • 1 year ago
No more content to load