Loading video...
Video Failed to Load
This Paper from Google DeepMind is a landmark one. 📚 "Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters" It may have contributed to the 01 Model from OpenAI or the principle may have been long known to OpenAI. The paper basically says - Searching... show more
48,980 views • 1 year ago •via X (Twitter)
6 Comments

📚

@GoogleDeepMind these podcasts generated by are so good btw, great way to stay up to date if you don't have much time 😄

@GoogleDeepMind Thanks Jonas. Yes great for a quick understanding within 5 minutes. In a single office commute, I can cover 4-5 papers.

@GoogleDeepMind This sounds fascinating! Definitely worth a read if you're interested in how compute strategies can boost LLM performance. Thanks for sharing!

@GoogleDeepMind Absolutely, it’s fascinating to see how these insights can reshape our understanding of model efficiency. The interplay between compute and parameters is such an important topic right now. Looking forward to diving deeper into the paper!

@GoogleDeepMind I don’t know who came first, Google or OpenAI, but rebalancing the compute load from training to inference is a great idea. (Just gotta run it on Groq or a new chip from @sama ).
