Thomas Wolf's banner
Thomas Wolf's profile picture

Thomas Wolf

@Thom_Wolf117,933 subscribers

Co-founder at @HuggingFace - moonshots - angel

Shorts

favorite AGI/sci-fi vibe these days is coding a robot code together with the robot here vibe-pluging ElevenLabs in reachymini for a talk later today

favorite AGI/sci-fi vibe these days is coding a robot code together with the robot here vibe-pluging ElevenLabs in reachymini for a talk later today

30,228 次观看

Look what just arrived in the mail 📬 So excited to hold the final printed version in my hands after many months in the making. And now it’s out for everyone! (shortlink in the video...)

Look what just arrived in the mail 📬 So excited to hold the final printed version in my hands after many months in the making. And now it’s out for everyone! (shortlink in the video...)

93,885 次观看

being in a robotics lab in 2025 open-source hardware + good RL + impressive people => lot of fun

being in a robotics lab in 2025 open-source hardware + good RL + impressive people => lot of fun

81,380 次观看

you pay it in reliability/strength but a thing i really love about low-cost 3d-printed hardware is how it can be so harmless by design (plastic made with low-cost motors). kids can just play and experiment with it without risks/supervision and that’s such an amazing way to learn

you pay it in reliability/strength but a thing i really love about low-cost 3d-printed hardware is how it can be so harmless by design (plastic made with low-cost motors). kids can just play and experiment with it without risks/supervision and that’s such an amazing way to learn

41,605 次观看

Videos

Thom_Wolf's profile picture

There is a beautiful story that just happened in AI so let me share it for a lighter tone weekend post among all the doom stories in our AI field this week. It’s a story of people on three continents building and sharing in the open a new small efficient and state-of-the-art AI model. It started a couple of months ago when a new team in the AI scene released their first model from their headquarters in Paris (France): Mistral 7B. Impressive model, small and very strong performances in the benchmarks, better than all previous models of this size. And open source! So you could build on top of it. Lewis in Bern (Switzerland) and Ed (in Lyon, in the South of France) both from the H4 team, a team of researchers in model fine-tuning and alignment were talking about it over a coffee, in one of these gatherings that often happen at Hugging Face to break the distance between people (literal distance as HF is a remote company). What about fine-tuning it using this new DPO method that a research team from Stanford in California just posted on Arxiv, says one? Hey, that’s a great idea, replies the other. We've just build a great code base (with Nathan, Nazneen, Costa, Younes and all the H4 team and TRL community) let's use it! The next day they start diving in the datasets openly shared on the HF hub and stumble upon two interesting large and good quality fine-tuning datasets recently open-sourced by OpenBMB, a Chinese team from Tsinghua: UltraFeedback and UltraChat. A few rounds of training experiments confirm the intuition, the resulting model is super strong, by far the strongest they have ever seen in their benchmarks from Berkeley and Stanford (LMSYS and Alpaca). Join Clementine, the big boss of the open evaluation leaderboard. Her deep dive into the model capabilities confirms the results: impressive performance. But the H4 team also hosts a famous faculty member, Pr. Sasha Rush, Associate Professor at Cornell University in his daytime, hacker at HF in his nighttime. Joining the conversation, he proposes to quickly draft a research paper to organize and share all the details with the community. A few days later, the model, called Zephyr (a wind like Mistral), paper, and all details are shared with the world. Quickly other companies, everywhere in the world starts to use it. LlamaIndex, a famous data framework and community, shares how the model blew their expectations on real-life use-case benchmarks, while researchers and practitioners discuss the paper and work on the Hugging Face hub. All this happened in just a few weeks catalyzed by open access to knowledge, models, research, and datasets released all over the world (Europe, California, China) and by the idea that people can build upon one another work in AI to bring real-world value with efficient and open models. Stories like this are numerous everywhere around us and make me really proud of the AI community and see how we can build amazingly useful things together. [the video is just me reading this Friday post hahah]

Thomas Wolf

169,127 次观看 • 2 年前