
Thomas Wolf
@Thom_Wolf • 117,933 subscribers
Co-founder at @HuggingFace - moonshots - angel
Shorts
Videos

Thrilled to finally share what we've been working on for months at Hugging Face 🤝Pollen Robotics Our first robot: Reachy Mini A dream come true: cute and low priced, hackable yet easy to use, powered by open-source and the infinite community. Tiny price, small size, huge possibilities. A robot built to code, learn, share with AI builders of all ages, all around the globe, using the latest vision, speech and text AI model. A first robot for today's and tomorrow's AI builders. Read more and order now at First deliveries expected right after the summer.
Thomas Wolf1,254,840 views • 10 months ago

we've seen nothing yet! hosted a 9-13 yo vibe-coding event w. Robert Keus 👨🏼💻 this w-e (h/t Anton Osika Darky) takeaway? AI is unleashing a generation of wildly creative builders beyond anything I'd have imagined and they grow up *knowing* they can build anything!
Thomas Wolf987,187 views • 1 year ago

wow, total BoM cost $660, folks open-source community >> closed source hyped robots
Thomas Wolf192,435 views • 9 months ago

Wow! Super impressive work by the new Amazon FAR team (from Covariant acquisition). Mapping long sequences of human motion (>30 sec) on robots with a differing shapes or interating with objects (box, table, etc) of different size. Enabling easier in-simulation data-augmentation and zero-shoot transfer. Super impressive and huge help to reduce the need for human teleop data (which is very complex to gather for humanoids) Dataset trajectories on Hugging Face (search OmniRetarget), full code framework to come soon Project page has some pretty three.js interactive demos
Thomas Wolf149,337 views • 8 months ago

The kyutai fully end-to-end audio model demo of today is a huge deal that many people missed in the room Mostly irrelevant are the facts that: - they come a few week after OpenAI ChatGPT-4o - the demo was less polished than the 4o one (in terms of voice quality, voice timing…) Relevant: - the model training pipeline and model archi are simple and hugely scalable, with a tiny 8+ people team like Kyutai building it in 4 months. Synthetic data is a huge enabler here - laser focus on local devices: Moshi will soon be everywhere. Frontier model builders have low incentive to let you run smaller models locally (price per token…) but non-profits like Kyutai have very different incentives. The Moshi demo is already online while the OpenAI 4o one is still in limbo. - going under 300 ms of latency while keeping Llama 8B or above quality of answers is a key enabler in terms of interactivity, it’s game changing, This feeling when the model answer your question before you even finished asking is quite crazy or when you interrupt the model while it’s talking and it react… Predictive coding in a model, instantly updated model of what you’re about to say... Basically they nailed the fundamentals. It’s here. This interactive voice tech will be everywhere. It will soon be an obvious commodity.
Thomas Wolf339,432 views • 1 year ago

Reachy Mini starring in Jensen's CES keynote 🌟 really proud is was so prominently featured on stage and humbled that our product is getting so many AI builders excited and building you don't have to make humanoids just because everyone else is talking about them – be contrarian - build what you think is the right thing to create now
Thomas Wolf47,498 views • 4 months ago

There is a beautiful story that just happened in AI so let me share it for a lighter tone weekend post among all the doom stories in our AI field this week. It’s a story of people on three continents building and sharing in the open a new small efficient and state-of-the-art AI model. It started a couple of months ago when a new team in the AI scene released their first model from their headquarters in Paris (France): Mistral 7B. Impressive model, small and very strong performances in the benchmarks, better than all previous models of this size. And open source! So you could build on top of it. Lewis in Bern (Switzerland) and Ed (in Lyon, in the South of France) both from the H4 team, a team of researchers in model fine-tuning and alignment were talking about it over a coffee, in one of these gatherings that often happen at Hugging Face to break the distance between people (literal distance as HF is a remote company). What about fine-tuning it using this new DPO method that a research team from Stanford in California just posted on Arxiv, says one? Hey, that’s a great idea, replies the other. We've just build a great code base (with Nathan, Nazneen, Costa, Younes and all the H4 team and TRL community) let's use it! The next day they start diving in the datasets openly shared on the HF hub and stumble upon two interesting large and good quality fine-tuning datasets recently open-sourced by OpenBMB, a Chinese team from Tsinghua: UltraFeedback and UltraChat. A few rounds of training experiments confirm the intuition, the resulting model is super strong, by far the strongest they have ever seen in their benchmarks from Berkeley and Stanford (LMSYS and Alpaca). Join Clementine, the big boss of the open evaluation leaderboard. Her deep dive into the model capabilities confirms the results: impressive performance. But the H4 team also hosts a famous faculty member, Pr. Sasha Rush, Associate Professor at Cornell University in his daytime, hacker at HF in his nighttime. Joining the conversation, he proposes to quickly draft a research paper to organize and share all the details with the community. A few days later, the model, called Zephyr (a wind like Mistral), paper, and all details are shared with the world. Quickly other companies, everywhere in the world starts to use it. LlamaIndex, a famous data framework and community, shares how the model blew their expectations on real-life use-case benchmarks, while researchers and practitioners discuss the paper and work on the Hugging Face hub. All this happened in just a few weeks catalyzed by open access to knowledge, models, research, and datasets released all over the world (Europe, California, China) and by the idea that people can build upon one another work in AI to bring real-world value with efficient and open models. Stories like this are numerous everywhere around us and make me really proud of the AI community and see how we can build amazingly useful things together. [the video is just me reading this Friday post hahah]
Thomas Wolf169,127 views • 2 years ago

We've just released the new Spaces search and it's totally mind blowing Explore over 400k AI Apps in the most intuitive way background removal, image-to-3D, comic factory, sound transcription, image editing, clothes virtual try-on, etc All made by AI builders for AI builders
Thomas Wolf40,458 views • 1 year ago

briefly chatting on Bloomberg earlier today about DeepSeek and open-source AI
Thomas Wolf26,337 views • 1 year ago