
Harrison Kinsley
@Sentdex • 104,520 subscribers
gpus and tractors. Director of AI and Engineering @ https://t.co/H4St8dd1ip Neural networks from Scratch book: https://t.co/hyMkWyUP7R https://t.co/8WGZRkUGsn
Shorts
Videos

People often ask how did the Unitree robots get so good all of a sudden. It wasn't all of a sudden, and it's because they ship their hardware and open source their SDKs. Arguably these robots are nearly useless out of the box, but you have full dev control of them. Because of that, the hardware has become a very popular R&D platform with an ecosystem around it and the Unitree G1 is undoubtedly an order of magnitude better than it could ever be at this point if Unitree was instead just doing quiet internal dev of both the hardware and software. Too many hardware companies for really cool products that seek to be community-driven (robots, AR glasses...etc) desire to make a profitable walled garden and their greed just ends up walling out developers and their product gets outpaced by the G1s of the world.
Harrison Kinsley815,526 просмотров • 8 месяцев назад

I need a picker upper pupper in my twin toddler life.
Harrison Kinsley13,751 просмотров • 25 дней назад

just cooked up a new sprinter policy, do we attempt sim2real?
Harrison Kinsley57,912 просмотров • 7 месяцев назад

picking something up off the floor w/ a humanoid is more challenging than a backflip
Harrison Kinsley25,485 просмотров • 3 месяцев назад

In a world of PPO everything for reinforcement learning, I've been tinkering with SAC for training a quadruped gait. This gait is trained purely on CPU (training on one of the Dell GB10s) on a single environment. Training any particular run is obviously slower than PPO on an RTX Pro 6000 with 8092 envs, if you already know the exact hyperparams/rwd function for your PPO algo... but, if we're honest with ourselves, then we know we usually spend days tuning our PPO algo and fighting it to do what we want. In contrast, SAC has kind of been a breath of fresh air, very amenable to changing the reward function to tune behavior. So far, my first attempts to tune things have consistently just worked immediately rather than 15 different variations of reward hacking only to find previous tuned behaviors got lost in the process. There is also FastSAC, which I've not yet tried, but can speed things up potentially and introduce scale back into the equation. My main painpoint in getting SAC to work for gait was actually getting it to learn to step. It seems as though SAC is not as good as PPO at significant exploration on its own. I ended up starting with a sinusoidal gait (basically just a rule to make legs swing) as training wheels then blended it out through training as phase 1, then began working on smoothing things out after this. I think if we look at end to end dev time rather than any particular run that finally managed to work, SAC may actually be the "faster" algorithm to train. Quadruped gaits are inherently easier than bipedal and maybe there are areas where SAC falls short, but I'll definitely be spending more time with SAC.
Harrison Kinsley26,711 просмотров • 3 месяцев назад

This is a vertically integrated end to end deep neural network performing forward pass inference real-time, controlling individual actuator's torque output for bidpedal gait generation in adverse, GPS denied, envs. ok its standard PPO rl trained in mjlab, strapped to a tractor.
Harrison Kinsley34,118 просмотров • 5 месяцев назад

Ladies and gentlemen, we have our first successful sim2real transfer on Geoff the G1!
Harrison Kinsley28,619 просмотров • 6 месяцев назад

after many failed attempts to make a single model that walks and crouches with all the attributes I wanted, I ended up just splitting into 2 models with basic control logic to swap between them. Could just keep adding "skills" as models and tweak those specific models as needed
Harrison Kinsley19,212 просмотров • 4 месяцев назад

alright, finally we're almost ready to ship our new policy to the world
Harrison Kinsley20,756 просмотров • 7 месяцев назад
Больше нет контента для загрузки

