Uploaded: 2026-04-13T10:22:56.000Z
Duration: PT11.935S
Channel: David Bar

System identification (sysid) is the process of finding the... physical parameters that make a simulation match reality. If you're training an RL locomotion policy in simulation, the accuracy of your motor model directly affects how well the policy transfers to the real robot. A recent git commit by Kevin Zakka added a sysid toolbox to MuJoCo which automates this process: you provide recorded motor data and a MuJoCo model, and it optimizes the model parameters to minimize the difference between simulated and real trajectories. For my RobStride Dynamics RS02 QDD motors (17 Nm peak, 7.75:1 gear), I built a Rust tool that sends multi-sine torque excitation at 1 kHz and records position/velocity feedback. I then feed this data into MuJoCo's sysid optimizer.show more

David Bar

47,996 görüntüleme • 2 ay önce

I've gotten a mujoco sim RL training loop for... show more

kache

29,377 görüntüleme • 3 gün önce

System ID for legged robots is hard: (1) Discontinuous... dynamics and (2) many parameters to identify and hard to "excite" them. SPI-Active is a general tool for legged robot system ID. Key ideas: (1) massively parallel sampling-based optimization, (2) structured parameter space, and (3) active exploration based on Fisher Information to collect the most informative data in real. SPI-Active provides an accurate robot model and effectively reduces the sim2real gap. In sim2real policy learning setting, it outperforms baselines by 42-63% in various quadruped & humanoid tasks. Led by Nikhil Sobanbabu Guanqi Heshow more

Guanya Shi

21,442 görüntüleme • 1 yıl önce

A visualisation of different velocity factors in a PCB.... show more

Lukas Henkel

34,402 görüntüleme • 1 yıl önce

In flow matching, a coupling determines how noise and... data samples are paired during training. The choice of coupling is important because it influences the geometry of trajectories at inference time. The simplest choice is the independent coupling, where noise and data points are paired arbitrarily. This can lead to curved trajectories as the model averages over many conflicting pairings. However, if we use optimal transport on batches of pairs, this leads to fewer ambiguous intersections that the model must resolve, leading to straighter trajectories at inference time.show more

Alec Helbling

64,978 görüntüleme • 29 gün önce

hi all, excited to join! i'm building an expressive... mini "shoggoth" robot which will eventually be hooked up to gpt4o realtime voice. i'm currently working on the low-level policies, which are trained in a mujoco simulation with RL. to delay working on raw-pixels for now, i trained a pose-estimation model using deeplabcut and triangulate the position in 3d space using the stereo cameras. eventually, i'll use gpt4o's tool calling capabilities to activate several of these policies (closed and open loop) based on the dialog flow! captions: manual actuation of the tentacle / 3d pose estimation / target designshow more

Matthieu LC

45,506 görüntüleme • 1 yıl önce

A $50B market. A new model for data. And... show more

Pyth Network 🔮

13,863 görüntüleme • 5 ay önce

✨ Made a new mini feature on Photo AI:... [ Grab from 3d model ] So the problem is we're at that stage in time (typical for AI) where image-to-3d models are not good enough but are fun to play with, but we know they'll be good enough in 1-2 years With [ Make 3d model ] you already can turn any Photo AI pic into a 3d model but it still looks hyper clunky and deformed, but it works! One cool idea I had to make that more useful and made now: Let people make a 3d model then change the view of the it with the 3d viewer, then press [ o ] and it grabs a frame of the 3d That image you can then [ Remix ] (img2img), and it becomes a real photo again and that in turn you can then turn into a video again with [ Make video ] So that essentially gives you a fully freeform camera position control to take photos with One thing I need to fix is the background/skybox, I kinda need to take the original photo and remove the person and just get the background for the 3d model viewer, in this case it should be white, but it's a start!show more

@levelsio

119,210 görüntüleme • 11 ay önce

A Letter to Our Community: The Road Ahead for... Robotics To our Community and Partners, As we step into 2026, our mission at Axis is clearer than ever: Constructing the definitive End-to-End Scaling Layer for Robotics. Our goal is to accelerate the transfer of diverse human intelligence into Robotics General Intelligence (RGI). By owning the critical path of intelligence creation, we are turning the physical limitations of robotics into a scalable, software-driven future. Here is our strategic outlook and roadmap for the year ahead. The Core Thesis: Simulation is the Only Way Out The path to RGI is currently blocked by Data Scarcity, Generalization Fragility, and Hardware Fragmentation. At Axis, we believe Simulation is the only way out. Our Simulation Data Platform and Data Augmentation Engine transform raw data into "Synthetic Gold". Backed by academic milestones like Roboverse, Skill Blending, and GraspVLA, we have proven that pure simulation can achieve the generalization required for the real world. We don’t just collect data; we architect it. The Engine: Why Crypto? We believe RGI should come from all, not a few. Crypto is not just a feature; it is the primitive that powers our entire ecosystem flywheel: - Incentive Mechanism: Democratizing contribution and rewarding the trainers and developers. - Assetization: Turning proprietary data and refined models into liquid, ownable assets. - Verifiable Workflow: We are opening the "Black Box" of AI. By bringing total transparency to the Task Generation → Data Collection → Model Training pipeline, we ensure every byte of intelligence is verifiable, traceable, and secure. 2026 Strategic Deliverables This year, we are committed to delivering three foundational pillars: - The World's Largest Training Dataset for Robots: A robot training set—diverse, high-quality interaction data at an unprecedented scale. - A Robotics Foundation Model: A universal robotic brain trained on our pure simulation and synthetic data, capable of robust cross-embodiment transfer and open-world adaptability. - Evolvable Robot Hardware: Robots deployed with Axis models that autonomously evolve through continuous interaction, turning every deployment into a self-improving node within our RGI network. The Ultimate Vision We are building more than models; we are architecting the Distributed Machine Economy. A future where every dataset, model, and robotic embodiment is a verifiable asset in a global, autonomous network. Thank you for building the future of intelligence with us✌️📷show more

Axis Robotics

27,858 görüntüleme • 5 ay önce

Your model crushed the benchmark. Then it couldn’t pick... up a cup. That’s the reality nobody talks about. You train in simulation, it falls apart on real hardware. You collect real-world data instead (months of teleop, physical setups, safety protocols) and still can’t scale it. Meanwhile, your model outgrows every available benchmark, and you have no way to know if you’re actually getting better. That costs you iteration speed. Engineering hours. And confidence in every decision about when to ship. This year I’m collaborating with Lightwheel to cover robotics and embodied AI at NVIDIA GTC — Booth 1406, March 16–19, San Jose. Live demos of what happens when the physics ACTUALLY match. Worth a stop. I’ll be there. Saying hello to Steve Xie 👋show more

Ilir Aliu

10,575 görüntüleme • 3 ay önce

𝗣𝗼𝗽𝘂𝗹𝗮𝗿 𝗼𝗽𝗶𝗻𝗶𝗼𝗻: "𝗝𝘂𝘀𝘁 𝗴𝗲𝗻𝗲𝗿𝗮𝘁𝗲 𝗺𝗼𝗿𝗲 𝘀𝗶𝗺𝘂𝗹𝗮𝘁𝗶𝗼𝗻 𝗱𝗮𝘁𝗮." After working... with many 𝗿𝗼𝗯𝗼𝘁 𝗺𝗮𝗻𝗶𝗽𝘂𝗹𝗮𝘁𝗶𝗼𝗻 teams who've fallen into the simulation trap, here's what I've learned: Simulation teaches your robot to be really, really good at simulation. Unlike blind locomotion policies that can get away with sim-to-real transfer because they rely mainly on proprioception and contact forces, 𝘃𝗶𝘀𝗶𝗼𝗻-𝗴𝘂𝗶𝗱𝗲𝗱 𝗺𝗮𝗻𝗶𝗽𝘂𝗹𝗮𝘁𝗶𝗼𝗻 𝗶𝘀 𝗲𝘅𝘁𝗿𝗲𝗺𝗲𝗹𝘆 𝘀𝗲𝗻𝘀𝗶𝘁𝗶𝘃𝗲 𝘁𝗼 𝘃𝗶𝘀𝘂𝗮𝗹 𝗱𝗼𝗺𝗮𝗶𝗻 𝗴𝗮𝗽. The subtle differences accumulate: - Simulated friction vs real surface textures - Perfect lighting vs shadows, reflections, glare - Ideal object geometries vs manufacturing tolerances - Instantaneous sensor readings vs real-world noise and latency - Clean backgrounds vs cluttered, dynamic environments 𝗧𝗵𝗲 𝗰𝗹𝗮𝘀𝘀𝗶𝗰 𝗽𝗿𝗼𝗴𝗿𝗲𝘀𝘀𝗶𝗼𝗻: Week 1: "Our model works perfectly in sim!" Week 2: "Let's collect some real data to fine-tune." Week 3: "The real data completely contradicts what the sim taught..." Week 4: "Okay, let's collect way more real data." Month 2: "We basically need to retrain from scratch." 𝗧𝗵𝗲 𝗽𝗮𝗶𝗻𝗳𝘂𝗹 𝘁𝗿𝘂𝘁𝗵: There's no shortcut to real-world data collection for vision-based manipulation. Simulation is amazing for debugging, prototyping, safety testing, and of course to supplement your real data. But it's not a substitute for understanding how your robot actually behaves in the actual environment. 𝗪𝗵𝗮𝘁 𝘄𝗼𝗿𝗸𝘀: Use simulation strategically - for exploring edge cases, testing safety boundaries, and rapid iteration. But build your production models on real data from real environments. The teams that succeed treat simulation as a powerful tool, not a magic solution. This is why Neuracore focuses on making real-world data collection so much easier and faster. Because the physics of your actual environment can't be simulated away. 𝗪𝗼𝗿𝗹𝗱 𝗺𝗼𝗱𝗲𝗹𝘀, 𝘆𝗼𝘂 𝘀𝗮𝘆? 𝗪𝗲𝗹𝗹, 𝗽𝗲𝗿𝗵𝗮𝗽𝘀 𝗺𝗼𝗿𝗲 𝗼𝗻 𝘁𝗵𝗮𝘁 𝗶𝗻 𝗮𝗻𝗼𝘁𝗵𝗲𝗿 𝗽𝗼𝘀𝘁! 𝗪𝗵𝗮𝘁'𝘀 𝗯𝗲𝗲𝗻 𝘆𝗼𝘂𝗿 𝗲𝘅𝗽𝗲𝗿𝗶𝗲𝗻𝗰𝗲 𝘄𝗶𝘁𝗵 𝘀𝗶𝗺-𝘁𝗼-𝗿𝗲𝗮𝗹 𝘁𝗿𝗮𝗻𝘀𝗳𝗲𝗿? 𝗛𝗮𝘀 𝗶𝘁 𝘄𝗼𝗿𝗸𝗲𝗱 𝗮𝘀 𝘄𝗲𝗹𝗹 𝗮𝘀 𝗲𝘅𝗽𝗲𝗰𝘁𝗲𝗱?show more

Stephen James

31,009 görüntüleme • 10 ay önce

Your model can't zoom up as much as your... friends? Send this to your rigger❗️ - In the video, left side is adjusted exported param, right side is the default. - The left model has a more close up default state and can zoom up more than the right The export parameters that you should pay attention to - Center of model Y vertically affects the center of exported model - Canvas scale unit: How big the default state of your model is Personally I would recommend having - Center of model Y at 0.25 - 0.15 ( to focus on the face) - Canvas scale unit: 4.0 #live2d #live2dtip #vtubestudioshow more

ALKANimate | 2d rigging comm (closed for 2025) |

160,740 görüntüleme • 2 yıl önce

How do we create realistic models of dressed humans... show more

Qingqing Zhao

66,926 görüntüleme • 2 yıl önce

Now that I've recovered my model files I thought... show more

Iko ☯️💀【VTuber】

27,192 görüntüleme • 1 yıl önce

i trained this computer vision model just by asking... for "tennis players", using roboflow rapid and SAM3 the model picks out Alcaraz and Sinner, while correctly avoiding the ball boys and fans then I exported the model to python and wrote a script to filter the data, track movement history, and annotate the final video this was a quick experiment but I could expand the project to track player speeds, distance covered, shot count, etc. could use this for a sports analytics app that tracks live matches and calculates custom metricsshow more

293,409 görüntüleme • 6 ay önce

I present the Timeless Transitional Universe Theory V9 TUTT... - Space is not a VOID but a fluid dynamic fabric. I’ve unified strong and weak nuclear force, gravity, and electromagnetism my model functions between super activity and entropy. NEED PEER REVIEW HELP! You can see in the video clip, the activity of this fabric of space. My model uses the voyager probes to validate for empirical data. This is version nine of months of refinement. This model removes the man-made construct of temperature and time. Feel free to ask any questions. I recommend uploading these images to a GPT and then assessing its value :-) thank you for your time it is appreciated… Even though it is a man-made construct.show more

AskACapper

80,201 görüntüleme • 1 yıl önce

A viral paper "Language Model Represents Space and Time"... recently claims that LLMs learn "world models". As much as I like Max Tegmark's works, I disagree with their definition of world model. World model is a core concept in AI agent and decision making. It is our mental simulation of how the world works given interventions (or lack thereof). A world model captures causality and intuitive physics, telling the agent what is likely and what is impossible. It can and should be used for counterfactual reasoning, i.e. "what ifs": what would happen if I knock over a cup of water? Where would I have been if I had not taken that bus? Yann LeCun Yann LeCun says it well in his position paper ( I quote: "Using such world models, animals can learn new skills with very few trials. They can predict the consequences of their actions, they can reason, plan, explore, and imagine new solutions to problems. Importantly, they can also avoid making dangerous mistakes when facing an unknown situation." The first use of the term World Model in deep policy learning is attributed to hardmaru & Jürgen Schmidhuber: In their seminal paper, an agent masters shooting skills in the popular game Doom (demo below) by learning in imagination, using an internal world model as a "physics simulator". To put in a simple Python math formula, world model learns a function F(s[0:t-1], a) -> s[t:], which takes as input the observed past and current action, and outputs plausible future states. Now the definition of World Model in Tegmark's paper seems to be about predicting GPS coordinates and time eras. I see this as just a classification task with no causal learning and simulation going on. You cannot make meaningful interventions against that model, nor can you optimize any decision making in a closed feedback loop. As for the "space & time neurons", I think they are most similar to the "sentiment neuron" that OpenAI published in 2017: Predicting GPS is conceptually no different from predicting sentiment in my opinion. I don't think their experimental results are wrong - just that their conclusion is on shaky grounds. I welcome any debate! Paper link:show more

Jim Fan

593,909 görüntüleme • 2 yıl önce

i built an open world minigame that's controlled with... hand movements only here's a step-by-step tutorial: > started with my mediapipe + threejs template (see QT) > added a 3D model made by quaternius > sent a few prompts to gemini 3... --- prompt #1: repurpose the attached script, but now I want to import a gltf model (assets/model.gltf) and use that in the scene instead of the cube when the user moves their hand, a waypoint indicator should move around the scene, and the 3D model should smoothly move there when the user makes a fist, the model should jump the 3D model contains bundled animations in it. use the "idle", "run", and "jump" animations --- prompt #2: make this a procedural open world adventure. generate very simple procedural voxel terrain when the model gets near the edges of the screen, the camera should move, allowing the model to keep going in that direction --- prompt #3: add some floating glowing gems around the map that the user can collect --- i made some manual tweaks for styling and game feel, but that's the gist of it thanks for reading if you got all the way here full code is available at my link in bioshow more

131,799 görüntüleme • 6 ay önce

the fact that i can take an image of... show more

Jan

131,884 görüntüleme • 6 ay önce

Live Cam