Loading video...

Video Failed to Load

There was a problem loading this video. This could be due to a temporary network issue or the video might be unavailable.

Constructing interactive simulated worlds has been a challenging problem, requiring considerable manual effort for asset creation and articulation, and composing assets to form full scenes. In our new work - DRAWER, we made the process of creating scenes in simulation as simple as taking a video of the scene... show more

Abhishek Gupta

11,566 subscribers

12,072 views • 1 year ago •via X (Twitter)

Science & Technology

Anya Rossi• Live Now

Private livecam show

10 Comments

Abhishek Gupta1 year ago

So DRAWER builds a “dual-representation” of a scene, integrating the strengths of Gaussian splatting and Neural SDFs to have both high fidelity and quickly rendered visuals (from GS), and accurate geometry (from Neural SDFs). This enables highly realistic scene creation that simulates efficiently enough for high-throughput applications like RL - with a combination of geometry, appearance, articulation and speed. (2/7)

Abhishek Gupta1 year ago

Given this static scene, DRAWER then uses physical reasoning from foundation models (3DOI/GPT-4) to articulate the scene, and fills in the inside of cabinets and drawers using techniques for amodal shape estimation with hidden region texturing. This allows the scene to be fully interactable for tasks like placing objects inside cabinets. (3/7)

Abhishek Gupta1 year ago

Ok neat, so what can we do with DRAWER. Firstly, we show that we can quickly get reconstructions of a diversity of scenarios. Here are fully interactive kitchen simulations at both UW and UIUC, reconstructed from *just* videos of the scene taken from a phone using the same pipeline. (4/7)

Abhishek Gupta1 year ago

Secondly, you can use this for gaming applications. Here is a fun application made by @XHongchi97338 in Unreal engine to interact with various elements of the environment. For instance, one can explore each environment, open cabinets to look for objects of interest, and shoot objects in the scene with dynamic realism! (5/7)

Abhishek Gupta1 year ago

Next, we can use this for training in robotics! We show that we can easily generate data in simulation for training robotic policies, and the resulting policies can transfer directly to the real world! The process of data generation becomes as simple as taking a video and then running a simple motion planner in simulation, with minimal human effort. (6/7)

Abhishek Gupta1 year ago

This work was a tour-de-force from @XHongchi97338! I was pretty mind-blown when he first showed us these results :) I learned a lot about 3D vision and simulation in the process :) Fun collaboration with @XHongchi97338, @EntongSu, @memmelma, @prodarhan, @yu_raymond5, @nums_ai, Ali Farhadi, @ShenlongWang, @weichiuma. Hoping this can be a useful tool for many in the community to build interactive simulation environments quickly and easily. Especially hoping it's going to make robotics a lot easier! Paper: Website: Code:

Abhishek Gupta1 year ago

@EntongSu @memmelma @prodarhan @yu_raymond5 @nums_ai Forgot to mention - this will be presented at #CVPR2025!

RTTS1 year ago

Testing Salesforce presents unique challenges due to its complexity, scalability and customizability. RTTS can plan, design & automate a successful testing process for you.

Kaixin Chai1 year ago

wow, really impressive!

Junshan Huang1 year ago

This is really useful! Thank you!

Related Videos

Here’s a simple breakdown of the hair simulation process we used for May in this shot. Watch the full video at #toanimate #rigging #blender

Here’s a simple breakdown of the hair simulation process we used for May in this shot. Watch the full video at #toanimate #rigging #blender

TOAnimate

20,109 views • 7 months ago

Robora Sim: A PyBullet-Powered Environment for Learning Robotic Physical Intelligence We are currently building our Robora simulation environment setup for our sim based learning, leveraging PyBullet, an industry-standard physics engine widely used in AI-driven robotics research and development. The environment is optimized with GPU-accelerated learning algorithms, enabling high-speed imitation learning and reinforcement learning within a safe and controlled virtual setup before shipping out to real world. This simulation platform allows our models to learn, adapt, and generalize across different robot morphologies, terrain types and task objectives - all before deployment to the real world. At it's core, the system combines a VLA-powered high-level planner with low-level motion control algorithms, working cohesively to produce emergent, physically intelligent behaviors. This synergy between simulation, learning, and real-world transfer marks a major step forward in our pursuit of adaptive and intelligent robotic systems. Through advanced domain randomization and synthetic data generation, the Robora Simulation Environment ensures that policies trained in simulation transfer effectively to real-world robots, minimizing the sim-to-real gap. Moreover, users will be able to test and integrate their own hardware kits within selected simulation environments in the Robora Dapp, ensuring seamless compatibility and safer real-world implementation.

Robora Sim: A PyBullet-Powered Environment for Learning Robotic Physical Intelligence We are currently building our Robora simulation environment setup for our sim based learning, leveraging PyBullet, an industry-standard physics engine widely used in AI-driven robotics research and development. The environment is optimized with GPU-accelerated learning algorithms, enabling high-speed imitation learning and reinforcement learning within a safe and controlled virtual setup before shipping out to real world. This simulation platform allows our models to learn, adapt, and generalize across different robot morphologies, terrain types and task objectives - all before deployment to the real world. At it's core, the system combines a VLA-powered high-level planner with low-level motion control algorithms, working cohesively to produce emergent, physically intelligent behaviors. This synergy between simulation, learning, and real-world transfer marks a major step forward in our pursuit of adaptive and intelligent robotic systems. Through advanced domain randomization and synthetic data generation, the Robora Simulation Environment ensures that policies trained in simulation transfer effectively to real-world robots, minimizing the sim-to-real gap. Moreover, users will be able to test and integrate their own hardware kits within selected simulation environments in the Robora Dapp, ensuring seamless compatibility and safer real-world implementation.

Robora

23,489 views • 9 months ago

This is how Strike Robot turns Simulation into Reality! One of the biggest challenges in robotics is ensuring that behaviors validated in simulation work reliably in the real world. For this experiment, we reconstructed part of a real laboratory at Eastworlds inside SR Platform. The generated layout was then deployed into MuJoCo. Using SR Agentic, the robot was tasked with finding abnormal objects in a cluttered environment and sending a Telegram notification when detected. Before deployment, everything is validated in simulation.

This is how Strike Robot turns Simulation into Reality! One of the biggest challenges in robotics is ensuring that behaviors validated in simulation work reliably in the real world. For this experiment, we reconstructed part of a real laboratory at Eastworlds inside SR Platform. The generated layout was then deployed into MuJoCo. Using SR Agentic, the robot was tasked with finding abnormal objects in a cluttered environment and sending a Telegram notification when detected. Before deployment, everything is validated in simulation.

Strike Robot

15,025 views • 27 days ago

I prepared a few demos for the new simulation nodes. Here's a fun example of custom 2D curve simulation that I worked on for FX on our current short `Pet Projects` at Blender Studio 🔶 Check out everything new in Blender 3.6, and the demo files here: #b3d

I prepared a few demos for the new simulation nodes. Here's a fun example of custom 2D curve simulation that I worked on for FX on our current short `Pet Projects` at Blender Studio 🔶 Check out everything new in Blender 3.6, and the demo files here: #b3d

Simon Thommes

296,188 views • 3 years ago

Introducing 📦𝗔𝗿𝘁𝗶𝗟𝗮𝘁𝗲𝗻𝘁🔧 (SIGGRAPH Asia 2025) — a high-quality 3D diffusion model that explicitly models object articulation, paving the way for richer, more realistic assets in embodied AI and simulation: – Generates fully articulated 3D objects – Physically plausible joints & motion – High-fidelity 3D Gaussian appearance – Supports generation from a single real image arXiv: Project: Code (coming soon):

Introducing 📦𝗔𝗿𝘁𝗶𝗟𝗮𝘁𝗲𝗻𝘁🔧 (SIGGRAPH Asia 2025) — a high-quality 3D diffusion model that explicitly models object articulation, paving the way for richer, more realistic assets in embodied AI and simulation: – Generates fully articulated 3D objects – Physically plausible joints & motion – High-fidelity 3D Gaussian appearance – Supports generation from a single real image arXiv: Project: Code (coming soon):

Xingang Pan

11,517 views • 8 months ago

Foliage Tools simulation plug-in for Unreal Engine 5 from artbyrens has received a new beta version. The updated version received a renewed UI and features such as Packed level actor support and more:

Foliage Tools simulation plug-in for Unreal Engine 5 from artbyrens has received a new beta version. The updated version received a renewed UI and features such as Packed level actor support and more:

80 LEVEL

14,696 views • 3 months ago

When I visited NASA Ames back in July I got a look at this supercomputer simulation showing why SLS has acquired a pair of strakes next to the SRBs. These work as vortex generators that inhibit the large oscillations in the airflow between the core and boosters

When I visited NASA Ames back in July I got a look at this supercomputer simulation showing why SLS has acquired a pair of strakes next to the SRBs. These work as vortex generators that inhibit the large oscillations in the airflow between the core and boosters

Scott Manley

249,954 views • 10 months ago

🚨 Uganda's general just demanded Turkey's most beautiful woman as a wife and that was the SECOND most insane part of his demands.. $1 billion. 30 days. or the embassy closes. Turkey has F-16s and NATO. Uganda has a general posting on X like it's a wedding registry. no diplomats. no summits. just vibes and a wife request. we are not in a simulation.

🚨 Uganda's general just demanded Turkey's most beautiful woman as a wife and that was the SECOND most insane part of his demands.. $1 billion. 30 days. or the embassy closes. Turkey has F-16s and NATO. Uganda has a general posting on X like it's a wedding registry. no diplomats. no summits. just vibes and a wife request. we are not in a simulation.

BuBBliK

2,097,770 views • 3 months ago

A Letter to Our Community: The Road Ahead for Robotics To our Community and Partners, As we step into 2026, our mission at Axis is clearer than ever: Constructing the definitive End-to-End Scaling Layer for Robotics. Our goal is to accelerate the transfer of diverse human intelligence into Robotics General Intelligence (RGI). By owning the critical path of intelligence creation, we are turning the physical limitations of robotics into a scalable, software-driven future. Here is our strategic outlook and roadmap for the year ahead. The Core Thesis: Simulation is the Only Way Out The path to RGI is currently blocked by Data Scarcity, Generalization Fragility, and Hardware Fragmentation. At Axis, we believe Simulation is the only way out. Our Simulation Data Platform and Data Augmentation Engine transform raw data into "Synthetic Gold". Backed by academic milestones like Roboverse, Skill Blending, and GraspVLA, we have proven that pure simulation can achieve the generalization required for the real world. We don’t just collect data; we architect it. The Engine: Why Crypto? We believe RGI should come from all, not a few. Crypto is not just a feature; it is the primitive that powers our entire ecosystem flywheel: - Incentive Mechanism: Democratizing contribution and rewarding the trainers and developers. - Assetization: Turning proprietary data and refined models into liquid, ownable assets. - Verifiable Workflow: We are opening the "Black Box" of AI. By bringing total transparency to the Task Generation → Data Collection → Model Training pipeline, we ensure every byte of intelligence is verifiable, traceable, and secure. 2026 Strategic Deliverables This year, we are committed to delivering three foundational pillars: - The World's Largest Training Dataset for Robots: A robot training set—diverse, high-quality interaction data at an unprecedented scale. - A Robotics Foundation Model: A universal robotic brain trained on our pure simulation and synthetic data, capable of robust cross-embodiment transfer and open-world adaptability. - Evolvable Robot Hardware: Robots deployed with Axis models that autonomously evolve through continuous interaction, turning every deployment into a self-improving node within our RGI network. The Ultimate Vision We are building more than models; we are architecting the Distributed Machine Economy. A future where every dataset, model, and robotic embodiment is a verifiable asset in a global, autonomous network. Thank you for building the future of intelligence with us✌️📷

A Letter to Our Community: The Road Ahead for Robotics To our Community and Partners, As we step into 2026, our mission at Axis is clearer than ever: Constructing the definitive End-to-End Scaling Layer for Robotics. Our goal is to accelerate the transfer of diverse human intelligence into Robotics General Intelligence (RGI). By owning the critical path of intelligence creation, we are turning the physical limitations of robotics into a scalable, software-driven future. Here is our strategic outlook and roadmap for the year ahead. The Core Thesis: Simulation is the Only Way Out The path to RGI is currently blocked by Data Scarcity, Generalization Fragility, and Hardware Fragmentation. At Axis, we believe Simulation is the only way out. Our Simulation Data Platform and Data Augmentation Engine transform raw data into "Synthetic Gold". Backed by academic milestones like Roboverse, Skill Blending, and GraspVLA, we have proven that pure simulation can achieve the generalization required for the real world. We don’t just collect data; we architect it. The Engine: Why Crypto? We believe RGI should come from all, not a few. Crypto is not just a feature; it is the primitive that powers our entire ecosystem flywheel: - Incentive Mechanism: Democratizing contribution and rewarding the trainers and developers. - Assetization: Turning proprietary data and refined models into liquid, ownable assets. - Verifiable Workflow: We are opening the "Black Box" of AI. By bringing total transparency to the Task Generation → Data Collection → Model Training pipeline, we ensure every byte of intelligence is verifiable, traceable, and secure. 2026 Strategic Deliverables This year, we are committed to delivering three foundational pillars: - The World's Largest Training Dataset for Robots: A robot training set—diverse, high-quality interaction data at an unprecedented scale. - A Robotics Foundation Model: A universal robotic brain trained on our pure simulation and synthetic data, capable of robust cross-embodiment transfer and open-world adaptability. - Evolvable Robot Hardware: Robots deployed with Axis models that autonomously evolve through continuous interaction, turning every deployment into a self-improving node within our RGI network. The Ultimate Vision We are building more than models; we are architecting the Distributed Machine Economy. A future where every dataset, model, and robotic embodiment is a verifiable asset in a global, autonomous network. Thank you for building the future of intelligence with us✌️📷

Axis Robotics

27,858 views • 6 months ago

Spindrift has been pushing boundaries in product and brand for years. As one of our earliest partners, we are excited to share how they have leveraged simulation to accelerate their truly innovative company. Over the past several months, we’ve worked to test and develop new products, supported marketing and media efforts, and provided clarity on their toughest questions.

Aaru

89,907 views • 8 months ago

Claude Fable 5 / Mythos is absolutely insane. People are already one-shotting full games, simulations, worlds, physics systems, robots, and stuff that used to need entire teams. Top generations so far: Minecraft in browser Full universe simulation Skyrim-style browser world Fable beat Pokemon by itself Crysis-style destruction physics Level Devil remake Infinite gothic city Water in glass gravity simulation Designed a whole robot German submarine simulation 3D world building from text Neon racing game This feels like the first real glimpse of AI software creation going fully wild.

Claude Fable 5 / Mythos is absolutely insane. People are already one-shotting full games, simulations, worlds, physics systems, robots, and stuff that used to need entire teams. Top generations so far: Minecraft in browser Full universe simulation Skyrim-style browser world Fable beat Pokemon by itself Crysis-style destruction physics Level Devil remake Infinite gothic city Water in glass gravity simulation Designed a whole robot German submarine simulation 3D world building from text Neon racing game This feels like the first real glimpse of AI software creation going fully wild.

VORTEX: AI Bros & AI Arena, Peak AI Buzz

74,373 views • 1 month ago

There's a lot of speculation about whether OpenAI's video generation model Sora has a 'physics engine' (bolstered by OAI's own claims about 'world simulation'). Like the debate about world models in LLMs, this question is both genuinely interesting and somewhat ill-defined. 🧵1/

There's a lot of speculation about whether OpenAI's video generation model Sora has a 'physics engine' (bolstered by OAI's own claims about 'world simulation'). Like the debate about world models in LLMs, this question is both genuinely interesting and somewhat ill-defined. 🧵1/

Raphaël Millière

617,441 views • 2 years ago

📢 Our lab has been exploring 3D world models for years — and we’re thrilled to share **PhysTwin**: a milestone that reconstructs object appearance, geometry, and dynamics from just a few seconds of interaction! Led by the amazing Hanxiao Jiang 👉 PhysTwin combines **Gaussian splatting** with **inverse dynamics optimization** based on simple **spring-mass** systems. ⚙️ The result? Real-time, action-conditioned 3D video prediction under novel interactions (i.e., 3D world models). 🔑 A few key takeaways: 1. Having the right structure (e.g., particles/masses) helps navigate the trade-off between sample efficiency, generalization, and broad applicability. 2. Visual foundation models (VFMs) have matured to the point where they can provide rich supervision for world modeling (e.g., tracking, shape completion). 3. Beyond VFMs, many crucial components have come together in recent years: Gaussian splats for rendering, NVIDIA Warp for high-performance simulation, and scene/asset generation from a wide range of labs and companies. The future of 3D world models is looking bright! ✨ 4. The resulting digital twin supports a wide range of downstream applications—especially in data generation and policy evaluation, thanks to its realistic rendering and simulation capabilities. 🎥 All code and data to reproduce the results, along with interactive demos, are available on the website. Check the following visualizations of: (1) observations, (2) reconstructed state/actions, (3) interactive digital twins, and (4) the overlays between real-world robot teleoperation and our model’s open-loop predictions.

📢 Our lab has been exploring 3D world models for years — and we’re thrilled to share PhysTwin: a milestone that reconstructs object appearance, geometry, and dynamics from just a few seconds of interaction! Led by the amazing Hanxiao Jiang 👉 PhysTwin combines Gaussian splatting with inverse dynamics optimization based on simple spring-mass systems. ⚙️ The result? Real-time, action-conditioned 3D video prediction under novel interactions (i.e., 3D world models). 🔑 A few key takeaways: 1. Having the right structure (e.g., particles/masses) helps navigate the trade-off between sample efficiency, generalization, and broad applicability. 2. Visual foundation models (VFMs) have matured to the point where they can provide rich supervision for world modeling (e.g., tracking, shape completion). 3. Beyond VFMs, many crucial components have come together in recent years: Gaussian splats for rendering, NVIDIA Warp for high-performance simulation, and scene/asset generation from a wide range of labs and companies. The future of 3D world models is looking bright! ✨ 4. The resulting digital twin supports a wide range of downstream applications—especially in data generation and policy evaluation, thanks to its realistic rendering and simulation capabilities. 🎥 All code and data to reproduce the results, along with interactive demos, are available on the website. Check the following visualizations of: (1) observations, (2) reconstructed state/actions, (3) interactive digital twins, and (4) the overlays between real-world robot teleoperation and our model’s open-loop predictions.

Yunzhu Li

25,279 views • 1 year ago

Rare high view of the building left over from a previous simulation that humans here did not build and has false documentation of its "632 year" construction to cover its tracks so no one realizes it. In my humble opinion of course.

Rare high view of the building left over from a previous simulation that humans here did not build and has false documentation of its "632 year" construction to cover its tracks so no one realizes it. In my humble opinion of course.

Biff

294,850 views • 6 months ago

System identification (sysid) is the process of finding the physical parameters that make a simulation match reality. If you're training an RL locomotion policy in simulation, the accuracy of your motor model directly affects how well the policy transfers to the real robot. A recent git commit by Kevin Zakka added a sysid toolbox to MuJoCo which automates this process: you provide recorded motor data and a MuJoCo model, and it optimizes the model parameters to minimize the difference between simulated and real trajectories. For my RobStride Dynamics RS02 QDD motors (17 Nm peak, 7.75:1 gear), I built a Rust tool that sends multi-sine torque excitation at 1 kHz and records position/velocity feedback. I then feed this data into MuJoCo's sysid optimizer.

System identification (sysid) is the process of finding the physical parameters that make a simulation match reality. If you're training an RL locomotion policy in simulation, the accuracy of your motor model directly affects how well the policy transfers to the real robot. A recent git commit by Kevin Zakka added a sysid toolbox to MuJoCo which automates this process: you provide recorded motor data and a MuJoCo model, and it optimizes the model parameters to minimize the difference between simulated and real trajectories. For my RobStride Dynamics RS02 QDD motors (17 Nm peak, 7.75:1 gear), I built a Rust tool that sends multi-sine torque excitation at 1 kHz and records position/velocity feedback. I then feed this data into MuJoCo's sysid optimizer.

David Bar

48,347 views • 3 months ago

🚀Major update to Blender MCP: 🌄 Generate high-quality 3D assets in Blender by just prompting, thanks to Hyper3D by Deemos's Rodin AI. 🎬 Demo: Creating a cozy scene with a cat. Just describe the assets you want, and it will show up right in Blender. No manual modeling needed!

🚀Major update to Blender MCP: 🌄 Generate high-quality 3D assets in Blender by just prompting, thanks to Hyper3D by Deemos's Rodin AI. 🎬 Demo: Creating a cozy scene with a cat. Just describe the assets you want, and it will show up right in Blender. No manual modeling needed!

siddharth ahuja

109,971 views • 1 year ago

With Hunyuan3D World Model 1.0 now released and open-sourced, we're excited to showcase the technical highlights behind this impressive innovation: ✅360° Panoramic Generation: Creates complete, immersive “world scenes”, far beyond localized views. ✅Explorable 3D Scene Generation: Generates diverse, spatially consistent 3D worlds from text/image for truly immersive exploration. ✅Interactive/Editable: Achieves separation of foreground objects, background terrain, ground, and sky, for seamless secondary editing. ✅Exportable Mesh: Generated scenes can be exported as 3D meshes for direct import into mainstream game engines and modeling software. ✅Industry-Leading SOTA Evaluation: Surpasses state-of-the-art open-source models in generation quality. As the industry's first open-source model for physical simulation and explorable world generation, Hunyuan3D World Model 1.0 aims to foster a collaborative community ecosystem with developers and enthusiasts. ✨ Try it now: 🤗 Hugging Face:

With Hunyuan3D World Model 1.0 now released and open-sourced, we're excited to showcase the technical highlights behind this impressive innovation: ✅360° Panoramic Generation: Creates complete, immersive “world scenes”, far beyond localized views. ✅Explorable 3D Scene Generation: Generates diverse, spatially consistent 3D worlds from text/image for truly immersive exploration. ✅Interactive/Editable: Achieves separation of foreground objects, background terrain, ground, and sky, for seamless secondary editing. ✅Exportable Mesh: Generated scenes can be exported as 3D meshes for direct import into mainstream game engines and modeling software. ✅Industry-Leading SOTA Evaluation: Surpasses state-of-the-art open-source models in generation quality. As the industry's first open-source model for physical simulation and explorable world generation, Hunyuan3D World Model 1.0 aims to foster a collaborative community ecosystem with developers and enthusiasts. ✨ Try it now: 🤗 Hugging Face:

Tencent Hy

23,150 views • 11 months ago