NVIDIA AI Developer's banner
NVIDIA AI Developer's profile picture

NVIDIA AI Developer

@NVIDIAAIDev111,125 subscribers

This account is inactive. Follow @NVIDIAAI for the latest updates.

Shorts

Today, we released Lyra 2.0, a framework for generating persistent, explorable 3D worlds at scale, from NVIDIA Research. Generating large-scale, complex environments is difficult for AI models. Current models often “forget” what spaces look like and lose track of movement over time, causing objects to shift, blur, or appear inconsistent. This prevents them from creating the reliable 3D environments required for downstream simulations. Lyra 2.0 solves these issues by: ✅ Maintaining per-frame 3D geometry to retrieve past frames and establish spatial correspondences ✅ Using self-augmented training to correct its own temporal drifting. Lyra 2.0 turns an image into a 3D world you can walk through, look back, and drop a robot into for real-time rendering, simulation, and immersive applications. ➡️ Learn more: 📄 Read the paper:

Today, we released Lyra 2.0, a framework for generating persistent, explorable 3D worlds at scale, from NVIDIA Research. Generating large-scale, complex environments is difficult for AI models. Current models often “forget” what spaces look like and lose track of movement over time, causing objects to shift, blur, or appear inconsistent. This prevents them from creating the reliable 3D environments required for downstream simulations. Lyra 2.0 solves these issues by: ✅ Maintaining per-frame 3D geometry to retrieve past frames and establish spatial correspondences ✅ Using self-augmented training to correct its own temporal drifting. Lyra 2.0 turns an image into a 3D world you can walk through, look back, and drop a robot into for real-time rendering, simulation, and immersive applications. ➡️ Learn more: 📄 Read the paper:

432,726 views

Most 3DGS segmentation tools either pre‑train per scene or lock errors into a feature field you can’t undo. ArtisanGS instead turns a few 2D masks into editable 3D object selections via Cutie tracking + black‑box splat aggregation, then lets you iteratively correct mistakes with consistent 2D/3D selection modes. 📗 #NVIDIAResearch paper:

Most 3DGS segmentation tools either pre‑train per scene or lock errors into a feature field you can’t undo. ArtisanGS instead turns a few 2D masks into editable 3D object selections via Cutie tracking + black‑box splat aggregation, then lets you iteratively correct mistakes with consistent 2D/3D selection modes. 📗 #NVIDIAResearch paper:

42,107 views

🤖 One predictive backbone, three distinct tasks, consistent gains: a strong signal that investing in reusable world models is the right abstraction for robot learning. Toyota Research Institute (TRI) just ran the same idea three different ways—and it worked each time. Using our NVIDIA Cosmos Predict 2–style world models, they hit SOTA with dynamic view synthesis, teleop data augmentation (even up against MimicGen as an upper bound), and navigation world models. 🙌 📚 dynamic view synthesis 📚 teleop data augmentation (also compare to MimicGen as an upper bound) 📚 navigation world models 📸 from

🤖 One predictive backbone, three distinct tasks, consistent gains: a strong signal that investing in reusable world models is the right abstraction for robot learning. Toyota Research Institute (TRI) just ran the same idea three different ways—and it worked each time. Using our NVIDIA Cosmos Predict 2–style world models, they hit SOTA with dynamic view synthesis, teleop data augmentation (even up against MimicGen as an upper bound), and navigation world models. 🙌 📚 dynamic view synthesis 📚 teleop data augmentation (also compare to MimicGen as an upper bound) 📚 navigation world models 📸 from

38,100 views

A preview Into the future of #quantum-GPU computing: At GTC Washington, D.C., we announced NVIDIA NVQLink, an open system architecture for tightly coupling the extreme performance of GPU computing with quantum processors to build accelerated quantum supercomputers. In the expo hall, accelerated quantum computing was a centerpiece of the NVIDIA booth. See our live blog:

A preview Into the future of #quantum-GPU computing: At GTC Washington, D.C., we announced NVIDIA NVQLink, an open system architecture for tightly coupling the extreme performance of GPU computing with quantum processors to build accelerated quantum supercomputers. In the expo hall, accelerated quantum computing was a centerpiece of the NVIDIA booth. See our live blog:

54,981 views

While at #OpenSourceAIWeek, Andrej Karpathy received one of our first NVIDIA DGX Sparks, personally delivered by Nader Khalil🍊 and @Baxate_carter. 🙌 Excellent work by Andrej on introducing Nanochat -- an #opensource project that makes it easy for developers to explore and train their own LLMs from scratch. We can’t wait to see what you’ll do with your Spark! #SparkSomethingBig ✨

While at #OpenSourceAIWeek, Andrej Karpathy received one of our first NVIDIA DGX Sparks, personally delivered by Nader Khalil🍊 and @Baxate_carter. 🙌 Excellent work by Andrej on introducing Nanochat -- an #opensource project that makes it easy for developers to explore and train their own LLMs from scratch. We can’t wait to see what you’ll do with your Spark! #SparkSomethingBig ✨

48,804 views

🎉 Announcing NVIDIA DGX Spark (f.k.a. Project DIGITS). Are you ready to #SparkSomethingBig? 💫 ➡️ #GTC25

🎉 Announcing NVIDIA DGX Spark (f.k.a. Project DIGITS). Are you ready to #SparkSomethingBig? 💫 ➡️ #GTC25

58,868 views

Brain-Computer Interfaces for speech just overcame their largest challenge — latency. Researchers from UC Berkeley and UC San Francisco have developed a new streaming approach using AI models that allows near real-time speech, a massive breakthrough from the previous 8-second delay for a single sentence. Read ➡️

Brain-Computer Interfaces for speech just overcame their largest challenge — latency. Researchers from UC Berkeley and UC San Francisco have developed a new streaming approach using AI models that allows near real-time speech, a massive breakthrough from the previous 8-second delay for a single sentence. Read ➡️

18,952 views

✨Just released: NVIDIA Nemotron post-training multi-lingual dataset ✨ We expanded permissive post-training dataset with addition of synthetically translated reasoning traces. ✋ Five new languages 💪 World class reasoning traces 🤗 Learn more: 📥 Download:

✨Just released: NVIDIA Nemotron post-training multi-lingual dataset ✨ We expanded permissive post-training dataset with addition of synthetically translated reasoning traces. ✋ Five new languages 💪 World class reasoning traces 🤗 Learn more: 📥 Download:

13,800 views

Hello from #GTC25 🤖 Send us your best jokes for the robot below 👇

Hello from #GTC25 🤖 Send us your best jokes for the robot below 👇

17,669 views

Excited to share our ICRA’23 IEEE ICRA work by Adithya Murali We scale up neural collision detection for object rearrangement with procedurally generated synthetic data. Project: Video: 🧵👇

Excited to share our ICRA’23 IEEE ICRA work by Adithya Murali We scale up neural collision detection for object rearrangement with procedurally generated synthetic data. Project: Video: 🧵👇

26,238 views

🎉 Congratulations Google DeepMind on the launch of Gemma 3 🎊 Introducing lightweight, multimodal, multilingual models streamlined for optimal performance on all your NVIDIA hardware – from your #datacenter GPUs to Jetson and RTX AI PCs for Windows. ➡️

🎉 Congratulations Google DeepMind on the launch of Gemma 3 🎊 Introducing lightweight, multimodal, multilingual models streamlined for optimal performance on all your NVIDIA hardware – from your #datacenter GPUs to Jetson and RTX AI PCs for Windows. ➡️

13,828 views

Developers at groundbreaking start-ups like Alpha3D and BOOM Interactive are making it possible to quickly and accurately convert 2D images into high-quality #3D models with #OpenUSD. Read more:

Developers at groundbreaking start-ups like Alpha3D and BOOM Interactive are making it possible to quickly and accurately convert 2D images into high-quality #3D models with #OpenUSD. Read more:

16,879 views

Videos