Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

We've released the code for LegoGPT. This autoregressive model generates physically stable and buildable designs from text prompts, by integrating physics laws and assembly constraints into LLM training and inference. This work is led by PhD students Ava Pun, Kangle Deng, Ruixuan Liu, and in collaboration with CMU faculty...

38,582 görüntüleme • 1 yıl önce •via X (Twitter)

7 Yorum

Or Patashnik profil fotoğrafı
Or Patashnik1 yıl önce

@AvaLovelace0 @kangle_deng Wow, really cool!

Rainmaker profil fotoğrafı
Rainmaker2 yıl önce

Here I share an XGBoost model that delivers a 25% CAGR with minimal drawdown on Visa stock. In this free Substack post I share code and commentary for a powerful Machine Learning strategy that delivers powerful returns.

Jason Liu profil fotoğrafı
Jason Liu1 yıl önce

@AvaLovelace0 @kangle_deng Awesome project 👍🏼. Some designs may require other orientation than from the ground up. I’m excited to learn about this!

Redcrown profil fotoğrafı
Redcrown1 yıl önce

@AvaLovelace0 @kangle_deng woah, this is soo cool

Ant A profil fotoğrafı
Ant A1 yıl önce

@AvaLovelace0 @kangle_deng So just GenAI every step/layer?

Aiden profil fotoğrafı
Aiden1 yıl önce

@AvaLovelace0 @kangle_deng Super interesting project! We're also big believers in using natural language to create. With jenova ai, anyone can build their own custom AI apps just by describing what they need.

Max Zhaoshuo Li 李赵硕 profil fotoğrafı
Max Zhaoshuo Li 李赵硕1 yıl önce

@AvaLovelace0 @kangle_deng Very interesting work! Congrats!

Benzer Videolar

This is THE moment of Physical AI! We are officially announcing Cosmos 3: Omnimodal World Models for Physical AI 🚀 - Cosmos 3 is an omnimodal world model: within a unified architecture, it can understand and generate language, images, video, audio, and actions. - It is not just a VLM, not just a video generator, not just an audio-visual generative model, and not just a physics simulator / world-action model. It can understand images and videos, generate images, videos, and audio, simulate future worlds, predict actions, and generate robot policies—enabling models to truly begin to “touch the world.” - Cosmos 3 is the #1 open-weight reasoner / T2I / I2V / robot policy across many benchmarks. Huge thanks to every teammate who fought side by side on this journey—from architecture, data, training, infra, serving, and evaluation to post-training. Every part of this project carries an incredible amount of hard work. This was my first time leading a project as Tech Lead, and I feel truly fortunate. The future of Physical AI needs models that can not only “see” and “describe” the world, but also “imagine,” “simulate,” and “act”—and eventually close the loop with the real world. I hope Cosmos 3 can become an important starting point for this direction, and I’m excited to push Physical AI into its next stage together with the open-source community. Welcome to the era of Physical AI. HuggingFace: Project Website: Code:

Max Zhaoshuo Li 李赵硕

1,071,971 görüntüleme • 10 gün önce