Chop the gradients ✂️! We found that truncating decoder...

Felix Heide's profile picture

Felix Heide

28,323 görüntüleme • 2 ay önce

NVIDIA just released a very impressive text-to-video paper. Video...

Lior Alexander's profile picture

Lior Alexander

158,553 görüntüleme • 3 yıl önce

Selected as a best paper finalist at #CVPR2026: PixelDiT...

NVIDIA AI's profile picture

NVIDIA AI

27,766 görüntüleme • 28 gün önce

Wonderland: Navigating 3D Scenes from a Single Image Contributions:...

MrNeRF's profile picture

MrNeRF

52,801 görüntüleme • 1 yıl önce

DimensionX: Create Any 3D and 4D Scenes from a...

MrNeRF's profile picture

MrNeRF

17,039 görüntüleme • 1 yıl önce

High-resolution image and video generation is hitting a wall...

Gordon Wetzstein's profile picture

Gordon Wetzstein

163,340 görüntüleme • 3 ay önce

1/ Happy to share VADER: Video Diffusion Alignment via...

Mihir Prabhudesai's profile picture

Mihir Prabhudesai

13,368 görüntüleme • 1 yıl önce

Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering discuss: The...

AK's profile picture

AK

19,101 görüntüleme • 1 yıl önce

Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation paper page:...

AK's profile picture

AK

375,090 görüntüleme • 3 yıl önce

🚀New paper out - We present Video-MSG (Multimodal Sketch...

Jialu Li's profile picture

Jialu Li

35,060 görüntüleme • 1 yıl önce

Diffusions are excellent in creating fantastic images and videos...

Minkai Xu's profile picture

Minkai Xu

50,434 görüntüleme • 1 yıl önce

Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos...

MrNeRF's profile picture

MrNeRF

24,729 görüntüleme • 11 ay önce

LLaDA (the first Large Language Diffusion Model) is *just*...

apolinario (poli)'s profile picture

apolinario (poli)

82,599 görüntüleme • 1 yıl önce

The latent space of earlier generative models like GANS...

Amil Dravid's profile picture

Amil Dravid

94,276 görüntüleme • 2 yıl önce

1/ Happy to share UniDisc - Unified Multimodal Discrete...

Mihir Prabhudesai's profile picture

Mihir Prabhudesai

104,862 görüntüleme • 1 yıl önce

How can a visuomotor policy learn from internet videos?...

Ruoshi Liu's profile picture

Ruoshi Liu

50,797 görüntüleme • 2 yıl önce

Can you make a jigsaw puzzle with two different...

Daniel Geng's profile picture

Daniel Geng

125,806 görüntüleme • 2 yıl önce

I made a tool called Diffusion Explorer that lets...

Alec Helbling's profile picture

Alec Helbling

73,113 görüntüleme • 1 yıl önce

Depth Any Video with Scalable Synthetic Data AI physicists...

MrNeRF's profile picture

MrNeRF

27,428 görüntüleme • 1 yıl önce

You can't 3D reconstruct glass from images... ...WRONG! Thanks...

Jonathan Stephens's profile picture

Jonathan Stephens

17,712 görüntüleme • 6 ay önce

Diffusion models are sensitive to small changes in the...

Xingang Pan's profile picture

Xingang Pan

42,538 görüntüleme • 1 yıl önce