Loading video...
Video Failed to Load
RTFM can be seen as a learned renderer: it is an autoregressive diffusion transformer trained end-to-end on large-scale video data, and it learns to model 3D geometry, reflections, shadows and more just by observing them in its training set.
15,870 views • 7 months ago •via X (Twitter)
0 Comments
No comments available
Comments from the original post will appear here

