Video yükleniyor...
Video Yüklenemedi
Scaling up GANs for Text-to-Image Synthesis present our 1B-parameter GigaGAN, achieving lower FID than Stable Diffusion v1.5, DALL·E 2, and Parti-750M. It generates 512px outputs at 0.13s, orders of magnitude faster than diffusion and autoregressive models, and inherits the disentangled, continuous, and controllable latent space of GANs abs: project page:
278,115 görüntüleme • 3 yıl önce •via X (Twitter)
10 Yorum

Daniel Losey 🔀3 yıl önce
amazing

David Marx (@digthatdata.bsky.social)3 yıl önce
GANs are back baybee

Nicolay Mausz3 yıl önce
Adobe research - I guess this will be part of CC

Draz ⚛️3 yıl önce
The upscaling is quite insane on how it accurately fills in details

Nerdy Rodent 🐀🤓💻3 yıl önce
It’s been hours now, why isn’t it showing up? 😉

Asriel H3 yıl önce
It has the same schema of injecting latent vector into every scaling layer as StyleGAN has

okaris3 yıl önce
The examples provided don’t look as good as diffusion models. Some details obscured or looking weird.

Adhik Joshi3 yıl önce
Weights aren't open-source

Julien Genoud3 yıl önce
The 4k upsampler 🤯

Clarence Hu3 yıl önce
paging @gwern

