Video yükleniyor...
Video Yüklenemedi
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models paper page: github: Recent advancements in text-to-image generation with diffusion models have yielded remarkable results synthesizing highly realistic and diverse images. However, these models still encounter difficulties when generating images from prompts that demand spatial or... show more
83,657 görüntüleme • 2 yıl önce •via X (Twitter)
6 Yorum

Boyi Li2 yıl önce
Thanks @_akhaliq for sharing our work!

zorr0 (ττ)2 yıl önce
@replytensor

haareblond2 yıl önce
cool but still feels hacky

Takomo AI2 yıl önce
That's great progress!

Cavit Erginsoy2 yıl önce
@yuliangxiu I saw this about a month ago and had played around with it, is the same or a parallel dev? Wish someone built an extension for A1111

VIJAY KUMAR REDDY BOMMIREDDY2 yıl önce
Impressive work! Expanding the text-to-image domain with diffusion models showcases great potential. Looking forward to exploring the paper and GitHub repository. Keep up the great work! 👍
