Video wird geladen...
Video konnte nicht geladen werden
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models paper page: github: Recent advancements in text-to-image generation with diffusion models have yielded remarkable results synthesizing highly realistic and diverse images. However, these models still encounter difficulties when generating images from prompts that demand spatial or... show more
83,657 Aufrufe • vor 2 Jahren •via X (Twitter)
6 Kommentare

Boyi Livor 2 Jahren
Thanks @_akhaliq for sharing our work!

zorr0 (ττ)vor 2 Jahren
@replytensor

haareblondvor 2 Jahren
cool but still feels hacky

Takomo AIvor 2 Jahren
That's great progress!

Cavit Erginsoyvor 2 Jahren
@yuliangxiu I saw this about a month ago and had played around with it, is the same or a parallel dev? Wish someone built an extension for A1111

VIJAY KUMAR REDDY BOMMIREDDYvor 2 Jahren
Impressive work! Expanding the text-to-image domain with diffusion models showcases great potential. Looking forward to exploring the paper and GitHub repository. Keep up the great work! 👍
