Загрузка видео...
Не удалось загрузить видео
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models paper page: github: Recent advancements in text-to-image generation with diffusion models have yielded remarkable results synthesizing highly realistic and diverse images. However, these models still encounter difficulties when generating images from prompts that demand spatial or... show more
83,657 просмотров • 2 лет назад •via X (Twitter)
Комментарии: 6

Boyi Li2 лет назад
Thanks @_akhaliq for sharing our work!

zorr0 (ττ)2 лет назад
@replytensor

haareblond2 лет назад
cool but still feels hacky

Takomo AI2 лет назад
That's great progress!

Cavit Erginsoy2 лет назад
@yuliangxiu I saw this about a month ago and had played around with it, is the same or a parallel dev? Wish someone built an extension for A1111

VIJAY KUMAR REDDY BOMMIREDDY2 лет назад
Impressive work! Expanding the text-to-image domain with diffusion models showcases great potential. Looking forward to exploring the paper and GitHub repository. Keep up the great work! 👍
