apolinario 🌐's banner
apolinario 🌐's profile picture

apolinario 🌐

@multimodalart15,630 subscribers

ML for Art and Creativity, working @HuggingFace ([email protected])

Shorts

NVidia just released PiD: super resolution in pixel space directly from model latents 🔎 4X resolution for any generated image, FAST! 🏎️💨 FLUX.1, 2 and Z-Image (Qwen Image coming) of course, i built a demo: generate 4K images with Z-Image

NVidia just released PiD: super resolution in pixel space directly from model latents 🔎 4X resolution for any generated image, FAST! 🏎️💨 FLUX.1, 2 and Z-Image (Qwen Image coming) of course, i built a demo: generate 4K images with Z-Image

29,594 views

Stable Audio 3 by Stability AI is just out It mainly comes with 3 open source variants: - Stable Audio 3 Medium (2B) - Stable Audio 3 Small (0.6B) - Music - Stable Audio 3 Small (0.6B) - VFX (and a "large" closed variant) The open models are really fast and high quality

Stable Audio 3 by Stability AI is just out It mainly comes with 3 open source variants: - Stable Audio 3 Medium (2B) - Stable Audio 3 Small (0.6B) - Music - Stable Audio 3 Small (0.6B) - VFX (and a "large" closed variant) The open models are really fast and high quality

41,249 views

Boring Reality LoRA just dropped for HunyuanVideo 🏙️🏞️ A fine-tune that lead not to cinematic shots, but to something that could've come out of your phone 📱

Boring Reality LoRA just dropped for HunyuanVideo 🏙️🏞️ A fine-tune that lead not to cinematic shots, but to something that could've come out of your phone 📱

407,400 views

Qwen Image Multiple Angles LoRA is an exquisitely trained LoRA! 📐˚₊‧꒰ა Keep character and scenes consistent, and flies the camera around! Open source got there! One of the best LoRAs I've come across lately 🙌

Qwen Image Multiple Angles LoRA is an exquisitely trained LoRA! 📐˚₊‧꒰ა Keep character and scenes consistent, and flies the camera around! Open source got there! One of the best LoRAs I've come across lately 🙌

121,386 views

testing out the Diffusers Image Fill demo capabilities on a random image

testing out the Diffusers Image Fill demo capabilities on a random image

274,327 views

I just built a demo for this Light Migration LoRA on Hugging Face the quality surprises me on every output 🤯

I just built a demo for this Light Migration LoRA on Hugging Face the quality surprises me on every output 🤯

80,276 views

Apply Texture Qwen Image Edit LoRA by tarn59 works with EVERYTHING! 👉🪵🧶, this model trains so well I've built this demo so you can apply *any* texture to *any* object on Hugging Face

Apply Texture Qwen Image Edit LoRA by tarn59 works with EVERYTHING! 👉🪵🧶, this model trains so well I've built this demo so you can apply *any* texture to *any* object on Hugging Face

67,659 views

Introducing Kontext Relight! 💡 ✨ A FLUX Kontext Relight LoRA + demo trained for state-of-the art relighting for subjects & landscapes

Introducing Kontext Relight! 💡 ✨ A FLUX Kontext Relight LoRA + demo trained for state-of-the art relighting for subjects & landscapes

76,022 views

LLaDA (the first Large Language Diffusion Model) is *just* out 💥 and I've built a demo, try out now 👨‍💻 It's mesmerizing to watch the diffusion process 🌀, and it being a diffusion model gives you superpowers like "the 4th word has to be pineapple" 🦸 Demo and weights 👇

LLaDA (the first Large Language Diffusion Model) is *just* out 💥 and I've built a demo, try out now 👨‍💻 It's mesmerizing to watch the diffusion process 🌀, and it being a diffusion model gives you superpowers like "the 4th word has to be pineapple" 🦸 Demo and weights 👇

81,047 views

Excited to introduce LEDITS++, a novel way to edit real images with precision ✏️ - Multiple edits ✂️🔁 - Automagic free masking 🪄🎭 - 🆕 DPM-Solver fast inversion 🔀⚡ 🤗 Try it: 🔗 Project: 📝 Paper

Excited to introduce LEDITS++, a novel way to edit real images with precision ✏️ - Multiple edits ✂️🔁 - Automagic free masking 🪄🎭 - 🆕 DPM-Solver fast inversion 🔀⚡ 🤗 Try it: 🔗 Project: 📝 Paper

131,559 views

this is not a drill 🚨, real-time open source video generation is here 🔥 Self-Forcing - a real-time video distilled model from Wan 2.1 by Adobe is out, and they open sourced it 🐐 I've built a live real time demo on Hugging Face Spaces 📹💨

this is not a drill 🚨, real-time open source video generation is here 🔥 Self-Forcing - a real-time video distilled model from Wan 2.1 by Adobe is out, and they open sourced it 🐐 I've built a live real time demo on Hugging Face Spaces 📹💨

52,153 views

GANs are so back?! Scientists from Brown and Cornell have published a paper with a ✨ modern architecture GAN ✨ that is 🗿 stable to train 🗿 and competitive with SOTA GANs and even diffusion models Paper and demo 👇

GANs are so back?! Scientists from Brown and Cornell have published a paper with a ✨ modern architecture GAN ✨ that is 🗿 stable to train 🗿 and competitive with SOTA GANs and even diffusion models Paper and demo 👇

64,122 views

Hunyuan-3D-2.1 image-to-3D is now out! ✨ Open weights, permissively licensed 🔓 2.1 improves on 2.0 by a LOT in generating high quality textures for the 3D assets 🔥 This level of detail from a single image 🖤💎

Sensitive content

Hunyuan-3D-2.1 image-to-3D is now out! ✨ Open weights, permissively licensed 🔓 2.1 improves on 2.0 by a LOT in generating high quality textures for the 3D assets 🔥 This level of detail from a single image 🖤💎

48,115 views

Editing facial expressions in real time now on Hugging Face Spaces 👨‍🎤🔀 A Grog converted Cog image to Gradio running a ComfyUI backend - magic of open source 🤝 ▶️

Editing facial expressions in real time now on Hugging Face Spaces 👨‍🎤🔀 A Grog converted Cog image to Gradio running a ComfyUI backend - magic of open source 🤝 ▶️

71,633 views

whoa, Remade just dropped 8 open source video LoRA effects for Wan 2.1 on Hugging Face 🤯 Squish 🥞, Cakefy 🍰, Inflate 🎈, Deflate 📉, Shooting 🔫, Rotate 🔄 and Muscle 💪 all available openly

whoa, Remade just dropped 8 open source video LoRA effects for Wan 2.1 on Hugging Face 🤯 Squish 🥞, Cakefy 🍰, Inflate 🎈, Deflate 📉, Shooting 🔫, Rotate 🔄 and Muscle 💪 all available openly

56,321 views

The Dream 7B (diffusion reasoning language model) is OUT! 🚨 I built a demo so you can test it out (and check the diffusion process live) 𖣯🔍

The Dream 7B (diffusion reasoning language model) is OUT! 🚨 I built a demo so you can test it out (and check the diffusion process live) 𖣯🔍

35,257 views

You can now finally create your own stock photo smiling while eating salad in seconds 👨‍🎤🥗 IP-Apdater-FaceID Plus was silently released last week - it's first inference technique time face really captures my likeness 🥸🦚 ▶️

You can now finally create your own stock photo smiling while eating salad in seconds 👨‍🎤🥗 IP-Apdater-FaceID Plus was silently released last week - it's first inference technique time face really captures my likeness 🥸🦚 ▶️

60,039 views

Video-to-video is now available in the official CogVideoX-5B Space 🔥 Try it out 🎥 ➡️🎥

Video-to-video is now available in the official CogVideoX-5B Space 🔥 Try it out 🎥 ➡️🎥

33,458 views

Demo for the first open SD3-like architecture model, HunyuanDiT Hugging Face Spaces demo is out! 🎨 First impressions: - Image quality seems very good! - Chunky and the research code isn't super optimized for inference speed (👋 diffusers 👀) ▶️

Demo for the first open SD3-like architecture model, HunyuanDiT Hugging Face Spaces demo is out! 🎨 First impressions: - Image quality seems very good! - Chunky and the research code isn't super optimized for inference speed (👋 diffusers 👀) ▶️

26,694 views

starting the week with a true groundbreaking work 💥 Large Language Diffusion Models the first billion-parameter scale diffusion model competitive with its pairs (8B model comparable to LLaMA 3 8B) it gets rid of the michael scott syndrome on existing LLMs

starting the week with a true groundbreaking work 💥 Large Language Diffusion Models the first billion-parameter scale diffusion model competitive with its pairs (8B model comparable to LLaMA 3 8B) it gets rid of the michael scott syndrome on existing LLMs

11,833 views

Videos

No more content to load