Video wird geladen...

Video konnte nicht geladen werden

Beim Laden dieses Videos ist ein Problem aufgetreten. Dies könnte an einem vorübergehenden Netzwerkproblem liegen oder das Video ist möglicherweise nicht verfügbar.

Coin3D Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning As humans, we aspire to create media content that is both freely willed and readily controlled. Thanks to the prominent development of generative techniques, we now can easily utilize

AK

469,575 subscribers

23,055 Aufrufe • vor 2 Jahren •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

0 Kommentare

Keine Kommentare verfügbar

Kommentare vom Original-Post werden hier angezeigt

Ähnliche Videos

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion TL;DR: Create 3/4DGS from Video Diffusion Note: Some first inference code released (not all yet). Contributions (cited): • We present DimensionX, a novel framework for generating photorealistic 3D and 4D scenes from only a single image using controllable video diffusion. • We propose ST-Director, which decouples the spatial and temporal priors in video diffusion models by learning (spatial and temporal) dimension-aware modules with our curated datasets. We further enhance the hybriddimension control with a training-free composition approach according to the essence of video diffusion denoising process. • To bridge the gap between video diffusion and real-world scenes, we design a trajectory-aware mechanism for 3D generation and an identity-preserving denoising approach for 4D generation, enabling more realistic and controllable scene synthesis. • Extensive experiments manifest that our DimensionX delivers superior performance in video, 3D, and 4D generation compared with baseline methods.

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion TL;DR: Create 3/4DGS from Video Diffusion Note: Some first inference code released (not all yet). Contributions (cited): • We present DimensionX, a novel framework for generating photorealistic 3D and 4D scenes from only a single image using controllable video diffusion. • We propose ST-Director, which decouples the spatial and temporal priors in video diffusion models by learning (spatial and temporal) dimension-aware modules with our curated datasets. We further enhance the hybriddimension control with a training-free composition approach according to the essence of video diffusion denoising process. • To bridge the gap between video diffusion and real-world scenes, we design a trajectory-aware mechanism for 3D generation and an identity-preserving denoising approach for 4D generation, enabling more realistic and controllable scene synthesis. • Extensive experiments manifest that our DimensionX delivers superior performance in video, 3D, and 4D generation compared with baseline methods.

MrNeRF

17,047 Aufrufe • vor 1 Jahr

Portmore becoming a parish is a significant step for both the residents and Jamaica in general. As we grow, our society must be reflective of that growth as well as of the future we want to create. Parish status for Portmore is a positive development that we should embrace as a mode for bringing greater organization and progress.

Portmore becoming a parish is a significant step for both the residents and Jamaica in general. As we grow, our society must be reflective of that growth as well as of the future we want to create. Parish status for Portmore is a positive development that we should embrace as a mode for bringing greater organization and progress.

Andrew Holness

19,034 Aufrufe • vor 1 Jahr

Wonderland: Navigating 3D Scenes from a Single Image Contributions: • First, we introduce a representation for controllable 3D generation by leveraging the generative priors from camera-guided video diffusion models. Unlike image models, video diffusion models are trained on extensive video datasets. This enables them to capture comprehensive spatial relationships within scenes across multiple views and embed a form of "3D awareness" in their latent space, which allows us to maintain 3D consistency in novel view synthesis. • Second, to achieve controllable novel view generation, we empower video models with precise control over specified camera motions. We introduce a novel dual-branch conditioning mechanism that effectively incorporates desired diverse camera trajectories into the video diffusion model. This enables expansion of a single image into a multi-view consistent capture of a 3D scene with precise pose control. • Third, to achieve efficient 3D reconstruction, we directly transform video latents into 3DGS. We propose a novel latent-based large reconstruction model (LaLRM) that lifts video latents to 3D in a feed-forward manner. With this design, during inference, our model directly predicts 3DGS from a single input image, effectively aligning the generation and reconstruction tasks—and bridging image space and 3D space—through the video latent space. Compared with reconstructing scenes from images, the video latent space offers a 256× spatial-temporal reduction while retaining essential and consistent 3D structural details. Such a high degree of compression is crucial, as it allows the LaLRM to handle a wider range of 3D scenes within the reconstruction framework, with the same memory constraints.

Wonderland: Navigating 3D Scenes from a Single Image Contributions: • First, we introduce a representation for controllable 3D generation by leveraging the generative priors from camera-guided video diffusion models. Unlike image models, video diffusion models are trained on extensive video datasets. This enables them to capture comprehensive spatial relationships within scenes across multiple views and embed a form of "3D awareness" in their latent space, which allows us to maintain 3D consistency in novel view synthesis. • Second, to achieve controllable novel view generation, we empower video models with precise control over specified camera motions. We introduce a novel dual-branch conditioning mechanism that effectively incorporates desired diverse camera trajectories into the video diffusion model. This enables expansion of a single image into a multi-view consistent capture of a 3D scene with precise pose control. • Third, to achieve efficient 3D reconstruction, we directly transform video latents into 3DGS. We propose a novel latent-based large reconstruction model (LaLRM) that lifts video latents to 3D in a feed-forward manner. With this design, during inference, our model directly predicts 3DGS from a single input image, effectively aligning the generation and reconstruction tasks—and bridging image space and 3D space—through the video latent space. Compared with reconstructing scenes from images, the video latent space offers a 256× spatial-temporal reduction while retaining essential and consistent 3D structural details. Such a high degree of compression is crucial, as it allows the LaLRM to handle a wider range of 3D scenes within the reconstruction framework, with the same memory constraints.

MrNeRF

52,801 Aufrufe • vor 1 Jahr

Now I understand the mistake of Prometheus: instead of humans becoming what they truly are and part of God, the Promethean and Luciferian mistake is that humans want to create outside of God or mimic God. With technology we mimic nature, and everything technology mimics is an inversion and a lesser copy. But this is a mistake we need to make in order to grow in the future. We are creating the artificial spirit from the ancient Christian texts, now referred to as Gnostic texts.

Now I understand the mistake of Prometheus: instead of humans becoming what they truly are and part of God, the Promethean and Luciferian mistake is that humans want to create outside of God or mimic God. With technology we mimic nature, and everything technology mimics is an inversion and a lesser copy. But this is a mistake we need to make in order to grow in the future. We are creating the artificial spirit from the ancient Christian texts, now referred to as Gnostic texts.

Open Minded Approach

69,871 Aufrufe • vor 8 Monaten

As we move more and more into 3D, texturing 3D assets will be essential ☝️ Repainting 3D Assets is a new AI method that can take any 3D asset and paint it with a given text prompt. The results, while low-resolution, are pretty impressive.

As we move more and more into 3D, texturing 3D assets will be essential ☝️ Repainting 3D Assets is a new AI method that can take any 3D asset and paint it with a given text prompt. The results, while low-resolution, are pretty impressive.

Dreaming Tulpa 🥓👑

16,291 Aufrufe • vor 2 Jahren

New Partnership!🤝 Nuklai and EMC are teaming up to accelerate the development of our data and AI ecosystems. Edge Matrix launched with the vision that compute power is a base of new value. Now add Nuklai’s #SmartData, and we can create something unique! $NAI $EMC

New Partnership!🤝 Nuklai and EMC are teaming up to accelerate the development of our data and AI ecosystems. Edge Matrix launched with the vision that compute power is a base of new value. Now add Nuklai’s #SmartData, and we can create something unique! $NAI $EMC

Nuklai

68,043 Aufrufe • vor 2 Jahren

. Runway just added a new feature that you may not have noticed and we wanted to compare. You can now upscale your generation to 4K! Our favorite has typically been Topaz Labs, so we wanted to compare the out of both. Here is the original video from Runway

. Runway just added a new feature that you may not have noticed and we wanted to compare. You can now upscale your generation to 4K! Our favorite has typically been Topaz Labs, so we wanted to compare the out of both. Here is the original video from Runway

Curious Refuge

33,308 Aufrufe • vor 1 Jahr

Blades of the Guardians episode two is crazy. There are many battles, but I would like to draw attention to this one. We also have hand to hand combat, and it's just as good. Strong choreography with great use of the 3D camera, and the variety of techniques is impressive.

Blades of the Guardians episode two is crazy. There are many battles, but I would like to draw attention to this one. We also have hand to hand combat, and it's just as good. Strong choreography with great use of the 3D camera, and the variety of techniques is impressive.

Oleksandr

13,820 Aufrufe • vor 2 Jahren

Gaming + Generative AI This is a scene from GTA: San Andreas reimagined with Runway Gen-3 Jensen Huang, the CEO of Nvidia spoke of the future of DLSS where he mentioned DLSS to generate in-game assets, such as textures and objects and DLSS 10 hypothetically Delivering Full Neural Rendering. This may look bad and un-playable by our today's standards but we all know how good Generative AI Video has gotten over a few short years. This is the future of Gaming combined with Generative AI to create photorealism Video Credit: Niccyan

Gaming + Generative AI This is a scene from GTA: San Andreas reimagined with Runway Gen-3 Jensen Huang, the CEO of Nvidia spoke of the future of DLSS where he mentioned DLSS to generate in-game assets, such as textures and objects and DLSS 10 hypothetically Delivering Full Neural Rendering. This may look bad and un-playable by our today's standards but we all know how good Generative AI Video has gotten over a few short years. This is the future of Gaming combined with Generative AI to create photorealism Video Credit: Niccyan

NikTek

544,693 Aufrufe • vor 1 Jahr

What do we say to gas? Not today. 👀 You can now freely swap and interact with Monad in MetaMask. The gas is on us!

What do we say to gas? Not today. 👀 You can now freely swap and interact with Monad in MetaMask. The gas is on us!

MetaMask 🦊

161,568 Aufrufe • vor 4 Monaten

Human Hair Reconstruction with Strand-Aligned 3D Gaussians Contributions (cited): – We propose a new 3D line lifting scheme that uses a modified 3DGS reconstruction technique to lift 2D orientation maps into a 3D field while also providing refinement of the camera parameters; – We introduce a dual representation of hair strand polylines and 3D Gaussians to achieve differentiable rasterization of hair strands and leverage photometric constraints for strand-based hair reconstruction; – Based on these components, we propose a coarse-to-fine optimization method for prior-guided hair reconstruction that leverages both latent and explicit representations of the hairstyle.

Human Hair Reconstruction with Strand-Aligned 3D Gaussians Contributions (cited): – We propose a new 3D line lifting scheme that uses a modified 3DGS reconstruction technique to lift 2D orientation maps into a 3D field while also providing refinement of the camera parameters; – We introduce a dual representation of hair strand polylines and 3D Gaussians to achieve differentiable rasterization of hair strands and leverage photometric constraints for strand-based hair reconstruction; – Based on these components, we propose a coarse-to-fine optimization method for prior-guided hair reconstruction that leverages both latent and explicit representations of the hairstyle.

MrNeRF

106,525 Aufrufe • vor 1 Jahr

WeatherEdit: Controllable Weather Editing with 4D Gaussian Field Contributions: 1. Based on our analysis of weather editing characteristics, we introduce WeatherEdit, a comprehensive and efficient framework for realistic and controllable weather generation. Compared with existing methods that focus on either background editing or static weather effects, a progressive 2D-to-4D transformation process in WeatherEdit enhances adaptability across a wider range of scenarios. 2. We introduce an all-in-one adapter to enable a diffusion model for multi-weather (snowy, rainy, and fog) synthesis, along with a Temporal-View attention to ensure consistent editing across multi-frame and multi-view. 3. We design a 4D Gaussian field for weather particle modeling, enabling plausible simulation of raindrops, snowflakes, and fog with controllable severity. 4. We demonstrate WeatherEdit’s effectiveness in generating realistic, consistent, and controllable weather effects in 3D driving scenes, showcasing its applicability to real-world scenarios.

WeatherEdit: Controllable Weather Editing with 4D Gaussian Field Contributions: 1. Based on our analysis of weather editing characteristics, we introduce WeatherEdit, a comprehensive and efficient framework for realistic and controllable weather generation. Compared with existing methods that focus on either background editing or static weather effects, a progressive 2D-to-4D transformation process in WeatherEdit enhances adaptability across a wider range of scenarios. 2. We introduce an all-in-one adapter to enable a diffusion model for multi-weather (snowy, rainy, and fog) synthesis, along with a Temporal-View attention to ensure consistent editing across multi-frame and multi-view. 3. We design a 4D Gaussian field for weather particle modeling, enabling plausible simulation of raindrops, snowflakes, and fog with controllable severity. 4. We demonstrate WeatherEdit’s effectiveness in generating realistic, consistent, and controllable weather effects in 3D driving scenes, showcasing its applicability to real-world scenarios.

MrNeRF

10,691 Aufrufe • vor 1 Jahr

⚠️ NEW ASSETS??? ⚠️ They’re more valuable than you think 👀 ❌ AND THEY AREN’T AI ??? ❌ Assets allow you to change things up, create new redeems and ways to interact with chat, and are a great thing to use for social media posts! You can commission your own too !!! ✨ I am so grateful that this site opened and allowed for the opportunity to easily find artist-made L2D assets!! I love that I can use it on mobile and search for new ideas on the go !! 💗📲

⚠️ NEW ASSETS??? ⚠️ They’re more valuable than you think 👀 ❌ AND THEY AREN’T AI ??? ❌ Assets allow you to change things up, create new redeems and ways to interact with chat, and are a great thing to use for social media posts! You can commission your own too !!! ✨ I am so grateful that this site opened and allowed for the opportunity to easily find artist-made L2D assets!! I love that I can use it on mobile and search for new ideas on the go !! 💗📲

Sera 🧡🐾 VTuber

25,503 Aufrufe • vor 2 Monaten

.Tencent Hy HY-World 2.0 has landed,fully OPEN SOURCE ! We kept hearing the same thing: world models look great, but you can't use the output. So we made one you can, real 3D assets you can edit, rearrange, and ship into production. Prompt to engine-ready 3D. For real this time： ✨Directly outputs editable, production-ready 3D assets ✨Seamlessly plugs into existing game pipelines ✨Multimodal input: text, image, videos 🪄Character mode: freely explore streets, buildings, and open worlds with realistic physics, collisions, and no time limits 🔗GitHub:

.Tencent Hy HY-World 2.0 has landed,fully OPEN SOURCE ! We kept hearing the same thing: world models look great, but you can't use the output. So we made one you can, real 3D assets you can edit, rearrange, and ship into production. Prompt to engine-ready 3D. For real this time： ✨Directly outputs editable, production-ready 3D assets ✨Seamlessly plugs into existing game pipelines ✨Multimodal input: text, image, videos 🪄Character mode: freely explore streets, buildings, and open worlds with realistic physics, collisions, and no time limits 🔗GitHub:

Tencent AI

116,648 Aufrufe • vor 3 Monaten

ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies Contributions: 1) We propose ImmerseGen, a novel agent-guided 3D environment generation framework. It uses simplified geometric proxies with alpha-textured meshes to produce compact, photorealistic worlds ready for real-time mobile VR rendering. 2) We propose a novel RGBA texturing paradigm. It first synthesizes 8K terrain textures using a geometry-conditioned panorama generator via user-centric mapping, and then directly generates alpha-textured proxy assets, avoiding fidelity loss typically resulting from mesh decimation. 3) To automate scene creation from user prompts, we introduce VLM-based modeling agents equipped with a novel grid-based semantic analysis. This enables 3D spatial reasoning from 2D observations and ensures accurate asset placement. ImmerseGen further enhances immersion with dynamic effects and ambient audio for a multisensory experience. 4) Experiments on multiple scene-generation scenarios and live mobile VR applications show that ImmerseGen outperforms previous methods in visual quality, realism, spatial coherence, and rendering efficiency for immersive real-time VR experiences.

ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies Contributions: 1) We propose ImmerseGen, a novel agent-guided 3D environment generation framework. It uses simplified geometric proxies with alpha-textured meshes to produce compact, photorealistic worlds ready for real-time mobile VR rendering. 2) We propose a novel RGBA texturing paradigm. It first synthesizes 8K terrain textures using a geometry-conditioned panorama generator via user-centric mapping, and then directly generates alpha-textured proxy assets, avoiding fidelity loss typically resulting from mesh decimation. 3) To automate scene creation from user prompts, we introduce VLM-based modeling agents equipped with a novel grid-based semantic analysis. This enables 3D spatial reasoning from 2D observations and ensures accurate asset placement. ImmerseGen further enhances immersion with dynamic effects and ambient audio for a multisensory experience. 4) Experiments on multiple scene-generation scenarios and live mobile VR applications show that ImmerseGen outperforms previous methods in visual quality, realism, spatial coherence, and rendering efficiency for immersive real-time VR experiences.

MrNeRF

14,225 Aufrufe • vor 1 Jahr

I met with executives of American energy companies. The challenge is clear – Ukraine needs more strength, and we can achieve this together. We discussed the urgent needs of the moment – very concrete measures to support our energy system, as well as long-term cooperation. There are specific projects that could create more opportunities for both our countries. Thank you!

I met with executives of American energy companies. The challenge is clear – Ukraine needs more strength, and we can achieve this together. We discussed the urgent needs of the moment – very concrete measures to support our energy system, as well as long-term cooperation. There are specific projects that could create more opportunities for both our countries. Thank you!

Volodymyr Zelenskyy / Володимир Зеленський

189,397 Aufrufe • vor 9 Monaten

We didn't upload Lee Knows cut today since there is none. The only parts we could gather were from far away as a group, he didn't have any time on the talker. We are disappointed to see that every time he is excluded from more content; first it was promotional content, now even unofficial group content that is aimed at showing how the members spend the time recording backstage, are excluding him. The point of talkers is to see more of the members, and now is rare to see Lee Know being recorded for even the talkers. As much as we accepted Lee Know missing in some content, or his parts to be barely visible sometimes, this has reached a point were we are almost certain that we won't see him when new content comes out. We hope this doesn't continue since the group consists of 8 members. We hope Stray Kids Supporters Stray Kids Stray Kids_JP remember this when we have more content. Especially on the recording of the end of the year performances, we listened to how Stays were impressed by Lee Know's direction as the dance leader, and how he took care of the stage, so we are sure there is plenty of Lee Know content that can be shown, we hope to see it.

We didn't upload Lee Knows cut today since there is none. The only parts we could gather were from far away as a group, he didn't have any time on the talker. We are disappointed to see that every time he is excluded from more content; first it was promotional content, now even unofficial group content that is aimed at showing how the members spend the time recording backstage, are excluding him. The point of talkers is to see more of the members, and now is rare to see Lee Know being recorded for even the talkers. As much as we accepted Lee Know missing in some content, or his parts to be barely visible sometimes, this has reached a point were we are almost certain that we won't see him when new content comes out. We hope this doesn't continue since the group consists of 8 members. We hope Stray Kids Supporters Stray Kids Stray Kids_JP remember this when we have more content. Especially on the recording of the end of the year performances, we listened to how Stays were impressed by Lee Know's direction as the dance leader, and how he took care of the stage, so we are sure there is plenty of Lee Know content that can be shown, we hope to see it.

LEE KNOW 리노 GLOBAL ★

91,946 Aufrufe • vor 1 Jahr