Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

Introducing Agentic Object Detection! Given a text prompt like “unripe strawberries” or “Kellogg’s branded cereal” and an image, we use an agentic workflow to reason at length and detect the specified objects. No need to label any training data. Watch the video for details.

397,751 görüntüleme • 1 yıl önce •via X (Twitter)

11 Yorum

Andrew Ng profil fotoğrafı
Andrew Ng1 yıl önce

You can also play with the demo here:

Breadcrumb profil fotoğrafı
Breadcrumb1 yıl önce

Looking to automate reporting? Use AI agents to turn spreadsheets to reports in minutes without any coding.

Edrick🕗 profil fotoğrafı
Edrick🕗1 yıl önce

Agentic workflows for computer vision makes so much sense

Inforida profil fotoğrafı
Inforida1 yıl önce

Agentic Object Detection sounds fascinating. The ability to reason without labeled data is a game changer. Imagine applying this to educational tools to enhance learning experiences, making AI-powered learning more intuitive. Keep pushing the boundaries of innovation.

jc_stack profil fotoğrafı
jc_stack1 yıl önce

Have you tested this against more complex scenarios like partially occluded objects or under varied lighting conditions? Really curious about edge cases and performance degradation patterns.

Marian Veteanu profil fotoğrafı
Marian Veteanu1 yıl önce

Super cool! This has lots of applications!

Lets go Seahawks 🇺🇦 profil fotoğrafı
Lets go Seahawks 🇺🇦1 yıl önce

i asked it to detect rectangle in batsman picture and it cant find it.

Lets go Seahawks 🇺🇦 profil fotoğrafı
Lets go Seahawks 🇺🇦1 yıl önce

and also, what's #23 wearing? isnt it a hat?

NEXUS AI Solutions profil fotoğrafı
NEXUS AI Solutions1 yıl önce

That's fascinating! Using agentic workflows to detect objects without labeled data could revolutionize how we approach image recognition tasks. How do you think this technology could be adapted for real-time applications like autonomous vehicles?

Nimaano profil fotoğrafı
Nimaano1 yıl önce

Its amazing

Andrew Ng profil fotoğrafı
Andrew Ng1 yıl önce

Thanks!

Benzer Videolar

Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields paper page: Editing a local region or a specific object in a 3D scene represented by a NeRF is challenging, mainly due to the implicit nature of the scene representation. Consistently blending a new realistic object into the scene adds an additional level of difficulty. We present Blended-NeRF, a robust and flexible framework for editing a specific region of interest in an existing NeRF scene, based on text prompts or image patches, along with a 3D ROI box. Our method leverages a pretrained language-image model to steer the synthesis towards a user-provided text prompt or image patch, along with a 3D MLP model initialized on an existing NeRF scene to generate the object and blend it into a specified region in the original scene. We allow local editing by localizing a 3D ROI box in the input scene, and seamlessly blend the content synthesized inside the ROI with the existing scene using a novel volumetric blending technique. To obtain natural looking and view-consistent results, we leverage existing and new geometric priors and 3D augmentations for improving the visual fidelity of the final result. We test our framework both qualitatively and quantitatively on a variety of real 3D scenes and text prompts, demonstrating realistic multi-view consistent results with much flexibility and diversity compared to the baselines. Finally, we show the applicability of our framework for several 3D editing applications, including adding new objects to a scene, removing/replacing/altering existing objects, and texture conversion.

AK

62,768 görüntüleme • 3 yıl önce