Loading video...

Video Failed to Load

Go Home

Microsoft just released an impressive tool OmniParser V2 can turn any LLM into an agent capable of using a computer 🔥 You can enable GPT-4o, DeepSeek R1, Sonnet 3.5, Qwen... to understand what's on your screen and take actions. 100% free & open source

459,919 views • 1 year ago •via X (Twitter)

11 Comments

Paul Couvert's profile picture
Paul Couvert1 year ago

Official blog post → Hugging Face → GitHub →

PDF GPT's profile picture
PDF GPT1 year ago

Everyone is getting ahead with AI. You should be too. Summarize documents, craft emails, and generate custom content instantly with this powerful tool. It's like having ChatGPT tailored for your job. Try it for free.

Science Degen's profile picture
Science Degen1 year ago

Thanks, I have been using this and fairly happy. Shall check out Omni.

Paul Couvert's profile picture
Paul Couvert1 year ago

Never tried it. UI looks clean, thanks for sharing!

Drohi - ReSSRection 🌄's profile picture
Drohi - ReSSRection 🌄1 year ago

I'm wondering if I should buy a completely separate computer, on a separate network, for these agents to fool around with. With no access to OneDrive.

Paul Couvert's profile picture
Paul Couvert1 year ago

It's not a tool directly related to the Microsoft ecosystem. More of an "independent" open source project.

Shushant Lakhyani's profile picture
Shushant Lakhyani1 year ago

Building agents is getting more and more accessible

Paul Couvert's profile picture
Paul Couvert1 year ago

True. We have more and more choice of frameworks.

Alamin's profile picture
Alamin1 year ago

Great share.

Paul Couvert's profile picture
Paul Couvert1 year ago

Thanks man! Always a pleasure to see new open source projects coming from the giants.

Anik Singal's profile picture
Anik Singal1 year ago

@iam_chonchol Agentic AI is seriously just reach away!

Related Videos