正在加载视频...

视频加载失败

Microsoft just released an impressive tool OmniParser V2 can turn any LLM into an agent capable of using a computer 🔥 You can enable GPT-4o, DeepSeek R1, Sonnet 3.5, Qwen... to understand what's on your screen and take actions. 100% free & open source

459,919 次观看 • 1 年前 •via X (Twitter)

11 条评论

Paul Couvert 的头像
Paul Couvert1 年前

Official blog post → Hugging Face → GitHub →

PDF GPT 的头像
PDF GPT1 年前

Everyone is getting ahead with AI. You should be too. Summarize documents, craft emails, and generate custom content instantly with this powerful tool. It's like having ChatGPT tailored for your job. Try it for free.

Science Degen 的头像
Science Degen1 年前

Thanks, I have been using this and fairly happy. Shall check out Omni.

Paul Couvert 的头像
Paul Couvert1 年前

Never tried it. UI looks clean, thanks for sharing!

Drohi - ReSSRection 🌄 的头像
Drohi - ReSSRection 🌄1 年前

I'm wondering if I should buy a completely separate computer, on a separate network, for these agents to fool around with. With no access to OneDrive.

Paul Couvert 的头像
Paul Couvert1 年前

It's not a tool directly related to the Microsoft ecosystem. More of an "independent" open source project.

Shushant Lakhyani 的头像
Shushant Lakhyani1 年前

Building agents is getting more and more accessible

Paul Couvert 的头像
Paul Couvert1 年前

True. We have more and more choice of frameworks.

Alamin 的头像
Alamin1 年前

Great share.

Paul Couvert 的头像
Paul Couvert1 年前

Thanks man! Always a pleasure to see new open source projects coming from the giants.

Anik Singal 的头像
Anik Singal1 年前

@iam_chonchol Agentic AI is seriously just reach away!

相关视频