正在加载视频...

视频加载失败

OpenAI just released access to Computer Use Agent via an API. It combines GPT-4o vision with advanced reasoning for an AI Agent to simulate controlling computer interfaces and performing tasks just like humans. You can try it for free using this Agent Playground.

57,436 次观看 • 1 年前 •via X (Twitter)

11 条评论

Shubham Saboo 的头像
Shubham Saboo1 年前

I will be creating Agent tutorials using OpenAI Agents SDK. I have created 50+ AI Agents and RAG tutorials, 100% free and opensource. 1. Subscribe to Unwind AI (for free): 2. Star the repo: Follow me → @Saboo_Shubham_

Shubham Saboo 的头像
Shubham Saboo1 年前

Try it out here: If you find this useful, RT to share it with your friends. Don't forget to follow me @Saboo_Shubham_ for more such LLM tips and AI Agent, RAG tutorials.

MightyBot 的头像
MightyBot1 年前

🧠 Unified Search. Smarter Meetings. Effortless CRM. MightyBot is your AI agent platform for seamless workflows—record meetings, automate CRM updates, and find answers across apps in seconds. 🌟 Focus on what matters. We'll handle the grind.

browserbase 🅱️ 的头像
browserbase 🅱️1 年前

🔥🙌

Gargi 的头像
Gargi1 年前

This is really good!

Shubham Saboo 的头像
Shubham Saboo1 年前

+10000%

Kairos Data Labs 的头像
Kairos Data Labs1 年前

This is so freakkin cool!

Shubham Saboo 的头像
Shubham Saboo1 年前

It is super smooth and fast!

Pallav Agarwal 的头像
Pallav Agarwal1 年前

I also made an Open Source script that lets it control MacOS without Docker:

pelpa.sats | 🟧.pepe 的头像
pelpa.sats | 🟧.pepe1 年前

@KingBootoshi

Diamondswap 💎│ DiamondBot 🤖 的头像
Diamondswap 💎│ DiamondBot 🤖1 年前

GAME CHANGER!!! 🚀👏💻

相关视频

Our first short course with Anthropic! Building Towards Computer Use with Anthropic. This teaches you to build an LLM-based agent that uses a computer interface by generating mouse clicks and keystrokes. Computer Use is an important, emerging capability for LLMs that will let AI agents do many more tasks than were possible before, since it lets them interact with interfaces designed for humans to use, rather than only tools that provide explicit API access. I hope you will enjoy learning about it! This course is taught by Anthropic's Head of Curriculum, Colt_Steele. You'll learn to apply image reasoning and tool use to "use" a computer as follows: a model processes an image of the screen, analyzes it to understand what's going on, and navigates the computer via mouse clicks and keystrokes. This course goes through the key building blocks, and culminates in a demo of an AI assistant that uses a web browser to search for a research paper, downloads the PDF, and finally summarizes the paper for you. In detail, you’ll: - Learn about Anthropic's family of models, when to use which one, and make API requests to Claude - Use multi-modal prompts that combine text and image content blocks, and also work with streaming responses - Improve your prompting by using prompt templates, using XML to structure prompts, and providing examples - Implement prompt caching to reduce cost and latency - Apply tool-use to build a chatbot that can call different tools to respond to queries - See all these building blocks come together in Computer Use demo Please sign up here:

Andrew Ng

170,211 次观看 • 1 年前