Loading video...

Video Failed to Load

Go Home

OpenAI just released access to Computer Use Agent via an API. It combines GPT-4o vision with advanced reasoning for an AI Agent to simulate controlling computer interfaces and performing tasks just like humans. You can try it for free using this Agent Playground.

57,436 views • 1 year ago •via X (Twitter)

11 Comments

Shubham Saboo's profile picture
Shubham Saboo1 year ago

I will be creating Agent tutorials using OpenAI Agents SDK. I have created 50+ AI Agents and RAG tutorials, 100% free and opensource. 1. Subscribe to Unwind AI (for free): 2. Star the repo: Follow me → @Saboo_Shubham_

Shubham Saboo's profile picture
Shubham Saboo1 year ago

Try it out here: If you find this useful, RT to share it with your friends. Don't forget to follow me @Saboo_Shubham_ for more such LLM tips and AI Agent, RAG tutorials.

MightyBot's profile picture
MightyBot1 year ago

🧠 Unified Search. Smarter Meetings. Effortless CRM. MightyBot is your AI agent platform for seamless workflows—record meetings, automate CRM updates, and find answers across apps in seconds. 🌟 Focus on what matters. We'll handle the grind.

browserbase 🅱️'s profile picture
browserbase 🅱️1 year ago

🔥🙌

Gargi's profile picture
Gargi1 year ago

This is really good!

Shubham Saboo's profile picture
Shubham Saboo1 year ago

+10000%

Kairos Data Labs's profile picture
Kairos Data Labs1 year ago

This is so freakkin cool!

Shubham Saboo's profile picture
Shubham Saboo1 year ago

It is super smooth and fast!

Pallav Agarwal's profile picture
Pallav Agarwal1 year ago

I also made an Open Source script that lets it control MacOS without Docker:

pelpa.sats | 🟧.pepe's profile picture
pelpa.sats | 🟧.pepe1 year ago

@KingBootoshi

Diamondswap 💎│ DiamondBot 🤖's profile picture
Diamondswap 💎│ DiamondBot 🤖1 year ago

GAME CHANGER!!! 🚀👏💻

Related Videos

Our first short course with Anthropic! Building Towards Computer Use with Anthropic. This teaches you to build an LLM-based agent that uses a computer interface by generating mouse clicks and keystrokes. Computer Use is an important, emerging capability for LLMs that will let AI agents do many more tasks than were possible before, since it lets them interact with interfaces designed for humans to use, rather than only tools that provide explicit API access. I hope you will enjoy learning about it! This course is taught by Anthropic's Head of Curriculum, Colt_Steele. You'll learn to apply image reasoning and tool use to "use" a computer as follows: a model processes an image of the screen, analyzes it to understand what's going on, and navigates the computer via mouse clicks and keystrokes. This course goes through the key building blocks, and culminates in a demo of an AI assistant that uses a web browser to search for a research paper, downloads the PDF, and finally summarizes the paper for you. In detail, you’ll: - Learn about Anthropic's family of models, when to use which one, and make API requests to Claude - Use multi-modal prompts that combine text and image content blocks, and also work with streaming responses - Improve your prompting by using prompt templates, using XML to structure prompts, and providing examples - Implement prompt caching to reduce cost and latency - Apply tool-use to build a chatbot that can call different tools to respond to queries - See all these building blocks come together in Computer Use demo Please sign up here:

Andrew Ng

170,211 views • 1 year ago