Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

OpenAI just released access to Computer Use Agent via an API. It combines GPT-4o vision with advanced reasoning for an AI Agent to simulate controlling computer interfaces and performing tasks just like humans. You can try it for free using this Agent Playground.

64,206 Aufrufe • vor 1 Jahr •via X (Twitter)

11 Kommentare

Profilbild von Shubham Saboo
Shubham Saboovor 1 Jahr

I will be creating Agent tutorials using OpenAI Agents SDK. I have created 50+ AI Agents and RAG tutorials, 100% free and opensource. 1. Subscribe to Unwind AI (for free): 2. Star the repo: Follow me → @Saboo_Shubham_

Profilbild von Shubham Saboo
Shubham Saboovor 1 Jahr

Try it out here: If you find this useful, RT to share it with your friends. Don't forget to follow me @Saboo_Shubham_ for more such LLM tips and AI Agent, RAG tutorials.

Profilbild von MightyBot
MightyBotvor 1 Jahr

🧠 Unified Search. Smarter Meetings. Effortless CRM. MightyBot is your AI agent platform for seamless workflows—record meetings, automate CRM updates, and find answers across apps in seconds. 🌟 Focus on what matters. We'll handle the grind.

Profilbild von browserbase 🅱️
browserbase 🅱️vor 1 Jahr

🔥🙌

Profilbild von Gargi
Gargivor 1 Jahr

This is really good!

Profilbild von Shubham Saboo
Shubham Saboovor 1 Jahr

+10000%

Profilbild von Kairos Data Labs
Kairos Data Labsvor 1 Jahr

This is so freakkin cool!

Profilbild von Shubham Saboo
Shubham Saboovor 1 Jahr

It is super smooth and fast!

Profilbild von Pallav Agarwal
Pallav Agarwalvor 1 Jahr

I also made an Open Source script that lets it control MacOS without Docker:

Profilbild von pelpa.sats | 🟧.pepe
pelpa.sats | 🟧.pepevor 1 Jahr

@KingBootoshi

Profilbild von Diamondswap 💎│ DiamondBot 🤖
Diamondswap 💎│ DiamondBot 🤖vor 1 Jahr

GAME CHANGER!!! 🚀👏💻

Ähnliche Videos

Our first short course with Anthropic! Building Towards Computer Use with Anthropic. This teaches you to build an LLM-based agent that uses a computer interface by generating mouse clicks and keystrokes. Computer Use is an important, emerging capability for LLMs that will let AI agents do many more tasks than were possible before, since it lets them interact with interfaces designed for humans to use, rather than only tools that provide explicit API access. I hope you will enjoy learning about it! This course is taught by Anthropic's Head of Curriculum, Colt_Steele. You'll learn to apply image reasoning and tool use to "use" a computer as follows: a model processes an image of the screen, analyzes it to understand what's going on, and navigates the computer via mouse clicks and keystrokes. This course goes through the key building blocks, and culminates in a demo of an AI assistant that uses a web browser to search for a research paper, downloads the PDF, and finally summarizes the paper for you. In detail, you’ll: - Learn about Anthropic's family of models, when to use which one, and make API requests to Claude - Use multi-modal prompts that combine text and image content blocks, and also work with streaming responses - Improve your prompting by using prompt templates, using XML to structure prompts, and providing examples - Implement prompt caching to reduce cost and latency - Apply tool-use to build a chatbot that can call different tools to respond to queries - See all these building blocks come together in Computer Use demo Please sign up here:

Andrew Ng

170,305 Aufrufe • vor 1 Jahr