Video wird geladen...

Video konnte nicht geladen werden

Beim Laden dieses Videos ist ein Problem aufgetreten. Dies könnte an einem vorübergehenden Netzwerkproblem liegen oder das Video ist möglicherweise nicht verfügbar.

OpenAI just released access to Computer Use Agent via an API. It combines GPT-4o vision with advanced reasoning for an AI Agent to simulate controlling computer interfaces and performing tasks just like humans. You can try it for free using this Agent Playground.

Shubham Saboo

116,992 subscribers

64,206 Aufrufe • vor 1 Jahr •via X (Twitter)

Bildung Wissenschaft & Technologie

Anya Rossi• Live Now

Private livecam show

11 Kommentare

Profilbild von Shubham Saboo

Shubham Saboovor 1 Jahr

I will be creating Agent tutorials using OpenAI Agents SDK. I have created 50+ AI Agents and RAG tutorials, 100% free and opensource. 1. Subscribe to Unwind AI (for free): 2. Star the repo: Follow me → @Saboo_Shubham_

Profilbild von Shubham Saboo

Shubham Saboovor 1 Jahr

Try it out here: If you find this useful, RT to share it with your friends. Don't forget to follow me @Saboo_Shubham_ for more such LLM tips and AI Agent, RAG tutorials.

Profilbild von MightyBot

MightyBotvor 1 Jahr

🧠 Unified Search. Smarter Meetings. Effortless CRM. MightyBot is your AI agent platform for seamless workflows—record meetings, automate CRM updates, and find answers across apps in seconds. 🌟 Focus on what matters. We'll handle the grind.

Profilbild von browserbase 🅱️

browserbase 🅱️vor 1 Jahr

🔥🙌

Profilbild von Gargi

Gargivor 1 Jahr

This is really good!

Profilbild von Shubham Saboo

Shubham Saboovor 1 Jahr

+10000%

Profilbild von Kairos Data Labs

Kairos Data Labsvor 1 Jahr

This is so freakkin cool!

Profilbild von Shubham Saboo

Shubham Saboovor 1 Jahr

It is super smooth and fast!

Profilbild von Pallav Agarwal

Pallav Agarwalvor 1 Jahr

I also made an Open Source script that lets it control MacOS without Docker:

Profilbild von pelpa.sats | 🟧.pepe

pelpa.sats | 🟧.pepevor 1 Jahr

@KingBootoshi

Profilbild von Diamondswap 💎│ DiamondBot 🤖

Diamondswap 💎│ DiamondBot 🤖vor 1 Jahr

GAME CHANGER!!! 🚀👏💻

Ähnliche Videos

We’ve been working with OpenAI for the past few weeks to test their latest Computer-using Agent model. On our evals, CUA has set a new SOTA. Once integrated with an agent, it can complete long-horizon tasks previously impossible. Try CUA on our playground and Act SDK for free!

We’ve been working with OpenAI for the past few weeks to test their latest Computer-using Agent model. On our evals, CUA has set a new SOTA. Once integrated with an agent, it can complete long-horizon tasks previously impossible. Try CUA on our playground and Act SDK for free!

Scrapybara

89,614 Aufrufe • vor 1 Jahr

🚨 BREAKING: Hugging Face just dropped a free AI agent that uses a computer like a human. It’s called Open Computer Agent, and it mimics real computer use. You can run it in your browser no install required. Here's everything you need to know:

🚨 BREAKING: Hugging Face just dropped a free AI agent that uses a computer like a human. It’s called Open Computer Agent, and it mimics real computer use. You can run it in your browser no install required. Here's everything you need to know:

Ihtesham Haider

245,607 Aufrufe • vor 1 Jahr

Open AI released Operator, an agent that can use the browser to perform and automate tasks for you! I have built an Open Source version of Operator using Browser Use, running locally on your computer. 100% Open Source

Open AI released Operator, an agent that can use the browser to perform and automate tasks for you! I have built an Open Source version of Operator using Browser Use, running locally on your computer. 100% Open Source

Sumanth

62,485 Aufrufe • vor 1 Jahr

Microsoft just released an impressive tool OmniParser V2 can turn any LLM into an agent capable of using a computer 🔥 You can enable GPT-4o, DeepSeek R1, Sonnet 3.5, Qwen... to understand what's on your screen and take actions. 100% free & open source

Microsoft just released an impressive tool OmniParser V2 can turn any LLM into an agent capable of using a computer 🔥 You can enable GPT-4o, DeepSeek R1, Sonnet 3.5, Qwen... to understand what's on your screen and take actions. 100% free & open source

Paul Couvert

460,004 Aufrufe • vor 1 Jahr

Open Computer Agent - LLMs completing tasks using a VM. It's playground to test how well current LLM agents use a computer to solve everyday tasks. And this is just the start - very soon models will be 10x faster and 10x better at it! ❤️ built with e2b x qwen2.5-vl x smolagent

Open Computer Agent - LLMs completing tasks using a VM. It's playground to test how well current LLM agents use a computer to solve everyday tasks. And this is just the start - very soon models will be 10x faster and 10x better at it! ❤️ built with e2b x qwen2.5-vl x smolagent

Leandro von Werra

17,126 Aufrufe • vor 1 Jahr

I built an automated AI competitor analyis team with multi-agents. It has 3 AI agents working together as a team: 1. Data Extraction Agent using Firecrawl 2. Analyst Agent using OpenAI GPT-4o 3. Competitor Research Agent using Perplexity AI or Exa AI 100% Opensource code.

I built an automated AI competitor analyis team with multi-agents. It has 3 AI agents working together as a team: 1. Data Extraction Agent using Firecrawl 2. Analyst Agent using OpenAI GPT-4o 3. Competitor Research Agent using Perplexity AI or Exa AI 100% Opensource code.

Shubham Saboo

65,158 Aufrufe • vor 1 Jahr

Genspark AI agent just released AI Sheets. You can now upload any data and the agent automatically analyzes it, generates reports, and can research the web to find the data for you. 5 powerful use cases + how to try👇:

Genspark AI agent just released AI Sheets. You can now upload any data and the agent automatically analyzes it, generates reports, and can research the web to find the data for you. 5 powerful use cases + how to try👇:

Alvaro Cintas

110,569 Aufrufe • vor 1 Jahr

OpenAI Operator looks great here is a open source version of it you can use now see an agent that uses your browser to perform tasks for you developers get started in a few lines of code pip install 'ai-gradio[browser]' import gradio as gr import ai_gradio gr.load( name='browser:gpt-4o', src=ai_gradio.registry, title='AI Browser Agent', description='Agent that helps with web tasks' ).launch()

OpenAI Operator looks great here is a open source version of it you can use now see an agent that uses your browser to perform tasks for you developers get started in a few lines of code pip install 'ai-gradio[browser]' import gradio as gr import ai_gradio gr.load( name='browser:gpt-4o', src=ai_gradio.registry, title='AI Browser Agent', description='Agent that helps with web tasks' ).launch()

AK

98,797 Aufrufe • vor 1 Jahr

A research preview of Operator, an agent that can use its own browser to perform tasks for you.

A research preview of Operator, an agent that can use its own browser to perform tasks for you.

OpenAI

3,937,467 Aufrufe • vor 1 Jahr

🌐@Hyperbrowser just launched HyperPilot: a playground for all the leading AI browser agents like OpenAI’s CUA, Anthropic’s Claude Computer Use, and Browser Use. And you can try it out for free.

🌐@Hyperbrowser just launched HyperPilot: a playground for all the leading AI browser agents like OpenAI’s CUA, Anthropic’s Claude Computer Use, and Browser Use. And you can try it out for free.

Y Combinator

52,037 Aufrufe • vor 1 Jahr

This is wild. OpenAI just dropped GPT-5.4 and it will completely change the AI agent game. 1M context, huge leap for coding + agents, and native computer use. 7 wild examples. Bookmark this. 1. Build & Play 3D chess game

This is wild. OpenAI just dropped GPT-5.4 and it will completely change the AI agent game. 1M context, huge leap for coding + agents, and native computer use. 7 wild examples. Bookmark this. 1. Build & Play 3D chess game

Min Choi

326,236 Aufrufe • vor 4 Monaten

Today on Training Data, the OpenAI team behind ChatGPT agent explain how Agent Mode works, combining: 1) Deep Research (text based research agent) 2) Operator (GUI/action based computer agent) 3) Other new tools (terminal, computer apps) 4) Tied together with shared state to create an agent that's highly capable at most tasks that humans do on a computer: data science analysis, analyzing spreadsheets, making slides, etc. Thanks for joining us Isa Fulford Casey Chu Zhiqing Sun Lauren Reeder!

Today on Training Data, the OpenAI team behind ChatGPT agent explain how Agent Mode works, combining: 1) Deep Research (text based research agent) 2) Operator (GUI/action based computer agent) 3) Other new tools (terminal, computer apps) 4) Tied together with shared state to create an agent that's highly capable at most tasks that humans do on a computer: data science analysis, analyzing spreadsheets, making slides, etc. Thanks for joining us Isa Fulford Casey Chu Zhiqing Sun Lauren Reeder!

Sonya Huang 🐥

44,149 Aufrufe • vor 1 Jahr

🚨 BREAKING: OpenAI just launched ChatGPT Agent. ChatGPT just became an ‘independent computer’ with this. This is not GPT-5, but this is very close. Here’s why this is the beginning of a new era of AI: ⤵️

🚨 BREAKING: OpenAI just launched ChatGPT Agent. ChatGPT just became an ‘independent computer’ with this. This is not GPT-5, but this is very close. Here’s why this is the beginning of a new era of AI: ⤵️

Mushfiq Sajib

24,422 Aufrufe • vor 1 Jahr

OpenAI taught ChatGPT to use a computer. We taught it to run a factory. RoboGPT Machine Use — just like Computer Use, but for the real world. It operates CNCs, drill presses, and presses with plain language. Manufacturing isn’t dead. It just got an upgrade.

OpenAI taught ChatGPT to use a computer. We taught it to run a factory. RoboGPT Machine Use — just like Computer Use, but for the real world. It operates CNCs, drill presses, and presses with plain language. Manufacturing isn’t dead. It just got an upgrade.

Orangewood

29,473 Aufrufe • vor 1 Jahr

You can create an AI Agent that answers your email with a few clicks. 1. Go to ChatLLM ( 2. Click on AI Engineer 3. Select Create an AI Agent 4. Choose the Email Answering Agent ChatLLM will do the rest: it will code, test, and deploy the agent for you. You can also create a custom agent in English. The Agent Economy is coming (somebody should write a book and use this title.) We are going to see examples like this, times 1,000 in 2025. Just think about how many repetitive tasks you perform every day. Some of these tasks are involved enough that we couldn't automate them with pre-AI solutions. That's where we'll see agents explode, and I'm here for it.

You can create an AI Agent that answers your email with a few clicks. 1. Go to ChatLLM ( 2. Click on AI Engineer 3. Select Create an AI Agent 4. Choose the Email Answering Agent ChatLLM will do the rest: it will code, test, and deploy the agent for you. You can also create a custom agent in English. The Agent Economy is coming (somebody should write a book and use this title.) We are going to see examples like this, times 1,000 in 2025. Just think about how many repetitive tasks you perform every day. Some of these tasks are involved enough that we couldn't automate them with pre-AI solutions. That's where we'll see agents explode, and I'm here for it.

Santiago

79,974 Aufrufe • vor 1 Jahr

Imagine a computer where you don’t need to learn 10 apps to get work done. You just tell it what you want, and it adapts to how you work. I tested Happycapy with a real use case and created an image and a video. No coding, no complex software. Do this: - Build automations that run on schedule - Deploy agent teams that work for you AI molds how you work, not the other way around. If you can use a computer, you can make anything happen. Start your first Automation and Agent teams today.

Imagine a computer where you don’t need to learn 10 apps to get work done. You just tell it what you want, and it adapts to how you work. I tested Happycapy with a real use case and created an image and a video. No coding, no complex software. Do this: - Build automations that run on schedule - Deploy agent teams that work for you AI molds how you work, not the other way around. If you can use a computer, you can make anything happen. Start your first Automation and Agent teams today.

Aaliya

17,042 Aufrufe • vor 5 Monaten

Our first short course with Anthropic! Building Towards Computer Use with Anthropic. This teaches you to build an LLM-based agent that uses a computer interface by generating mouse clicks and keystrokes. Computer Use is an important, emerging capability for LLMs that will let AI agents do many more tasks than were possible before, since it lets them interact with interfaces designed for humans to use, rather than only tools that provide explicit API access. I hope you will enjoy learning about it! This course is taught by Anthropic's Head of Curriculum, Colt_Steele. You'll learn to apply image reasoning and tool use to "use" a computer as follows: a model processes an image of the screen, analyzes it to understand what's going on, and navigates the computer via mouse clicks and keystrokes. This course goes through the key building blocks, and culminates in a demo of an AI assistant that uses a web browser to search for a research paper, downloads the PDF, and finally summarizes the paper for you. In detail, you’ll: - Learn about Anthropic's family of models, when to use which one, and make API requests to Claude - Use multi-modal prompts that combine text and image content blocks, and also work with streaming responses - Improve your prompting by using prompt templates, using XML to structure prompts, and providing examples - Implement prompt caching to reduce cost and latency - Apply tool-use to build a chatbot that can call different tools to respond to queries - See all these building blocks come together in Computer Use demo Please sign up here:

Our first short course with Anthropic! Building Towards Computer Use with Anthropic. This teaches you to build an LLM-based agent that uses a computer interface by generating mouse clicks and keystrokes. Computer Use is an important, emerging capability for LLMs that will let AI agents do many more tasks than were possible before, since it lets them interact with interfaces designed for humans to use, rather than only tools that provide explicit API access. I hope you will enjoy learning about it! This course is taught by Anthropic's Head of Curriculum, Colt_Steele. You'll learn to apply image reasoning and tool use to "use" a computer as follows: a model processes an image of the screen, analyzes it to understand what's going on, and navigates the computer via mouse clicks and keystrokes. This course goes through the key building blocks, and culminates in a demo of an AI assistant that uses a web browser to search for a research paper, downloads the PDF, and finally summarizes the paper for you. In detail, you’ll: - Learn about Anthropic's family of models, when to use which one, and make API requests to Claude - Use multi-modal prompts that combine text and image content blocks, and also work with streaming responses - Improve your prompting by using prompt templates, using XML to structure prompts, and providing examples - Implement prompt caching to reduce cost and latency - Apply tool-use to build a chatbot that can call different tools to respond to queries - See all these building blocks come together in Computer Use demo Please sign up here:

Andrew Ng

170,404 Aufrufe • vor 1 Jahr

Genspark AI Agent just released AI Designer. You can now have an AI agent that performs deep research and generates entire brand design systems. 5 powerful use cases + how to try👇: 1. From asset creation to full working site

Alvaro Cintas

123,409 Aufrufe • vor 11 Monaten

AgenticSeek is an opensource alternative to $200/month Manus AI agent. It runs entirely on your computer, letting local reasoning LLMs like DeepSeek R1 browse the web, write code, and execute tasks while keeping all your data private. 100% free and without internet.

AgenticSeek is an opensource alternative to $200/month Manus AI agent. It runs entirely on your computer, letting local reasoning LLMs like DeepSeek R1 browse the web, write code, and execute tasks while keeping all your data private. 100% free and without internet.

Unwind AI

57,425 Aufrufe • vor 1 Jahr

Hermes Agent released v0.14, and it just made every other AI agent framework feel small. It now drives your computer. Searches X natively. Joins your Microsoft Teams calls. AND runs Grok with a 1M context window. (and for my crypto bro HyperLiquid) The AI agent era is crazy and beautiful.

Hermes Agent released v0.14, and it just made every other AI agent framework feel small. It now drives your computer. Searches X natively. Joins your Microsoft Teams calls. AND runs Grok with a 1M context window. (and for my crypto bro HyperLiquid) The AI agent era is crazy and beautiful.

Antoine Rousseaux

24,505 Aufrufe • vor 2 Monaten