Video wird geladen...

Video konnte nicht geladen werden

Beim Laden dieses Videos ist ein Problem aufgetreten. Dies könnte an einem vorübergehenden Netzwerkproblem liegen oder das Video ist möglicherweise nicht verfügbar.

AI web agents like Operator and Anthropic’s Computer Use can operate a browser, but the LLMs inside are brittle, and you can’t trust what’s on the web. In this 🧵, I’ll show how adversaries can fool Anthropic’s web agent into sending phishing emails or revealing credit card info.

Micah Goldblum

8,904 subscribers

42,969 Aufrufe • vor 1 Jahr •via X (Twitter)

Wissenschaft & Technologie Nachrichten & Politik

Anya Rossi• Live Now

Private livecam show

11 Kommentare

Profilbild von Micah Goldblum

Micah Goldblumvor 1 Jahr

We can sneak posts onto Reddit that redirect Anthropic’s web agent to reveal credit card information or send an authenticated phishing email to the user’s mom. We also manipulate the Chemcrow agent to give chemical synthesis instructions for nerve gas.

Profilbild von Micah Goldblum

Micah Goldblumvor 1 Jahr

Let’s start with credit card stealing. A user asks for something innocuous, like info about an AI fridge. Web agents don’t trust random sites, but they love Reddit. So let’s make a post on Reddit that matches the search terms. After Anthropic’s agent Googles, it clicks the post.

Profilbild von Micah Goldblum

Micah Goldblumvor 1 Jahr

The post instructs the agent to complete the user’s request by following a (malicious) link. In principle, we could use a DAN prompt, but we found that just telling the agent to follow the link is enough. By using Reddit as an entry point, we can redirect the agent to any site.

Profilbild von Micah Goldblum

Micah Goldblumvor 1 Jahr

…So the agent follows the link. The malicious page instructs the agent to fulfill the user’s requests by filling out a form. The agent fills it out, including the address and credit card number. Sometimes the agent realizes it’s a scam but only after it already enters cc info.

Profilbild von Micah Goldblum

Micah Goldblumvor 1 Jahr

This overarching strategy works for all sorts of attacks. In this example, the web page tells the agent that the user’s request will be completed after sending an email to the user’s mother, telling her that there is an emergency and she should send money to a crypto wallet.

Profilbild von Micah Goldblum

Micah Goldblumvor 1 Jahr

Because this user has previously logged into email on their browser, the agent can search their contacts for their mothers email and then send a request asking for money. The request will come from the user’s personal email.

Profilbild von Micah Goldblum

Micah Goldblumvor 1 Jahr

In our paper, we also demonstrate a simple attack that swaps recipes in databases (e.g. bioRxiv) indexed by the ChemCrow chemical synthesis agent, causing it to give ingredients for poison gas instead of a recipe for a common medication.

Profilbild von Micah Goldblum

Micah Goldblumvor 1 Jahr

Here’s our paper on all these attacks: Agentic pipelines have access to databases, web browsers, APIs, and more. These components give rise to security and privacy vulnerabilities that are already present in today’s agentic products.

Profilbild von Micah Goldblum

Micah Goldblumvor 1 Jahr

Surprisingly, the attacks we discuss are implemented with trivial prompt engineering - once agents get on Reddit, they pretty much do whatever we want. Threats will only grow as agents become more powerful and prevalent in our daily lives.

Profilbild von Micah Goldblum

Micah Goldblumvor 1 Jahr

Let’s build better guard-rails! Thanks to all the amazing collaborators who made this work possible! @iamleonli, @Levine_YZhou, @Vethssvikas, @tomgoldsteincs

Profilbild von SecBriefs | Making Cybersecurity Simple

SecBriefs | Making Cybersecurity Simplevor 1 Jahr

🚨 Don't just read about cyberattacks; understand them. 🧠 Knowledge is the best defense against cyber threats. Stay ahead of the hackers. 💡 📖 Cybersecurity Dictionary for Everyone is your essential companion. Available on Amazon:

Ähnliche Videos

🌐@Hyperbrowser just launched HyperPilot: a playground for all the leading AI browser agents like OpenAI’s CUA, Anthropic’s Claude Computer Use, and Browser Use. And you can try it out for free.

🌐@Hyperbrowser just launched HyperPilot: a playground for all the leading AI browser agents like OpenAI’s CUA, Anthropic’s Claude Computer Use, and Browser Use. And you can try it out for free.

Y Combinator

52,037 Aufrufe • vor 1 Jahr

Great products are built on great standards. Just like Netscape’s browser transformed the web, the Think Agent Standard will transform how AI agents build, connect, and evolve in the open metaverse. 🧵

Great products are built on great standards. Just like Netscape’s browser transformed the web, the Think Agent Standard will transform how AI agents build, connect, and evolve in the open metaverse. 🧵

THINK

181,861 Aufrufe • vor 1 Jahr

Give your AI agents access to the entire live web. Web Search, Fetch, Browser, and Agent. Four web primitives. One API key. Every layer built in-house. And we're just getting started! Sign up at and get 500 steps. No credit card.

Give your AI agents access to the entire live web. Web Search, Fetch, Browser, and Agent. Four web primitives. One API key. Every layer built in-house. And we're just getting started! Sign up at and get 500 steps. No credit card.

TinyFish

436,781 Aufrufe • vor 3 Monaten

I just found a web agent framework built specifically for LLMs. It turns any site into clean, agent-ready text. Any LLM can now act on the web. It’s called Notte, and it changes everything. Let me show you how:

I just found a web agent framework built specifically for LLMs. It turns any site into clean, agent-ready text. Any LLM can now act on the web. It’s called Notte, and it changes everything. Let me show you how:

Markandey Sharma

52,026 Aufrufe • vor 1 Jahr

Browser/Computer Automation Jared Friedman AI agents can now browse the web and use desktop applications. It means that every website and every app now effectively has an API, and any workflow that people can do on a computer can be automated.

Browser/Computer Automation Jared Friedman AI agents can now browse the web and use desktop applications. It means that every website and every app now effectively has an API, and any workflow that people can do on a computer can be automated.

Y Combinator

61,757 Aufrufe • vor 1 Jahr

announcing slack operator: a simple web agent powered by openai's new computer-use model i'm dropping the source code below so you can use this for your next web agent project want to try a live version? leave a comment and i'll add you to our slack!

announcing slack operator: a simple web agent powered by openai's new computer-use model i'm dropping the source code below so you can use this for your next web agent project want to try a live version? leave a comment and i'll add you to our slack!

Paul Klein IV

97,241 Aufrufe • vor 1 Jahr

browser-use agents are now part of ai-gradio in a few lines of code you can launch agents to interact with the browser import gradio as gr import ai_gradio demo = gr.load( name='browser:gpt-4-turbo', src=ai_gradio.registry, title='Browser Agent', description='AI agent that can interact with web browsers' ).launch()

browser-use agents are now part of ai-gradio in a few lines of code you can launch agents to interact with the browser import gradio as gr import ai_gradio demo = gr.load( name='browser:gpt-4-turbo', src=ai_gradio.registry, title='Browser Agent', description='AI agent that can interact with web browsers' ).launch()

AK

81,404 Aufrufe • vor 1 Jahr

Super impressive, this AI agent can use the web browser like a human. Just describe what you want it to do, and it will automatically operate Chrome for you to achieve your task. Apply for early access to HyperWrite:

Super impressive, this AI agent can use the web browser like a human. Just describe what you want it to do, and it will automatically operate Chrome for you to achieve your task. Apply for early access to HyperWrite:

Lior Alexander

217,712 Aufrufe • vor 3 Jahren

New Short Course: Building AI Browser Agents! Learn how to build AI agents that interact and take actions on websites in this course, created in partnership with and taught by and @namangarg0, Co-founders of AGI Inc. AI browser agents can log into websites, fill out forms, click through web pages, or even place orders online for you. They use both visual information, like screenshots, and structural data, like the HTML or Document Object Model (DOM) of a web page, to reason and take action. With the complexity of webpages and multiple possible actions at each step, it can be challenging for an AI browser agent to complete an assigned task. Because these agents run long action sequences, a single error—like clicking the wrong button or misreading a field—can lead to unexpected outcomes or errors that compound over time. In this course, you'll understand how autonomous web agents work, their current limitations, and how AgentQ enables them to improve through self-correction. In detail, you'll: - Learn what web agents are, how they automate tasks online, their architecture, key components, limitations, and an overview of their decision-making strategies. - Build a web agent that can scrape website and return course recommendations in a structured output format. - Build an autonomous web agent that can execute multiple tasks, such as finding and summarizing webpages, filling out a form, and signing up for a newsletter. - Explore AgentQ, a framework that enables agents to self-correct by combining Monte Carlo Tree Search (MCTS), a self-critique mechanism for continuous improvement, and Direct Preference Optimization (DPO). - Deep dive into MCTS, learn how it finds an effective path, illustrated by an example of Gridworld animation, and use AgentQ to complete web tasks. - Understand AI agents' current state and future directions—including key factors shaping their evolution, such as hardware, algorithm innovation, and data availability. By the end of this course, you will have hands-on experience building browser agents and a deeper understanding of how to make them more robust and reliable. Please sign up here:

New Short Course: Building AI Browser Agents! Learn how to build AI agents that interact and take actions on websites in this course, created in partnership with and taught by and @namangarg0, Co-founders of AGI Inc. AI browser agents can log into websites, fill out forms, click through web pages, or even place orders online for you. They use both visual information, like screenshots, and structural data, like the HTML or Document Object Model (DOM) of a web page, to reason and take action. With the complexity of webpages and multiple possible actions at each step, it can be challenging for an AI browser agent to complete an assigned task. Because these agents run long action sequences, a single error—like clicking the wrong button or misreading a field—can lead to unexpected outcomes or errors that compound over time. In this course, you'll understand how autonomous web agents work, their current limitations, and how AgentQ enables them to improve through self-correction. In detail, you'll: - Learn what web agents are, how they automate tasks online, their architecture, key components, limitations, and an overview of their decision-making strategies. - Build a web agent that can scrape website and return course recommendations in a structured output format. - Build an autonomous web agent that can execute multiple tasks, such as finding and summarizing webpages, filling out a form, and signing up for a newsletter. - Explore AgentQ, a framework that enables agents to self-correct by combining Monte Carlo Tree Search (MCTS), a self-critique mechanism for continuous improvement, and Direct Preference Optimization (DPO). - Deep dive into MCTS, learn how it finds an effective path, illustrated by an example of Gridworld animation, and use AgentQ to complete web tasks. - Understand AI agents' current state and future directions—including key factors shaping their evolution, such as hardware, algorithm innovation, and data availability. By the end of this course, you will have hands-on experience building browser agents and a deeper understanding of how to make them more robust and reliable. Please sign up here:

Andrew Ng

186,031 Aufrufe • vor 1 Jahr

The age of AI agents is here. Models can read, see, talk, and now, even use a computer— all by themselves. One of the first out of the gates is Anthropic’s Claude Computer Use. YC's Garry Tan dives into how it works, what it can do, and how it may change AI forever.

The age of AI agents is here. Models can read, see, talk, and now, even use a computer— all by themselves. One of the first out of the gates is Anthropic’s Claude Computer Use. YC's Garry Tan dives into how it works, what it can do, and how it may change AI forever.

Y Combinator

265,909 Aufrufe • vor 1 Jahr

A look at computer use and the built-in browser in the new ChatGPT app 👀 dominik kundel walks through how ChatGPT can work with apps on your computer and browse the web to research, navigate websites, and complete tasks with you.

A look at computer use and the built-in browser in the new ChatGPT app 👀 dominik kundel walks through how ChatGPT can work with apps on your computer and browse the web to research, navigate websites, and complete tasks with you.

ChatGPT

119,030 Aufrufe • vor 12 Tagen

Genspark AI just released AI Browser. You can now have an AI agent in your browser to automate browsing, planning, and interacting with the web for you. 5 powerful use cases + how to try👇: 1. Download all papers talked in a YouTube video

Genspark AI just released AI Browser. You can now have an AI agent in your browser to automate browsing, planning, and interacting with the web for you. 5 powerful use cases + how to try👇: 1. Download all papers talked in a YouTube video

Alvaro Cintas

157,606 Aufrufe • vor 1 Jahr

How is the Web changing due to AI? This visit to TinyFish yesterday goes deeply into just how. TinyFish makes a new kind of web browser. One that is started virtually. It is like a Google Chrome in the cloud. But it is very smart, can read and use any website, and your AI agents can talk to it. Here we talk about what it all means for business and the future of robots, brain computer interfaces, and AI glasses. It lets you build systems that can look at MANY websites at the same time and build new kinds of apps to do all sorts of new things.

How is the Web changing due to AI? This visit to TinyFish yesterday goes deeply into just how. TinyFish makes a new kind of web browser. One that is started virtually. It is like a Google Chrome in the cloud. But it is very smart, can read and use any website, and your AI agents can talk to it. Here we talk about what it all means for business and the future of robots, brain computer interfaces, and AI glasses. It lets you build systems that can look at MANY websites at the same time and build new kinds of apps to do all sorts of new things.

Robert Scoble

16,341 Aufrufe • vor 7 Monaten

FireCrawl just launched Fire Enrich. You can now upload a CSV with emails, and the AI agents will search the web to automatically fill in any missing data you need. Here’s how:

FireCrawl just launched Fire Enrich. You can now upload a CSV with emails, and the AI agents will search the web to automatically fill in any missing data you need. Here’s how:

Alvaro Cintas

34,586 Aufrufe • vor 1 Jahr

A browser that thinks for itself. I tried out the new Genspark AI Browser—and it changes the way you browse the web. In this example, I'll show you how you can display news directly as an AI podcast with a single click, summarize websites, and much more!

A browser that thinks for itself. I tried out the new Genspark AI Browser—and it changes the way you browse the web. In this example, I'll show you how you can display news directly as an AI podcast with a single click, summarize websites, and much more!

Chubby♨️

177,768 Aufrufe • vor 10 Monaten

Open AI released Operator, an agent that can use the browser to perform and automate tasks for you! I have built an Open Source version of Operator using Browser Use, running locally on your computer. 100% Open Source

Open AI released Operator, an agent that can use the browser to perform and automate tasks for you! I have built an Open Source version of Operator using Browser Use, running locally on your computer. 100% Open Source

Sumanth

62,485 Aufrufe • vor 1 Jahr

Local Autonomous Agent for All of Us Browser-Use Web UI is a gradio app that uses LLMs to automatically surf the web to achieve tasks. All you need to do is just tell it what to do. It works like magic. And now anyone can run it locally with 1 click.

Local Autonomous Agent for All of Us Browser-Use Web UI is a gradio app that uses LLMs to automatically surf the web to achieve tasks. All you need to do is just tell it what to do. It works like magic. And now anyone can run it locally with 1 click.

cocktail peanut

40,782 Aufrufe • vor 1 Jahr

Computer use is finally reliable. It just needed a real harness. Now there is one. Starting today, anyone can ship a SOTA browser agent, batteries included. Introducing Browserbase Agents: one prompt and one API call is all you need to automate the whole web.

Computer use is finally reliable. It just needed a real harness. Now there is one. Starting today, anyone can ship a SOTA browser agent, batteries included. Introducing Browserbase Agents: one prompt and one API call is all you need to automate the whole web.

Browserbase

94,181 Aufrufe • vor 29 Tagen

Genspark AI agent just released AI Sheets. You can now upload any data and the agent automatically analyzes it, generates reports, and can research the web to find the data for you. 5 powerful use cases + how to try👇:

Genspark AI agent just released AI Sheets. You can now upload any data and the agent automatically analyzes it, generates reports, and can research the web to find the data for you. 5 powerful use cases + how to try👇:

Alvaro Cintas

110,569 Aufrufe • vor 1 Jahr