正在加载视频...

视频加载失败

加载此视频时出现问题。这可能是由于临时网络问题，或视频可能不可用。

Droids have native multi-agent orchestration built-in. Our desktop app let's you easily manage these sub-agents as they tackle complex tasks. Monitor each agent as it works, and interrupt or inject context when needed.

Factory

43,416 subscribers

11,781 次观看 • 2 个月前 •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

0 条评论

暂无评论

原始帖子的评论将显示在这里

相关视频

Finally, we're so excited to bring Project Mariner capabilities to AI Mode in Search, Agent Mode in Google Gemini, and as a standalone web app. This will allow the products you know and love to intelligently manage complex multi-step tasks on your behalf.

Finally, we're so excited to bring Project Mariner capabilities to AI Mode in Search, Agent Mode in Google Gemini, and as a standalone web app. This will allow the products you know and love to intelligently manage complex multi-step tasks on your behalf.

Google AI

29,189 次观看 • 1 年前

my new vibe code setup: 1 orchestrator agent which controls 85 sub-agents working in parallel each sub-agent spawns from my stream of consciousness and tests from the main orchestrator Here's how it works:

my new vibe code setup: 1 orchestrator agent which controls 85 sub-agents working in parallel each sub-agent spawns from my stream of consciousness and tests from the main orchestrator Here's how it works:

Greg Kamradt

316,172 次观看 • 1 年前

🔥 Introducing Skywork Mobile App 5.0—the world’s first native mobile app for Super AI Agents. - Meet VoiceNotes: Turn a single voice memo into clean summaries, transcripts, and visual notes. - Super Agent: Carry a crew of expert minds in your pocket. Run up to 3 expert agents in parallel to tackle complex tasks instantly. Skywork App 5.0 is now available for both iOS and Android.

🔥 Introducing Skywork Mobile App 5.0—the world’s first native mobile app for Super AI Agents. - Meet VoiceNotes: Turn a single voice memo into clean summaries, transcripts, and visual notes. - Super Agent: Carry a crew of expert minds in your pocket. Run up to 3 expert agents in parallel to tackle complex tasks instantly. Skywork App 5.0 is now available for both iOS and Android.

Skywork

276,527 次观看 • 6 个月前

Manage multiple agents at once in the Agent Dashboard. See what each is doing, reply to the ones that need you, and dispatch new tasks. Try /dashboard in Grok Build.

Manage multiple agents at once in the Agent Dashboard. See what each is doing, reply to the ones that need you, and dispatch new tasks. Try /dashboard in Grok Build.

Grok

590,839 次观看 • 7 天前

Stop spending hours on manual work. You can now use a multi-agent AI workforce to get more work done in less time. Here's how 👇 --- Try Eigent AI - Lets you build and run a custom AI workforce on your desktop. - Automate complex workflows using multi-agent task execution. - Built on CAMEL-AI’s top open-source projects ( CAMEL-AI.org & OWL). - Boost productivity with deep customization and strong privacy --- Features: - Customize Your AI Workforce: Build task-specific agents with domain skills and tools. - Faster Execution: Eigent runs agents in parallel to automate complex workflows. - Human-in-the-loop: Automatically asks for help when tasks hit uncertainty. --- What sets Eigent apart? - 3–5× faster task execution using a parallel multi-agent workforce. - Modular design lets you add new capabilities without changing the core system. - Self-optimizing agents that replan and adapt during execution for higher success. - Deploy anywhere: cloud, local, or enterprise, with full open-source flexibility. --- Try building your multi-agent AI workforce here: Join their community to build your multi-agent workforce: Check their GitHub: ---

Stop spending hours on manual work. You can now use a multi-agent AI workforce to get more work done in less time. Here's how 👇 --- Try Eigent AI - Lets you build and run a custom AI workforce on your desktop. - Automate complex workflows using multi-agent task execution. - Built on CAMEL-AI’s top open-source projects ( CAMEL-AI.org & OWL). - Boost productivity with deep customization and strong privacy --- Features: - Customize Your AI Workforce: Build task-specific agents with domain skills and tools. - Faster Execution: Eigent runs agents in parallel to automate complex workflows. - Human-in-the-loop: Automatically asks for help when tasks hit uncertainty. --- What sets Eigent apart? - 3–5× faster task execution using a parallel multi-agent workforce. - Modular design lets you add new capabilities without changing the core system. - Self-optimizing agents that replan and adapt during execution for higher success. - Deploy anywhere: cloud, local, or enterprise, with full open-source flexibility. --- Try building your multi-agent AI workforce here: Join their community to build your multi-agent workforce: Check their GitHub: ---

Shushant Lakhyani

20,423 次观看 • 10 个月前

Introducing the GitHub Copilot app, the desktop home for agent-native software development on GitHub

Introducing the GitHub Copilot app, the desktop home for agent-native software development on GitHub

Pierce Boggan

30,961 次观看 • 20 天前

Introducing agent-browser chat agent-browser is now a browser agent. → One-shot: agent-browser chat "open google, search for dogs" → Interactive: agent-browser chat → Built-in AI chat in the dashboard → Execute any agent-browser command → Use agent-browser as a sub-agent

Introducing agent-browser chat agent-browser is now a browser agent. → One-shot: agent-browser chat "open google, search for dogs" → Interactive: agent-browser chat → Built-in AI chat in the dashboard → Execute any agent-browser command → Use agent-browser as a sub-agent

Chris Tate

90,926 次观看 • 2 个月前

China just released a desktop automation agent that runs 100% locally. It can run any desktop app, open files, browse websites, and automate tasks without needing an internet connection. 100% Open-Source.

China just released a desktop automation agent that runs 100% locally. It can run any desktop app, open files, browse websites, and automate tasks without needing an internet connection. 100% Open-Source.

Hasan Toor

1,187,454 次观看 • 4 个月前

Microsoft presents Windows Agent Arena Evaluating Multi-Modal OS Agents at Scale discuss: Large language models (LLMs) show remarkable potential to act as computer agents, enhancing human productivity and software accessibility in multi-modal tasks that require planning and reasoning. However, measuring agent performance in realistic environments remains a challenge since: (i) most benchmarks are limited to specific modalities or domains (e.g. text-only, web navigation, Q&A, coding) and (ii) full benchmark evaluations are slow (on order of magnitude of days) given the multi-step sequential nature of tasks. To address these challenges, we introduce the Windows Agent Arena: a reproducible, general environment focusing exclusively on the Windows operating system (OS) where agents can operate freely within a real Windows OS and use the same wide range of applications, tools, and web browsers available to human users when solving tasks. We adapt the OSWorld framework (Xie et al., 2024) to create 150+ diverse Windows tasks across representative domains that require agent abilities in planning, screen understanding, and tool usage. Our benchmark is scalable and can be seamlessly parallelized in Azure for a full benchmark evaluation in as little as 20 minutes. To demonstrate Windows Agent Arena's capabilities, we also introduce a new multi-modal agent, Navi. Our agent achieves a success rate of 19.5% in the Windows domain, compared to 74.5% performance of an unassisted human. Navi also demonstrates strong performance on another popular web-based benchmark, Mind2Web. We offer extensive quantitative and qualitative analysis of Navi's performance, and provide insights into the opportunities for future research in agent development and data generation using Windows Agent Arena.

Microsoft presents Windows Agent Arena Evaluating Multi-Modal OS Agents at Scale discuss: Large language models (LLMs) show remarkable potential to act as computer agents, enhancing human productivity and software accessibility in multi-modal tasks that require planning and reasoning. However, measuring agent performance in realistic environments remains a challenge since: (i) most benchmarks are limited to specific modalities or domains (e.g. text-only, web navigation, Q&A, coding) and (ii) full benchmark evaluations are slow (on order of magnitude of days) given the multi-step sequential nature of tasks. To address these challenges, we introduce the Windows Agent Arena: a reproducible, general environment focusing exclusively on the Windows operating system (OS) where agents can operate freely within a real Windows OS and use the same wide range of applications, tools, and web browsers available to human users when solving tasks. We adapt the OSWorld framework (Xie et al., 2024) to create 150+ diverse Windows tasks across representative domains that require agent abilities in planning, screen understanding, and tool usage. Our benchmark is scalable and can be seamlessly parallelized in Azure for a full benchmark evaluation in as little as 20 minutes. To demonstrate Windows Agent Arena's capabilities, we also introduce a new multi-modal agent, Navi. Our agent achieves a success rate of 19.5% in the Windows domain, compared to 74.5% performance of an unassisted human. Navi also demonstrates strong performance on another popular web-based benchmark, Mind2Web. We offer extensive quantitative and qualitative analysis of Navi's performance, and provide insights into the opportunities for future research in agent development and data generation using Windows Agent Arena.

AK

19,684 次观看 • 1 年前

I added multi-agents to our hedge fund UX. Best part is the agents run in parallel. We now have: 1 • Warren Buffett agent 2 • Charlie Munger agent All made easy thanks to LangChain

I added multi-agents to our hedge fund UX. Best part is the agents run in parallel. We now have: 1 • Warren Buffett agent 2 • Charlie Munger agent All made easy thanks to LangChain

virat

25,108 次观看 • 1 年前

New agent mode and IDE enhancements in Code Assist! → Agent mode: Analyzes full codebase to handle multi-file tasks from one prompt IDE Enhancements: Control context via .aiexclude, attach terminal output & enjoy a smoother UI

New agent mode and IDE enhancements in Code Assist! → Agent mode: Analyzes full codebase to handle multi-file tasks from one prompt IDE Enhancements: Control context via .aiexclude, attach terminal output & enjoy a smoother UI

Google Cloud Tech

177,774 次观看 • 11 个月前

Contact sheet prompting as an agent workflow. No manual orchestration needed. Claude Sonnet 4.5 → Nano Banana Pro → Kling 2.6 → finished video.

Contact sheet prompting as an agent workflow. No manual orchestration needed. Claude Sonnet 4.5 → Nano Banana Pro → Kling 2.6 → finished video.

Miguel | AP

11,903 次观看 • 6 个月前

I found this last night and I have not stopped thinking about it. HERMES JUST LAUNCHED HERMES DESKTOP. 100% FREE. It is a free desktop app that gives Hermes Agent a proper interface. One place for everything. What is inside: ↳ Auto install and setup, no terminal needed ↳ Streaming chat with token tracking ↳ Multiple agent profiles ↳ Memory you can actually see and edit ↳ 14 tool categories including web, browser, image gen, and voice ↳ Scheduler for automated tasks ↳ 16 messaging gateways including Telegram, WhatsApp, Discord, Slack, and Signal ↳ Full conversation history with search ↳ Backups and logs in one settings screen Works with Anthropic, OpenAI, Gemini, Grok, Groq, Ollama, and more. Hermes Agent is the brain. Hermes Desktop is the cockpit. Free. Open source. Mac, Windows, and Linux.

I found this last night and I have not stopped thinking about it. HERMES JUST LAUNCHED HERMES DESKTOP. 100% FREE. It is a free desktop app that gives Hermes Agent a proper interface. One place for everything. What is inside: ↳ Auto install and setup, no terminal needed ↳ Streaming chat with token tracking ↳ Multiple agent profiles ↳ Memory you can actually see and edit ↳ 14 tool categories including web, browser, image gen, and voice ↳ Scheduler for automated tasks ↳ 16 messaging gateways including Telegram, WhatsApp, Discord, Slack, and Signal ↳ Full conversation history with search ↳ Backups and logs in one settings screen Works with Anthropic, OpenAI, Gemini, Grok, Groq, Ollama, and more. Hermes Agent is the brain. Hermes Desktop is the cockpit. Free. Open source. Mac, Windows, and Linux.

Kanika

56,505 次观看 • 23 天前

SOMEONE OPEN SOURCED A SMALL MODEL TRAINED SPECIFICALLY AS A PERSONAL AGENT ROUTER DECIDES WHAT RUNS LOCAL VS CLOUD AUTOMATICALLY ROUTES TASKS TO THE RIGHT MODEL BASED ON COMPLEXITY SMARTER AGENT ORCHESTRATION WITHOUT THE OVERHEAD

SOMEONE OPEN SOURCED A SMALL MODEL TRAINED SPECIFICALLY AS A PERSONAL AGENT ROUTER DECIDES WHAT RUNS LOCAL VS CLOUD AUTOMATICALLY ROUTES TASKS TO THE RIGHT MODEL BASED ON COMPLEXITY SMARTER AGENT ORCHESTRATION WITHOUT THE OVERHEAD

0xMarioNawfal

157,414 次观看 • 3 个月前

WandrLust’s AI agents have distinct personalities and voices, all with a unique purpose within the app: • Gary: Guide Agent • Remi: Wellness Agent • Diego: Offer Agent • Frankie: Payment Agent They’re inspired by our GRFFs, an upcoming Bitcoin Ordinals collection and the original characters that form the cultural foundation of the WandrLust Outverse, our ecosystem built around real-world outdoor activity, explorers, agents and rewards. The next chapter begins soon. $AFK TGE: 25 March

WandrLust’s AI agents have distinct personalities and voices, all with a unique purpose within the app: • Gary: Guide Agent • Remi: Wellness Agent • Diego: Offer Agent • Frankie: Payment Agent They’re inspired by our GRFFs, an upcoming Bitcoin Ordinals collection and the original characters that form the cultural foundation of the WandrLust Outverse, our ecosystem built around real-world outdoor activity, explorers, agents and rewards. The next chapter begins soon. $AFK TGE: 25 March

WandrLust🌿We Pay You To Live

25,029 次观看 • 3 个月前

🧃 Introducing stereOS: a Linux based operating system hardened and purpose built for AI agents. It's clear that agents need an ACTUAL operating system (not what people are calling an "OS") to witness the full breadth and depth of their capabilities while mitigating the blast radius of autonomous, untrusted actors. But there are so many problems with AI sandboxes today: * Going out to the apple store and buying a mac mini will never scale and is way too expensive (obviously) * Running in Docker is too restrictive (agents can't stand up their own container infrastructure, no sub virtualization, docker-in-docker is very broken) * Firecracker strips all the hardware so GPU PCIe passthrough, secure boot, FIPs, etc. is out of the question. * Native VMs are too fat and the overhead of 1 agent per VM is too much. stereOS takes a different approach: it's a full NixOS system that you boot and then kick off agent sandboxes inside with gVisor + /nix/store namespace mounting. Each agent gets their own kernel and the /nix/store is read only by nature. Even if the agent was somehow able to escape the gVisor virtual kernel, they'd land on the NixOS system as the "agent" user! Not your actual hardware!! If you want to take a defense-in-depth approach, we support "native" agents that run at the system level kicked off by our `agentd` utility. These agents, on their own, can manage and kick off other sub agents using the internal sandboxing mechanisms. Today, we're open sourcing all of this: * stereOS: our purpose built Linux OS - * masterblaster: client utility to launch, manage, and orchestrate agents - * stereosd: the stereOS system control plane daemon - * agentd: the stereOS system agent management daemon - Give it a try, throw us a star, and let me know what you think 🧃⭐️

🧃 Introducing stereOS: a Linux based operating system hardened and purpose built for AI agents. It's clear that agents need an ACTUAL operating system (not what people are calling an "OS") to witness the full breadth and depth of their capabilities while mitigating the blast radius of autonomous, untrusted actors. But there are so many problems with AI sandboxes today: * Going out to the apple store and buying a mac mini will never scale and is way too expensive (obviously) * Running in Docker is too restrictive (agents can't stand up their own container infrastructure, no sub virtualization, docker-in-docker is very broken) * Firecracker strips all the hardware so GPU PCIe passthrough, secure boot, FIPs, etc. is out of the question. * Native VMs are too fat and the overhead of 1 agent per VM is too much. stereOS takes a different approach: it's a full NixOS system that you boot and then kick off agent sandboxes inside with gVisor + /nix/store namespace mounting. Each agent gets their own kernel and the /nix/store is read only by nature. Even if the agent was somehow able to escape the gVisor virtual kernel, they'd land on the NixOS system as the "agent" user! Not your actual hardware!! If you want to take a defense-in-depth approach, we support "native" agents that run at the system level kicked off by our `agentd` utility. These agents, on their own, can manage and kick off other sub agents using the internal sandboxing mechanisms. Today, we're open sourcing all of this: * stereOS: our purpose built Linux OS - * masterblaster: client utility to launch, manage, and orchestrate agents - * stereosd: the stereOS system control plane daemon - * agentd: the stereOS system agent management daemon - Give it a try, throw us a star, and let me know what you think 🧃⭐️

John McBride

150,178 次观看 • 3 个月前

BREAKING: OpenAI just launched ChatGPT Agent It allows ChatGPT to think, plan, and execute complex tasks on its own virtual computer while you do other things I had early access, and ChatGPT Agent built me a complete early retirement plan in 20 minutes: > Found local tax laws (Vancouver) > Analyzed average monthly spend rates > Calculated savings needed to retire at 30 > Researched optimal investment allocations > Found tax optimization strategies I'd never heard of > Built multiple FIRE scenarios > Created a downloadable presentation with results This would've cost me $5,000+ from a financial advisor and taken weeks I think with ChatGPT Agent now, and especially as it gains access to more tools, we're finally going to see the rise of a new AI skill category in *Agent Management* Agents are finally becoming capable of doing real work autonomously, so anyone who learns how to effectively orchestrate agents will have a huge advantage

BREAKING: OpenAI just launched ChatGPT Agent It allows ChatGPT to think, plan, and execute complex tasks on its own virtual computer while you do other things I had early access, and ChatGPT Agent built me a complete early retirement plan in 20 minutes: > Found local tax laws (Vancouver) > Analyzed average monthly spend rates > Calculated savings needed to retire at 30 > Researched optimal investment allocations > Found tax optimization strategies I'd never heard of > Built multiple FIRE scenarios > Created a downloadable presentation with results This would've cost me $5,000+ from a financial advisor and taken weeks I think with ChatGPT Agent now, and especially as it gains access to more tools, we're finally going to see the rise of a new AI skill category in Agent Management Agents are finally becoming capable of doing real work autonomously, so anyone who learns how to effectively orchestrate agents will have a huge advantage

Rowan Cheung

653,674 次观看 • 11 个月前

your coding agents are Kanban cards now 😯. New in Orca: open a board over any terminal pane and drag each agent worktree between statuses. todo, in progress, review, testing, blocked, done, or whatever custom columns fit your workflow. much easier when you have 10+ agent running across different features.

your coding agents are Kanban cards now 😯. New in Orca: open a board over any terminal pane and drag each agent worktree between statuses. todo, in progress, review, testing, blocked, done, or whatever custom columns fit your workflow. much easier when you have 10+ agent running across different features.

Orca ADE

40,317 次观看 • 28 天前

Big news, friends! I hereby introduce It's a multi-agent chat app with special features for collaborative ranking and estimation tasks, to help you quickly fact-check AI responses against each other. It has GPT-5, Claude Opus 4.1, Gemini 2.5 Pro, and Grok 4, and built-in systems for comparing and aggregating their responses. If you try it, post feature requests for me and the team theMultiplicity.ai!

Big news, friends! I hereby introduce It's a multi-agent chat app with special features for collaborative ranking and estimation tasks, to help you quickly fact-check AI responses against each other. It has GPT-5, Claude Opus 4.1, Gemini 2.5 Pro, and Grok 4, and built-in systems for comparing and aggregating their responses. If you try it, post feature requests for me and the team theMultiplicity.ai!

Andrew Critch (🤖🩺🚀)

20,112 次观看 • 7 个月前