Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

part 4 Gsenti Sentient Sentient Chat Explaining ROMA – Recursive Open Meta-Agent 1. What is ROMA? ROMA is an open-source framework for building meta-agents — systems that can orchestrate multiple smaller agents and tools to solve complex tasks. Instead of letting one AI model handle an entire large problem...

16,298 görüntüleme • 9 ay önce •via X (Twitter)

0 Yorum

Yorum bulunmuyor

Orijinal gönderinin yorumları burada görünecek

Benzer Videolar

New Short Course: Building AI Browser Agents! Learn how to build AI agents that interact and take actions on websites in this course, created in partnership with and taught by and @namangarg0, Co-founders of AGI Inc. AI browser agents can log into websites, fill out forms, click through web pages, or even place orders online for you. They use both visual information, like screenshots, and structural data, like the HTML or Document Object Model (DOM) of a web page, to reason and take action. With the complexity of webpages and multiple possible actions at each step, it can be challenging for an AI browser agent to complete an assigned task. Because these agents run long action sequences, a single error—like clicking the wrong button or misreading a field—can lead to unexpected outcomes or errors that compound over time. In this course, you'll understand how autonomous web agents work, their current limitations, and how AgentQ enables them to improve through self-correction. In detail, you'll: - Learn what web agents are, how they automate tasks online, their architecture, key components, limitations, and an overview of their decision-making strategies. - Build a web agent that can scrape website and return course recommendations in a structured output format. - Build an autonomous web agent that can execute multiple tasks, such as finding and summarizing webpages, filling out a form, and signing up for a newsletter. - Explore AgentQ, a framework that enables agents to self-correct by combining Monte Carlo Tree Search (MCTS), a self-critique mechanism for continuous improvement, and Direct Preference Optimization (DPO). - Deep dive into MCTS, learn how it finds an effective path, illustrated by an example of Gridworld animation, and use AgentQ to complete web tasks. - Understand AI agents' current state and future directions—including key factors shaping their evolution, such as hardware, algorithm innovation, and data availability. By the end of this course, you will have hands-on experience building browser agents and a deeper understanding of how to make them more robust and reliable. Please sign up here:

Andrew Ng

185,933 görüntüleme • 1 yıl önce

Announcing my new course: Agentic AI! Building AI agents is one of the most in-demand skills in the job market. This course, available now at teaches you how. You'll learn to implement four key agentic design patterns: - Reflection, in which an agent examines its own output and figures out how to improve it - Tool use, in which an LLM-driven application decides which functions to call to carry out web search, access calendars, send email, write code, etc. - Planning, where you'll use an LLM to decide how to break down a task into sub-tasks for execution, and - Multi-agent collaboration, in which you build multiple specialized agents — much like how a company might hire multiple employees — to perform a complex task You'll also learn to take a complex application and systematically decompose it into a sequence of tasks to implement using these design patterns. But here's what I think is the most important part of this course: Having worked with many teams on AI agents, I've found that the single biggest predictor of whether someone executes well is their ability to drive a disciplined process for evals and error analysis. In this course, you'll learn how to do this, so you can efficiently home in on which components to improve in a complex agentic workflow. Instead of guessing what to work on, you'll let evals data guide you. This will put you significantly ahead of the game compared to the vast majority of teams building agents. Together, we'll build a deep research agent that searches, synthesizes, and reports, using all of these agentic design patterns and best practices. This self-paced course is taught in a vendor neutral way, using raw Python - without hiding details in a framework. You'll see how each step works, and learn the core concepts that you can then implement using any popular agentic AI framework, or using no framework. The only prerequisite is familiarity with Python, though knowing a bit about LLMs helps. Come join me, and let's build some agentic AI systems! Sign up to get started:

Andrew Ng

883,327 görüntüleme • 8 ay önce

New short course: Building Code Agents with Hugging Face smolagents! Learn how to build code agents in this course, created in collaboration with Hugging Face, and taught by Thomas Wolf, its co-founder and CSO, and m_ric, Hugging Face’s Project Lead on Agents. Tool-calling agents use LLMs to generate multiple function calls sequentially to complete a complex sequence of tasks. They generate one function call, execute it, observe, reason, and decide what to do next. Code agents take a different approach. They consolidate all these calls into a single block of code, letting the LLM lay out an entire action plan at once, which can be executed efficiently to provide more reliable results. You’ll learn how to code agents using smolagents, a lightweight agentic framework from Hugging Face. Along the way, you’ll learn how to run LLM-generated code safely and develop an evaluation system to optimize your code agent for production. In detail, you’ll learn: - How agentic systems have evolved, gaining greater levels of agency over time—and why code agents are a next step. - How code agents write their actions in code. - When code agents outperform function-calling agents. - How to run code agents safely in your system using a constrained Python interpreter and sandboxing using E2B. - To trace, debug, and assess the code agent to optimize its behaviours for complex requests. - How to build a research multi-agent system that can find information online and organize it into an interactive report. By the end of this course, you’ll know how to build and run code agents using smolagents, and deploy them safely with a structured evaluation system in your projects. Please sign up here!

Andrew Ng

124,382 görüntüleme • 1 yıl önce

AI AGENTS 101 (58 minute free masterclass) send this to anyone who wants to understand ai agents, claude skills, md files, how to get the most out of AI etc in plain english: 1. chat vs agents - chat models answer questions in a back and forth while agents take a goal, figure out the steps, and deliver a result 2. agents don’t stop after one response. they keep running until the task is actually finishedno babysitting required 3. everything runs on a loop. they gather context, decide what to do, take an action, then repeat until done 4. the loop is the system. they look at files, tools, and the internet. decide the next step. execute and then feed that back into the next step. over and over until completion 5. the model is just one piece. gpt, claude, gemini are the reasoning layer. the key is model + loop + tools + context 6. mcp is how agents use tools. it connects things like browser, code, apis, and your internal software. once connected, the agent decides when to use them to get the job done 7. context beats prompt all day. you don't need to write perfect prompts. load your agent with context about your business, style, and goals and then simple instructions work 8. claude.md or agents.md is the onboarding doc it tells the agent who it is, how to behave, what it knows, and what tools it can use. this gets loaded every time before it starts 9. memory.md is how it improves. agents don’t remember by default. this file stores preferences, corrections, and patterns you tell the agent to update it, and it gets better over time 10. skills + harnesses make it usable. skills are reusable tasks like writing, research, analysis the harness is the environment like claude code or openclaw that runs everything. basiclaly, different interfaces, same system underneath this episode with remy on The Startup Ideas Podcast (SIP) 🧃 was one of the clearest ways of understanding a lot of the core concepts of ai agents could be the best beginners course for ai agents 58 mins. all free. no advertisers. i just want to see you build cool stuff. im rooting for you. send to a friend watch

GREG ISENBERG

374,979 görüntüleme • 3 ay önce

JUST IN: Perplexity launched "Perplexity Computer" — and it might be the most complete AI agent system available right now. Not a chatbot upgrade. Not a research tool with a new name. A system that plans entire projects, delegates to specialist AI models, and runs autonomously for hours, days, or months (their words). Here's what makes the architecture genuinely different: → Opus 4.6 handles core reasoning and orchestration → Gemini handles deep research (spawning its own sub-agents) → Grok handles lightweight speed tasks → Veo 3.1 handles video generation → Nano Banana handles image creation → ChatGPT 5.2 handles long-context recall and wide search → You can override model choices per subtask 19 models total. Each task runs in an isolated environment with a real filesystem, real browser, and real tool integrations. You describe an outcome. It breaks it into tasks and subtasks, creates sub-agents for each, and coordinates them automatically. When a sub-agent hits a problem, it spawns more sub-agents to solve it. And it connects to your existing stack — GitHub, Google Drive, Gmail, Slack, Jira, Linear, Notion, Confluence, Ahrefs, Airtable, and more. Critically, it doesn't just run once. It can run on a schedule. Reading your docs, checking your project boards, pulling from your CRM, and acting on what it finds. Market monitoring. Competitor tracking. Weekly reports with charts. Content pipelines. CRON jobs that actually execute. Not "AI that helps you once." AI that runs in the background for days or months. Think of it as managed OpenClaw — similar autonomous capability (scheduled tasks, multi-step workflows, tool integrations) but fully managed. No Mac Mini. No security config. No infrastructure to maintain. I tested it with a complex prompt — a full stock trading simulator with what-if scenarios, correlation heatmaps, sentiment analysis, and a Bloomberg Terminal aesthetic. Two prompts later: deployed to Netlify via GitHub, with working CRON jobs updating live data. I've started using it to analyze my portfolio. But coding is just one lane. This thing researches, writes reports, generates datasets, creates videos, processes documents, and connects to your existing tools — all in one coordinated workflow. The real shift: you don't choose a model anymore. You describe what you need. The system routes each piece of work to whichever model does it best — and spawns new agents when it hits a wall. 19 models, dynamic sub-agents, scheduled tasks, and your entire tool stack connected. Thoughts?

Paweł Huryn

219,176 görüntüleme • 3 ay önce