正在加载视频...

视频加载失败

加载此视频时出现问题。这可能是由于临时网络问题，或视频可能不可用。

There is no better place on the internet to learn about Context Engineering than this repo. It's literally a course with a learning path. It gathers the best resources and covers the theory + code for anything related to context, RAG, memory, and agentic systems and more.

ℏεsam

39,319 subscribers

48,999 次观看 • 10 个月前 •via X (Twitter)

健康养生科学技术教育

Anya Rossi• Live Now

Private livecam show

0 条评论

暂无评论

原始帖子的评论将显示在这里

相关视频

The best way to learn AI is to build with agents. To help with that, we've launched hands-on labs and a new series on Agentic Engineering. First topic: Agent Skills. Next in the pipeline: planning, context engineering, multi-agent systems, long-running agents,.. Go build!

The best way to learn AI is to build with agents. To help with that, we've launched hands-on labs and a new series on Agentic Engineering. First topic: Agent Skills. Next in the pipeline: planning, context engineering, multi-agent systems, long-running agents,.. Go build!

elvis

31,802 次观看 • 1 个月前

Build Multi Agent Systems with Reasoning and Context Thanks to sonnet-4, we have level 4 autonomous multi-agent systems working. Learn how to add: -> Reasoning Tools (think -> analyze) -> Shared Agentic Context -> Agentic Memory Code below 👇

Build Multi Agent Systems with Reasoning and Context Thanks to sonnet-4, we have level 4 autonomous multi-agent systems working. Learn how to add: -> Reasoning Tools (think -> analyze) -> Shared Agentic Context -> Agentic Memory Code below 👇

Ashpreet Bedi

38,576 次观看 • 1 年前

New short course: LLMs as Operating Systems: Agent Memory, created with Letta, and taught by its founders Charles Packer and Sarah Wooders. An LLM's input context window has limited space. Using a longer input context also costs more and results in slower processing. So, managing what's stored in this context window is important. In the innovative paper MemGPT: Towards LLMs as Operating Systems, its authors (which include the instructors) proposed using an LLM agent to manage this context window. Their system uses a large persistent memory that stores everything that could be included in the input context, and an agent decides what is actually included. Take the example of building a chatbot that needs to remember what's been said earlier in a conversation (perhaps over many days of interaction with a user). As the conversation's length grows, the memory management agent will move information from the input context to a persistent searchable database; summarize information to keep relevant facts in the input context; and restore relevant conversation elements from further back in time. This allows a chatbot to keep what's currently most relevant in its input context memory to generate the next response. When I read the original MemGPT paper, I thought it was an innovative technique for handling memory for LLMs. The open-source Letta framework, which we'll use in this course, makes MemGPT easy to implement. It adds memory to your LLM agents and gives them transparent long-term memory. In detail, you’ll learn: - How to build an agent that can edit its own limited input context memory, using tools and multi-step reasoning - What is a memory hierarchy (an idea from computer operating systems, which use a cache to speed up memory access), and how these ideas apply to managing the LLM input context (where the input context window is a "cache" storing the most relevant information; and an agent decides what to move in and out of this to/from a larger persistent storage system) - How to implement multi-agent collaboration by letting different agents share blocks of memory This course will give you a sophisticated understanding of memory management for LLMs, which is important for chatbots having long conversations, and for complex agentic workflows. Please sign up here!

New short course: LLMs as Operating Systems: Agent Memory, created with Letta, and taught by its founders Charles Packer and Sarah Wooders. An LLM's input context window has limited space. Using a longer input context also costs more and results in slower processing. So, managing what's stored in this context window is important. In the innovative paper MemGPT: Towards LLMs as Operating Systems, its authors (which include the instructors) proposed using an LLM agent to manage this context window. Their system uses a large persistent memory that stores everything that could be included in the input context, and an agent decides what is actually included. Take the example of building a chatbot that needs to remember what's been said earlier in a conversation (perhaps over many days of interaction with a user). As the conversation's length grows, the memory management agent will move information from the input context to a persistent searchable database; summarize information to keep relevant facts in the input context; and restore relevant conversation elements from further back in time. This allows a chatbot to keep what's currently most relevant in its input context memory to generate the next response. When I read the original MemGPT paper, I thought it was an innovative technique for handling memory for LLMs. The open-source Letta framework, which we'll use in this course, makes MemGPT easy to implement. It adds memory to your LLM agents and gives them transparent long-term memory. In detail, you’ll learn: - How to build an agent that can edit its own limited input context memory, using tools and multi-step reasoning - What is a memory hierarchy (an idea from computer operating systems, which use a cache to speed up memory access), and how these ideas apply to managing the LLM input context (where the input context window is a "cache" storing the most relevant information; and an agent decides what to move in and out of this to/from a larger persistent storage system) - How to implement multi-agent collaboration by letting different agents share blocks of memory This course will give you a sophisticated understanding of memory management for LLMs, which is important for chatbots having long conversations, and for complex agentic workflows. Please sign up here!

Andrew Ng

200,729 次观看 • 1 年前

A new short course, Claude Code: A Highly Agentic Coding Assistant, is live! Claude Code is currently one of the most capable coding assistants. It can explore your codebase, plan features, write tests, refactor code, and even collaborate across multiple sessions—with surprisingly minimal input. In this course, you’ll learn how to guide Claude Code effectively: from setting up context and memory to integrating with GitHub and MCP servers. You’ll use it to extend a RAG chatbot, refactor a Jupyter notebook for e-commerce data analysis, build a web app from a Figma design, and more. Taught by Elie Schoppik (Elie Schoppik) and built in collaboration with Anthropic, this course is a must for AI builders. 👉 Enroll now:

A new short course, Claude Code: A Highly Agentic Coding Assistant, is live! Claude Code is currently one of the most capable coding assistants. It can explore your codebase, plan features, write tests, refactor code, and even collaborate across multiple sessions—with surprisingly minimal input. In this course, you’ll learn how to guide Claude Code effectively: from setting up context and memory to integrating with GitHub and MCP servers. You’ll use it to extend a RAG chatbot, refactor a Jupyter notebook for e-commerce data analysis, build a web app from a Figma design, and more. Taught by Elie Schoppik (Elie Schoppik) and built in collaboration with Anthropic, this course is a must for AI builders. 👉 Enroll now:

DeepLearning.AI

32,513 次观看 • 10 个月前

Repo Prompt 1.1 just released with the context builder update! This major new update helps you surface only the files necessary for a given task, to avoid needing to zip up way more context than necessary, saving you on cost and getting better model responses as a result

Repo Prompt 1.1 just released with the context builder update! This major new update helps you surface only the files necessary for a given task, to avoid needing to zip up way more context than necessary, saving you on cost and getting better model responses as a result

eric provencher

48,963 次观看 • 1 年前

Introducing Cognee v1.0: a major breakthrough in agentic intelligence. It is 145% better than Opus 4.8 and GPT 5.5 at long context memory retrieval. Cognee allows a 100 BILLION token context window 100,000x more than Claude. It's: - 6.9x cheaper than GPT 5.5 and Opus 4.8 - Cold starts in 350ms & searches in 260ms Why this matters: Today agents forget important context, redo tasks, waste tokens, and slow down as workflows get more complex. Cognee solves this. It’s not a place to build agents. It connects to the agents you’ve already built, across any platform, and makes them significantly cheaper, faster, and more accurate. Here's how it works:

Introducing Cognee v1.0: a major breakthrough in agentic intelligence. It is 145% better than Opus 4.8 and GPT 5.5 at long context memory retrieval. Cognee allows a 100 BILLION token context window 100,000x more than Claude. It's: - 6.9x cheaper than GPT 5.5 and Opus 4.8 - Cold starts in 350ms & searches in 260ms Why this matters: Today agents forget important context, redo tasks, waste tokens, and slow down as workflows get more complex. Cognee solves this. It’s not a place to build agents. It connects to the agents you’ve already built, across any platform, and makes them significantly cheaper, faster, and more accurate. Here's how it works:

Vasilije

756,202 次观看 • 1 天前

Announcing a new Coursera course: Retrieval Augmented Generation (RAG) You'll learn to build high performance, production-ready RAG systems in this hands-on, in-depth course created by and taught by Zain, experienced AI and ML engineer, researcher, and educator. RAG is a critical component today of many LLM-based applications in customer support, internal company Q&A systems, even many of the leading chatbots that use web search to answer your questions. This course teaches you in-depth how to make RAG work well. LLMs can produce generic or outdated responses, especially when asked specialized questions not covered in its training data. RAG is the most widely used technique for addressing this. It brings in data from new data sources, such as internal documents or recent news, to give the LLM the relevant context to private, recent, or specialized information. This lets it generate more grounded and accurate responses. In this course, you’ll learn to design and implement every part of a RAG system, from retrievers to vector databases to generation to evals. You’ll learn about the fundamental principles behind RAG and how to optimize it at both the component and whole-system levels. As AI evolves, RAG is evolving too. New models can handle longer context windows, reason more effectively, and can be parts of complex agentic workflows. One exciting growth area is Agentic RAG, in which an AI agent at runtime (rather than it being hardcoded at development time) autonomously decides what data to retrieve, and when/how to go deeper. Even with this evolution, access to high-quality data at runtime is essential, which is why RAG is a key part of so many applications. You'll learn via hands-on experiences to: - Build a RAG system with retrieval and prompt augmentation - Compare retrieval methods like BM25, semantic search, and Reciprocal Rank Fusion - Chunk, index, and retrieve documents using a Weaviate vector database and a news dataset - Develop a chatbot, using open-source LLMs hosted by Together AI, for a fictional store that answers product and FAQ questions - Use evals to drive improving reliability, and incorporate multi-modal data RAG is an important foundational technique. Become good at it through this course! Please sign up here:

Announcing a new Coursera course: Retrieval Augmented Generation (RAG) You'll learn to build high performance, production-ready RAG systems in this hands-on, in-depth course created by and taught by Zain, experienced AI and ML engineer, researcher, and educator. RAG is a critical component today of many LLM-based applications in customer support, internal company Q&A systems, even many of the leading chatbots that use web search to answer your questions. This course teaches you in-depth how to make RAG work well. LLMs can produce generic or outdated responses, especially when asked specialized questions not covered in its training data. RAG is the most widely used technique for addressing this. It brings in data from new data sources, such as internal documents or recent news, to give the LLM the relevant context to private, recent, or specialized information. This lets it generate more grounded and accurate responses. In this course, you’ll learn to design and implement every part of a RAG system, from retrievers to vector databases to generation to evals. You’ll learn about the fundamental principles behind RAG and how to optimize it at both the component and whole-system levels. As AI evolves, RAG is evolving too. New models can handle longer context windows, reason more effectively, and can be parts of complex agentic workflows. One exciting growth area is Agentic RAG, in which an AI agent at runtime (rather than it being hardcoded at development time) autonomously decides what data to retrieve, and when/how to go deeper. Even with this evolution, access to high-quality data at runtime is essential, which is why RAG is a key part of so many applications. You'll learn via hands-on experiences to: - Build a RAG system with retrieval and prompt augmentation - Compare retrieval methods like BM25, semantic search, and Reciprocal Rank Fusion - Chunk, index, and retrieve documents using a Weaviate vector database and a news dataset - Develop a chatbot, using open-source LLMs hosted by Together AI, for a fictional store that answers product and FAQ questions - Use evals to drive improving reliability, and incorporate multi-modal data RAG is an important foundational technique. Become good at it through this course! Please sign up here:

Andrew Ng

124,458 次观看 • 11 个月前

This is Repo Prompt 2.0 A fully integrated agent, that makes it seamless to use RP's powerful MCP tools, with a built-in oracle and context builder. A first class experience showcasing how much better and efficient your agents can be with good context engineering tools.

This is Repo Prompt 2.0 A fully integrated agent, that makes it seamless to use RP's powerful MCP tools, with a built-in oracle and context builder. A first class experience showcasing how much better and efficient your agents can be with good context engineering tools.

eric provencher

38,921 次观看 • 4 个月前

Our new short course, Knowledge Graphs for RAG, is now available! Knowledge graphs are a data structure that is great at capturing complex relationships between data of multiple types. By enabling more sophisticated retrieval of text than similarity search alone, knowledge graphs can improve the context you pass to the LLM and the performance of your RAG applications. In this course, taught by Andreas Kollegger of Neo4j, you’ll - Explore how knowledge graphs work by building a graph of public financial documents from scratch - Learn to write queries that retrieve text and data from the graph and use it to enhance the context you pass to an LLM chatbot - Combine a knowledge graph with a question-answer chain to build better RAG-powered chat systems Sign up here!

Our new short course, Knowledge Graphs for RAG, is now available! Knowledge graphs are a data structure that is great at capturing complex relationships between data of multiple types. By enabling more sophisticated retrieval of text than similarity search alone, knowledge graphs can improve the context you pass to the LLM and the performance of your RAG applications. In this course, taught by Andreas Kollegger of Neo4j, you’ll - Explore how knowledge graphs work by building a graph of public financial documents from scratch - Learn to write queries that retrieve text and data from the graph and use it to enhance the context you pass to an LLM chatbot - Combine a knowledge graph with a question-answer chain to build better RAG-powered chat systems Sign up here!

Andrew Ng

244,257 次观看 • 2 年前

Claude Tag is the next evolution of agents. It's a proactive, multiplayer agent with memory and identity, built on top of Claude Code. Learn more about how Claude Tag works and best practices for using it in this deep dive.

Claude Tag is the next evolution of agents. It's a proactive, multiplayer agent with memory and identity, built on top of Claude Code. Learn more about how Claude Tag works and best practices for using it in this deep dive.

ClaudeDevs

378,049 次观看 • 1 天前

Introducing Claude Code Hook - Context Timeline (Saving this to try later) Install with: npx claude-code-templates@latest --hook monitoring/context-timeline Managing the context window and the subagents running in Claude Code is hard to keep track of That's why I built this hook... It starts the moment you open a session and shows a timeline with the main agent's context window and how subagents start working in their own separate context Every subagent you have running will show up in real time This way you can manage the context and the subagents you run, and see everything in a much simpler way than in the console

Introducing Claude Code Hook - Context Timeline (Saving this to try later) Install with: npx claude-code-templates@latest --hook monitoring/context-timeline Managing the context window and the subagents running in Claude Code is hard to keep track of That's why I built this hook... It starts the moment you open a session and shows a timeline with the main agent's context window and how subagents start working in their own separate context Every subagent you have running will show up in real time This way you can manage the context and the subagents you run, and see everything in a much simpler way than in the console

Daniel San

51,228 次观看 • 2 个月前

Almost 20 years later, AWS is still the most popular cloud in the world. The reason is simple: it just works! They have four services focused on Generative AI: 1. Amazon Q 2. Amazon Bedrock 3. SageMaker JumpStart 4. PartyRock I've been using AWS for around 15 years (honestly, I don't remember well), and their Developer Center is a gold mine. If you open their Developer Center, you'll find a new learning path, "Generative AI for Developers." I'm linking to it below. This is not just a course. This is a collection of courses, examples, videos, tutorials, and code walkthroughs. They will teach you how to use Generative AI on AWS using the four services above. ↑ That right there is a huge selling point: These classes aren't just theoretical. You'll have a chance to learn using the same professional tools everyone else uses. By the way, there are many more resources in the Developer Center: • Machine Learning • Data Operations • DevOps All of these are free. Click, click, and start learning right away. One more thing before I forget: If you are building anything with AWS, check out Amazon Q, their coding assistant. This is the *best* coding assistant for AWS-related work, and it's not even close. It's a Visual Studio Code extension. Install it, and it works like any other code assistant, except this one knows a lot about AWS. Thanks to AWS for sponsoring a post about their free courses and learning resources. There's a special place in Developer Heaven for you.

Almost 20 years later, AWS is still the most popular cloud in the world. The reason is simple: it just works! They have four services focused on Generative AI: 1. Amazon Q 2. Amazon Bedrock 3. SageMaker JumpStart 4. PartyRock I've been using AWS for around 15 years (honestly, I don't remember well), and their Developer Center is a gold mine. If you open their Developer Center, you'll find a new learning path, "Generative AI for Developers." I'm linking to it below. This is not just a course. This is a collection of courses, examples, videos, tutorials, and code walkthroughs. They will teach you how to use Generative AI on AWS using the four services above. ↑ That right there is a huge selling point: These classes aren't just theoretical. You'll have a chance to learn using the same professional tools everyone else uses. By the way, there are many more resources in the Developer Center: • Machine Learning • Data Operations • DevOps All of these are free. Click, click, and start learning right away. One more thing before I forget: If you are building anything with AWS, check out Amazon Q, their coding assistant. This is the best coding assistant for AWS-related work, and it's not even close. It's a Visual Studio Code extension. Install it, and it works like any other code assistant, except this one knows a lot about AWS. Thanks to AWS for sponsoring a post about their free courses and learning resources. There's a special place in Developer Heaven for you.

Santiago

22,104 次观看 • 1 年前

New short course: Build Long-Context AI Apps with Jamba. Learn about state space models (SSMs), which have emerged as an alternative to transformers! Specifically, Jamba is a hybrid transformer-Mamba architecture that combines strengths of the transformer with ideas from SSMs. This course is built with AI21 Labs and taught by Chen Wang and Chen Almagor. The transformer architecture is computationally expensive when handling very long input contexts. But there's an alternative called Mamba, a selective state space model that can process very long contexts with a much lower computational cost. However, researchers found that the pure Mamba architecture underperforms in understanding the context, and gives lower-quality responses. To overcome this, AI21 developed the Jamba model, which combines Mamba's computational efficiency with the transformer's attention mechanism to help with the output quality. In this course, you’ll learn about how state space models, and Jamba, work. You’ll also learn how to prompt Jamba, use it to process long documents, and build long-context RAG apps. - Learn how Jamba combines transformer and state space model architectures to achieve high performance and quality - Use the AI21 SDK, with an example of prompting over a large 200k-token annual financial report of Nvidia - Use Jamba for tool-calling, with hands-on examples from calling simple arithmetic calculations to a function that returns quarterly company financial reports. - Learn how training for long context is done, and the metrics used for its evaluation - Create a RAG app using the AI21 Conversational RAG tool and build your own RAG pipeline that uses Jamba and LangChain. By the end of this course, you'll learn how to build applications that can handle context as long as an entire book. Please sign up here:

New short course: Build Long-Context AI Apps with Jamba. Learn about state space models (SSMs), which have emerged as an alternative to transformers! Specifically, Jamba is a hybrid transformer-Mamba architecture that combines strengths of the transformer with ideas from SSMs. This course is built with AI21 Labs and taught by Chen Wang and Chen Almagor. The transformer architecture is computationally expensive when handling very long input contexts. But there's an alternative called Mamba, a selective state space model that can process very long contexts with a much lower computational cost. However, researchers found that the pure Mamba architecture underperforms in understanding the context, and gives lower-quality responses. To overcome this, AI21 developed the Jamba model, which combines Mamba's computational efficiency with the transformer's attention mechanism to help with the output quality. In this course, you’ll learn about how state space models, and Jamba, work. You’ll also learn how to prompt Jamba, use it to process long documents, and build long-context RAG apps. - Learn how Jamba combines transformer and state space model architectures to achieve high performance and quality - Use the AI21 SDK, with an example of prompting over a large 200k-token annual financial report of Nvidia - Use Jamba for tool-calling, with hands-on examples from calling simple arithmetic calculations to a function that returns quarterly company financial reports. - Learn how training for long context is done, and the metrics used for its evaluation - Create a RAG app using the AI21 Conversational RAG tool and build your own RAG pipeline that uses Jamba and LangChain. By the end of this course, you'll learn how to build applications that can handle context as long as an entire book. Please sign up here:

Andrew Ng

77,792 次观看 • 1 年前

LLMs can make sense of retrieved context because of how transformers work. In one of the lessons from the Retrieval Augmented Generation (RAG) course, we unpack how LLMs process augmented prompts using token embeddings, positional vectors, and multi-head attention. Understanding these internals helps you design more reliable and efficient RAG systems. Watch the breakdown and keep learning how to build production-ready RAG systems in this course, taught by Zain:

LLMs can make sense of retrieved context because of how transformers work. In one of the lessons from the Retrieval Augmented Generation (RAG) course, we unpack how LLMs process augmented prompts using token embeddings, positional vectors, and multi-head attention. Understanding these internals helps you design more reliable and efficient RAG systems. Watch the breakdown and keep learning how to build production-ready RAG systems in this course, taught by Zain:

DeepLearning.AI

11,500 次观看 • 11 个月前

Custom instructions provide more context about your specific coding preferences and tech stack. Better context = better results from the LLM. And custom instructions are extra helpful for remote development, where you can provide more info about the type of remote environment you're connected to.

Custom instructions provide more context about your specific coding preferences and tech stack. Better context = better results from the LLM. And custom instructions are extra helpful for remote development, where you can provide more info about the type of remote environment you're connected to.

Visual Studio Code

44,868 次观看 • 1 年前

AI engineers at top labs earn $500K+ a year to build agentic AI systems. Stanford just dropped a 90 min lecture that covers the entire playbook. For FREE. Prompting. Chains. RAG. Multi-agent systems. All of it. Worth more than any "AI agent mastery" course. Bookmark it:

AI engineers at top labs earn $500K+ a year to build agentic AI systems. Stanford just dropped a 90 min lecture that covers the entire playbook. For FREE. Prompting. Chains. RAG. Multi-agent systems. All of it. Worth more than any "AI agent mastery" course. Bookmark it:

Jaynit Makwana

44,956 次观看 • 2 个月前

Introducing the context course: a free course on doing ML with agent context. You will learn how to train models, optimize inferences, and build datasets, all by defining harness context with`SKILLS.md`, Plugins, MCP, Subagents, and Hooks. The course includes: - Weekly live AMA on YouTube - Weekly practical projects for ML with context - Instructions in Pi, Codex, Claude, and Opencode - Tutorials and guides on fundamentals - Interactive Quizzes Learn to give AI agents the right knowledge, tools, and structure to actually get work done. Skills, MCP servers, plugins, multi-agent workflows, and building an agent from scratch. Join here:

Introducing the context course: a free course on doing ML with agent context. You will learn how to train models, optimize inferences, and build datasets, all by defining harness context with`SKILLS.md`, Plugins, MCP, Subagents, and Hooks. The course includes: - Weekly live AMA on YouTube - Weekly practical projects for ML with context - Instructions in Pi, Codex, Claude, and Opencode - Tutorials and guides on fundamentals - Interactive Quizzes Learn to give AI agents the right knowledge, tools, and structure to actually get work done. Skills, MCP servers, plugins, multi-agent workflows, and building an agent from scratch. Join here:

Ben Burtenshaw

16,905 次观看 • 1 个月前

This was literally less than a grandma ago and millions of Americans willfully and knowingly voted for a sequel. The context was always there.

This was literally less than a grandma ago and millions of Americans willfully and knowingly voted for a sequel. The context was always there.

✭ 🅑🅤🅑🅑🅐 ✭

16,624 次观看 • 4 个月前

x402 repo a better way to navigate the repo to find what you're looking for, ask questions and learn about x402

x402 repo a better way to navigate the repo to find what you're looking for, ask questions and learn about x402

Ash

10,335 次观看 • 5 个月前