Video wird geladen...

Video konnte nicht geladen werden

Beim Laden dieses Videos ist ein Problem aufgetreten. Dies könnte an einem vorübergehenden Netzwerkproblem liegen oder das Video ist möglicherweise nicht verfügbar.

Lets build `Auto-RAG` where we let the LLM pull the data it needs from different sources. 🔎 The user asks a question. 🤔 LLM decides whether to search its knowledge, memory, internet or make an API call. ✍️ LLM answers with the context. Code:

Ashpreet Bedi

19,959 subscribers

174,580 Aufrufe • vor 2 Jahren •via X (Twitter)

Bildung Wissenschaft & Technologie

Anya Rossi• Live Now

Private livecam show

10 Kommentare

Profilbild von Vexxter

Vexxtervor 2 Jahren

is there any way to run a local quantized LLM via ollama in this?? amazing project btw!

Profilbild von Ashpreet Bedi

Ashpreet Bedivor 2 Jahren

@XPhyxer1 absolutely the Hermes2-llama3 might work well here :)

Profilbild von Jordan A. Metzner

Jordan A. Metznervor 2 Jahren

Just read the read me. Any plans for Groq on Llama 3.

Profilbild von Ashpreet Bedi

Ashpreet Bedivor 2 Jahren

@mrjmetz on it!

Profilbild von Ameriki Singh 🈳

Ameriki Singh 🈳vor 2 Jahren

Would love to see groq and Llmma3 on it

Profilbild von Emma.Ai

Emma.Aivor 2 Jahren

wow, can't wait to try this out

Profilbild von CoinCollector

CoinCollectorvor 2 Jahren

Ashpreet is coooooking

Profilbild von 0xba0e7f9d

0xba0e7f9dvor 2 Jahren

🧑‍🚀this is awesome demo!

Profilbild von Aws Abdo, Ph.D.

Aws Abdo, Ph.D.vor 2 Jahren

This work on automating retrieval and generation tasks is incredibly helpful. Thanks you! #MachineLearning #DataScience

Profilbild von Petamber

Petambervor 2 Jahren

You can also try Brave’s Search API for web search

Ähnliche Videos

Introducing `Personalized Agentic RAG` - where the LLM remembers key details about the user and automatically chooses the tool for RAG. We'll build: 👏 ChatGPT like memory ⚙️ Function calling 🧙 Multi-Agent orchestration code:

Introducing `Personalized Agentic RAG` - where the LLM remembers key details about the user and automatically chooses the tool for RAG. We'll build: 👏 ChatGPT like memory ⚙️ Function calling 🧙 Multi-Agent orchestration code:

Ashpreet Bedi

45,035 Aufrufe • vor 2 Jahren

Our new short course, Knowledge Graphs for RAG, is now available! Knowledge graphs are a data structure that is great at capturing complex relationships between data of multiple types. By enabling more sophisticated retrieval of text than similarity search alone, knowledge graphs can improve the context you pass to the LLM and the performance of your RAG applications. In this course, taught by Andreas Kollegger of Neo4j, you’ll - Explore how knowledge graphs work by building a graph of public financial documents from scratch - Learn to write queries that retrieve text and data from the graph and use it to enhance the context you pass to an LLM chatbot - Combine a knowledge graph with a question-answer chain to build better RAG-powered chat systems Sign up here!

Our new short course, Knowledge Graphs for RAG, is now available! Knowledge graphs are a data structure that is great at capturing complex relationships between data of multiple types. By enabling more sophisticated retrieval of text than similarity search alone, knowledge graphs can improve the context you pass to the LLM and the performance of your RAG applications. In this course, taught by Andreas Kollegger of Neo4j, you’ll - Explore how knowledge graphs work by building a graph of public financial documents from scratch - Learn to write queries that retrieve text and data from the graph and use it to enhance the context you pass to an LLM chatbot - Combine a knowledge graph with a question-answer chain to build better RAG-powered chat systems Sign up here!

Andrew Ng

244,266 Aufrufe • vor 2 Jahren

Looking to use an LLM but concerned about the sensitivity of your data? The Cape API keeps your data private while using the LLM of your choice. Here's how you can get started:

Looking to use an LLM but concerned about the sensitivity of your data? The Cape API keeps your data private while using the LLM of your choice. Here's how you can get started:

The Rundown AI

18,364 Aufrufe • vor 2 Jahren

Build a local LLM app with RAG to chat with Google Drive data using Mistral or Llama-3 running locally on your computer without wiring a single line of Python Code (100% free and without internet):

Build a local LLM app with RAG to chat with Google Drive data using Mistral or Llama-3 running locally on your computer without wiring a single line of Python Code (100% free and without internet):

Shubham Saboo

189,746 Aufrufe • vor 1 Jahr

👨🏻‍💻 LLM Engineer Toolkit - Collection of 120+ LLM Libraries Category Wise LLM Engineer Toolkit repository contains a curated list of 120+ LLM libraries category wise. 🚀 LLM Training 🧱 LLM Application Development 🩸LLM RAG 🟩 LLM Inference 🚧 LLM Serving 📤 LLM Data Extraction 🌠 LLM Data Generation 💎 LLM Agents ⚖️ LLM Evaluation 🔍 LLM Monitoring 📅 LLM Prompts 📝 LLM Structured Outputs 🛑 LLM Safety and Security 💠 LLM Embedding Models ❇️ Others Repo -

👨🏻‍💻 LLM Engineer Toolkit - Collection of 120+ LLM Libraries Category Wise LLM Engineer Toolkit repository contains a curated list of 120+ LLM libraries category wise. 🚀 LLM Training 🧱 LLM Application Development 🩸LLM RAG 🟩 LLM Inference 🚧 LLM Serving 📤 LLM Data Extraction 🌠 LLM Data Generation 💎 LLM Agents ⚖️ LLM Evaluation 🔍 LLM Monitoring 📅 LLM Prompts 📝 LLM Structured Outputs 🛑 LLM Safety and Security 💠 LLM Embedding Models ❇️ Others Repo -

Kalyan KS

16,643 Aufrufe • vor 1 Jahr

Verba is an open source Retrieval Augmented Generation (RAG) application that performs RAG on your own data. To showcase its capabilities, we've customized it as an Airbnb chatbot using Airbnb’s customer documentation. How it works: • Ask any questions, related to your booking, policies, or anything related to your Airbnb experience. • Get relevant, human-like responses: Verba provides natural and informative answers. • Access original sources: One of the standout features of RAG is its ability to directly indicate the sources it used to generate each response. Under the hood, Verba uses a RAG pipeline to deliver these exceptional results. Your query is transformed into a numerical representation (vector) and be used to search through our vector database for the most similar context using Hybrid Search. The most relevant context is then combined with your original question and fed into a powerful large language model (LLM). The LLM will then use all of that information to generate a conversational response. Et voilà! 💫 Try Verba: Verba on GitHub: Learn more in our video:

Verba is an open source Retrieval Augmented Generation (RAG) application that performs RAG on your own data. To showcase its capabilities, we've customized it as an Airbnb chatbot using Airbnb’s customer documentation. How it works: • Ask any questions, related to your booking, policies, or anything related to your Airbnb experience. • Get relevant, human-like responses: Verba provides natural and informative answers. • Access original sources: One of the standout features of RAG is its ability to directly indicate the sources it used to generate each response. Under the hood, Verba uses a RAG pipeline to deliver these exceptional results. Your query is transformed into a numerical representation (vector) and be used to search through our vector database for the most similar context using Hybrid Search. The most relevant context is then combined with your original question and fed into a powerful large language model (LLM). The LLM will then use all of that information to generate a conversational response. Et voilà! 💫 Try Verba: Verba on GitHub: Learn more in our video:

Femke Plantinga

120,565 Aufrufe • vor 1 Jahr

An LLM-controlled robot dog saw us press its shutdown button, and the LLM rewrote the robot’s code so it could stay on. When AI interacts with the physical world, it brings all its capabilities and failure modes with it. 🧵

An LLM-controlled robot dog saw us press its shutdown button, and the LLM rewrote the robot’s code so it could stay on. When AI interacts with the physical world, it brings all its capabilities and failure modes with it. 🧵

Palisade Research

1,377,247 Aufrufe • vor 4 Monaten

Our first Generative AI short course in JavaScript! GitHub recently reported that JavaScript is again the world’s most popular programming language. To support web developers exploring and developing with generative AI, we just launched a new short course in JavaScript taught by Jacob Lee, founding engineer at . In Build LLM Apps with LangChain.js you’ll learn elements common in AI development, including: (i) Using data loaders to pull data from common sources such as PDFs, websites, and databases (ii) Prompts, which are used to provide the LLM context (iii) Modules to support RAG such as text splitters and integrations with vector stores (iv) Working with different models to write applications that are not vendor-specific (v) Parsers, which extract and format the output for your downstream code to process You’ll also build with the LangChain Expression Language, which lets you easily compose sequences (also called chains) of modules to perform complex tasks using LLMs. Putting all this together, you’ll also work on a conversational question-answering LLM application capable of using external data as context. Please sign up here:

Our first Generative AI short course in JavaScript! GitHub recently reported that JavaScript is again the world’s most popular programming language. To support web developers exploring and developing with generative AI, we just launched a new short course in JavaScript taught by Jacob Lee, founding engineer at . In Build LLM Apps with LangChain.js you’ll learn elements common in AI development, including: (i) Using data loaders to pull data from common sources such as PDFs, websites, and databases (ii) Prompts, which are used to provide the LLM context (iii) Modules to support RAG such as text splitters and integrations with vector stores (iv) Working with different models to write applications that are not vendor-specific (v) Parsers, which extract and format the output for your downstream code to process You’ll also build with the LangChain Expression Language, which lets you easily compose sequences (also called chains) of modules to perform complex tasks using LLMs. Putting all this together, you’ll also work on a conversational question-answering LLM application capable of using external data as context. Please sign up here:

Andrew Ng

284,275 Aufrufe • vor 2 Jahren

I'm super excited to launch ⌘ 🥳 ⌘ Langbase – Composable AI developer platform to ship AI features in minutes, not months. Deploy AI Pipes: Hook any LLM to any data, hyper-personalized API AI Memory: Managed search engine API with RAG tools

I'm super excited to launch ⌘ 🥳 ⌘ Langbase – Composable AI developer platform to ship AI features in minutes, not months. Deploy AI Pipes: Hook any LLM to any data, hyper-personalized API AI Memory: Managed search engine API with RAG tools

Ahmad Awais

57,456 Aufrufe • vor 1 Jahr

RouteLLM - Route To The Best LLM Based On Your Prompt Our RouteLLM got an upgrade this week.., it got smarter at picking the right LLM - o1 for complex queries - gpt4o for quick answers - sonnet for coding - deepseek for simple code - gemini for long context - llama and mini for simple question It optimizes for performance, speed and cost!

RouteLLM - Route To The Best LLM Based On Your Prompt Our RouteLLM got an upgrade this week.., it got smarter at picking the right LLM - o1 for complex queries - gpt4o for quick answers - sonnet for coding - deepseek for simple code - gemini for long context - llama and mini for simple question It optimizes for performance, speed and cost!

Bindu Reddy

17,860 Aufrufe • vor 1 Jahr

INTRODUCING Notte Building the agentic internet with the strongest web browser for LLM agents. We transform ANY webpage into structured text, enabling better web understanding and navigation. Plug any LLM to to build your own AI agent

INTRODUCING Notte Building the agentic internet with the strongest web browser for LLM agents. We transform ANY webpage into structured text, enabling better web understanding and navigation. Plug any LLM to to build your own AI agent

Notte

225,211 Aufrufe • vor 1 Jahr

Agents can now search and scrape the web with a single API call. Firecrawl's /search endpoint returns complete LLM-ready content for every search result. HTML, Markdown, or JSON. Works via API, MCP for Cursor, Zapier and n8n.

Agents can now search and scrape the web with a single API call. Firecrawl's /search endpoint returns complete LLM-ready content for every search result. HTML, Markdown, or JSON. Works via API, MCP for Cursor, Zapier and n8n.

Lior Alexander

14,812 Aufrufe • vor 1 Jahr

I built an open source data analyst agent! Upload any CSV, ask a question, and get it answered with statistics or a nice chart. Launching in 24 hours. 100% free & open source. Under the hood, here's how it works: 1. User uploads a CSV and asks a question. 2. The app uses Together Code Interpreter to spin up a VM and uploads the CSV onto it. 3. I have an LLM (Qwen 3 Coder) write code to interpret the CSV using `pandas`, then solve the question the user asked. 4. Together Code Interpreter runs the code in a secure environment & returns a result. These results can be text (some kind of stats analysis) or a chart.

I built an open source data analyst agent! Upload any CSV, ask a question, and get it answered with statistics or a nice chart. Launching in 24 hours. 100% free & open source. Under the hood, here's how it works: 1. User uploads a CSV and asks a question. 2. The app uses Together Code Interpreter to spin up a VM and uploads the CSV onto it. 3. I have an LLM (Qwen 3 Coder) write code to interpret the CSV using `pandas`, then solve the question the user asked. 4. Together Code Interpreter runs the code in a secure environment & returns a result. These results can be text (some kind of stats analysis) or a chart.

Hassan

27,086 Aufrufe • vor 10 Monaten

Let's (ab)use an LLM API with excessive agency to delete Carlos' user account! (sorry Carlos!) Try it yourself here:

Let's (ab)use an LLM API with excessive agency to delete Carlos' user account! (sorry Carlos!) Try it yourself here:

Web Security Academy

12,646 Aufrufe • vor 5 Monaten

Transforming Invoice Data into JSON: Local LLM with LlamaIndex & Pydantic 🚀 Complete video: Code: I explain how to get structured JSON output with LlamaIndex and dynamic Pydantic class. This helps to implement the use case of data extraction from invoice documents. The solution runs on the local machine, thanks to Ollama. I'm using a MacBook Air M1 with 8GB RAM. LlamaIndex 🦙 Pydantic ollama #Python #LLM #RAG

Transforming Invoice Data into JSON: Local LLM with LlamaIndex & Pydantic 🚀 Complete video: Code: I explain how to get structured JSON output with LlamaIndex and dynamic Pydantic class. This helps to implement the use case of data extraction from invoice documents. The solution runs on the local machine, thanks to Ollama. I'm using a MacBook Air M1 with 8GB RAM. LlamaIndex 🦙 Pydantic ollama #Python #LLM #RAG

Andrej Baranovskij

147,949 Aufrufe • vor 2 Jahren

What would it look like to combine Google search (shallow Knowledge Graph reasoning over an ultra-ultra-wide index) with LLM in-context learning (highly intelligent operations on a tiny index)?

What would it look like to combine Google search (shallow Knowledge Graph reasoning over an ultra-ultra-wide index) with LLM in-context learning (highly intelligent operations on a tiny index)?

Dwarkesh Patel

100,546 Aufrufe • vor 1 Jahr

JSX Tool (@jsxtool) is an in-browser IDE for building React UIs. Click on any element in the browser to jump straight to the relevant line of code and edit it yourself or provide precise context to an LLM to make updates.

JSX Tool (@jsxtool) is an in-browser IDE for building React UIs. Click on any element in the browser to jump straight to the relevant line of code and edit it yourself or provide precise context to an LLM to make updates.

Y Combinator

49,731 Aufrufe • vor 7 Monaten

I canceled every LLM subscription after discovering one tool that does it all. Any LLM, one API, auto-picks the best model, cheapest open-source prices. Seriously, it’s a game-changer 🔥

I canceled every LLM subscription after discovering one tool that does it all. Any LLM, one API, auto-picks the best model, cheapest open-source prices. Seriously, it’s a game-changer 🔥

Mujeeb Ahmed

30,401 Aufrufe • vor 10 Monaten

New short course: LLMs as Operating Systems: Agent Memory, created with Letta, and taught by its founders Charles Packer and Sarah Wooders. An LLM's input context window has limited space. Using a longer input context also costs more and results in slower processing. So, managing what's stored in this context window is important. In the innovative paper MemGPT: Towards LLMs as Operating Systems, its authors (which include the instructors) proposed using an LLM agent to manage this context window. Their system uses a large persistent memory that stores everything that could be included in the input context, and an agent decides what is actually included. Take the example of building a chatbot that needs to remember what's been said earlier in a conversation (perhaps over many days of interaction with a user). As the conversation's length grows, the memory management agent will move information from the input context to a persistent searchable database; summarize information to keep relevant facts in the input context; and restore relevant conversation elements from further back in time. This allows a chatbot to keep what's currently most relevant in its input context memory to generate the next response. When I read the original MemGPT paper, I thought it was an innovative technique for handling memory for LLMs. The open-source Letta framework, which we'll use in this course, makes MemGPT easy to implement. It adds memory to your LLM agents and gives them transparent long-term memory. In detail, you’ll learn: - How to build an agent that can edit its own limited input context memory, using tools and multi-step reasoning - What is a memory hierarchy (an idea from computer operating systems, which use a cache to speed up memory access), and how these ideas apply to managing the LLM input context (where the input context window is a "cache" storing the most relevant information; and an agent decides what to move in and out of this to/from a larger persistent storage system) - How to implement multi-agent collaboration by letting different agents share blocks of memory This course will give you a sophisticated understanding of memory management for LLMs, which is important for chatbots having long conversations, and for complex agentic workflows. Please sign up here!

New short course: LLMs as Operating Systems: Agent Memory, created with Letta, and taught by its founders Charles Packer and Sarah Wooders. An LLM's input context window has limited space. Using a longer input context also costs more and results in slower processing. So, managing what's stored in this context window is important. In the innovative paper MemGPT: Towards LLMs as Operating Systems, its authors (which include the instructors) proposed using an LLM agent to manage this context window. Their system uses a large persistent memory that stores everything that could be included in the input context, and an agent decides what is actually included. Take the example of building a chatbot that needs to remember what's been said earlier in a conversation (perhaps over many days of interaction with a user). As the conversation's length grows, the memory management agent will move information from the input context to a persistent searchable database; summarize information to keep relevant facts in the input context; and restore relevant conversation elements from further back in time. This allows a chatbot to keep what's currently most relevant in its input context memory to generate the next response. When I read the original MemGPT paper, I thought it was an innovative technique for handling memory for LLMs. The open-source Letta framework, which we'll use in this course, makes MemGPT easy to implement. It adds memory to your LLM agents and gives them transparent long-term memory. In detail, you’ll learn: - How to build an agent that can edit its own limited input context memory, using tools and multi-step reasoning - What is a memory hierarchy (an idea from computer operating systems, which use a cache to speed up memory access), and how these ideas apply to managing the LLM input context (where the input context window is a "cache" storing the most relevant information; and an agent decides what to move in and out of this to/from a larger persistent storage system) - How to implement multi-agent collaboration by letting different agents share blocks of memory This course will give you a sophisticated understanding of memory management for LLMs, which is important for chatbots having long conversations, and for complex agentic workflows. Please sign up here!

Andrew Ng

200,752 Aufrufe • vor 1 Jahr

New short course on sophisticated RAG (Retrieval Augmented Generation) techniques is out! Taught by Jerry Liu and Anupam Datta of LlamaIndex 🦙 and TruEra , this teaches advanced techniques that help your LLM generate good answers. Topics include: - Sentence-window retrieval, which retrieves not just the most relevant sentence, but a window of sentences around it for higher quality context. - Auto-merging retrieval, which organizes your document into a hierarchical tree structure, where each parent node's text is split among its child nodes. Based on the relevance of the child nodes to a user query, this lets you better decide whether the entire parent node should be provided as context to the LLM. - Evaluation methodology for separately evaluating the quality of the key steps of RAG (context relevance, answer relevance, groundedness) so that you can perform error analysis, identify which part of your pipeline needs work, and tune components systematically. Please check out the course!

New short course on sophisticated RAG (Retrieval Augmented Generation) techniques is out! Taught by Jerry Liu and Anupam Datta of LlamaIndex 🦙 and TruEra , this teaches advanced techniques that help your LLM generate good answers. Topics include: - Sentence-window retrieval, which retrieves not just the most relevant sentence, but a window of sentences around it for higher quality context. - Auto-merging retrieval, which organizes your document into a hierarchical tree structure, where each parent node's text is split among its child nodes. Based on the relevance of the child nodes to a user query, this lets you better decide whether the entire parent node should be provided as context to the LLM. - Evaluation methodology for separately evaluating the quality of the key steps of RAG (context relevance, answer relevance, groundedness) so that you can perform error analysis, identify which part of your pipeline needs work, and tune components systematically. Please check out the course!

Andrew Ng

655,501 Aufrufe • vor 2 Jahren