正在加载视频...

视频加载失败

加载此视频时出现问题。这可能是由于临时网络问题，或视频可能不可用。

I recorded a gentle introduction to building RAG applications using open-source models. Prerequisite: You should be comfortable reading Python. I wanted to record a video to introduce as many people as possible to building RAG applications and using Large Language Models. Here, I tried to stay away... from complex jargon and break everything down as simple as possible. Some of the topics I cover here: • Fundamental components of a RAG application • How to use Llama 3.1 running locally • How to use Lightning AI Studio's dev platform and GPUs • How to orchestrate everything using Langchain It's a long video with lots of detailed explanations. Here is a link to the video (what you see attached is just a trailer): You'll find a link to all of the code in the video description.show more

Santiago

416,778 subscribers

69,223 次观看 • 1 年前 •via X (Twitter)

科学技术健康养生教育

Anya Rossi• Live Now

Private livecam show

10 条评论

Think Pythonic 的头像

Think Pythonic1 年前

RAG (Retrieve, Augment, Generate) combine two AI techniques: **retrieval** of relevant information (ie database or doc) and **generation** of new text (from models like GPT). Essentially, it retrieves data first, then uses it to generate a more informed and accurate response.

Louis Polart 的头像

Louis Polart1 年前

Awsome, thank you. Will watch it !

The Monk Dev 的头像

The Monk Dev1 年前

What's a RAG? Anyone?

CTS Tech 的头像

CTS Tech1 年前

Thnx for all the work u do 👏🙏

Marcelo Russo | Web Design & Webflow Expert 的头像

Marcelo Russo | Web Design & Webflow Expert1 年前

Crisp image and on point background ❤️🔥 fantastic! ❤️🔥🔥

Ben 的头像

Ben1 年前

Have you tried LangGraph yet, or similar tools for RAG evaluation, using secondary LLMs to review the responses? I’d be interested to learn your take on these.

AI Tiger 的头像

AI Tiger1 年前

Thanks so much. You are on the top

zer0nerd 的头像

zer0nerd1 年前

@svpino thanks for the knowledge!

Tim 的头像

Tim1 年前

@Memdotai mem it

Mem 的头像

Mem1 年前

@svpino Saved! Here's the compiled thread:

相关视频

I recorded a step-by-step tutorial about how to test RAG applications. It's 50 minutes, and I tried to make it as simple to understand as possible. For those of you who find these long tutorials helpful, what should I cover next?

I recorded a step-by-step tutorial about how to test RAG applications. It's 50 minutes, and I tried to make it as simple to understand as possible. For those of you who find these long tutorials helpful, what should I cover next?

Santiago

77,204 次观看 • 2 年前

Here is a 50-minute tutorial for those who want to build their first RAG application using open-source models. This is very beginner-friendly. As long as you are comfortable with Python, you should be able to get a ton from this.

Here is a 50-minute tutorial for those who want to build their first RAG application using open-source models. This is very beginner-friendly. As long as you are comfortable with Python, you should be able to get a ton from this.

Santiago

143,563 次观看 • 1 年前

I recorded a full tutorial on how to build a RAG application using open-source models. It's almost 1 hour. Step by step. I explained everything as if you were 5 years old.

I recorded a full tutorial on how to build a RAG application using open-source models. It's almost 1 hour. Step by step. I explained everything as if you were 5 years old.

Santiago

555,413 次观看 • 2 年前

I built an entire RAG application without writing any code. This is one of the easiest ways to start building AI applications. I used Langflow. It's open-source. It's a visual interface for building and deploying AI applications. I built a couple of workflows to show you how it works, and I'm impressed! Check the video I recorded. I built two workflows: 1. The first one loads data into a vector store database (Astra DB). 2. The second asks questions from that data using OpenAI's models. Langchain does the heavy lifting behind the scenes, but having a visual interface makes this 10 times easier. Here is Langflow's GitHub repository: In the video, I used their pre-release version. The development experience for building AI applications is improving tremendously! What a time to be a builder!

I built an entire RAG application without writing any code. This is one of the easiest ways to start building AI applications. I used Langflow. It's open-source. It's a visual interface for building and deploying AI applications. I built a couple of workflows to show you how it works, and I'm impressed! Check the video I recorded. I built two workflows: 1. The first one loads data into a vector store database (Astra DB). 2. The second asks questions from that data using OpenAI's models. Langchain does the heavy lifting behind the scenes, but having a visual interface makes this 10 times easier. Here is Langflow's GitHub repository: In the video, I used their pre-release version. The development experience for building AI applications is improving tremendously! What a time to be a builder!

Santiago

263,053 次观看 • 2 年前

99% of AI applications are cool-looking demos. Impressive, but don't get fooled by the hype. It takes a lot to build enterprise-grade products that deliver real value. I have at least three weekly conversations with companies that want to use a Large Language Model with their data. The demand is huge! Here is one idea about what you can do to help. The use cases that most of these companies want to solve are similar: They have an extensive knowledge base and want to build a simple application that uses that information to answer questions. In other words, they need help building Retrieval Augmented Generation (RAG) applications they can use in many different scenarios: 1. To train new employees 2. To help their support team 3. To search old meetings and documents 4. To help with their research However, building these systems is not straightforward. Yes, there's a lot of information online, but there aren't enough people who know how to create solutions that work. Here is the idea: Today, you can build an enterprise-grade RAG application without writing code. A couple of MIT PhDs with 10+ years of experience building AI applications created . It's a no-code platform for building applications using Large Language Models. They are partnering with me on this post. You can use Stack AI to create, test, and deploy an end-to-end production-ready AI system. It's SOC-2, HIPAA, and GDPR compliant and offers SSO, role management, access control, and on-premise deployments. Of course, you can use the platform with any LLM on the market now. It's the whole nine yards for building AI applications. Check them out here: 2023 was about models. 2024 is about the tools using these models to build production-ready applications. That's where I'd start.

99% of AI applications are cool-looking demos. Impressive, but don't get fooled by the hype. It takes a lot to build enterprise-grade products that deliver real value. I have at least three weekly conversations with companies that want to use a Large Language Model with their data. The demand is huge! Here is one idea about what you can do to help. The use cases that most of these companies want to solve are similar: They have an extensive knowledge base and want to build a simple application that uses that information to answer questions. In other words, they need help building Retrieval Augmented Generation (RAG) applications they can use in many different scenarios: 1. To train new employees 2. To help their support team 3. To search old meetings and documents 4. To help with their research However, building these systems is not straightforward. Yes, there's a lot of information online, but there aren't enough people who know how to create solutions that work. Here is the idea: Today, you can build an enterprise-grade RAG application without writing code. A couple of MIT PhDs with 10+ years of experience building AI applications created . It's a no-code platform for building applications using Large Language Models. They are partnering with me on this post. You can use Stack AI to create, test, and deploy an end-to-end production-ready AI system. It's SOC-2, HIPAA, and GDPR compliant and offers SSO, role management, access control, and on-premise deployments. Of course, you can use the platform with any LLM on the market now. It's the whole nine yards for building AI applications. Check them out here: 2023 was about models. 2024 is about the tools using these models to build production-ready applications. That's where I'd start.

Santiago

197,675 次观看 • 2 年前

The future of AI is open-source. And ollama is the easiest way to build AI applications with open-source LLMs. Here's how to build a free, private RAG app using open-source tools. We'll use: - Ollama for LLMs and embedding models - PostgreSQL for data storage and retrieval - pgai Vectorizer for embedding creation and sync (I use Nomic for embeddings and tinnyllama as my LLM but you can substitute them for any models on Ollama)

The future of AI is open-source. And ollama is the easiest way to build AI applications with open-source LLMs. Here's how to build a free, private RAG app using open-source tools. We'll use: - Ollama for LLMs and embedding models - PostgreSQL for data storage and retrieval - pgai Vectorizer for embedding creation and sync (I use Nomic for embeddings and tinnyllama as my LLM but you can substitute them for any models on Ollama)

Avthar

34,261 次观看 • 1 年前

You can now try Llama 3.1 405B for free (link below)! This is the largest open-source model out there, and for the first time, an open model is competitive with closed models. This time around, Meta did something new: Llama 3.1 has a license that allows developers to use it to enhance other models. For the first time, you can distill Llama 3.1 405B's capabilities into a smaller, more practical model for your use case. First, here is the link where you can play with Llama 3.1 for free: The model is hosted in Tune Studio, an end-to-end platform for developing applications using Large Language Models. They are sponsoring this post. Take a look at the attached video. It will show you how you can fine-tune a simple model using Llama 3.1 without leaving the platform: 1. You can create an empty dataset 2. Use the playground to generate and record interactions with Llama 3.1 3. Modify the dataset directly using the playground 4. Export the data and fine-tune a smaller model Fast and easy! As long as you have a web browser, you can start experimenting with fine-tuning and Llama 3.1. That's all it takes!

You can now try Llama 3.1 405B for free (link below)! This is the largest open-source model out there, and for the first time, an open model is competitive with closed models. This time around, Meta did something new: Llama 3.1 has a license that allows developers to use it to enhance other models. For the first time, you can distill Llama 3.1 405B's capabilities into a smaller, more practical model for your use case. First, here is the link where you can play with Llama 3.1 for free: The model is hosted in Tune Studio, an end-to-end platform for developing applications using Large Language Models. They are sponsoring this post. Take a look at the attached video. It will show you how you can fine-tune a simple model using Llama 3.1 without leaving the platform: 1. You can create an empty dataset 2. Use the playground to generate and record interactions with Llama 3.1 3. Modify the dataset directly using the playground 4. Export the data and fine-tune a smaller model Fast and easy! As long as you have a web browser, you can start experimenting with fine-tuning and Llama 3.1. That's all it takes!

Santiago

55,609 次观看 • 2 年前

I've made $4.7M with AI. Today, I recorded myself building an entire OpenClaw🦞 business in an hour. You'll learn how to: • Find a winning niche using AI • Build an offer people actually want to pay for • To get your first clients • To use AI to productize and fulfill the entire service • The exact prompts I use to orchestrate all of it After building the #1 Lovable agency. This is the entire playbook behind AI native businesses. The full video is live on YouTube. Comment "Playbook" and I'll DM you the link.

I've made $4.7M with AI. Today, I recorded myself building an entire OpenClaw🦞 business in an hour. You'll learn how to: • Find a winning niche using AI • Build an offer people actually want to pay for • To get your first clients • To use AI to productize and fulfill the entire service • The exact prompts I use to orchestrate all of it After building the #1 Lovable agency. This is the entire playbook behind AI native businesses. The full video is live on YouTube. Comment "Playbook" and I'll DM you the link.

Jacob Klug

124,166 次观看 • 4 个月前

Here's how I make rags out of all my data with LOCAL MODELS ONLY GLM-4.7-Flash turns all my anthropic chats to a rag, and makes a skill to use it from anywhere. This video has the whole process. excuse the annoying watermark, I wanted to speed the video up

Here's how I make rags out of all my data with LOCAL MODELS ONLY GLM-4.7-Flash turns all my anthropic chats to a rag, and makes a skill to use it from anywhere. This video has the whole process. excuse the annoying watermark, I wanted to speed the video up

0xSero

20,952 次观看 • 6 个月前

108 workflow templates you can use to build AI applications without writing any code. You can use these templates with n8n. I recorded the attached video to show you how it works. n8n is the workhorse behind an open-source, self-hosted AI starter kit you can install on your computer. They are sponsoring this post. Here is the link to the starter kit repository: And here is the spreadsheet with the 108 templates: Whatever idea you have, search for something similar in the list of templates, and you'll save a ton of time. Lately, I've talked to many non-coders who want to start using AI more seriously to build things. n8n is perfect for that.

108 workflow templates you can use to build AI applications without writing any code. You can use these templates with n8n. I recorded the attached video to show you how it works. n8n is the workhorse behind an open-source, self-hosted AI starter kit you can install on your computer. They are sponsoring this post. Here is the link to the starter kit repository: And here is the spreadsheet with the 108 templates: Whatever idea you have, search for something similar in the list of templates, and you'll save a ton of time. Lately, I've talked to many non-coders who want to start using AI more seriously to build things. n8n is perfect for that.

Santiago

78,133 次观看 • 1 年前

If you could only learn one thing that will be relevant for the next 10-20 years, focus on learning how to deal with data. The future is not about faster hardware, smarter algorithms, or better ideas. The future is about DATA, and those who know how to deal with it will stay relevant much longer than anyone else. I recorded a video to show you how easy it is to get started. In the video, I'm using Kestra. For a long time, I was a fan of AirFlow. Then, I moved to AWS Step Functions. Today, I only use Kestra. Kestra is open-source (repo link below) and kind enough to sponsor my work. The video will show you how easy it is to do the following: 1. Run Kestra locally (literally, one command) 2. Build a simple flow 3. Run Python scripts as part of your flow 4. Connect to HuggingFace models If you have never built a data pipeline, open Kestra's Quick Start Guide and follow their examples. (I think it will take you one weekend to feel comfortable with the application and build the courage you need to get into more serious work.)

If you could only learn one thing that will be relevant for the next 10-20 years, focus on learning how to deal with data. The future is not about faster hardware, smarter algorithms, or better ideas. The future is about DATA, and those who know how to deal with it will stay relevant much longer than anyone else. I recorded a video to show you how easy it is to get started. In the video, I'm using Kestra. For a long time, I was a fan of AirFlow. Then, I moved to AWS Step Functions. Today, I only use Kestra. Kestra is open-source (repo link below) and kind enough to sponsor my work. The video will show you how easy it is to do the following: 1. Run Kestra locally (literally, one command) 2. Build a simple flow 3. Run Python scripts as part of your flow 4. Connect to HuggingFace models If you have never built a data pipeline, open Kestra's Quick Start Guide and follow their examples. (I think it will take you one weekend to feel comfortable with the application and build the courage you need to get into more serious work.)

Santiago

51,012 次观看 • 1 年前

Knowledge graphs for representing information are unbeatable. After this, you will never build a RAG system without knowledge graphs. It will take you five lines of code to build a knowledge graph with your data. I recorded a video to show you how you can do this. I used Cognee, an open-source library that outperforms any basic vector search approach in terms of retrieval relevance. They are collaborating with me on this post. Cognee is: • Easy to use • Reduces hallucinations • Open-source Here is a link to the repository: They also offer a comprehensive platform and UI with Python notebooks you can utilize to manage your data. Here is the link:

Knowledge graphs for representing information are unbeatable. After this, you will never build a RAG system without knowledge graphs. It will take you five lines of code to build a knowledge graph with your data. I recorded a video to show you how you can do this. I used Cognee, an open-source library that outperforms any basic vector search approach in terms of retrieval relevance. They are collaborating with me on this post. Cognee is: • Easy to use • Reduces hallucinations • Open-source Here is a link to the repository: They also offer a comprehensive platform and UI with Python notebooks you can utilize to manage your data. Here is the link:

Santiago

125,928 次观看 • 10 个月前

Run state-of-the-art RAG applications locally on your computer with ollama and use all the fantastic open-source models like llama3, msk's awesome models, or Command R from Cohere With Verba 1.0, we put it all in your hands 🙌 Get on board for a wild open-source ride, we're bridging any moat as open-source is here to win

Run state-of-the-art RAG applications locally on your computer with ollama and use all the fantastic open-source models like llama3, msk's awesome models, or Command R from Cohere With Verba 1.0, we put it all in your hands 🙌 Get on board for a wild open-source ride, we're bridging any moat as open-source is here to win

Philip Vollet

41,450 次观看 • 2 年前

Let's build a dashboard to evaluate and monitor your Agentic and RAG apps! . . In this video, I'll guide you through creating an evaluation and observability pipeline for your AI apps using a 100% open-source tool! Tech Stack: - Comet's Opik to eval and monitor - LlamaIndex to build a RAG pipeline - Ragas for synthetic datagen You'll learn: - Setting up Opik - Building a RAG pipeline - Creating an eval dataset - Evaluating the RAG pipeline - Monitoring all activities during the process It's a hands on demo with code and step-by-step guide to do everything listed above. CometML's Opik is fully open-source, offering the most Pythonic and easiest way to monitor LLM apps. I have shared link to their repo in next tweet!

Let's build a dashboard to evaluate and monitor your Agentic and RAG apps! . . In this video, I'll guide you through creating an evaluation and observability pipeline for your AI apps using a 100% open-source tool! Tech Stack: - Comet's Opik to eval and monitor - LlamaIndex to build a RAG pipeline - Ragas for synthetic datagen You'll learn: - Setting up Opik - Building a RAG pipeline - Creating an eval dataset - Evaluating the RAG pipeline - Monitoring all activities during the process It's a hands on demo with code and step-by-step guide to do everything listed above. CometML's Opik is fully open-source, offering the most Pythonic and easiest way to monitor LLM apps. I have shared link to their repo in next tweet!

Akshay 🚀

20,058 次观看 • 1 年前

Tune Studio is an end-to-end platform for developing applications using Large Language Models. So far, I haven't seen any other platform like this one. You can do everything here: 1. You can curate your data. 2. Use the playground to play with different models and try your ideas. 3. Fine-tune an open-source model on your data. 4. Deploy the model when you are done. This is awesome for anyone building generative AI applications. You can use Tune Studio to work with any of the open-source models out there. They were one of the few companies to host Llama 2 and Llama 3 before anyone else. Here is a link to check it out: One of their main selling points is that Tune Studio scales! You don't have to worry about serving your model to lots of users. They also have built-in user management, authentication, on-prem support, user context management, and pretty much everything you need to build generative AI applications. Thanks to the Tune team for collaborating with me on this post. We are living through the best years of development tools for AI developers. The field is unstoppable.

Tune Studio is an end-to-end platform for developing applications using Large Language Models. So far, I haven't seen any other platform like this one. You can do everything here: 1. You can curate your data. 2. Use the playground to play with different models and try your ideas. 3. Fine-tune an open-source model on your data. 4. Deploy the model when you are done. This is awesome for anyone building generative AI applications. You can use Tune Studio to work with any of the open-source models out there. They were one of the few companies to host Llama 2 and Llama 3 before anyone else. Here is a link to check it out: One of their main selling points is that Tune Studio scales! You don't have to worry about serving your model to lots of users. They also have built-in user management, authentication, on-prem support, user context management, and pretty much everything you need to build generative AI applications. Thanks to the Tune team for collaborating with me on this post. We are living through the best years of development tools for AI developers. The field is unstoppable.

Santiago

39,101 次观看 • 2 年前

I recorded a video tutorial on building an AI Code Assistant. Going through the full stack - from data models to the front end. Using Postgres, pg_vector, LangChain, OpenAI, and NextJS.

I recorded a video tutorial on building an AI Code Assistant. Going through the full stack - from data models to the front end. Using Postgres, pg_vector, LangChain, OpenAI, and NextJS.

Gwen (Chen) Shapira

30,463 次观看 • 1 年前

Announcing a new Coursera course: Retrieval Augmented Generation (RAG) You'll learn to build high performance, production-ready RAG systems in this hands-on, in-depth course created by and taught by , experienced AI and ML engineer, researcher, and educator. RAG is a critical component today of many LLM-based applications in customer support, internal company Q&A systems, even many of the leading chatbots that use web search to answer your questions. This course teaches you in-depth how to make RAG work well. LLMs can produce generic or outdated responses, especially when asked specialized questions not covered in its training data. RAG is the most widely used technique for addressing this. It brings in data from new data sources, such as internal documents or recent news, to give the LLM the relevant context to private, recent, or specialized information. This lets it generate more grounded and accurate responses. In this course, you’ll learn to design and implement every part of a RAG system, from retrievers to vector databases to generation to evals. You’ll learn about the fundamental principles behind RAG and how to optimize it at both the component and whole-system levels. As AI evolves, RAG is evolving too. New models can handle longer context windows, reason more effectively, and can be parts of complex agentic workflows. One exciting growth area is Agentic RAG, in which an AI agent at runtime (rather than it being hardcoded at development time) autonomously decides what data to retrieve, and when/how to go deeper. Even with this evolution, access to high-quality data at runtime is essential, which is why RAG is a key part of so many applications. You'll learn via hands-on experiences to: - Build a RAG system with retrieval and prompt augmentation - Compare retrieval methods like BM25, semantic search, and Reciprocal Rank Fusion - Chunk, index, and retrieve documents using a Weaviate vector database and a news dataset - Develop a chatbot, using open-source LLMs hosted by Together AI, for a fictional store that answers product and FAQ questions - Use evals to drive improving reliability, and incorporate multi-modal data RAG is an important foundational technique. Become good at it through this course! Please sign up here:

Announcing a new Coursera course: Retrieval Augmented Generation (RAG) You'll learn to build high performance, production-ready RAG systems in this hands-on, in-depth course created by and taught by , experienced AI and ML engineer, researcher, and educator. RAG is a critical component today of many LLM-based applications in customer support, internal company Q&A systems, even many of the leading chatbots that use web search to answer your questions. This course teaches you in-depth how to make RAG work well. LLMs can produce generic or outdated responses, especially when asked specialized questions not covered in its training data. RAG is the most widely used technique for addressing this. It brings in data from new data sources, such as internal documents or recent news, to give the LLM the relevant context to private, recent, or specialized information. This lets it generate more grounded and accurate responses. In this course, you’ll learn to design and implement every part of a RAG system, from retrievers to vector databases to generation to evals. You’ll learn about the fundamental principles behind RAG and how to optimize it at both the component and whole-system levels. As AI evolves, RAG is evolving too. New models can handle longer context windows, reason more effectively, and can be parts of complex agentic workflows. One exciting growth area is Agentic RAG, in which an AI agent at runtime (rather than it being hardcoded at development time) autonomously decides what data to retrieve, and when/how to go deeper. Even with this evolution, access to high-quality data at runtime is essential, which is why RAG is a key part of so many applications. You'll learn via hands-on experiences to: - Build a RAG system with retrieval and prompt augmentation - Compare retrieval methods like BM25, semantic search, and Reciprocal Rank Fusion - Chunk, index, and retrieve documents using a Weaviate vector database and a news dataset - Develop a chatbot, using open-source LLMs hosted by Together AI, for a fictional store that answers product and FAQ questions - Use evals to drive improving reliability, and incorporate multi-modal data RAG is an important foundational technique. Become good at it through this course! Please sign up here:

Andrew Ng

124,625 次观看 • 1 年前

Building with AI gets easier every day. Here is an open-source library that makes integrating AI into an application extremely easy: Star the repository! This library alone can make React the best front-end framework out there! There are a bunch of cool things I like about CopilotKit. Here are 3 of them: 1. It allows you to take any -powered agent and bring it into your application. (This is a brand-new feature!) 2. You can build an AI-powered chatbot in your application. The chatbot will have access to your context and can act on the application. 3. You can build a RAG workflow to process and answer questions from a real-time knowledge base. I recorded a video to show you how simple it is to make some of this happen. A few lines of code, and you are in business. Here is a link to the sample application: CopilotKit is open-source. You can self-host it. You can use it with any LLM. Thanks to the team for showing me their tool and collaborating with me on this post!

Santiago

108,835 次观看 • 2 年前