正在加载视频...

视频加载失败

I recorded a gentle introduction to building RAG applications using open-source models. ​ Prerequisite: You should be comfortable reading Python. ​ I wanted to record a video to introduce as many people as possible to building RAG applications and using Large Language Models. Here, I tried to stay away...

69,223 次观看 • 1 年前 •via X (Twitter)

10 条评论

Think Pythonic 的头像
Think Pythonic1 年前

RAG (Retrieve, Augment, Generate) combine two AI techniques: **retrieval** of relevant information (ie database or doc) and **generation** of new text (from models like GPT). Essentially, it retrieves data first, then uses it to generate a more informed and accurate response.

Louis Polart 的头像
Louis Polart1 年前

Awsome, thank you. Will watch it !

The Monk Dev 的头像
The Monk Dev1 年前

What's a RAG? Anyone?

CTS Tech 的头像
CTS Tech1 年前

Thnx for all the work u do 👏🙏

Marcelo Russo | Web Design & Webflow Expert 的头像
Marcelo Russo | Web Design & Webflow Expert1 年前

Crisp image and on point background ❤️🔥 fantastic! ❤️🔥🔥

Ben 的头像
Ben1 年前

Have you tried LangGraph yet, or similar tools for RAG evaluation, using secondary LLMs to review the responses? I’d be interested to learn your take on these.

AI Tiger 的头像
AI Tiger1 年前

Thanks so much. You are on the top

zer0nerd 的头像
zer0nerd1 年前

@svpino thanks for the knowledge!

Tim 的头像
Tim1 年前

@Memdotai mem it

Mem 的头像
Mem1 年前

@svpino Saved! Here's the compiled thread:

相关视频

99% of AI applications are cool-looking demos. Impressive, but don't get fooled by the hype. It takes a lot to build enterprise-grade products that deliver real value. I have at least three weekly conversations with companies that want to use a Large Language Model with their data. The demand is huge! Here is one idea about what you can do to help. The use cases that most of these companies want to solve are similar: They have an extensive knowledge base and want to build a simple application that uses that information to answer questions. In other words, they need help building Retrieval Augmented Generation (RAG) applications they can use in many different scenarios: 1. To train new employees 2. To help their support team 3. To search old meetings and documents 4. To help with their research However, building these systems is not straightforward. Yes, there's a lot of information online, but there aren't enough people who know how to create solutions that work. Here is the idea: Today, you can build an enterprise-grade RAG application without writing code. A couple of MIT PhDs with 10+ years of experience building AI applications created . It's a no-code platform for building applications using Large Language Models. They are partnering with me on this post. You can use Stack AI to create, test, and deploy an end-to-end production-ready AI system. It's SOC-2, HIPAA, and GDPR compliant and offers SSO, role management, access control, and on-premise deployments. Of course, you can use the platform with any LLM on the market now. It's the whole nine yards for building AI applications. Check them out here: 2023 was about models. 2024 is about the tools using these models to build production-ready applications. That's where I'd start.

Santiago

197,675 次观看 • 2 年前

Announcing a new Coursera course: Retrieval Augmented Generation (RAG) You'll learn to build high performance, production-ready RAG systems in this hands-on, in-depth course created by and taught by Zain, experienced AI and ML engineer, researcher, and educator. RAG is a critical component today of many LLM-based applications in customer support, internal company Q&A systems, even many of the leading chatbots that use web search to answer your questions. This course teaches you in-depth how to make RAG work well. LLMs can produce generic or outdated responses, especially when asked specialized questions not covered in its training data. RAG is the most widely used technique for addressing this. It brings in data from new data sources, such as internal documents or recent news, to give the LLM the relevant context to private, recent, or specialized information. This lets it generate more grounded and accurate responses. In this course, you’ll learn to design and implement every part of a RAG system, from retrievers to vector databases to generation to evals. You’ll learn about the fundamental principles behind RAG and how to optimize it at both the component and whole-system levels. As AI evolves, RAG is evolving too. New models can handle longer context windows, reason more effectively, and can be parts of complex agentic workflows. One exciting growth area is Agentic RAG, in which an AI agent at runtime (rather than it being hardcoded at development time) autonomously decides what data to retrieve, and when/how to go deeper. Even with this evolution, access to high-quality data at runtime is essential, which is why RAG is a key part of so many applications. You'll learn via hands-on experiences to: - Build a RAG system with retrieval and prompt augmentation - Compare retrieval methods like BM25, semantic search, and Reciprocal Rank Fusion - Chunk, index, and retrieve documents using a Weaviate vector database and a news dataset - Develop a chatbot, using open-source LLMs hosted by Together AI, for a fictional store that answers product and FAQ questions - Use evals to drive improving reliability, and incorporate multi-modal data RAG is an important foundational technique. Become good at it through this course! Please sign up here:

Andrew Ng

124,314 次观看 • 10 个月前