Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

I recorded a gentle introduction to building RAG applications using open-source models. ​ Prerequisite: You should be comfortable reading Python. ​ I wanted to record a video to introduce as many people as possible to building RAG applications and using Large Language Models. Here, I tried to stay away...

69,223 Aufrufe • vor 1 Jahr •via X (Twitter)

10 Kommentare

Profilbild von Think Pythonic
Think Pythonicvor 1 Jahr

RAG (Retrieve, Augment, Generate) combine two AI techniques: **retrieval** of relevant information (ie database or doc) and **generation** of new text (from models like GPT). Essentially, it retrieves data first, then uses it to generate a more informed and accurate response.

Profilbild von Louis Polart
Louis Polartvor 1 Jahr

Awsome, thank you. Will watch it !

Profilbild von The Monk Dev
The Monk Devvor 1 Jahr

What's a RAG? Anyone?

Profilbild von CTS Tech
CTS Techvor 1 Jahr

Thnx for all the work u do 👏🙏

Profilbild von Marcelo Russo | Web Design & Webflow Expert
Marcelo Russo | Web Design & Webflow Expertvor 1 Jahr

Crisp image and on point background ❤️🔥 fantastic! ❤️🔥🔥

Profilbild von Ben
Benvor 1 Jahr

Have you tried LangGraph yet, or similar tools for RAG evaluation, using secondary LLMs to review the responses? I’d be interested to learn your take on these.

Profilbild von AI Tiger
AI Tigervor 1 Jahr

Thanks so much. You are on the top

Profilbild von zer0nerd
zer0nerdvor 1 Jahr

@svpino thanks for the knowledge!

Profilbild von Tim
Timvor 1 Jahr

@Memdotai mem it

Profilbild von Mem
Memvor 1 Jahr

@svpino Saved! Here's the compiled thread:

Ähnliche Videos

99% of AI applications are cool-looking demos. Impressive, but don't get fooled by the hype. It takes a lot to build enterprise-grade products that deliver real value. I have at least three weekly conversations with companies that want to use a Large Language Model with their data. The demand is huge! Here is one idea about what you can do to help. The use cases that most of these companies want to solve are similar: They have an extensive knowledge base and want to build a simple application that uses that information to answer questions. In other words, they need help building Retrieval Augmented Generation (RAG) applications they can use in many different scenarios: 1. To train new employees 2. To help their support team 3. To search old meetings and documents 4. To help with their research However, building these systems is not straightforward. Yes, there's a lot of information online, but there aren't enough people who know how to create solutions that work. Here is the idea: Today, you can build an enterprise-grade RAG application without writing code. A couple of MIT PhDs with 10+ years of experience building AI applications created . It's a no-code platform for building applications using Large Language Models. They are partnering with me on this post. You can use Stack AI to create, test, and deploy an end-to-end production-ready AI system. It's SOC-2, HIPAA, and GDPR compliant and offers SSO, role management, access control, and on-premise deployments. Of course, you can use the platform with any LLM on the market now. It's the whole nine yards for building AI applications. Check them out here: 2023 was about models. 2024 is about the tools using these models to build production-ready applications. That's where I'd start.

Santiago

197,675 Aufrufe • vor 2 Jahren

Announcing a new Coursera course: Retrieval Augmented Generation (RAG) You'll learn to build high performance, production-ready RAG systems in this hands-on, in-depth course created by and taught by Zain, experienced AI and ML engineer, researcher, and educator. RAG is a critical component today of many LLM-based applications in customer support, internal company Q&A systems, even many of the leading chatbots that use web search to answer your questions. This course teaches you in-depth how to make RAG work well. LLMs can produce generic or outdated responses, especially when asked specialized questions not covered in its training data. RAG is the most widely used technique for addressing this. It brings in data from new data sources, such as internal documents or recent news, to give the LLM the relevant context to private, recent, or specialized information. This lets it generate more grounded and accurate responses. In this course, you’ll learn to design and implement every part of a RAG system, from retrievers to vector databases to generation to evals. You’ll learn about the fundamental principles behind RAG and how to optimize it at both the component and whole-system levels. As AI evolves, RAG is evolving too. New models can handle longer context windows, reason more effectively, and can be parts of complex agentic workflows. One exciting growth area is Agentic RAG, in which an AI agent at runtime (rather than it being hardcoded at development time) autonomously decides what data to retrieve, and when/how to go deeper. Even with this evolution, access to high-quality data at runtime is essential, which is why RAG is a key part of so many applications. You'll learn via hands-on experiences to: - Build a RAG system with retrieval and prompt augmentation - Compare retrieval methods like BM25, semantic search, and Reciprocal Rank Fusion - Chunk, index, and retrieve documents using a Weaviate vector database and a news dataset - Develop a chatbot, using open-source LLMs hosted by Together AI, for a fictional store that answers product and FAQ questions - Use evals to drive improving reliability, and incorporate multi-modal data RAG is an important foundational technique. Become good at it through this course! Please sign up here:

Andrew Ng

124,314 Aufrufe • vor 10 Monaten