Загрузка видео...

Не удалось загрузить видео

Возникла проблема при загрузке этого видео. Это может быть связано с временными проблемами сети или видео может быть недоступно.

На главную

I’ve created a full-stack financial analysis bot that can query both text + embedded tables across multiple SEC 10Ks 📊🤖 It’s made possible with `create-llama` scaffolding + LlamaIndex 🦙 advanced RAG, and I’m sharing the full template below for you to clone! It’s more sophisticated than a basic RAG... setup: 🦾 Unstructured to parse embedded tables into a node graph 🦾 Recursive retriever to retrieve/query embedded tables + text 🦾 An agent to do chain of thought + document comparisons 🦾 Custom callback to stream intermediate function calls to the UI Our goal is to make building full-stack advanced RAG as easy as possible. Created a repo to host this template + future templates here: Want to submit a project? We’d love contributions! 🙌 One caveat: - Right now the index is lazily built/cached during the first query, we’re working to decouple the ingestion process - Check out `create-llama` if you haven’t already:show more

Jerry Liu

79,469 subscribers

131,442 просмотров • 2 лет назад •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

Комментарии: 0

Нет доступных комментариев

Здесь появятся комментарии из оригинального поста

Похожие видео

New JavaScript short course: Build a full-stack web application that uses RAG in JavaScript RAG Web Apps with LlamaIndex, taught by Laurie Voss, VP of Developer Relations at LlamaIndex 🦙 and npm co-founder. - Build a RAG application for querying your own data - Develop tools to interact with multiple data sources using an agent that intelligently selects the right tool for your queries - Create a full-stack web app that can chat with your data - Dig further into production-ready techniques, like how to persist your data so you aren’t constantly reindexing, and try the create-llama command line tool from LlamaIndex You can sign up here:

New JavaScript short course: Build a full-stack web application that uses RAG in JavaScript RAG Web Apps with LlamaIndex, taught by Laurie Voss, VP of Developer Relations at LlamaIndex 🦙 and npm co-founder. - Build a RAG application for querying your own data - Develop tools to interact with multiple data sources using an agent that intelligently selects the right tool for your queries - Create a full-stack web app that can chat with your data - Dig further into production-ready techniques, like how to persist your data so you aren’t constantly reindexing, and try the create-llama command line tool from LlamaIndex You can sign up here:

Andrew Ng

218,284 просмотров • 2 лет назад

An underrated issue with document parsing for RAG / agent use cases is dealing with multi-page tables - sometimes a big table spills over into multiple pages. This breaks chunking algorithms that generally operate at the page-level or smaller, and causes LLMs to lose the full view of the data. With LlamaParse Continuous Mode (in beta), you can now parse a document with multi-page tables and join them into a single table! This means you can now: 💡 Do contiguous chunking for RAG use cases OR 💡 Parse the table for text-to-SQL Check out our blog post highlighting this feature. Huge shoutout to Pierre-Loic Doulcet and Sacha Bron : Signup here: It's in beta, let us know your feedback!

An underrated issue with document parsing for RAG / agent use cases is dealing with multi-page tables - sometimes a big table spills over into multiple pages. This breaks chunking algorithms that generally operate at the page-level or smaller, and causes LLMs to lose the full view of the data. With LlamaParse Continuous Mode (in beta), you can now parse a document with multi-page tables and join them into a single table! This means you can now: 💡 Do contiguous chunking for RAG use cases OR 💡 Parse the table for text-to-SQL Check out our blog post highlighting this feature. Huge shoutout to Pierre-Loic Doulcet and Sacha Bron : Signup here: It's in beta, let us know your feedback!

Jerry Liu

24,245 просмотров • 1 год назад

Our new short course, Knowledge Graphs for RAG, is now available! Knowledge graphs are a data structure that is great at capturing complex relationships between data of multiple types. By enabling more sophisticated retrieval of text than similarity search alone, knowledge graphs can improve the context you pass to the LLM and the performance of your RAG applications. In this course, taught by Andreas Kollegger of Neo4j, you’ll - Explore how knowledge graphs work by building a graph of public financial documents from scratch - Learn to write queries that retrieve text and data from the graph and use it to enhance the context you pass to an LLM chatbot - Combine a knowledge graph with a question-answer chain to build better RAG-powered chat systems Sign up here!

Our new short course, Knowledge Graphs for RAG, is now available! Knowledge graphs are a data structure that is great at capturing complex relationships between data of multiple types. By enabling more sophisticated retrieval of text than similarity search alone, knowledge graphs can improve the context you pass to the LLM and the performance of your RAG applications. In this course, taught by Andreas Kollegger of Neo4j, you’ll - Explore how knowledge graphs work by building a graph of public financial documents from scratch - Learn to write queries that retrieve text and data from the graph and use it to enhance the context you pass to an LLM chatbot - Combine a knowledge graph with a question-answer chain to build better RAG-powered chat systems Sign up here!

Andrew Ng

244,352 просмотров • 2 лет назад

Introducing LlamaCloud 🦙🌤️ Today we’re thrilled to introduce LlamaCloud, a managed service designed to bring production-grade data for your LLM and RAG app. Spend less time data wrangling and more time on application logic. Launching with the following components: 1️⃣ LlamaParse 📑: a proprietary parser designed to be really really good at complex documents with embedded tables. Build advanced RAG over semi-structured PDFs, and ask questions that simply aren’t possible with the naive stack. Available publicly day 1 🔥 2️⃣ Managed Ingestion/Retrieval API ⚙️: An API letting you easily ingest/retrieve data from data sources. Opening up in private beta to select enterprises. We’re excited to be joined by launch users, partners, and collaborators: Mendable @DataStax MongoDB Qdrant NVIDIA + some awesome hackathon projects at the LlamaIndex 🦙 hackathon Check out our FULL blog post on LlamaCloud and LlamaParse: LlamaParse Client Repo: Signup for a LlamaCloud account to use LlamaParse: Interested in the broader LlamaCloud offering? Come talk to us: Also we have a slick new website 🌐:

Introducing LlamaCloud 🦙🌤️ Today we’re thrilled to introduce LlamaCloud, a managed service designed to bring production-grade data for your LLM and RAG app. Spend less time data wrangling and more time on application logic. Launching with the following components: 1️⃣ LlamaParse 📑: a proprietary parser designed to be really really good at complex documents with embedded tables. Build advanced RAG over semi-structured PDFs, and ask questions that simply aren’t possible with the naive stack. Available publicly day 1 🔥 2️⃣ Managed Ingestion/Retrieval API ⚙️: An API letting you easily ingest/retrieve data from data sources. Opening up in private beta to select enterprises. We’re excited to be joined by launch users, partners, and collaborators: Mendable @DataStax MongoDB Qdrant NVIDIA + some awesome hackathon projects at the LlamaIndex 🦙 hackathon Check out our FULL blog post on LlamaCloud and LlamaParse: LlamaParse Client Repo: Signup for a LlamaCloud account to use LlamaParse: Interested in the broader LlamaCloud offering? Come talk to us: Also we have a slick new website 🌐:

LlamaIndex 🦙

141,250 просмотров • 2 лет назад

Traditional data pipelines don't work for RAG applications. There are 3 issues with them: 1. Traditional data engineering solutions are optimized to handle structured data. RAG applications rely primarily on unstructured data. 2. The connector ecosystem to load data from unstructured data sources is very immature. 3. Traditional solutions do not offer any way to transform unstructured data into an optimized vector search index. The goal of a RAG Pipeline is to solve these problems. The number one objective is to create a reliable vector search index using factual knowledge and relevant context. This sounds easy, but it's one of the biggest challenges we face when building RAG applications. At a high level, there are four different stages in the architecture of a RAG pipeline: 1. Ingestion: Here is where the pipeline loads the information from the data source. 2. Extraction: Where the pipeline processes the input data and decides how to retrieve the text contained inside them. 3. Transform: Where the pipeline chunks the data and generates document embeddings. 4. Load: Where the pipeline creates a search index in a vector database and loads the document embeddings. There are different rabbit holes at each one of these stages. Here are three of them: 1. Ingesting data once is simple. The hard part is refreshing the vector database whenever the original data source changes. 2. Extracting the content of a plain text document is simple. The hard part is to extract content from complex documents containing tables, images, or cross-references. 3. A simple continual chunking strategy with an overlap is simple. The hard part is to find the optimal strategy for your specific knowledge base and the way you are planning to query it. In the attached video, I'll show you how you can build an enterprise-grade RAG Pipeline that solves every one of the above problems. I'll use Vectorize. They partnered with me on this post. You can use them to build RAG pipelines optimized for accurate context retrieval. If you have a few documents lying around, set up a free account and give it a try.

Traditional data pipelines don't work for RAG applications. There are 3 issues with them: 1. Traditional data engineering solutions are optimized to handle structured data. RAG applications rely primarily on unstructured data. 2. The connector ecosystem to load data from unstructured data sources is very immature. 3. Traditional solutions do not offer any way to transform unstructured data into an optimized vector search index. The goal of a RAG Pipeline is to solve these problems. The number one objective is to create a reliable vector search index using factual knowledge and relevant context. This sounds easy, but it's one of the biggest challenges we face when building RAG applications. At a high level, there are four different stages in the architecture of a RAG pipeline: 1. Ingestion: Here is where the pipeline loads the information from the data source. 2. Extraction: Where the pipeline processes the input data and decides how to retrieve the text contained inside them. 3. Transform: Where the pipeline chunks the data and generates document embeddings. 4. Load: Where the pipeline creates a search index in a vector database and loads the document embeddings. There are different rabbit holes at each one of these stages. Here are three of them: 1. Ingesting data once is simple. The hard part is refreshing the vector database whenever the original data source changes. 2. Extracting the content of a plain text document is simple. The hard part is to extract content from complex documents containing tables, images, or cross-references. 3. A simple continual chunking strategy with an overlap is simple. The hard part is to find the optimal strategy for your specific knowledge base and the way you are planning to query it. In the attached video, I'll show you how you can build an enterprise-grade RAG Pipeline that solves every one of the above problems. I'll use Vectorize. They partnered with me on this post. You can use them to build RAG pipelines optimized for accurate context retrieval. If you have a few documents lying around, set up a free account and give it a try.

Santiago

40,441 просмотров • 1 год назад

I’m excited to kick off the first of our short courses focused on agents, starting with Building Agentic RAG with LlamaIndex, taught by Jerry Liu, CEO of LlamaIndex 🦙. This covers an important shift in RAG (retrieval augmented generation), in which rather than having the developer write explicit routines to retrieve information to feed into the LLM context, we instead build a RAG agent that that has access to tools for retrieving information. This lets the agent decide what information to fetch, and enables it to answer more complex questions using multi-step reasoning. In detail, you'll learn about: - Routing: Where your agent will use decision-making to route requests to multiple tools. - Tool Use: Where you'll create an interface for agents to select what tool (function call) to use as well as generate the right arguments. - Multi-step reasoning with tool use: Where you'll use an LLM to carry out multiple steps of reasoning, while retaining memory throughout the process. You’ll also learn how to step through what your agent is doing to debug and improve it iteratively. It’s an exciting time to build agents. Sign up and get started here!

I’m excited to kick off the first of our short courses focused on agents, starting with Building Agentic RAG with LlamaIndex, taught by Jerry Liu, CEO of LlamaIndex 🦙. This covers an important shift in RAG (retrieval augmented generation), in which rather than having the developer write explicit routines to retrieve information to feed into the LLM context, we instead build a RAG agent that that has access to tools for retrieving information. This lets the agent decide what information to fetch, and enables it to answer more complex questions using multi-step reasoning. In detail, you'll learn about: - Routing: Where your agent will use decision-making to route requests to multiple tools. - Tool Use: Where you'll create an interface for agents to select what tool (function call) to use as well as generate the right arguments. - Multi-step reasoning with tool use: Where you'll use an LLM to carry out multiple steps of reasoning, while retaining memory throughout the process. You’ll also learn how to step through what your agent is doing to debug and improve it iteratively. It’s an exciting time to build agents. Sign up and get started here!

Andrew Ng

297,131 просмотров • 2 лет назад

We’re excited to officially launch LlamaParse, the first genAI-native document parsing solution. Not only is it better at parsing out images/tables/charts 📊📈 than virtually every other parser, it is now steerable through natural language instructions - output the document in whatever format you desire! It is also the only parsing solution that seamlessly allows you to build accurate RAG over complex documents, free of hallucinations 🔥 We launched it in private preview a few weeks ago and hit 2k users, 1M total PDF pages parsed. And now it’s better than ever. LlamaParse contains the following killer features: ✅ SOTA table/chart extraction ✅ Seamless integration with LlamaIndex 🦙 advanced RAG/agents ✅✨ Natural language Parsing Instructions ✅✨JSON mode and image extraction ✅✨Support for ~10 document types (.pdf, .pptx, .docx, .xml) and more Our pricing is simple: 1k free per day, and additional pages at 0.3c a page, or $3 for 1k pages. If you want advanced document RAG and/or private deployments, come get in touch with us to chat about LlamaCloud. Check out our full blog post here: LlamaParse client repo: Signup at 🦙☁️: Come talk to us:

We’re excited to officially launch LlamaParse, the first genAI-native document parsing solution. Not only is it better at parsing out images/tables/charts 📊📈 than virtually every other parser, it is now steerable through natural language instructions - output the document in whatever format you desire! It is also the only parsing solution that seamlessly allows you to build accurate RAG over complex documents, free of hallucinations 🔥 We launched it in private preview a few weeks ago and hit 2k users, 1M total PDF pages parsed. And now it’s better than ever. LlamaParse contains the following killer features: ✅ SOTA table/chart extraction ✅ Seamless integration with LlamaIndex 🦙 advanced RAG/agents ✅✨ Natural language Parsing Instructions ✅✨JSON mode and image extraction ✅✨Support for ~10 document types (.pdf, .pptx, .docx, .xml) and more Our pricing is simple: 1k free per day, and additional pages at 0.3c a page, or $3 for 1k pages. If you want advanced document RAG and/or private deployments, come get in touch with us to chat about LlamaCloud. Check out our full blog post here: LlamaParse client repo: Signup at 🦙☁️: Come talk to us:

LlamaIndex 🦙

143,136 просмотров • 2 лет назад

Quick demo of Zero’s new background queries. Zero’s sync is query-based. Rather than specifying what data you want using rules or some other separate system, you just use queries. Right inside the client app, you do a query using a full sql-style language. You get filters, subqueries , limits, etc. Zero syncs the data backing these queries to the client. It’s important to realize Zero isn’t really a cache. It’s a replica. It’s eagerly replicating a precise snapshot of a slice of your database - the slice covered by the queries you have open. So there is never stale data in Zero. When you close a query we delete the rows uniquely returned by that query because we can no longer keep them up to date. Of course that kind of sucks for the common case of doing a query, navigating, then pressing “back”. Ideally we want that back nav to be fast. To address this Zero 0.17 adds background queries. You can add a ttl to a query and it will keep running and syncing in the background. This is much different than normal caching because this data stays up to date. If you make the same query again, the results will be *instantly* available *and already up to date with server*. If you make a different query the data from the background query is used client-side to answer the new query instantly if possible. This all happens completely automatically. Just by adding the ttl flag.

Quick demo of Zero’s new background queries. Zero’s sync is query-based. Rather than specifying what data you want using rules or some other separate system, you just use queries. Right inside the client app, you do a query using a full sql-style language. You get filters, subqueries , limits, etc. Zero syncs the data backing these queries to the client. It’s important to realize Zero isn’t really a cache. It’s a replica. It’s eagerly replicating a precise snapshot of a slice of your database - the slice covered by the queries you have open. So there is never stale data in Zero. When you close a query we delete the rows uniquely returned by that query because we can no longer keep them up to date. Of course that kind of sucks for the common case of doing a query, navigating, then pressing “back”. Ideally we want that back nav to be fast. To address this Zero 0.17 adds background queries. You can add a ttl to a query and it will keep running and syncing in the background. This is much different than normal caching because this data stays up to date. If you make the same query again, the results will be instantly available and already up to date with server. If you make a different query the data from the background query is used client-side to answer the new query instantly if possible. This all happens completely automatically. Just by adding the ttl flag.

Aaron Boodman

18,808 просмотров • 1 год назад

ZerePy is now live thanks to Ayoubed (BlormDev) for a walkthrough video For our v1 we wanted to make it seamlessly easy to launch a personalized agent that can post on social platforms. The upcoming updates will be focused to expand agent capabilities, integrations, and start onchain actions. All instructions to use ZerePy are on our repo readme. As we collect feedback we will update instructions for a better experience. Additionally, with the goal to make the experience as seamless as possible, we created a Replit to allow users to be able to launch their agent via a browser instead of managing their dependencies locally. One click ready to deploy: All you need to launch your ZerePy agent is your: - X account api keys - openai/claude keys - Create/configure Agent - Start agent ZerePy is here and we are now excited for the new chapter of the agentic future to begin! Biggest s/o to our cracked devs F🦾 Max Huber @jyu_eth Ayoubed (BlormDev)

ZerePy is now live thanks to Ayoubed (BlormDev) for a walkthrough video For our v1 we wanted to make it seamlessly easy to launch a personalized agent that can post on social platforms. The upcoming updates will be focused to expand agent capabilities, integrations, and start onchain actions. All instructions to use ZerePy are on our repo readme. As we collect feedback we will update instructions for a better experience. Additionally, with the goal to make the experience as seamless as possible, we created a Replit to allow users to be able to launch their agent via a browser instead of managing their dependencies locally. One click ready to deploy: All you need to launch your ZerePy agent is your: - X account api keys - openai/claude keys - Create/configure Agent - Start agent ZerePy is here and we are now excited for the new chapter of the agentic future to begin! Biggest s/o to our cracked devs F🦾 Max Huber @jyu_eth Ayoubed (BlormDev)

Tint Blorm 🫟

61,441 просмотров • 1 год назад

Your assets, arts and collectibles deserve more than just likes. They deserve onchain ownership —-Ownership on Solana. Developers, take a step forward. What’s stopping you from launching your NFTs and collectibles with Metaplex 🦾? Do you know that Metaplex 🦾 is the backbone of Solana digital collectibles, pioneering since its inception in 2021? Do you also know that almost all Fungible and non-fungible tokens you see today was built using metaplex. ==> From building, to minting to marketplace, Metaplex 🦾 is at the forefront of Solana’s NFTs and fungible tokens. —> You’ve got to bring your digital assets onchain for maximum scalability and security. The only way to do it right is with Metaplex 🦾. With its unique features, you as a developer can build secure and scalable Fungible and non-fungible tokens right on Solana. Take a step today, build with Metaplex 🦾. If it’s not with Metaplex 🦾, it is not right! Seeking to learn more about Metaplex 🦾 and how you can start building? —> Learn more with metaplex official links: Twitter: Website: Blog: Guides:

Your assets, arts and collectibles deserve more than just likes. They deserve onchain ownership —-Ownership on Solana. Developers, take a step forward. What’s stopping you from launching your NFTs and collectibles with Metaplex 🦾? Do you know that Metaplex 🦾 is the backbone of Solana digital collectibles, pioneering since its inception in 2021? Do you also know that almost all Fungible and non-fungible tokens you see today was built using metaplex. ==> From building, to minting to marketplace, Metaplex 🦾 is at the forefront of Solana’s NFTs and fungible tokens. —> You’ve got to bring your digital assets onchain for maximum scalability and security. The only way to do it right is with Metaplex 🦾. With its unique features, you as a developer can build secure and scalable Fungible and non-fungible tokens right on Solana. Take a step today, build with Metaplex 🦾. If it’s not with Metaplex 🦾, it is not right! Seeking to learn more about Metaplex 🦾 and how you can start building? —> Learn more with metaplex official links: Twitter: Website: Blog: Guides:

og.blessed | Solana Summit Africa🇳🇬

18,781 просмотров • 1 год назад

3D AI animations are blowing up right now if your still working a job and want to get into that first stage of online money or you already have a brand and need people making content like this for you this is one of the easiest plays right now > it goes super viral > it's exceptionally easy to make > it can be completely automated (we built tools for this) > it converts well i’m looking to connect with as many talented creators as possible, we’re getting more client requests than we can currently handle that's why i created a full step-by-step blueprint on how to create videos like that reply "AI" + RT and i'll send you the full breakdown so you can create those too (must be following so i can DM)

MAX

46,956 просмотров • 3 месяцев назад

Building “RAG 2.0” is just making Claude Code running over your filesystem 🤖🗂️ To make this work well, you need to solve three things 1️⃣ Virtualize your filesystem to prevent the agent from messing stuff up. AgentFS by Turso is a nice example of how you can give the agent access to a copy of all your files without messing up your raw data. 2️⃣ Parse unstructured documents like PDFs, pptx, Word into an LLM-ready format. Agentic OCR solutions like LlamaParse can help here 3️⃣ Creating an agentic loop with human-in-the-loop. If you want to control the agent implementation instead of using Claude Code out of the box, you can use LlamaIndex 🦙 workflows to help orchestrate these long-running agent tasks. Shoutout Clelia Bertelli (🦙/acc), check it out! Blog: Repo:

Building “RAG 2.0” is just making Claude Code running over your filesystem 🤖🗂️ To make this work well, you need to solve three things 1️⃣ Virtualize your filesystem to prevent the agent from messing stuff up. AgentFS by Turso is a nice example of how you can give the agent access to a copy of all your files without messing up your raw data. 2️⃣ Parse unstructured documents like PDFs, pptx, Word into an LLM-ready format. Agentic OCR solutions like LlamaParse can help here 3️⃣ Creating an agentic loop with human-in-the-loop. If you want to control the agent implementation instead of using Claude Code out of the box, you can use LlamaIndex 🦙 workflows to help orchestrate these long-running agent tasks. Shoutout Clelia Bertelli (🦙/acc), check it out! Blog: Repo:

Jerry Liu

55,620 просмотров • 7 месяцев назад

I vibe-coded an AI agent for making presentations 🤖🖼️ - accessible to everyone and open-source! You can upload both your context files, as well as a style template (a file that captures the style of the presentation you want to create). Then you launch a prompt and watch the agent one-shot your presentation. After the presentation is generated, you can edit the text inline, chat with the agent to add/modify/delete slides, and export to Powerpoint or PDF. Built with Claude Code, powered by Claude Agent SDK + LlamaParse. Hosted on Vercel/Render. To use it, you do need to specify your LlamaCloud API key and Anthropic API key (see below). App: Repo: Sign up to LlamaCloud:

I vibe-coded an AI agent for making presentations 🤖🖼️ - accessible to everyone and open-source! You can upload both your context files, as well as a style template (a file that captures the style of the presentation you want to create). Then you launch a prompt and watch the agent one-shot your presentation. After the presentation is generated, you can edit the text inline, chat with the agent to add/modify/delete slides, and export to Powerpoint or PDF. Built with Claude Code, powered by Claude Agent SDK + LlamaParse. Hosted on Vercel/Render. To use it, you do need to specify your LlamaCloud API key and Anthropic API key (see below). App: Repo: Sign up to LlamaCloud:

Jerry Liu

21,073 просмотров • 6 месяцев назад

Introducing FlowMaker 🌊🤖 A fully open-source, low-code way of building custom agent workflows. Build agents via a drag and drop interface, run it directly in the app, and also directly export it into a deeply custom workflow backed by LlamaIndex 🦙.TS. It’s a fantastic visual tool to help you get started with agents but also gives you full customizability to build the deepest, most advanced workflows, since everything is backed by code. There’s a lot of great low-code agent tools already, but this is a fully-open source template that you can use or clone/customize for your own needs. It was a fun project by Laurie Voss and is intended to give users an easy-on ramp onto our powerful agent orchestration tooling. Also integrates with LlamaCloud in case you want to integrate your own knowledge base into your agent. FlowMaker app: FlowMaker Repo: LlamaCloud:

Introducing FlowMaker 🌊🤖 A fully open-source, low-code way of building custom agent workflows. Build agents via a drag and drop interface, run it directly in the app, and also directly export it into a deeply custom workflow backed by LlamaIndex 🦙.TS. It’s a fantastic visual tool to help you get started with agents but also gives you full customizability to build the deepest, most advanced workflows, since everything is backed by code. There’s a lot of great low-code agent tools already, but this is a fully-open source template that you can use or clone/customize for your own needs. It was a fun project by Laurie Voss and is intended to give users an easy-on ramp onto our powerful agent orchestration tooling. Also integrates with LlamaCloud in case you want to integrate your own knowledge base into your agent. FlowMaker app: FlowMaker Repo: LlamaCloud:

Jerry Liu

20,413 просмотров • 1 год назад

Introducing RAGApp 💫 A no-code interface to configure a RAG chatbot, as dead-simple as GPTs by OpenAI. It’s a docker container that’s easily deployable in any cloud infrastructure. Best of all, it’s fully open-source 🔥 1️⃣ Setup the LLM: Configure the model provider (OpenAI, Gemini) 2️⃣ Setup the data: Define the system prompt and upload your knowledge base. 3️⃣ Launch the chatbot both via the UI or API 4️⃣ If via the UI, stream intermediate events and also sources! This is fantastic work by Marcus Schiesser and is built upon the same DNA as our create-llama project. Check out RAGApp today:

Introducing RAGApp 💫 A no-code interface to configure a RAG chatbot, as dead-simple as GPTs by OpenAI. It’s a docker container that’s easily deployable in any cloud infrastructure. Best of all, it’s fully open-source 🔥 1️⃣ Setup the LLM: Configure the model provider (OpenAI, Gemini) 2️⃣ Setup the data: Define the system prompt and upload your knowledge base. 3️⃣ Launch the chatbot both via the UI or API 4️⃣ If via the UI, stream intermediate events and also sources! This is fantastic work by Marcus Schiesser and is built upon the same DNA as our create-llama project. Check out RAGApp today:

LlamaIndex 🦙

123,977 просмотров • 2 лет назад

Build and customize complex AI applications with a flexible framework in this new short course, Building AI Applications with Haystack. Created in collaboration with deepset, makers of Haystack, and taught by Tuana, who is the developer relations lead for Haystack at deepset. Generative AI technology is changing rapidly and it can be challenging to integrate APIs from different LLMs, vector databases, and various tools such as web search. In this course, you will learn how to use the Haystack framework to make your development process more modular, allowing you to manage complexity and focus more on building your application. In detail, you’ll: - Build a RAG pipeline using Haystack’s main building blocks – components, pipelines, and document stores. - Create custom components in your pipeline by building a Hacker News summarizer that extends your app’s ability to access APIs. - Use conditional routing to create a branching pipeline with a fallback to web search mechanism when the LLM does not have the necessary context to respond to the user's query. - Build a self-reflecting agent for named entity recognition that loops using an output validator custom component. - Create a chat agent using OpenAI's function-calling capabilities which allow you to provide Haystack pipelines as tools to the LLM, enhancing that agent's capabilities. By the end of this course, you will learn a high-level orchestration framework that can help make your applications flexible, extendible, and maintainable, even as the technology stack changes, new user needs arise, and you add new features to your application. Please sign up here:

Build and customize complex AI applications with a flexible framework in this new short course, Building AI Applications with Haystack. Created in collaboration with deepset, makers of Haystack, and taught by Tuana, who is the developer relations lead for Haystack at deepset. Generative AI technology is changing rapidly and it can be challenging to integrate APIs from different LLMs, vector databases, and various tools such as web search. In this course, you will learn how to use the Haystack framework to make your development process more modular, allowing you to manage complexity and focus more on building your application. In detail, you’ll: - Build a RAG pipeline using Haystack’s main building blocks – components, pipelines, and document stores. - Create custom components in your pipeline by building a Hacker News summarizer that extends your app’s ability to access APIs. - Use conditional routing to create a branching pipeline with a fallback to web search mechanism when the LLM does not have the necessary context to respond to the user's query. - Build a self-reflecting agent for named entity recognition that loops using an output validator custom component. - Create a chat agent using OpenAI's function-calling capabilities which allow you to provide Haystack pipelines as tools to the LLM, enhancing that agent's capabilities. By the end of this course, you will learn a high-level orchestration framework that can help make your applications flexible, extendible, and maintainable, even as the technology stack changes, new user needs arise, and you add new features to your application. Please sign up here:

Andrew Ng

53,788 просмотров • 1 год назад

New short course on sophisticated RAG (Retrieval Augmented Generation) techniques is out! Taught by Jerry Liu and Anupam Datta of LlamaIndex 🦙 and TruEra , this teaches advanced techniques that help your LLM generate good answers. Topics include: - Sentence-window retrieval, which retrieves not just the most relevant sentence, but a window of sentences around it for higher quality context. - Auto-merging retrieval, which organizes your document into a hierarchical tree structure, where each parent node's text is split among its child nodes. Based on the relevance of the child nodes to a user query, this lets you better decide whether the entire parent node should be provided as context to the LLM. - Evaluation methodology for separately evaluating the quality of the key steps of RAG (context relevance, answer relevance, groundedness) so that you can perform error analysis, identify which part of your pipeline needs work, and tune components systematically. Please check out the course!

New short course on sophisticated RAG (Retrieval Augmented Generation) techniques is out! Taught by Jerry Liu and Anupam Datta of LlamaIndex 🦙 and TruEra , this teaches advanced techniques that help your LLM generate good answers. Topics include: - Sentence-window retrieval, which retrieves not just the most relevant sentence, but a window of sentences around it for higher quality context. - Auto-merging retrieval, which organizes your document into a hierarchical tree structure, where each parent node's text is split among its child nodes. Based on the relevance of the child nodes to a user query, this lets you better decide whether the entire parent node should be provided as context to the LLM. - Evaluation methodology for separately evaluating the quality of the key steps of RAG (context relevance, answer relevance, groundedness) so that you can perform error analysis, identify which part of your pipeline needs work, and tune components systematically. Please check out the course!

Andrew Ng

655,580 просмотров • 2 лет назад

We’re on a mission to parse the world’s hardest PDFs, and we’d love your help There are so many document types that introduce a million edge cases for current VLMs / OCR: handwritten forms, badly scanned/rotated pages, charts, diagrams, and more. We are running a contest right now for you to try to extract the hardest PDFs you can find. Come sign up on our agent builder, describe what you want to extract through natural language, upload your document, and show the results. If our platform doesn’t work, even better; this is great feedback for us to improve our service. Either way submit your project and we’d love to get your feedback! Check out LlamaCloud here:

We’re on a mission to parse the world’s hardest PDFs, and we’d love your help There are so many document types that introduce a million edge cases for current VLMs / OCR: handwritten forms, badly scanned/rotated pages, charts, diagrams, and more. We are running a contest right now for you to try to extract the hardest PDFs you can find. Come sign up on our agent builder, describe what you want to extract through natural language, upload your document, and show the results. If our platform doesn’t work, even better; this is great feedback for us to improve our service. Either way submit your project and we’d love to get your feedback! Check out LlamaCloud here:

Jerry Liu

21,005 просмотров • 5 месяцев назад