Загрузка видео...
Не удалось загрузить видео
Chunking your text data is a crucial step when building a RAG app ✂️ 1. You avoid hitting the token limit 2. Smaller chunks make the retriever more accurate I cover a few chunking methods and suggest a few frameworks that offer this (LlamaIndex 🦙, , deepset, makers of Haystack)
28,451 просмотров • 2 лет назад •via X (Twitter)
Комментарии: 10

@llama_index @LangChainAI @deepset_ai @tuanacelik 🫶💙🫶💙

@llama_index @LangChainAI @deepset_ai Ur content has been so good, really informative. Thanks!!

@llama_index @LangChainAI @deepset_ai Thank you Joshua! 🙂

@llama_index @LangChainAI @deepset_ai Super cool! Great overview! 🔥

@llama_index @LangChainAI @deepset_ai Why thank you 😄

@llama_index @LangChainAI @deepset_ai For years, I have told people to chunk data based on what they can remember verbatim on the spot. Think about chunking as your mind understanding? Too aggressive.

@llama_index @LangChainAI @deepset_ai Interesting point 🤔 my chunk would be 4 sentences max 😄

@llama_index @LangChainAI @deepset_ai @AFokianos @NicolaS22898

@deepset_ai @llama_index @LangChainAI When your content you want to use with RAG is short form, maybe a sentence to 2-4, would you always want to do super small chunks?

@deepset_ai @llama_index @LangChainAI If you only have 2-4 sentences you don’t really need to worry about chunking. This is really only for long documents!
