
LlamaIndex 🦙
@llama_index • 115,303 subscribers
The world's best AI Document OCR LlamaParse: https://t.co/yQGTiRSfFL Docs: https://t.co/us6GCS14vD
Videos

We've spent years building LlamaParse into the most accurate document parser for production AI. Along the way, we learned a lot about what fast, lightweight parsing actually looks like under the hood. Today, we're open-sourcing a light-weight core of that tech as LiteParse 🦙 It's a CLI + TS-native library for layout-aware text parsing from PDFs, Office docs, and images. Local, zero Python dependencies, and built specifically for agents and LLM pipelines. Think of it as our way of giving the community a solid starting point for document parsing: npm i -g @llamaindex/liteparse lit parse anything.pdf - preserves spatial layout (columns, tables, alignment) - built-in local OCR, or bring your own server - screenshots for multimodal LLMs - handles PDFs, office docs, images Blog: Repo:
LlamaIndex 🦙580,651 görüntüleme • 2 ay önce

LiteParse hit 4.3K+ GitHub stars in a few weeks. Today it officially joins the LlamaIndex ecosystem, with its own page at ~500 pages in 2 sec. 50+ formats. Zero cloud dependency. Already powering agents in Claude Code, Cursor, and production pipelines. In a few days out head of OSS, Logan Markewich, is hosting a live workshop: build a fintech due diligence agent with LiteParse →
LlamaIndex 🦙85,279 görüntüleme • 1 ay önce

🚀 The team at Google just released the Agents API, a service for building and running custom agents inside a sandboxed Linux environment, and we built a template that gives these agents access to LlamaParse / LiteParse, enabling them to process unstructured documents automatically 📄⚡ Here’s how it works: 🔹 Configure a Git repository where data and outputs will be stored 🔹 Clone the repository into the agent sandbox 🔹 Install the LiteParse CLI, the LlamaParse SDK, and agent skills to use both 🔹 Prompt the agent with a task and watch it process documents autonomously 🤖 The result? An agent that can work directly with messy, real-world documents using LlamaParse and LiteParse within Google’s new agent runtime. Check out the GitHub repository: Get started with LlamaParse:
LlamaIndex 🦙20,619 görüntüleme • 15 gün önce

Introducing RAGs, a Streamlit app that allows you to create and customize your own RAG agent and then use it over your own data, all with natural language 🔥 Directly inspired by OpenAI GPTs, you can converse with an agent to help you do search/retrieval over any data you specify. The app contains three main pages: 🏠 Home Page : Have a “builder agent” build your RAG agent through natural language (you specify the data). ⚙️ RAG Config: Look at configured parameters 🤖 Use your RAG agent! Check out details below 👇 Blog: Repo:
LlamaIndex 🦙475,732 görüntüleme • 2 yıl önce

LlamaParse now has an official Agent Skill you can use across 40+ agents. With built-in instructions for parsing complex documents, including different formats, tables, charts, and images, your agents gain access to deeper document understanding, not just raw text extraction. 👇 Watch the demo 📖 Read the docs: 🚀 Get started with LlamaCloud:
LlamaIndex 🦙51,845 görüntüleme • 2 ay önce

Let's talk parsing tables. Two days ago we launched ParseBench,the first document OCR benchmark built for AI agents. This deep dive breaks down TableRecordMatch (GTRM), our metric for evaluating complex tables the way your pipeline actually consumes them: as records keyed by column headers.
LlamaIndex 🦙25,999 görüntüleme • 1 ay önce

Parsing documents with AI agents just got a lot more seamless🚀 We've rebuilt the LlamaParse MCP server to handle your document processing workflows, and you can connect it today to any MCP-compatible client at 🌐 Once connected, you'll be able to: 📁 Parse documents into clean markdown 🔍 Classify files against your own categories ✂️ Split long documents into labelled sections ⬆️ Upload files via URL or a browser-based upload flow Building a production MCP server surfaced some non-obvious challenges: getting auth to align with an existing platform identity system using WorkOS, working around MCP's lack of built-in file upload support, and making deployments, rate limiting and observability feel native with Vercel and Axiom. We wrote up all of it, from the OAuth flow, to the token-based upload design, to the tradeoffs we hit along the way📝 📚 Read the full blog: 👩💻 GitHub repository:
LlamaIndex 🦙19,601 görüntüleme • 1 ay önce

Built a vibe-coded presentation app generator that turns natural language into polished slides ✨ This project by Jerry Liu combines the Claude Agent SDK with LlamaParse to create an AI-powered presentation tool that handles everything from content creation to PDF export: 🎯 Chat-based slide creation - just describe what you want and watch slides appear ✏️ Real-time editing through natural conversation - refine slides by chatting with the AI 📄 Smart document parsing with LlamaParse for incorporating reference materials 📊 Full export functionality to PowerPoint and PDF formats Perfect example of how LlamaParse makes document processing seamless while Claude's conversational abilities create an intuitive slide editing experience. Check out the full project:
LlamaIndex 🦙50,184 görüntüleme • 4 ay önce

Check out the form-filling agent that automates PDF forms using AI by Jerry Liu 📄🤖 Use any fillable PDF with an agent that fills it out based on your prompts and context files. Our new experiment creates a multi-turn chat experience for form completion. 🔍 Upload fillable PDFs and automatically detect form fields using PyMuPDF 📝 Add custom prompts and context files (parsed via LlamaParse) to guide the AI 🤖 Multi-turn conversations let you refine and correct form entries after initial completion 💾 Download your completed forms when done The agent uses simple tools to list, set, get, and validate form fields. You can chat with it to make corrections and adjustments until your form is perfect. Check out the code on GitHub: Or the deployed app here:
LlamaIndex 🦙55,131 görüntüleme • 4 ay önce

🚀 The team at Google DeepMind just released Gemini Embedding 2, a frontier embeddings model with 3072 dimensions and state-of-the-art semantic quality. 👩💻 We built a demo showing how to integrate it across the LlamaIndex ecosystem, from LlamaParse to LlamaAgents: 𝗮𝘂𝗱𝗶𝗼-𝗸𝗯, a knowledge base for your audio notes. With audio-kb, you can: 🔹 Upload an MP3 or record directly from your terminal 🔹 LlamaParse extracts the transcript from the audio 🔹 Gemini Embedding 2 generates embeddings 🔹 Metadata + vectors are stored in SurrealDB and indexed with HNSW 🔍 Once ingested, you can search all your audio notes directly from the terminal. 🎙️ Perfect for turning voice memos, meetings, or lectures into a searchable knowledge base. 📖 Full blog: 💻 GitHub: ⚡ Try LlamaParse:
LlamaIndex 🦙34,765 görüntüleme • 2 ay önce

We’re excited to officially launch LlamaParse, the first genAI-native document parsing solution. Not only is it better at parsing out images/tables/charts 📊📈 than virtually every other parser, it is now steerable through natural language instructions - output the document in whatever format you desire! It is also the only parsing solution that seamlessly allows you to build accurate RAG over complex documents, free of hallucinations 🔥 We launched it in private preview a few weeks ago and hit 2k users, 1M total PDF pages parsed. And now it’s better than ever. LlamaParse contains the following killer features: ✅ SOTA table/chart extraction ✅ Seamless integration with LlamaIndex 🦙 advanced RAG/agents ✅✨ Natural language Parsing Instructions ✅✨JSON mode and image extraction ✅✨Support for ~10 document types (.pdf, .pptx, .docx, .xml) and more Our pricing is simple: 1k free per day, and additional pages at 0.3c a page, or $3 for 1k pages. If you want advanced document RAG and/or private deployments, come get in touch with us to chat about LlamaCloud. Check out our full blog post here: LlamaParse client repo: Signup at 🦙☁️: Come talk to us:
LlamaIndex 🦙143,087 görüntüleme • 2 yıl önce

Let's talk parsing charts 📊📈. Last week we released ParseBench, the first document OCR benchmark for AI agents. New in ParseBench: ChartDataPointMatch. Most document look at a chart and OCR the caption. Agents need the actual numbers. That's the gap between "OCR'd the text around the chart" and "actually read the chart." More about ParseBench, the GitHub code, Hugging Face dataset, and scientific paper→
LlamaIndex 🦙13,987 görüntüleme • 1 ay önce

Introducing LlamaCloud 🦙🌤️ Today we’re thrilled to introduce LlamaCloud, a managed service designed to bring production-grade data for your LLM and RAG app. Spend less time data wrangling and more time on application logic. Launching with the following components: 1️⃣ LlamaParse 📑: a proprietary parser designed to be really really good at complex documents with embedded tables. Build advanced RAG over semi-structured PDFs, and ask questions that simply aren’t possible with the naive stack. Available publicly day 1 🔥 2️⃣ Managed Ingestion/Retrieval API ⚙️: An API letting you easily ingest/retrieve data from data sources. Opening up in private beta to select enterprises. We’re excited to be joined by launch users, partners, and collaborators: Mendable POC MongoDB Qdrant NVIDIA + some awesome hackathon projects at the LlamaIndex 🦙 hackathon Check out our FULL blog post on LlamaCloud and LlamaParse: LlamaParse Client Repo: Signup for a LlamaCloud account to use LlamaParse: Interested in the broader LlamaCloud offering? Come talk to us: Also we have a slick new website 🌐:
LlamaIndex 🦙141,230 görüntüleme • 2 yıl önce

We're listening 👂LlamaSheets is in beta and we want your feedback Spreadsheets in the wild are messy—merged cells, broken layouts, headers spanning multiple rows. LlamaSheets (now in beta) extracts regions and tables from these files and outputs clean Parquet files you can actually use. What it does: · Identifies and isolates regions in your spreadsheet · Extracts them as Parquet files (load directly into pandas/polars/DuckDB) · Generates cell-level metadata (40+ features: formatting, position, data types) · Creates titles and descriptions for sheets and regions Built for the spreadsheets nobody wants to deal with manually. We need your feedback. While in beta and actively improving based on real-world use cases. Try it out and let us know what works, what doesn't, and what you need. Get started here:
LlamaIndex 🦙35,405 görüntüleme • 5 ay önce

Today we’re excited to feature RAGApp v0.1 - which lets any user construct a multi-agent application 🎨🤖 without writing a single line of code 💫 Add any number of agents that you wish, and assign each agent a role, system prompt, and a set of tools. In this example, use a researcher, analyst, and report generation agent to write a news article. This directly generates a full chat interface where you can ask questions and get back answers with full streaming and sources. Huge shoutout to Marcus Schiesser for working on this! RAGApp: create-llama: If you want finer control, you can define your own agentic workflows through code:
LlamaIndex 🦙92,941 görüntüleme • 1 yıl önce

We’re excited to introduce RAGs v2 - build, customize, and use multiple ChatGPTs over your data, all with natural language 💬 A huge upgrade vs. the initial launch: 💫 Easily create multiple RAG pipelines and save them 💫 Easily swap between and customize each one (e.g. over different data, or w/ different system prompts) 💫 Delete unused RAG pipelines 💫 (dev quality) added much-needed linting/CI Check out the video 🎥 for details. It’s super easy to setup and use. Some additional features: 🧠 Supports a lot of LLMs both for building RAG and within each RAG pipeline 🌐 Supports loading load files or web pages. Check out our repo here:
LlamaIndex 🦙124,174 görüntüleme • 2 yıl önce

After the release of Parse v2, Extract is also getting an upgrade — 𝗶𝗻𝘁𝗿𝗼𝗱𝘂𝗰𝗶𝗻𝗴 𝗘𝘅𝘁𝗿𝗮𝗰𝘁 𝘃𝟮! 🎉 We've been reworking the experience from the ground up to make document extraction more powerful and easier to use than ever. Here's what's new: ✦ 𝗦𝗶𝗺𝗽𝗹𝗶𝗳𝗶𝗲𝗱 𝘁𝗶𝗲𝗿𝘀: we've replaced modes with cleaner, more intuitive tiers. (And stay tuned: agentic plus is coming to Extract too, very soon.) ✦ 𝗣𝗿𝗲-𝘀𝗮𝘃𝗲𝗱 𝗲𝘅𝘁𝗿𝗮𝗰𝘁 𝗰𝗼𝗻𝗳𝗶𝗴𝘂𝗿𝗮𝘁𝗶𝗼𝗻𝘀: load your saved extraction configs directly, so you can skip the setup and get straight to extracting. ✦ 𝗖𝗼𝗻𝗳𝗶𝗴𝘂𝗿𝗮𝗯𝗹𝗲 𝗱𝗼𝗰𝘂𝗺𝗲𝗻𝘁 𝗽𝗮𝗿𝘀𝗶𝗻𝗴: now you can control how your documents get parsed before extraction, giving you more flexibility and better results end to end. And for those who need a transition period: Extract v1 will remain accessible via the UI under 'Settings → General' for a limited time. Try Extract v2 today →
LlamaIndex 🦙15,041 görüntüleme • 2 ay önce

Our OSS engineer Clelia Bertelli (🦙/acc) recently built 𝗹𝗶𝘁𝗲𝘀𝗲𝗮𝗿𝗰𝗵, a fully local document ingestion and retrieval CLI/TUI application powered by LiteParse ⚡ litesearch demonstrates how developers can assemble a high-performance, local-first retrieval pipeline using open tools from across the ecosystem: • Parsing: LiteParse, the fast and accurate document parser we recently open sourced • Chunking: Chonkie • Embeddings: A local Nomic model via Hugging Face transformers.js • Vector storage: A local Qdrant edge shard (custom-built in Rust and compiled as a native add-on) • Retrieval: Query stored files with optional path-based filtering and configurable relevance thresholds • Runtime: Bun for speed and versatility 💻 Check out the repository and try it yourself: 📚 LiteParse docs:
LlamaIndex 🦙14,806 görüntüleme • 2 ay önce

🚀 The Google DeepMind team just added Gemini 3.1 to the Live API, so we built a small demo showing how Gemini voice agents can plug directly into the document processing ecosystem powered by LlamaIndex. 🔥 In this example, we integrate LiteParse to enable fast, fully-local document parsing. With our TUI-based voice assistant, you can literally talk to your terminal: - Speak commands - Trigger live document parsing via tool calls - Hear the agent read back results in real time 🔊 The assistant can extract content from single files or entire folders, leveraging the lightning-fast local parsing that LiteParse provides ⚡ Take a look at the demo👇 👩💻 GitHub repo 📚 LiteParse docs
LlamaIndex 🦙14,766 görüntüleme • 2 ay önce