Загрузка видео...

Не удалось загрузить видео

На главную

Today, we're introducing our document parser built specifically for RAG. The parser combines the best vision, OCR, and vision language models to deliver unmatched accuracy. Try it for free today—the first 500+ pages are on us! 🧵 1/

1,308,495 просмотров • 1 год назад •via X (Twitter)

Комментарии: 10

Фото профиля Douwe Kiela
Douwe Kiela1 год назад

For many enterprise AI use cases, document parsing is a major bottleneck to achieving sufficient RAG performance. Existing parsers treat documents as disconnected pages, hallucinate critical information, and struggle with complex modalities. These fundamental failures cascade through AI systems, putting a ceiling on end-to-end accuracy. Here's what makes our approach different: 🧵 2/

Фото профиля Douwe Kiela
Douwe Kiela1 год назад

1️⃣ Holistic document understanding – Our parser automatically infers a document’s hierarchy, which enables teams to add metadata to each chunk that describes its position in the document. This allows agents to understand how different sections relate to each other across hundreds of pages. 🧵 3/

Фото профиля Douwe Kiela
Douwe Kiela1 год назад

2️⃣ Minimized hallucinations – Our multi-stage pipeline minimizes severe hallucinations while providing bounding boxes and confidence levels for table extraction to audit its output. 🧵 4/

Фото профиля Douwe Kiela
Douwe Kiela1 год назад

3️⃣ Superior handling of complex modalities – Technical diagrams, complex figures, and nested tables are efficiently processed to support all of your enterprise data. 🧵 5/

Фото профиля Douwe Kiela
Douwe Kiela1 год назад

Read more in our blog: See code examples: Tagging a few folks who may find this interesting: @deedydas @_avichawla @akshay_pachaar @rajhans_samdani @NirDiamantAI @AndrewYNg @sh_reya @soldni @simonw @jxnlco @dorialexander

Фото профиля Mingtian
Mingtian1 год назад

Checkout our open-sourced version of getting the hierarchy structure of the long documents:

Фото профиля elvis
elvis1 год назад

Great work, @douwekiela!

Фото профиля Lingxi Li
Lingxi Li1 год назад

🔥 This is truly the best document parser for LLM I had ever seen. The quality of text structures are insanely accurate, hierarchies are included, and the user experience of the playground is next level. Working with AI + documents? You gonna love this!!

Фото профиля Nina Lopatina
Nina Lopatina1 год назад

I love the user experience for the document parser!

Фото профиля Soumitr
Soumitr1 год назад

Amazing to see this out in the wild!

Похожие видео