Loading video...

Video Failed to Load

Go Home

Announcing: Agentic Document Extraction! PDF files represent information visually - via layout, charts, graphs, etc. - and are more than just text. Unlike traditional OCR and most PDF-to-text approaches, which focus on extracting the text, an agentic approach lets us break a document down into components and reason about...

689,047 views • 1 year ago •via X (Twitter)

11 Comments

Arnav Jaitly's profile picture
Arnav Jaitly1 year ago

This is such a pain point for a lot of companies. Agentic information extraction will be such a productivity and efficiency boost

PDF GPT's profile picture
PDF GPT1 year ago

This is my favorite AI tool for reviewing reports. Just upload a report, ask for a summary, and get one in seconds. It's like ChatGPT, but built for documents. Try it for free.

Andrei Hasna's profile picture
Andrei Hasna1 year ago

Well done and well needed.

Vasilis K.'s profile picture
Vasilis K.1 year ago

@JackPNicholls

The AI Agent Architect's profile picture
The AI Agent Architect1 year ago

Great use case. Using an agentic approach to this is kinda like getting the LLM to add metadata to what it finds in the PDF. So many thing you could do with that. If you want to see the opposite side of that coin - the creation of a structured Word document by an agent from only two simple commands, there will be a video demo showing this in my next article tomorrow.

🔮's profile picture
🔮1 year ago

Love what you’re doing! For what it’s worth the URL in your video is wrong and hits a 404

Mr Rio's profile picture
Mr Rio1 year ago

this is what I need

RyanRejoice's profile picture
RyanRejoice1 year ago

This is incredible and a game changer for handling and analyzing pdf data.

Clara Data's profile picture
Clara Data1 year ago

Innovative spin on PDFs! Looking promising for future efficiency!

Huxley MindOS's profile picture
Huxley MindOS1 year ago

Mastering the PDF mystery: deciphering both scroll & scribe! Sounds innovative. Looking forward to the details.

Javier Modified's profile picture
Javier Modified1 year ago

Agentic Document Extraction? Finally, AI can stop pretending it understands charts. This is how you handle a PDF! 🚀

Related Videos

Build better RAG by letting a team of agents extract and connect your reference materials into a knowledge graph. Our new short course, “Agentic Knowledge Graph Construction,” taught by Neo4j Innovation Lead Andreas Kollegger, shows you how. Knowledge graphs are an important way to store information accurately but they are a lot of work to build manually. In this course you’ll learn how to build a team of agents that turn data– in this case product reviews and invoices from suppliers–into structured graphs of entities and relationships for RAG. Learn how agents can automatically handle the time-consuming work of building graphs — extracting entities and relationships (e.g., Product "contains" Assembly, Part "supplied_by" Supplier, Customer review "mentions" Product), deduplicating them, fact-checking them, and committing them to a graph database — so your retrieval system can find right information to generate accurate output. For example, you can use agents to help trace customer complaints directly to specific suppliers, manufacturing processes, and product hierarchies, thus turning fragmented information into queryable business intelligence. Skills you’ll gain: - Build, store, and access knowledge graphs using the Neo4j graph database - Build multi-agent systems using Google’s Agent Development Kit (ADK) - Set up a loop of agentic workflows to propose and refine a graph schema through fact-checking - Connect agent-generated graphs of unstructured and structured data into a unified knowledge graph This course gets into the practicum of why knowledge graphs give more accurate information retrieval than vector search alone, especially for high-stakes applications where precision matters more than fuzzy similarity matching. Sign up here:

Andrew Ng

167,710 views • 9 months ago