Video wird geladen...
Video konnte nicht geladen werden
launching our open source OCR tool today! try it out with some terrible pdfs and let me know how it goes:
342,540 Aufrufe • vor 1 Jahr •via X (Twitter)
10 Kommentare

how are you dealing with hallucinations in the text if you're using GPT to OCR the text? while great for some use cases i've noticed hallucinations when it comes to dense text at scale. is there some benchmarks you're comparing against

hey Karthik. we're working pretty hard on a benchmark right now. Noticed the same issues with dense text (sideway text too). It's one of the big steps we need to take as we start on a fine tuning dataset. I'll be sharing as we make progress!

Waited a few minutes and nothing happened.

hey Bryan! sorry got more traffic than we expected and we had to up our limits. should be good to go now!

You should check out General OCR Theory, a specialized llm for OCR 2.0 Unfortunately, they don't have a gpt-4o benchmark for comparisson, but the results are pretty impressive

@ycombinator I’m literally just about to implement structured document OCR for accounts, tax computations and other similar docs, for my biz’s internal CRM. I thought “oh great I’ll use this instead of Textract”. But I can’t because of the “book a demo” thing. And no pricing. 😕

@ycombinator Make it open source

@ycombinator ... it is open source 👆

trying now with some different one's but doesn't seem to be working? Using Arc if that helps at all.

we got a lot of traffic and had to up our limits! should be good to go now!
