
Ashraff Hathibelagal
@Hathibel • 2,083 subscribers
Programmer | Effective accelerationist (e/acc) | History and language aficionado | Proponent of transhumanism 🐷
Shorts
Videos

Inspired by Brian Roemmele, I set up DeepSeek-OCR on colab. Even with a T4 GPU and 4-bit quantization, it scans a page in about 45 seconds. In this video, you can see that it compressed 527 text tokens to 249 image tokens. DeepSeek-OCR is trained on nearly 100 languages, they say in their paper. So, I'm going to try using it on some old manuscripts written in indic languages soon.
Ashraff Hathibelagal226,767 просмотров • 7 месяцев назад
Больше нет контента для загрузки