Hacker News new | past | comments | ask | show | jobs | submit login

And what lib is using ocr.space for OCR?



I suspect they're using Tesseract as they've written a gui for it ( https://ocr.space/blog/p/free-ocr-windows.html ) but there's no way to find more.


https://github.com/A9T9/Free-OCR-Software

Based on this github they might be using the microsoft ocr library.


I guess it auto defaults to English then? Running Tesseract on Scandinavian texts gives AAO instead of ÅÄÖ in my experience if you don't supply the correct language training set. That's quite the hen and the egg problem. Can't language identify without the text can't get the text without the right language identified.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: