I worked on this for the library of Uni Würzburg when I was a student. Pretty cool to see it on HN.
We worked on high quality scans, processing Terabytes of raw image material using Akka (this was around 2012 I think) and also created pipelines for performance OCR at scale on these scans. Doing this on Fraktur and medieval minuscule scripts was tricky and we didn't get really good results during my time there.
We worked on high quality scans, processing Terabytes of raw image material using Akka (this was around 2012 I think) and also created pipelines for performance OCR at scale on these scans. Doing this on Fraktur and medieval minuscule scripts was tricky and we didn't get really good results during my time there.