Could isolate the text by compositing neighbor images and getting the pixels tha...

KerrickStaley · on May 29, 2017

An approach I really want to try is taking a stream of the video without subs (can easily be found online) and subtracting the two. You'd have to deal with differences in resolution and compression between the two, and also handle cases where the background is either white or black, but in theory it should work very well. I haven't had time to dig into this.

0xfeba · on May 29, 2017

Seems like you could get gstreamer and some subtractive elements working pretty quick...

gbaygon · on May 30, 2017

Legitimate question: why would you want the first video (hardcoded subs) if you have a second stream without them, better resolution maybe?

kpozin · on May 30, 2017

In order to have access to vocabulary words. From the article: > I wanted to get a transcript of the episode’s dialog so I could study the unfamiliar vocabulary. Unfortunately, the video files I have only have hard subtitles

gbaygon · on May 30, 2017

Makes sense, thanks. I went straight to the technical details and missed that part.

stuaxo · on May 29, 2017

This has to be worth a try.

wolfgang42 · on May 29, 2017

This only works if the camera isn't fixed, though. In the frame from the post it might erase the dashboard, the car roof, and so on.

killin_dan · on May 29, 2017

Just define a small area of the screen to run on then. Subtitles are typically within a very small portion of the screen

thebooktocome · on May 30, 2017

Nope, every channel of CCTV seems to have its own subtitle convention.

killin_dan · on May 30, 2017

Which is irrelevant to stripping subs from one movie at a time.