Yeah I was toying around with that, too… but folks often mess around with metada...

cyberax · 2024-12-14T01:08:27 1734138507

I was going to suggest to use the same approach as the old CD tagging systems. Count the number of words in each chapter to create a "book fingerprint".

It's highly likely to be globally unique, and it can also help with the missing forewords/afterwords/bonus content sections.

In addition, you can also add fuzzy matching for the title.

smoores · 2024-12-14T02:24:42 1734143082

I think that the thing we need to account for (which, number of words per chapter would capture this, I think) is different publications of the same book, which would need different overlays if they have different chapter filepaths, etc.