Hacker News new | past | comments | ask | show | jobs | submit login
Ask HN: Any AI/ML approaches to “date tag” text documents?
2 points by bobosha on June 20, 2019 | hide | past | favorite | 1 comment
For news articles, press releases etc. etc., might/not have an explicit date mentioned, but in the body could state "In the Spring of 2018...." => 2018/03/??

Any tools in this space?




There are two parts to this: (1) labeling something as a date or time and (2) normalizing it to a time stamp. The first part is the tagging. The second part is temporal normalization.

There are several libraries for temporal normalization:

- Duckling: https://github.com/facebook/duckling - JChronic: https://github.com/samtingleff/jchronic - There's also Chronic (Ruby version that jchronic was made from).

Stanford NLP and SpaCy also do tagging: - https://github.com/stanfordnlp/stanfordnlp - https://spacy.io/usage/linguistic-features#named-entities

Edit: Stanford NLP does not do temporal normalization. Added SpaCy




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: