Hacker News new | past | comments | ask | show | jobs | submit login

Interesting! One quick question, how did you validate your data and ensure its correctness, since the ground truth is unstructured?



Not OP but based on their writeup it sounds like you do need to provide at least a target schema, so what data you need or expect to extract from the unstructured input.

I assume that in the validation step if you don't get all those data points, then that routes to an error state for further review or something.


The users specify the schema and output format and a validation rule and we make sure the system adheres to that.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: