Hey folks. Congrats on the launch. Everyone here knows that it's a really big pr...

macklinkachorn · 2024-08-13T17:19:27 1723569567

Appreciate the note.

1. I completely agree. Last-mile accuracy is crucial for enterprise buyers, and the challenge isn't just the AI. It's about mapping their business logic and workflows to the product in a way that demonstrates fast time to value.

2. Thanks for the feedback. We're still refining the messaging and don't want to be overly focused on just the extraction aspect. Do you think positioning it as ETL for unstructured data or high-accuracy extraction for enterprises might work better?"

artembugara · 2024-08-13T17:31:52 1723570312

2. I think that "AI" and "unstructured data" sounded "cool" 5 years ago :)

I'd be mindblown if you said, "We turn PDFs into structured data with 99.99% accuracy. Here is how:"

And then tell me about fine-tuning human-in-the-loop stuff.

EarlyOom · 2024-08-13T18:21:42 1723573302

We've been building something similar with https://vlm.run/: we're starting out with documents, but feel like the real killer app will involve agentic workflows grounded in visual inputs like websites. The challenge is that even the best foundation models still struggle a lot with hallucination and rate limits, which means that you have to chain together both OCR and LLMs to get a good result. Platforms like Tesseract work fine for simple, dense documents, but don't help with more complex visual media like charts and graphs. LLMs are great, but even the release of JSON schemas by OpenAI hasn't really fixed 'making things up' or 'giving up halfway through'.