I have been keeping track of a few products like these including some that are Y...

refset · 2024-01-14T22:36:38 1705271798

It's not a public facing product, but there was a talk from a team at Alibaba a couple of months ago during CMU's "ML⇄DB Seminar Series" [0] on how they augmented their NL2SQL transformer model with "Semantics Correction [...] a post-processing routine, which checks the initially generated SQL queries by applying rules to identify and correct semantic errors" [1]. It will be interesting to see whether VC-backed teams can keep up with the state of the art coming out of BigCorps.

[0] "Alibaba: Domain Knowledge Augmented AI for Databases (Jian Tan)" - https://www.youtube.com/watch?v=dsgHthzROj4&list=PLSE8ODhjZX...

[1] "CatSQL: Towards Real World Natural Language to SQL Applications" - https://www.vldb.org/pvldb/vol16/p1534-fu.pdf

ignoramous · 2024-01-15T01:47:32 1705283252

See also SQLCoder by defog.ai: https://github.com/defog-ai/sqlcoder

lmeyerov · 2024-01-15T01:16:15 1705281375

We have been piloting louie.ai with some fairly heavy orgs that may be relevant: Cybersecurity incident responders, natural disaster management, insurance fraud, and starting more regular commercial analytics (click streams, ...)

A bit unusual compared to the above, we find operational teams need more than just SQL, but also Python and more operational DBs (Splunk, OpenSearch, graph DBs, Databricks, ...). Likewise, due to our existing community there, we invest a lot more in data viz (GPU, ..) and AI + graph workflows. These have been through direct use, like Python notebooks & interactive dashboards except where code is more opt-in where desired or for checking the AI's work, and new, embedded use for building custom apps and dashboards that embed conversational analytics.

kszucs · 2024-01-14T22:47:26 1705272446

Please add Ibis Birdbrain https://ibis-project.github.io/ibis-birdbrain/ to the list. Birdbrain is an AI-powered data bot, built on Ibis and Marvin, supporting more than 18 database backends.

See https://github.com/ibis-project/ibis and https://ibis-project.org for more details.

codyvoda · 2024-01-14T22:50:17 1705272617

note that Ibis Birdbrain is very much work-in-progress, but should provide an open-source solution to do this w/ 20+ backends

old demo here: https://gist.github.com/lostmygithubaccount/08ddf29898732101...

planning to finish it...soon...

hatsix · 2024-01-14T23:08:07 1705273687

soon like "check back in a month", or "Soon™"?

codyvoda · 2024-01-14T23:16:11 1705274171

the "check back in a month" soon. I have versions of it that work but I just haven't been satisfied with. also, the major underlying dependency (Marvin) is going through a large refactor for v2. once that stabilizes a bit, I'm going to upgrade to it and that might simplify the code I need a lot

zurfer · 2024-01-15T07:29:59 1705303799

I would love to bring your attention also to getdot.ai We launched it on Hackernews with an analysis of HN post data. https://news.ycombinator.com/item?id=38709172

the main problems we see in the space: 1) good interface design: nobody wants another webapp if they can use Slack or Teams 2) learning enough about the business and usually messy data model to always give correct answers or say I don't know.

bredren · 2024-01-14T21:54:10 1705269250

Have you written up any results of your experience with each?

I’m interested in a survey of this field so far and would read it.

codegeek · 2024-01-14T23:12:26 1705273946

Not yet but not a bad idea if I can get to test them all soon :)

pylua · 2024-01-14T22:15:13 1705270513

I don’t fully understand the use of business case after reading the documentation. Is it really a time save?

MattGaiser · 2024-01-14T22:29:59 1705271399

It would be for people who are not that fluent in SQL. Even as a dev, I find ChatGPT to be easier for writing queries than hand coding them as I do it so infrequently.

pylua · 2024-01-14T22:31:52 1705271512

Yeah, same here. Seems like that approach is much simpler than this.

I guess the real benefit here is that you don’t need to understand the schemas so the knowledge is not lost when someone leaves a company.

Sort of an abstraction layer for the schemas

totalhack · 2024-01-15T03:10:47 1705288247

Sounds like you are describing a semantic layer. You don't need AI to achieve that, though it is fun when it works. Example of a semantic layer I made below, but there are others out there with more support behind them.

https://github.com/totalhack/zillion

EmilStenstrom · 2024-01-14T22:18:12 1705270692

Allow people that don't know SQL to query a database.

realanswe91 · 2024-01-14T22:54:45 1705272885

[flagged]

pylua · 2024-01-14T23:09:23 1705273763

The initial part about sql already being that bridge was my first reaction, too.

sherlock_h · 2024-01-15T13:41:30 1705326090

Here is one more: https://www.ycombinator.com/companies/patterns

markding00 · 2024-01-19T17:48:46 1705686526

also Brewit.ai (YC W23) https://brewit.ai/