Hacker News new | past | comments | ask | show | jobs | submit login

I have been keeping track of a few products like these including some that are YC backed. Interesting space as I am looking for a solution myself:

- Minds DB (YC W20) https://github.com/mindsdb/mindsdb

- Buster (YC W24) https://buster.so

- DB Pilot https://dbpilot.io

and now this one




It's not a public facing product, but there was a talk from a team at Alibaba a couple of months ago during CMU's "ML⇄DB Seminar Series" [0] on how they augmented their NL2SQL transformer model with "Semantics Correction [...] a post-processing routine, which checks the initially generated SQL queries by applying rules to identify and correct semantic errors" [1]. It will be interesting to see whether VC-backed teams can keep up with the state of the art coming out of BigCorps.

[0] "Alibaba: Domain Knowledge Augmented AI for Databases (Jian Tan)" - https://www.youtube.com/watch?v=dsgHthzROj4&list=PLSE8ODhjZX...

[1] "CatSQL: Towards Real World Natural Language to SQL Applications" - https://www.vldb.org/pvldb/vol16/p1534-fu.pdf


See also SQLCoder by defog.ai: https://github.com/defog-ai/sqlcoder


We have been piloting louie.ai with some fairly heavy orgs that may be relevant: Cybersecurity incident responders, natural disaster management, insurance fraud, and starting more regular commercial analytics (click streams, ...)

A bit unusual compared to the above, we find operational teams need more than just SQL, but also Python and more operational DBs (Splunk, OpenSearch, graph DBs, Databricks, ...). Likewise, due to our existing community there, we invest a lot more in data viz (GPU, ..) and AI + graph workflows. These have been through direct use, like Python notebooks & interactive dashboards except where code is more opt-in where desired or for checking the AI's work, and new, embedded use for building custom apps and dashboards that embed conversational analytics.


Please add Ibis Birdbrain https://ibis-project.github.io/ibis-birdbrain/ to the list. Birdbrain is an AI-powered data bot, built on Ibis and Marvin, supporting more than 18 database backends.

See https://github.com/ibis-project/ibis and https://ibis-project.org for more details.


note that Ibis Birdbrain is very much work-in-progress, but should provide an open-source solution to do this w/ 20+ backends

old demo here: https://gist.github.com/lostmygithubaccount/08ddf29898732101...

planning to finish it...soon...


soon like "check back in a month", or "Soon™"?


the "check back in a month" soon. I have versions of it that work but I just haven't been satisfied with. also, the major underlying dependency (Marvin) is going through a large refactor for v2. once that stabilizes a bit, I'm going to upgrade to it and that might simplify the code I need a lot


I would love to bring your attention also to getdot.ai We launched it on Hackernews with an analysis of HN post data. https://news.ycombinator.com/item?id=38709172

the main problems we see in the space: 1) good interface design: nobody wants another webapp if they can use Slack or Teams 2) learning enough about the business and usually messy data model to always give correct answers or say I don't know.


Have you written up any results of your experience with each?

I’m interested in a survey of this field so far and would read it.


Not yet but not a bad idea if I can get to test them all soon :)


I don’t fully understand the use of business case after reading the documentation. Is it really a time save?


It would be for people who are not that fluent in SQL. Even as a dev, I find ChatGPT to be easier for writing queries than hand coding them as I do it so infrequently.


Yeah, same here. Seems like that approach is much simpler than this.

I guess the real benefit here is that you don’t need to understand the schemas so the knowledge is not lost when someone leaves a company.

Sort of an abstraction layer for the schemas


Sounds like you are describing a semantic layer. You don't need AI to achieve that, though it is fun when it works. Example of a semantic layer I made below, but there are others out there with more support behind them.

https://github.com/totalhack/zillion


Allow people that don't know SQL to query a database.


[flagged]


The initial part about sql already being that bridge was my first reaction, too.



also Brewit.ai (YC W23) https://brewit.ai/




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: