Vanna.ai: Chat with your SQL database

sagaro · 2024-01-15T07:20:09 1705303209

All these products that pitch about using AI to find insights from your data always end up looking pretty in demos and fall short in reality. This is not because the product is bad, but because there is enormous amount of nuance in DB/Tables that becomes difficult to manage. Most startups evolve too quickly and product teams generally tries to deliver by hacking some existing feature. Columns are added, some columns get new meaning, some feature is identified by looking at a combination of 2 columns etc. All this needs to be documented properly and fed to the AI and there is no incentive for anyone to do it. If the AI gives the right answer, everyone is like wow AI is so good, we don't need the BAs. If the AI gives terrible answers they are like "this is useless". No one goes "wow, the data engineering team did a great job keeping the AI relevant".

joshstrange · 2024-01-15T11:08:11 1705316891

I couldn’t agree more. I’ve hooked up things to my DB with AI in an attempt to “talk” to it but the results have been lackluster. Sure it’s impressive when it does get things right but I found myself spending a bunch of time adding to the prompt to explain how the data is organized.

I’m not expecting any LLM to just understand it, heck another human would need the same rundown from me. Maybe it’s worth keeping this “documentation” up to date but my take away was that I couldn’t release access to the AI because it got things wrong too often and I could anticipate every question a user might ask. I didn’t want it to give out wrong answers (this DB is used for sales) since spitting out wrong numbers would be just as bad as my dashboards “lying”.

Demo DBs aren’t representative of shipping applications and so the demos using AI are able to have an extremely high success rate. My DB, with deprecated columns, possibly confusing (to other people) naming, etc had a much higher error rate.

eurekin · 2024-01-15T12:35:41 1705322141

Speculating

How about a chat interface, where you correct the result and provide more contextual information about those columns?

Those chats could be later fed back to the model and ran a DPO optimisation on top

lmeyerov · 2024-01-15T12:44:43 1705322683

Agreed.

Agent reasoning systems should learn based on past and future use, and both end users and maintainers should have power in how they work. So projects naturally progress on adding guard rails, heuristics, policies, customization, etc. Likewise, they first do it with simple hardcoding and then swapping in learning.

As we have built out Louie.ai with these kinds of things, I've appreciated ChatGPT as its own innovation separate from the underlying LLM. There is a lot going on behind the scenes. They do it in a very consumer/prosumer setting where they hide almost everything. Technical and business users need more in our experience, and even that is a coarse brush...

eek2121 · 2024-01-15T21:42:13 1705354933

Welcome to AI in general.

Billions wasted on a pointless endeavor.

10 years from now folks are going to be laughing at how billions of dollars and productivity was flushed down the drain to support Microsoft Word 2.0.

AI is a bubble. Do yourself a favor and short (or buy put options) the companies that only have "AI" for a business model.

Also short Intel, because Intel.

lmeyerov · 2024-01-15T08:34:29 1705307669

Our theory is we are having simultaneously a bit of a Google moment and a Tableau moment. There is à lot more discovery & work to pull it off, but the dam has been broken. It's been am exciting time to work through with our customers:

* Google moment: AI can now watch and learn how you and your team do data. Around the time Google pagerank came around, the Yahoo-style search engines were highly curated, and the semantic web people were writing xml/rdf schema and manually mapping all data to it. Google replaced slow and expensive work with something easier, higher quality, and more scalable + robust. We are making Louie.ai learn both ahead of time and as the system gets used, so data people can also get their Google moment. Having a tool that works with you & your team here is amazing.

* Tableau moment: A project or data owner can now guide a lot more without much work. Dashboarding used to require a lot of low-level custom web dev etc, while Tableau streamlined it so that a BI lead good at SQL and who understood the data & design can go much further without a big team and in way less time. Understanding the user personas, and adding abstractions for facilitating them, were a big deal for delivery speed, cost, and achieved quality. Arguably the same happened as Looker in introduced LookML and foreshadowed the whole semantic layer movement happening today. To help owners ensure quality and security, we have been investing a lot in the equivalent abstractions in Louie.ai for making data and more conversational. Luckily, while the AI part is new, there is a lot more precedent on the data ops side. Getting this right is a big deal in team settings and basically any time the stakes are high.

scoot · 2024-01-15T10:15:32 1705313732

> Around the time Google pagerank came around, the Yahoo-style search engines were highly curated

Hmmm, no. Altavista was the go-to search engine at the time (launched 1995), and was a crawler (i.e. not a curated catalog/directory) based search. Lycos predates that but had keyword rather than natural language search.

Google didn't launch until 1998.

eek2121 · 2024-01-15T21:44:56 1705355096

Hotbot was amazing back in the day. Google scored me post high school schoolin', however, so I won't complain...Shoot Google helped me make the local news back in the day!

tucnak · 2024-01-15T08:52:20 1705308740

Is that right? You do all that at Louie.ai?

lmeyerov · 2024-01-15T09:14:42 1705310082

Yep. A lot more on our roadmap, but a lot already in place!

It's been cool seeing how different pieces add up together and how gov/enterprise teams push us. While there are some surprising implementation details, a lot has been following up on what they need with foundational implementations and reusing them. The result is a lot is obvious in retrospect and well-done pieces carry it far.

Ex: We added a secure python sandbox last quarter so analysts can drive richer data wrangling on query results. Except now we are launching a GPU version, both so the wrangling can be ML/AI (ex: auto feature engineering), users can wrangle bigger results (GPU dataframes), and we will move our own built-in agents to it as well (ex: GPU-accelerated dashboard panels). Most individual PRs here are surprisingly small, but opens a lot!

zurfer · 2024-01-15T09:33:25 1705311205

as someone building in this space, I am a bit surprised how many concepts you managed to combine in your last sentence. :'D

I will bookmark: ... and we will move our own built-in agents to it as well (ex: GPU-accelerated dashboard panels).

lmeyerov · 2024-01-15T09:47:42 1705312062

Those are pretty normal needs for us and our users. A big reason louie.ai exists is to make easier all the years of Graphistry helping enterprise & gov teams use python notebooks, streamlit/databricks/plotly python dashboards, and overall python GPU+graph data science. Think pandas, pytorch, huggingface, Nvidia RAPIDS, our own pygraphistry, etc.

While we can't those years of our lives back, we can make the next ones a lot better!

j-a-a-p · 2024-01-15T15:15:25 1705331725

Mostly agree. I suggest to keep using ETL and create a data warehouse that irons outs most of these nuances that are needed for a production database. On a data warehouse with good meta data I can imagine this will work great.

sagaro · 2024-01-15T15:28:45 1705332525

I think getting clean tables/ETLs is a big blocker for move fast and break things. I would be more interested in actually github copilot style sql IDE (like datagrip etc.), which has access to all the queries written by all the people within a company. Which runs on a local server or something for security reasons and to get the nod from the IT/Sec department.

And basically when you next write queries, it just auto completes for you. This would improve the productivity of the analysts a lot. With the flexibility of them being able to tweak the query. Here if something is not right, the analyst updates. The Copilot AI keeps learning and giving weights to recent queries more than older queries.

Unlike the previous solution where if something breaks, you can do nothing till you clean up the ETL and redeploy it.

zurfer · 2024-01-15T07:42:54 1705304574

that is correct. GPT-4 is good on well-modelled data out of the box, but struggles with a messy and incomplete data model.

Documenting data definitely helps to close that gap.

However the last part you describe is nothing new (BI teams taking credit, and pushing on problems to data engineers). In fact there is a chance that tools like vanna.ai or getdot.ai bring engineers closer to business folks. So more honest conversations, more impact, more budget.

Disclaimer: I am a co-founder at getdot.ai :)

lmeyerov · 2024-01-15T12:24:32 1705321472

Agreed, maybe I wasn't clear enough. I don't view it as BI team vs platform team vs whoever. Maybe a decrease in the need for PhD AI consultants for small projects, or to wait for some privileged IT team for basic tasks, so they can focus on bigger things.

Instead of Herculean data infra projects, this is a good time for figuring out new policy abstractions, and finding more productive divisions of labor between different days stakeholders and systems. Machine-friendly abstractions and structure are tools for predictable collaboration and automation. More doing, less waiting.

More practically, an increasing part of the Louie.ai stack is helping get the time-consuming quality, guardrails, security, etc parts under easier control of small teams building things. As-is, it takes a lot to give a great experience.

sagaro · 2024-01-15T15:19:51 1705331991

There used to be a company/product called Business Objects aka BO (SAP bought them), which had folks meticulously map every relationship. When done correctly, it was pretty good. You could just drag drop and get answers immediately.

So yes, I can understand if there is incentive for the startups to invest in Data Engineers to make well maintained data models.

But I do think, the most important value here is not the chatgpt interface, it is getting DEs to maintain the data model in a company where product/biz is moving fast and breaking things. If that is done, then existing tools (Power BI for instance has "ask in natural language" feature) will be able to get the job done.

The google moment, the other person talks about in another comment, is where google or 1998 didn't require a webpage owner to do anything. They didn't need him/her to make something in a different format. Use specific tags. Use some tags around key words etc. It was just "you do what you do, and magically we will crawl and make sense of it".

Here unfortunately that is not the case. Say in a ecom business which always delivers in 2 days for free, a new product is launched (same day delivery for $5 dollars), the sales table is going to get two extra columns "is_same_day_delivery_flag" and "same_day_delivery_fee". The revenue definition will change to include this shipping charges. A new filter will be there, if someone wants to see the opt in rate for how many are going for same day delivery or how fast it is growing. Current table probably has revenue. But now revenue = revenue + same_day_delivery_fee and someone needs to make the BO connection to this. And after launch, you notice you don't have enough capacity to do same day shipping, so sometimes you just have to return the fee and send it as normal delivery. Here the is_same_day_delivery_flag is true, but the same_day_delivery_fee is 0. And so on and on...

Getting DE to keep everything up to date in a wiki is tough, let alone a BO type solution. But I do hope getdot.ai etc. someone incentivizes them to change this way of doing things.

anon291 · 2024-01-15T16:05:47 1705334747

The AI needs to truly be 'listening' in in a passive way to all Slack messages, virtual meetings, code commits, etc and really be present whenever the 'team' is in order to get anything done.

XCSme · 2024-01-20T11:49:40 1705751380

Or maybe the database documentation has to be very comprehensive and the AI should have access to it.

bob1029 · 2024-01-14T19:47:57 1705261677

The most success I had with AI+SQL was when I started feeding errors from the sql provider back to the LLM after each iteration.

I also had a formatted error message wrapper that would strongly suggest querying system tables to discover schema information.

These little tweaks made it scary good at finding queries, even ones requiring 4+ table joins. Even without any examples or fine tuning data.

echelon · 2024-01-14T20:24:38 1705263878

Please turn this into a product. There's enormous demand for that.

bob1029 · 2024-01-14T20:46:35 1705265195

I feel like by the time I could turn it into a product, Microsoft & friends will release something that makes it look like a joke. If there is no one on the SQL Server team working on this right now, I don't know what the hell their leadership is thinking.

I am not chasing this rabbit. Someone else will almost certainly catch it first. For now, this is a fun toy I enjoy in my free time. The moment I try to make money with it the fun begins to disappear.

Broadly speaking, I do think this is approximately the only thing that matters once you realize you can put pretty much anything in a big SQL database. What happens when 100% of the domain is in-scope of an LLM that has iteratively optimized itself against the schema?

benreesman · 2024-01-15T02:58:32 1705287512

There’s daylight between personal toolsmithing and a VC-backed startup (both are fun sometimes and a grind sometimes).

I’m getting together a bunch of related-sounding stuff in terms of integrating modern models into my workflow to polish up a bit and release MIT.

If you’d like to have a hand tidying it up a little and integrating it with e.g. editors and stuff, I think the bundle would be a lot cooler for it!

panarky · 2024-01-15T06:50:44 1705301444

Microsoft may well catch the rabbit that queries schemas and generates valid SQL.

But that rabbit can't understand the meaning of the data just by looking at column names and table relationships.

Let's say you want to know how sales and inventory are doing compared to last year at your chain of retail stores.

Will Microsoft's rabbit be smart enough to know that the retail business is seasonal, so it must compare the last x weeks this year with the same weeks last year? And account for differences in timing of holidays? And exclude stores that weren't open last year?

Will it know that inventory is a stock and sales is a flow, so while it can sum daily sales, it's nonsensical to sum daily inventory?

The real AI magic isn't generating SQL with four joins, it's understanding the mechanics of each industry and the quirks of your organization to extract the intent from ambiguous and incomplete natural language.

benreesman · 2024-01-15T08:25:49 1705307149

If I can TLDR your comment, which I agree with: the real value is in doing real work.

“Hustlers” burn countless hours trying to “optimize” work out of the picture.

Historically, there’s a lot of money in just sitting down with a to-do list of customer problems and solving them at acceptable cost, come hell or high water.

andy_ppp · 2024-01-14T21:09:57 1705266597

I will be extremely surprised if Microsoft build this for open source databases, however someone else will definitely build it if you don't, that is completely true :-)

JelteF · 2024-01-14T21:58:53 1705269533

Disclaimer: I work at Microsoft on Postgres related open source tools (Citus & PgBouncer mostly)

Microsoft is heavily investing in Postgres and its ecosystem, so I wouldn't be extremely surprised if we would do this. We're definitely building things to combine AI with Postgres[1]. Although afaik no-one is working actively on query generation using AI.

But I actually did a very basic POC of "natural language queries" in Postgres myself last year:

Conference talk about it: https://youtu.be/g8lzx0BABf0?si=LM0c6zTt8_P1urYC Repo (unmaintained): https://github.com/JelteF/pg_human

1: https://techcommunity.microsoft.com/t5/azure-database-for-po...

hot_gril · 2024-01-15T05:16:35 1705295795

Postgres is dear to me. Met its founders when I was in college at Berkeley, worked heavily with it at a previous company around 2015, used it for all my own projects. I'm glad to see it getting more attention lately (seemingly).

jzig · 2024-01-15T01:28:15 1705282095

Supabase already has an AI feature which queries your database for you [0]

[0]: https://supabase.com/blog/studio-introducing-assistant

hot_gril · 2024-01-15T04:23:22 1705292602

Microsoft owns Citus, a very major Postgres plugin.

andy_ppp · 2024-01-15T04:30:24 1705293024

I didn’t know this, it seems they love open source even thought they have competing commercial products. Maybe there is just more money is selling cloud than there is in selling commercial databases?

dcreater · 2024-01-14T21:32:59 1705267979

You can just make a GitHub repo with what you have. It'd still be valuable to the community

rattray · 2024-01-15T03:37:12 1705289832

Would you be willing to share your prompts? I bet a lot of people would find them useful!

giancarlostoro · 2024-01-15T02:58:31 1705287511

> I feel like by the time I could turn it into a product, Microsoft & friends will release something that makes it look like a joke. If there is no one on the SQL Server team working on this right now, I don't know what the hell their leadership is thinking.

If Cortana for Azure isn't a thing in the works, I *really* don't know what the hell their leadership is working on. I could see insane value in "why is my website slow?" and getting actionable responses.

whoiscroberts · 2024-01-14T21:47:07 1705268827

If the do release it , they will only release it for enterprise. Many many sql server installs are sql server standard. There is an entire ecosystem of companies built on selling packages that support sql server standard, wee DevArt, RedGate.

victor106 · 2024-01-15T00:09:34 1705277374

> Microsoft & friends will release something that makes it look like a joke

True, Microsoft & Friends have gotten greedy every passing year. Before they used to develop the platform (OS,DB etc.,) and let others develop and sell apps on it that would benefit them as well as the whole ecosystem.

Now they want every last dollar they can squeeze out of the ecosystem. So they don't leave any stone unturned and they have big pockets to do that.

personjerry · 2024-01-14T22:01:13 1705269673

Wouldn't it be pretty fast to make it as a chatgpt?

SOLAR_FIELDS · 2024-01-14T21:27:10 1705267630

There are already several products out there with varying success.

Some findings after I played with it awhile:

- Langchain already does something like this - a lot of the challenge is not with the query itself but efficiently summarizing data to fit in the context window. In other words if you give me 1-4 tables I can give you a product that will work well pretty easy. But when your data warehouse has tens or hundreds of tables with columns and meta types now we need to chain together a string of queries to arrive at the answer and we are basically building a state machine of sorts that has to do fun and creative RAG stuff - the single biggest thing that made a difference in effectiveness was not what op mentioned at all, but instead having a good summary of what every column in the db was stored in the db. This can be AI generated itself, but the way Langchain attempts to do it on the fly is slow and rather ineffective (or at least was the case when I played with it last summer, it might be better now).

Not affiliated, but after reviewing the products out there the data team I was working with ended up selecting getdot.ai as it had the right mix of price, ease of use, and effectiveness.

petters · 2024-01-14T20:59:04 1705265944

It sounds like pretty standard constructions with OpenAI's API. I have a couple of such iterative scripts myself for bash commands, SQL etc.

But sure, why not!

l5870uoo9y · 2024-01-14T22:55:23 1705272923

You can check this out https://www.sqlai.ai. It has AI-powered generators for:

- Generate SQL

- Generate optimized SQL

- Fix query

- Optimize query

- Explain query

Disclaimer: I am the solo developer behind it.

vopi · 2024-01-15T01:39:22 1705282762

Are all those Twitter testimonials fake? None seem to be actual accounts.

richardw · 2024-01-15T06:18:28 1705299508

- Generate testimonial

(I kid. Hope you do well with the app, just get some real testimonials in there if they aren't already.)

l5870uoo9y · 2024-01-15T08:14:22 1705306462

Thanks.

thdespou · 2024-01-15T10:12:37 1705313557

I bet even their Site design is AI generated...

l5870uoo9y · 2024-01-15T12:33:17 1705321997

It is based on Flowbite[1], ShadUI[2] and Landwind[3].

[1]: https://flowbite.com/

[2]: https://ui.shadcn.com/

[3]: https://demo.themesberg.com/landwind/

datashaman · 2024-01-17T01:22:03 1705454523

s/GTP/GPT/g

chrisjh · 2024-01-15T03:55:42 1705290942

Shameless plug – we're working on this at Velvet (https://usevelvet.com) and would love feedback. Our tool can connect and query across disparate data sources (databases and event-based systems) and allows you to write natural language questions that are turned automatically into SQL queries (and even make those queries into API endpoints you can call directly!). My email is in my HN profile if anyone wants to try it out or has feedback.

Kiro · 2024-01-15T07:01:29 1705302089

What made you say this? How is this different from the hundreds of AI startups already focusing on this, or even the submission that we're having this conversation on?

BenderV · 2024-01-15T10:56:12 1705316172

Shameless plug - https://github.com/BenderV/ada

It's an open source BI tool that does just that.

bob1029 · 2024-01-15T11:28:46 1705318126

Yep this is pretty much what I was going for.

This is why I don't chase rabbits. Y'all already got a whole box of em sitting here.

mcapodici · 2024-01-14T20:52:48 1705265568

I would be tempted to pivot to that! I am working on similar for CSS (see bio) but if that doesn’t work out my plan was to pivot to other languages.

teaearlgraycold · 2024-01-14T20:33:05 1705264385

Someone get YC on the phone

quickthrower2 · 2024-01-14T20:50:50 1705265450

Or open source? You could get 10k stars :-)

vivzkestrel · 2024-01-15T04:01:16 1705291276

who are the most recent signed up users and what is their hashed password? what is stopping me from running this query on your database?

hot_gril · 2024-01-15T04:22:07 1705292527

Same thing stopping you from executing arbitrary SQL on the DB.

bongodongobob · 2024-01-15T04:39:38 1705293578

I'm really curious as to the reasoning behind your question and why you think an LLM generated query somehow would have unfettered access and permissions.

sigmoid10 · 2024-01-15T08:30:35 1705307435

What's stopping anyone who can run ordinary SQL queries? The LLM just simplifies interaction, it is neither the right tool nor the right place to enforce user rights.

aussieguy1234 · 2024-01-14T22:24:59 1705271099

I've already done this with GPT-4.

It goes something like this:

Here's the table structure from MySQL cli `SHOW TABLE` statements for my tables I want to query.

Now given those tables, give me a query to show me my cart abandonment rate (or, some other business metric I want to know).

Seems to work pretty well.

zainhoda · 2024-01-14T23:23:35 1705274615

Author of the package here. That's pretty much what this package does just with optimization around what context gets sent to the LLM about the database:

https://github.com/vanna-ai/vanna/blob/main/src/vanna/base/b...

And then of course once you have the SQL, you can put it in an interface where you the SQL can be run automatically and then get a chart etc.

lopatin · 2024-01-15T01:40:21 1705282821

What surprises me is the amount of programmers and analysts that I meet the don’t do this yet. Writing complex, useful SQL queries is probably the most valuable thing that ChatGPT does for me. It makes you look like a god to stakeholders, and “I’m not strong in SQL” is no longer an excuse for analysis tasks.

EZ-E · 2024-01-15T03:44:34 1705290274

Same for excel - I suck at it but ChatGPT is really good at writing these formulas. I feel this is one area LLMs are pretty good at.

zainhoda · 2024-01-15T02:59:00 1705287540

I've noticed this as well. Do you have any theories as to why that is?

bongodongobob · 2024-01-15T04:59:34 1705294774

I think a lot of people tried just asking GPT-3.5 to "Write me full stack web app no bugs please." when it first came out. When it failed to do that they threw up their hands and said "It's just a parrot."

Then GPT4 came out, they tried the same thing and got the same results.

I keep seeing comments regarding it not being helpful because "super big codebase, doesn't work, it doesn't know the functions and what they do."

...so tell it? I've had it write programs to help it understand.

For example: Write me a Python program that scans a folder for source code. Have it output a YAML-like text file of the functions/methods with their expected arguments and return types.

Now plug that file into GPT and ask it about the code or use that when it needs to reference things.

I've spent the last year playing with how to use prompts effectively and just generally working with it. I think those that haven't are definitely going to be left behind in some sense.

It's like they aren't understanding the meta and the crazy implications of that. In the last year, I've written more code than I have in the last 5. I can focus on the big picture and not have to write the boilerplate and obvious parts. I can work on the interesting stuff.

For those still not getting it, try something like this.

Come up with a toy program.

Tell it it's a software project manager and explain what you want to do. Tell it to ask questions when it needs clarification.

Have it iterate through the requirements and write a spec/proposal.

Take that and then tell it it's a senior software architect. Have it analyze the plan (and ask questions etc) but tell it not to write any code.

Have it come up with the file structure and necessary libraries for your language.

Have it output than in JSON or YAML or whatever you like.

Now take that and the spec and tell it it's a software engineer. Ask it which file to work on first.

Have it mock up the functions in psuedo code with expected arguments and output type etc.

Tell it to write the code.

And iterate as necessary.

Do this a few times with different ideas and you'll start to get the hang of how to feed it information to get good results.

upmostly · 2024-01-15T06:06:13 1705298773

Nail. Head.

The amount of code I now "write" (I've started calling it directing) and features I've put into my side projects has been more than the last 5-10 years combined this last year.

I successfully created a sold a product within 3 months. Start to finish, because of the productivity power I received.

People are misusing it.

aussieguy1234 · 2024-01-16T01:54:51 1705370091

I'm also using GPT-4 for my side projects. For the database specifically, I can write a short sentence describing my table. Then get GPT-4 to generate the SQL to create it, plus a TypeScript interface and data repository module for access with the methods and SQL for basic CRUD. If I need more than basic CRUD, it can usually create methods for that also.

Its also pretty good at generating basic tests, amongst other things.

vincnetas · 2024-01-15T06:33:36 1705300416

Can you elaborate more on the product that you created and sold?

lysecret · 2024-01-15T10:08:58 1705313338

I think the main reason people don't do something like this more often is that this turns you from a "coder" to a "manager". Your task now is to serialize the issue and to ask the right questions / keep everything moving along.

I don't particularly mind because I cre more about building something than doing my craft but I can totally see how this will be different for many people.

ukuina · 2024-01-15T20:05:42 1705349142

Here's the philosophical conundrum: Do we prioritize learning new things (improve craft) or create value (build stuff)? The first used to lead to the second, but not as much anymore.

bongodongobob · 2024-01-15T16:34:31 1705336471

Yeah that's exactly it. You have to build up the proper context first. Get all the documentation to be nice and coherent and it will happily write beautiful code.

hot_gril · 2024-01-15T04:25:29 1705292729

I already know SQL, and yes it can take time to form exceptionally complex queries, but ChatGPT doesn't seem to do those accurately (yet). CGPT is more useful for general programming languages where there's a lot more boilerplate.

aussieguy1234 · 2024-01-16T01:57:39 1705370259

I've found GPT-4 to be alot better than the free version of ChatGPT (which uses the earlier GPT 3.5).

hot_gril · 2024-01-16T23:07:00 1705446420

It is. I should've mentioned that I don't have access to that. But I can still hammer out correct SQL faster than I can prompt ChatGPT.

kulikalov · 2024-01-14T22:43:06 1705272186

While I recognize the efforts in developing natural language to SQL translation systems, I remain skeptical. The core of my concern lies in the inherent nature of natural language and these models, which are approximative and lack precision. SQL databases, on the other hand, are built to handle precise, accurate information in most cases. Introducing an approximative layer, such as a language model, into a system that relies on precision could potentially create more problems than it solves, leading me to question the productivity of these endeavors in effectively addressing real-world needs.

thom · 2024-01-15T00:27:50 1705278470

The key here is to always have some structured intermediate layer that can allow people to evaluate what it is the system thinks they're asking, short of the full complexity of SQL. You'll need something like this for disambiguation anyway - are "Jon Favreau movies" movies starring or directed by Jon Favreau? Or do you mean the other Jon Favreau? Don't one-shot anything important end to end from language. Use language to guide users towards unambiguous and easy to reason about compositions of smaller abstractions.

totalhack · 2024-01-15T02:51:40 1705287100

I think AI to direct SQL will have use cases, but I personally have found it more of a fun toy than anything close to the main way I would interact with my data, at least in the course of running a business.

I am a fan of a semantic layer (I made an open source one but there are others out there), and think having AI talk to a semantic layer has potential. Though TBH a good UI over a semantic layer is so easy it makes the natural language approach moot.

https://github.com/totalhack/zillion

pietz · 2024-01-14T23:01:18 1705273278

I think you have a point. I also think people who share this mindset and keep sticking with it will have a tough future.

The technical benefits and potentials clearly outshine the problems and challenges. That's also true in this example. You just have to let go of some principles that were helpful in the past but aren't anymore.

kulikalov · 2024-01-14T23:04:29 1705273469

What technical benefits?

pietz · 2024-01-14T23:40:32 1705275632

A translation engine between natural language and SQL means that everyone can communicate with a SQL database now. That's huge. Soon also DB people will use it to get the response for complex questions and queries. It's just more natural and way faster.

With technology like this, there is little reason to even know SQL anymore as the average developer. Just like today, the average developer doesn't know how databases work because the cloud takes care of it. We're moving up on the abstraction ladder and tomorrow all you need to know for SQL is to ask the right question.

bamboozled · 2024-01-14T23:44:56 1705275896

What’s funny is SQL was supposed to be the natural language everyone uses to communicate with. Few bothered to learn it.

spennant · 2024-01-15T00:29:06 1705278546

"Few"? I guess I must be one of the olds.

bamboozled · 2024-01-15T02:40:48 1705286448

Are you a “business person” or analyst ?

totalhack · 2024-01-15T03:01:48 1705287708

This somewhat already exists in the form of semantic layers, which if done well can handle many of the queries necessary in the course of business without business users needing to know any SQL. There will still be cases where you need direct SQL, and AI tools can help the development process there at a minimum.

pietz · 2024-01-15T08:30:04 1705307404

Yes, there will be cases where you need SQL knowledge. There will also always be cases where knowing exactly how a database works under the hood is necessary. I think this is somewhat of a weak argument because you can always construct an example of how something may be helpful for a small group of people.

The relevant question is: How many people who work with databases need to have a lot of experience with SQL? My argument is that while the answer today is "most," the answer in a couple of years might be "very few."

totalhack · 2024-01-15T11:57:44 1705319864

Sure, AI will assist more and more in the cases where people must write SQL or manage a database, perhaps to the point you suggest.

But my point was actually that more people think they need to know SQL today than is actually the case. Excluding people that manage databases or cases that go direct to SQL for things like complex ETL, your average business user / marketer / etc should not be asked to write SQL or have to ask someone else to write SQL for them. Use a semantic layer instead with a UI on top and it's almost as easy as natural language.

Here is a example of one I made below, but there are others out there with more support. At my company, and the last few I've worked for, we use this approach for ~all day to day querying and a chunk of backend SQL replacement.

https://github.com/totalhack/zillion

chrisjh · 2024-01-15T04:06:49 1705291609

I have similar thoughts on this - so few members of a team actually know what the underlying data model looks like. Gets even harder when you start trying to query across your database(s) + external sources like analytics/event systems. Natural language lets the whole team peel away at the black box of data and helps build common ground on which to improve products. I already mentioned it in another part of this thread so I won't spam my project but would love to get your feedback on my natural language to SQL tool if you're interested.

macNchz · 2024-01-15T02:34:31 1705286071

Lots of average developers these days do not (or just barely) know SQL, and it shows when the ORM generates some nonsense and nobody can figure out why the app is suddenly two orders of magnitude slower.

bamboozled · 2024-01-14T23:19:32 1705274372

Your project manager can be an expert with SQL now too…and the novelty factor is high because using makes it feel like you’re in a “sci-fi” movie?

samstave · 2024-01-14T22:57:02 1705273022

Reverse idea:

Use this to POPULATE sql based on captured NLP "surveillance" -- for example, build a DB of things I say as my thing listens to me, and categorize things, topics, place, people etc mentioned.

Keep count of experiencing the same things....

When I say I need to "buy thing" build table of frequency for "buy thing" etc...

Effectively - query anything you've said to Alexa and be able to map behaviors/habits/people/things...

If I say - Bob's phone number is BLAH. It add's bob+# to my "random people I met today table" with a note of "we met at the dog park"

Narrate yourself into something journaled.

Makes it easy to name a trip "Hike Mount Tam" then log all that - then have it create a link in the table to the pics folder on your SpaceDrive9000.ai and then you have a full narration with links to the pics that you take.

kulikalov · 2024-01-14T23:06:12 1705273572

You are describing use case for a vector database

samstave · 2024-01-14T23:11:40 1705273900

So you wouldnt be able to pull out specific phrases and store them in a typical RDBMS?

codegeek · 2024-01-14T21:50:23 1705269023

I have been keeping track of a few products like these including some that are YC backed. Interesting space as I am looking for a solution myself:

- Minds DB (YC W20) https://github.com/mindsdb/mindsdb

- Buster (YC W24) https://buster.so

- DB Pilot https://dbpilot.io

and now this one

refset · 2024-01-14T22:36:38 1705271798

It's not a public facing product, but there was a talk from a team at Alibaba a couple of months ago during CMU's "ML⇄DB Seminar Series" [0] on how they augmented their NL2SQL transformer model with "Semantics Correction [...] a post-processing routine, which checks the initially generated SQL queries by applying rules to identify and correct semantic errors" [1]. It will be interesting to see whether VC-backed teams can keep up with the state of the art coming out of BigCorps.

[0] "Alibaba: Domain Knowledge Augmented AI for Databases (Jian Tan)" - https://www.youtube.com/watch?v=dsgHthzROj4&list=PLSE8ODhjZX...

[1] "CatSQL: Towards Real World Natural Language to SQL Applications" - https://www.vldb.org/pvldb/vol16/p1534-fu.pdf

ignoramous · 2024-01-15T01:47:32 1705283252

See also SQLCoder by defog.ai: https://github.com/defog-ai/sqlcoder

lmeyerov · 2024-01-15T01:16:15 1705281375

We have been piloting louie.ai with some fairly heavy orgs that may be relevant: Cybersecurity incident responders, natural disaster management, insurance fraud, and starting more regular commercial analytics (click streams, ...)

A bit unusual compared to the above, we find operational teams need more than just SQL, but also Python and more operational DBs (Splunk, OpenSearch, graph DBs, Databricks, ...). Likewise, due to our existing community there, we invest a lot more in data viz (GPU, ..) and AI + graph workflows. These have been through direct use, like Python notebooks & interactive dashboards except where code is more opt-in where desired or for checking the AI's work, and new, embedded use for building custom apps and dashboards that embed conversational analytics.

kszucs · 2024-01-14T22:47:26 1705272446

Please add Ibis Birdbrain https://ibis-project.github.io/ibis-birdbrain/ to the list. Birdbrain is an AI-powered data bot, built on Ibis and Marvin, supporting more than 18 database backends.

See https://github.com/ibis-project/ibis and https://ibis-project.org for more details.

codyvoda · 2024-01-14T22:50:17 1705272617

note that Ibis Birdbrain is very much work-in-progress, but should provide an open-source solution to do this w/ 20+ backends

old demo here: https://gist.github.com/lostmygithubaccount/08ddf29898732101...

planning to finish it...soon...

hatsix · 2024-01-14T23:08:07 1705273687

soon like "check back in a month", or "Soon™"?

codyvoda · 2024-01-14T23:16:11 1705274171

the "check back in a month" soon. I have versions of it that work but I just haven't been satisfied with. also, the major underlying dependency (Marvin) is going through a large refactor for v2. once that stabilizes a bit, I'm going to upgrade to it and that might simplify the code I need a lot

zurfer · 2024-01-15T07:29:59 1705303799

I would love to bring your attention also to getdot.ai We launched it on Hackernews with an analysis of HN post data. https://news.ycombinator.com/item?id=38709172

the main problems we see in the space: 1) good interface design: nobody wants another webapp if they can use Slack or Teams 2) learning enough about the business and usually messy data model to always give correct answers or say I don't know.

bredren · 2024-01-14T21:54:10 1705269250

Have you written up any results of your experience with each?

I’m interested in a survey of this field so far and would read it.

codegeek · 2024-01-14T23:12:26 1705273946

Not yet but not a bad idea if I can get to test them all soon :)

pylua · 2024-01-14T22:15:13 1705270513

I don’t fully understand the use of business case after reading the documentation. Is it really a time save?

MattGaiser · 2024-01-14T22:29:59 1705271399

It would be for people who are not that fluent in SQL. Even as a dev, I find ChatGPT to be easier for writing queries than hand coding them as I do it so infrequently.

pylua · 2024-01-14T22:31:52 1705271512

Yeah, same here. Seems like that approach is much simpler than this.

I guess the real benefit here is that you don’t need to understand the schemas so the knowledge is not lost when someone leaves a company.

Sort of an abstraction layer for the schemas

totalhack · 2024-01-15T03:10:47 1705288247

Sounds like you are describing a semantic layer. You don't need AI to achieve that, though it is fun when it works. Example of a semantic layer I made below, but there are others out there with more support behind them.

https://github.com/totalhack/zillion

EmilStenstrom · 2024-01-14T22:18:12 1705270692

Allow people that don't know SQL to query a database.

realanswe91 · 2024-01-14T22:54:45 1705272885

[flagged]

pylua · 2024-01-14T23:09:23 1705273763

The initial part about sql already being that bridge was my first reaction, too.

sherlock_h · 2024-01-15T13:41:30 1705326090

Here is one more: https://www.ycombinator.com/companies/patterns

markding00 · 2024-01-19T17:48:46 1705686526

also Brewit.ai (YC W23) https://brewit.ai/

pamelafox · 2024-01-14T20:22:30 1705263750

I love that this exists but I worry how it uses the term “train”, even in quotes, as I spend a lot of time explaining how RAG works and I try to emphasize that there is no training/fine-tuning involved. Just data preparation, chunking and vectorization as needed.

zainhoda · 2024-01-14T23:27:18 1705274838

Author of the package here. I would be open to alternative suggestions on terminology! The problem is that our typical user has never encountered RAG before.

pamelafox · 2024-01-20T19:17:11 1705778231

Hm we typically talk about that phase as “data ingestion”. We also use verbs extract, chunk, index, but those are parts of ingestion. Another phrase we use is “data preparation”. The tricky thing is that your train() method is on the vanna object, so if you called it “ingest”, it wouldn’t be clear where the data was headed. But I think it’d be clearer than “train” given that has such a specific meaning in this space. You could also look to LlamaIndex API for naming inspiration.

zacmps · 2024-01-15T03:36:09 1705289769

I've been using 'build' as in, 'build a dataset'. I think it gets the same idea across without being as easy to confuse with fine-tuning.

hrpnk · 2024-01-14T20:56:41 1705265801

Prompts are quite straightforward.

- OpenAI: https://github.com/vanna-ai/vanna/blob/a4cdf7593ac0c584f7d74...

- Mistral: https://github.com/vanna-ai/vanna/blob/a4cdf7593ac0c584f7d74...

peheje · 2024-01-14T21:19:53 1705267193

Many of these AI "products" - Is it just feeding text into LLMs in a structured manner?

wruza · 2024-01-15T05:19:26 1705295966

Why quotes. Your file manager is just feeding readdir into tableViewDataSource, and your video player is just feeding video files into ffmpeg and then connects its output to a frameBuffer control. Even your restaurant is just feeding vegetables into a dish. Most products "just" do something with existing technologies to remove the tedious parts.

okwhateverdude · 2024-01-14T22:02:20 1705269740

Basically, yeah. It is shockingly trivial to do, and yet like playing with alchemy when it comes to the prompting, especially if doing inference on the cheap like running smaller models. They can get distracted in your formatting, ordering, CAPITALIZATION, etc.

jedberg · 2024-01-14T19:18:46 1705259926

This is awesome. It's a quick turnkey way to get started with RAG using your own existing SQL database. Which to be honest is what most people really want when they say they "want ChatGPT for their business".

They just want a way to ask questions in prose and get an answer back, and this gets them a long way there.

Very cool!

zainhoda · 2024-01-15T00:04:16 1705277056

Author of the package here. Thank you! That's exactly what we've seen. Businesses spent millions of dollars getting all their structured data into a data warehouse and then it just sits there because the people who need the information don't know how to query the database.

qiller · 2024-01-14T23:02:01 1705273321

We did something similar for our reporting service which is based duckdb. Overall it works great, though we've ran into a few things:

* Even with low temperature, GPT-4 sometimes deviates from examples or schema. For example, sometimes it forgets to check one or another field...

* Our service hosts generic data, but customers ask to generate reports using their domain language (give me top 10 colors... what's a color?). So we need to teach customers to nudge the report generator a bit towards generic terms

* Debugging LLM prompts is just tricky... Customers can confuse the model pretty easily. We ended up exposing the "explained" generated query back to give some visibility of what's been used for the report

ignoramous · 2024-01-15T00:23:20 1705278200

Curious, as we're looking to build / use a similar setup.

> Debugging LLM prompts is just tricky... Customers can confuse the model pretty easily.

Would a RAG like how Vanna.ai uses, help?

> For example, sometimes it forgets to check one or another field

Do prompting techniques like CoT improve the outcome?

> So we need to teach customers to nudge the report generator a bit towards generic terms.

Did you folks experiment with building an Agent-like interface that asks more questions before the LLM finally answers?

qiller · 2024-01-15T02:05:53 1705284353

Our primary issue is that our DB is a dynamic Entity-Attribute-Value schema, even quite a bit denormalized at that. The model has to remember to do subqueries to retrieve "attributes" based on what's needed for the query and then combine them correctly.

NLQ is a somewhat new feature for us, so we don't have a great library to pull from for RAG. Experimenting, I found that having a few-shot examples with some CoT (showing examples of chaining attributes retrieval) sprinkled around did help a lot.

Even still, some queries come out quite ugly, but still functional. I'm thankful that DuckDB is a beast when tackling those :D

> Did you folks experiment with building an Agent-like interface that asks more questions before the LLM finally answers?

That's something I want to figure out next:

1) try to check if a generated query would work but would generate absolutely junk results (cause the model forgot to check something) and ask to rephrase

2) or show results (which may look "real" enough), but give an ability to tweak the prompt. A good example is something like "top 5 products on Cyber Monday" <- which returns 0 products, cause 2024 didn't happen yet, and should trigger a follow up.

totalhack · 2024-01-15T03:14:11 1705288451

Maybe you could utilize views to make your EAV schema more friendly for the LLM? Whether that's realistic depends on the specifics of your situation of course.

atomicvibe · 2024-01-15T20:03:03 1705348983

>Did you folks experiment with building an Agent-like interface that asks more questions before the LLM finally answers?

We built something similar to query DB. Created two versions, one of which was agent based that had a maker-checker style of generation. Basically one generates and the other checks its correctness and if objective has been acheived.

The accuracy improves in the agent driven framework, at the cost of latency.

swimwiththebeat · 2024-01-14T22:22:03 1705270923

I'm curious to see if people have tried this out with their datasets and seen success? I've been using similar techniques at work to build a bot that allows employees internally to talk to our structured datasets (a couple MySQL tables). It works kind of ok in practice, but there are a few challenges:

1. We have many enums and data types specific to our business that will never be in these foundation models. Those have to be manually defined and fed into the prompt as context also (i.e. the equivalent of adding documentation in Vanna.ai).

2. People can ask many kinds of questions that are time-related like 'how much demand was there in the past year?'. If you store your data in quarters, how would you prompt engineer the model to take into account the current time AND recognize it's the last 4 quarters? This has typically broken for me.

3. It took a LOT of sample and diverse example SQL queries in order for it to generate the right SQL queries for a set of plausible user questions (15-20 SQL queries for a single MySQL table). Given that users can ask anything, it has to be extremely robust. Requiring this much context for just a single table means it's difficult to scale to tens or hundreds of tables. I'm wondering if there's a more efficient way of doing this?

4. I've been using the Llama2 70B Gen model, but curious to know if other models work significantly better than this one in generating SQL queries?

qiller · 2024-01-14T23:07:41 1705273661

For 2. we ended up stuffing the prompt with examples for common date ranges "this month", "last year", "this year to date" and some date math, and examples of date fields (we have timestamp, and extracted Year, Month, Day, etc)

  Current date: `current_date()`
  3 days ago: `current_date() - INTERVAL 3 DAY`
  Beginning of this month: `date_trunc('month', current_date())`
  ...

4. I get best results with GPT-4, haven't tried Llama yet. 3.5 and 4-turbo tend to "forget" stuff for complex queries, but may be we need more tuning yet.

totalhack · 2024-01-15T03:15:33 1705288533

Have you considered a semantic layer instead, or does it have to be a natural language interface?

jonahx · 2024-01-14T20:28:38 1705264118

Is the architecture they use in this diagram currently the best way to train LLMs in general on custom data sets?

https://raw.githubusercontent.com/vanna-ai/vanna/main/img/va...

That is, store your trained custom data in vector db and then use RAG to retrieve relevant content and inject that into the prompt of the LLM the user is querying with?

As opposed to fine tuning or other methods?

firejake308 · 2024-01-14T20:34:01 1705264441

All the podcasts I've been listening to recommend RAG over fine-tuning. My intuition is that having the relevant knowledge in the context rather than the weights brings it closer to the outputs, thereby making it much more likely to provide accurate information and avoid hallucinations/confabulations.

zmmmmm · 2024-01-14T22:11:06 1705270266

> All the podcasts I've been listening to recommend RAG over fine-tuning

I'm always suspicious that is just because RAG is so much more accessible (both compute wise and in terms of expertise required). There's far more profit in selling something accessible to the masses to a lot of people than something only a niche group of users can do.

I think most people who do actual fine tuning would still probably then use RAG afterwards ...

zainhoda · 2024-01-15T03:14:04 1705288444

One added benefit of RAG is that it's more "pluggable." It's a lot easier to plug into newer LLMs that come out. If and when GPT-5 comes out, it'll be a one character change in your code to start using it and still maintain the same reference corpus.

benjaminwootton · 2024-01-14T21:27:23 1705267643

Do you have any podcasts you would reccomend with this type of content?

firejake308 · 2024-01-16T02:02:59 1705370579

I listen to Data Skeptic and Practical AI. It's interesting content, and I find Practical AI has guests who are particularly relevant to what I want to learn, but you should keep in mind that the guests are usually trying to promote their own products, so everything they say should be taken with a grain of salt

ajhai · 2024-01-14T20:42:57 1705264977

We can get a lot done with vector db + RAG before having to finetune or custom models. There are a lot of techniques to improve RAG performance. Captured a few of them a while back at https://llmstack.ai/blog/retrieval-augmented-generation.

331c8c71 · 2024-01-14T20:45:54 1705265154

Yes from what I gather. And just to emphasize there's no LLM (re)training involved at all.

kleiba · 2024-01-14T20:34:57 1705264497

Sorry, maybe I'm just too tired to see it, but how much control do you have over the SQL query that is generated by the AI? Is there a risk that it could access unwanted portions or, worse, delete parts of your data? (the AI equivalent of Bobby Tables, so to speak)

thih9 · 2024-01-14T20:40:33 1705264833

Why not give it access to relevant parts of the database only? And read only access too?

8organicbits · 2024-01-15T15:31:39 1705332699

Access management can be harder than building queries if you start getting fine grained.

thih9 · 2024-01-15T23:46:48 1705362408

Yes. Then again, in a project where queries are simpler than granting relevant access to a third party, I guess the AI chat product is just not a good fit.

htk · 2024-01-14T20:38:52 1705264732

I guess you could limit that with the correct user permissions.

bob1029 · 2024-01-14T21:20:03 1705267203

In some SQL providers, your can define rules that dynamically mask fields, suppress rows, etc. based upon connection-specific details (e.g. user or tenant ID).

So, you could have all connections from the LLM-enabled systems enforce masking of PII, whereas any back-office connections get to see unmasked data. Doing things at this level makes it very difficult to break out of the intended policy framework.

iuvcaw · 2024-01-14T22:22:57 1705270977

Guessing its intended use case is business analytic queries without write permissions —- particularly for non-programmers. I don’t think it’d be advisable to use something like this for app logic

zainhoda · 2024-01-15T00:02:17 1705276937

100% -- in fact originally the package used to parse out SELECT statements and only execute SELECT statements. After some feedback, we decided that the permissions on the user should handle that level of detail.

e12e · 2024-01-15T03:00:30 1705287630

You can select from a procedure that change data, though?

osigurdson · 2024-01-14T19:32:35 1705260755

I wish we had landed on a better acronym than RAG.

spencerchubb · 2024-01-14T22:16:08 1705270568

I'm pretty sure whoever coined the term just wanted to sound smart. Retrieval Augmented Generation is a fancy way to say "put data in the prompt"

hliyan · 2024-01-15T04:11:38 1705291898

Or rather, run a cosine similarity search on a large data set that won't fit in the prompt, find only the bits that are relevant to the query, and put that in the prompt.

arbot360 · 2024-01-14T21:16:45 1705267005

REALM (REtrieval Augmented Language Model) is a better acronym.

bdcravens · 2024-01-14T20:20:30 1705263630

Rags are used for cleaning, and this gives you a cleaner interface into your data :-)

nightski · 2024-01-14T19:57:39 1705262259

It doesn't matter, RAG is very temporary and will not be around long imho.

mediaman · 2024-01-14T21:47:23 1705268843

RAG, at its core, is a very human way of doing research, because RAG is essentially just building a search mechanism for a reasoning engine. Much like human research.

Your boss asks you to look into something, and you do it through a combination of structured and semantic research. Perhaps you get some books that look relevant, you use search tools to find information, you use structured databases to find data. Then you synthesize it into a response that's useful to answer the question.

People say RAG is temporary, that it's just a patch until "something else" is achieved.

I don't understand what technically is being proposed.

That the weights will just learn everything it needs to know? That is an awful way of knowing things, because it is difficult to update, difficult to cite, difficult to ground, and difficult to precisely manage weights.

That the context windows will get huge so retrieval will be unnecessary? That's an argument about chunking, not retrieval. Perhaps people could put 30,000 pages of documents into the context for every question. But there will always be tradeoffs between size and quality: you could run a smarter model with smaller contexts for the same money, so why, for a given budget, would you choose to stuff a dumber model with enormous quantities of unnecessary information, when you could get a better answer from a higher intelligence using more reasonably sized retrievals at the same cost?

Likewise, RAG is not just vector DBs, but (as in this case) the use of structured queries to analyze information, the use of search mechanisms to find information in giant unstructured corpuses (i.e., the Internet, corporate intranets, etc).

Because RAG is relatively similar to the way organic intelligence conducts research, I believe RAG is here for the long haul, but its methods will advance significantly and the way it gets information will change over time. Ultimately, achieving AGI is not about developing a system that "knows everything," but a system that can reason about anything, and dismissing RAG is to confuse the two objectives.

nightski · 2024-01-15T05:17:12 1705295832

That's the problem, it's just search. If search was the answer, Google would of achieved AGI long ago. The problem is there's no intelligence. In some situations it can find semantically similar content, but that's it. The intelligence is completely missing from the RAG mechanism, because it's not even part of the model itself.

zainhoda · 2024-01-15T03:19:15 1705288755

Yes, 100%! Can you turn this comment into a blog post so that I can send it to people who make this claim?

ren_engineer · 2024-01-14T20:21:29 1705263689

how else would you get private or recent data into an LLM without some form of RAG? The only aspect that might not be needed is the vector database

sroecker · 2024-01-14T20:00:31 1705262431

Care to enlighten us why?

nkozyra · 2024-01-14T20:05:02 1705262702

Most of this stuff is replaced within a calendar year and that will probably accelerate.

osigurdson · 2024-01-14T20:22:37 1705263757

It sounds dumb to me.

vinnymac · 2024-01-14T19:40:41 1705261241

Every single time I see it, I immediately think of Red Amber Green.

brnbrdg · 2024-01-22T19:02:01 1705950121

I think some of the doubters here are conflating a few things.

Any analyst will struggle if data quality is a mess, and there is a lack of clear semantic layer/useful documentation around how to query metrics/join tables from your warehouse. These are problems that affect human analysts as well as LLM based 'virtual' ones... However, these are separate problems being addressed by other players.

An llm powered chatbot that can consume adequate context from company systems should be able to perform on par with a junior analyst with a similar level of context. All else equal, the LLM analyst will be orders of magnitude cheaper and open up data analysis to non-sql-literate people (huge unlock).

teaearlgraycold · 2024-01-14T19:40:40 1705261240

I haven't loaded this up so maybe this has been accounted for, but I think a critical feature is tying the original SQL query to all artifacts generated by Vanna.

Vanna would be helpful for someone that knows SQL when they don't know the existing schema and business logic and also just to save time as a co-pilot. But the users that get the most value out of this are the ones without the ability to validate the generated SQL. Issues will occur - people will give incomplete definitions to the AI, the AI will reproduce some rookie mistake it saw 1,000,000 times in its training data (like failing to realize that by default a UNIQUE INDEX will consider NULL != NULL), etc. At least if all distributed assets can tie back to the query people will be able to retroactively verify the query.

zainhoda · 2024-01-14T23:45:32 1705275932

This is a good idea. I think what you'd want to do is override the generate_sql function and store the question with the "related" metadata and the generated sql somewhere:

https://github.com/vanna-ai/vanna/blob/main/src/vanna/base/b...

We're going to be adding a generic logging function soon and fairly soon what you're talking about could just be a custom logger.

adam_gyroscope · 2024-01-15T00:22:36 1705278156

We did this at bit.io and people loved it - there a bunch of articles we wrote on what we found during our work: https://innerjoin.bit.io/

We’ve since shut down (acquired by databricks) but happy to answer what I can.

zainhoda · 2024-01-15T00:32:21 1705278741

Super interesting! Why did you decide to shut down?

adam_gyroscope · 2024-01-15T02:13:41 1705284821

Acquired by databricks - I can’t really speak about what we’re working on.

benjaminwootton · 2024-01-14T21:22:27 1705267347

I built a demo of something similar, using LlamaIndex to query data as it streamed into ClickHouse.

I think this has a lot of real world potential, particularly when you move between the query and a GenAI task:

https://youtu.be/F3Eup8yQiQQ?si=pa_JrUbBNyvPXlV0

https://youtu.be/7G-VwZ_fC5M?si=TxDQgi-w5f41xRJL

I generally found this worked quite well. It was good at identifying which fields to query and how to build where clauses and aggregations. It could pull off simple joins but started to break down much past there.

I agree with the peer comment that being able to process and respond to error logs would make it more robust.

breadwinner · 2024-01-14T21:24:45 1705267485

I have seen good results from just describing the schema to ChatGPT-4 and then asking it to translate English to SQL. Does this work significantly better?

SOLAR_FIELDS · 2024-01-14T21:30:20 1705267820

That’s mostly what the products and libraries around this like llamaindex or Langchain are doing. If you look at the Langchain sql agent all it’s doing is chaining together a series of prompts that take the users initial query, attempt to take in a db and discover its schema on the fly and then execute queries against it based on that discovered schema, ensuring the result makes sense.

The tough part is doing this at scale as part of a fully automated solution (picture a slack bot hooked up to your data warehouse that just does all of that for you that you converse with). When you have tens or hundreds of tables with relationships and metadata in that schema and you want your AI to be able to unprompted walk all of them, you’re then basically doing some context window shenanigans and building complex state machines to walk that schema

Unfortunately that’s kind of what you need if you want to achieve the dream of just having a db that you can ask arbitrary questions to with no other knowledge of sql or how it works. Else the end user has to have some prior knowledge of the schema and db’s to get value from the LLM. Which somewhat reduces the audience for said chatbot if you have to do that

zainhoda · 2024-01-15T00:36:23 1705278983

That's basically what happens but the power is that in Python, you can do this:

sql = vn.generate_sql(question=...)

Which means that now the SQL can be executed and you can get the table, chart, etc in any interface.

Flask: https://github.com/vanna-ai/vanna-flask

Streamlit: https://github.com/vanna-ai/vanna-streamlit

Chainlit: https://github.com/vanna-ai/vanna-chainlit

Slack: https://github.com/vanna-ai/vanna-slack

esafak · 2024-01-14T20:03:15 1705262595

The nitty gritty: https://vanna.ai/blog/ai-sql-accuracy.html

kgdiem · 2024-01-15T00:45:32 1705279532

Thanks so much for making this and making it under MIT to boot.

I’ve been thinking about how to do this for about 6 months now and just started working on a demo for querying just one table // JSON column today.

I would feel a lot more comfortable with putting this (and/or my demo) into production if the database had been set up with a schema+db user per account rather than every tenant sharing just the one set of tables.

zainhoda · 2024-01-15T03:23:37 1705289017

Author of the package here. Apologies if this wasn't clear in the documentation, but what you're talking about is absolutely possible. Feel free to join our Discord -- we have other users who have this multi-tenant setup.

kgdiem · 2024-01-15T12:49:16 1705322956

I’m in! Thanks.

miohtama · 2024-01-14T22:25:56 1705271156

How about instead of making AI wrappers to over 50 years old SQL, we’d make a database query language that’s easier to read an write?

marginalia_nu · 2024-01-14T22:30:13 1705271413

In general, if something has been around for a very long time and nobody apparently seems to have thought to improve it, then odds are the reason is it's pretty good and genuinely hard to improve on.

runlaszlorun · 2024-01-15T00:26:30 1705278390

> pretty good and genuinely hard to improve on

I think that might be a bit more positive than I would be. Broadly speaking, I think you could say that the downsides of the legacy technology in question aren’t larger than the collective switching costs.

But I’d definitely agree that when something has been around that long, it’s prob not all bad.

aae42 · 2024-01-14T22:40:12 1705272012

in other words, SQL is a shark, not a dinosaur

oblio · 2024-01-15T15:12:26 1705331546

Well, if you think about it, a huge percentage of dinosaurs survived (birds), and for the rest, can you really fault them for going extinct when a humongous boulder hit the planet?

realanswe91 · 2024-01-14T22:58:09 1705273089

The Shark Query Language

fbdab103 · 2024-01-15T06:28:18 1705300098

Codd himself had loads of complaints about SQL.

SoftTalker · 2024-01-15T03:18:35 1705288715

Because SQL is a good query language, if you bother to really learn it.

drittich · 2024-01-15T02:37:48 1705286268

https://prql-lang.org/ might be an answer for this. As a cross-database pipelined language, it would allow RAG to be intermixed with the query, and the syntax may(?) be more reliable to generate

aitchnyu · 2024-01-15T04:46:54 1705294014

I'm watching Edgeql, I feel it could end the career of AI query generators, ORMs and low end BI.

https://www.edgedb.com/

neodymiumphish · 2024-01-14T22:39:50 1705271990

My fear with this approach is that the first implementation would be severely handicapped compared to SQL, amd it'd take years to support some one-off need for any organizational user, so it'd never be fully utilized.

holoduke · 2024-01-14T20:01:34 1705262494

It would be fun if you could actually train your raw sql and the llm output is the actual answer and not sql commands. In this way its just another language layer on top/in between of sql. Probably hurts efficiency and performance in the long run.

FrostKiwi · 2024-01-15T07:32:54 1705303974

A bit ironic, considering SQL statements were designed to read like English sentences.

crimbles · 2024-01-14T22:20:13 1705270813

I can't wait until it naively does a table scan on one of our several TB tables...

ajhai · 2024-01-14T20:38:51 1705264731

We have recently added support to query data from SingleStore to our agent framework, LLMStack (https://github.com/trypromptly/LLMStack). Out of the box performance performance when prompting with just the table schemas is pretty good with GPT-4.

The more domain specific knowledge needed for queries, the harder it has gotten in general. We've had good success `teaching` the model different concepts in relation to the dataset and giving it example questions and queries greatly improved performance.

hyuuu · 2024-01-15T01:10:30 1705281030

im curious to know how you get around the hallucinations? for example, for the query: "give me <products / sales / rows> that were created yesterday"

the llm hallucinates as to what "yesterday" means, there are other instances as well where the generated SQL is valid syntax-wise but not in intent. This is especially dangerous for aggregation queries such as MAX, COUNT, etc because it will spit out a number but is it the right number? and the only way to check is to read the SQL itself and verify, which defeats the whole purpose of it.

zainhoda · 2024-01-15T03:32:49 1705289569

That's a fair concern. In actual usage, however, the vast majority of hallucination (>90%) tends to be:

- phantom tables and columns, in which case the query will fail

- incorrect syntax for date functions (i.e. the wrong flavor of SQL)

And we tend to see less of this type of hallucination when there are lots of example SQL queries that have been "trained" into the RAG system.

jug · 2024-01-14T22:09:23 1705270163

I wonder if this supports spatial queries as in PostGIS, SpatiaLite, SQL Server Spatial as per the OGC standard?

I'm interested in integrating a user friendly natural language query tool for our GIS application.

I've looked at LangChain and the SQL chain before but I didn't feel it was robust enough for professional use. You needed to run an expensive GPT-4 backend to begin with and even then, it wasn't perfect. I think a major part of this is that it wasn't actually trained on the data like Vanna apparently does.

zainhoda · 2024-01-15T00:00:05 1705276805

Author of the package here. I think that it probably could handle that but may need more example SQL queries.

dcreater · 2024-01-15T02:35:17 1705286117

Perhaps an uninformed question, but why do we need an llm rather than a simpler natural language to SQL NLP translator? Wouldn't that be much more efficient and reliable?

zainhoda · 2024-01-15T02:50:45 1705287045

What would be missing from that is the ability to look up what tables are in the database, how they join together, what terminology the business uses, preferences around whether to exclude nulls from certain columns, etc.

This allows you to ask more of a “business question” like “who are the top 10 customers by sales?” and the LLM will be able to construct a query that joins the customers table to an orders table and get the results that you’re looking for.

With a simple NL to SQL, you’d have to say “join the customer table with the sales table, aggregate on the sales column from the orders table and limit the results to 10 rows” or something along those lines.

bobbylarrybobby · 2024-01-15T02:39:52 1705286392

People have been working on simple natural language processing algorithms since the 50‘s. If it were easy we'd have one by now.

l5870uoo9y · 2024-01-14T22:41:33 1705272093

Is there a list of SQL generations to see how it performs? This is a list of SQL examples using GTP-4 and the DVDrental database sample.

[1]: https://www.sqlai.ai/sql-examples

[2]: https://www.postgresqltutorial.com/postgresql-getting-starte...

altdataseller · 2024-01-14T19:44:42 1705261482

What's the origin behind the name Vanna?

zainhoda · 2024-01-14T23:36:57 1705275417

Author of the package here. It was originally because it's both a name and a finance term because I was originally using this to query a financial database.

https://www.thebalancemoney.com/vanna-explanation-of-the-opt...

booleandilemma · 2024-01-14T20:05:47 1705262747

Vanna White? It's the only Vanna I know.

https://en.wikipedia.org/wiki/Vanna_White

cpt100 · 2024-01-15T04:58:40 1705294720

I have tried things like this, they are good for debugging and asking simple questions but is really hard to train them to be good enough for production. You'll get enough frustrating results, that you'll abandon them soon.