It's interesting, and a bit concerning, that it's so hard to control LLMs from d...

sanxiyn · 2024-03-02T11:37:15 1709379435

It is concerning, but I am not sure whether it is more concerning than that it's so hard to write a web browser that doesn't execute arbitrary code. Security is like that, and security is especially hard when the system is featureful like web browsers and LLMs.

layer8 · 2024-03-02T12:04:51 1709381091

The issue is that with LLMs it's fundamentally impossible to have a "prepared statement" (the database query concept), whereas a web browser has no problem in principle being a safe sandbox. With LLMs, we have no idea how to make them safe even in principle. This has nothing to do with "security is hard" hand-waving.

Emiledel · 2024-03-10T02:15:48 1710036948

I'm excited to share that this is already supported, and I highly recommend leveraging it for safer application deployments. https://platform.openai.com/docs/guides/function-calling

teddyh · 2024-03-02T19:19:33 1709407173

> hard to write a web browser that doesn't execute arbitrary code

It would be easy if only we could define what “code” and “execute” means. The problem is, we can’t. Data is code and code is data. Doing things depending on data is fundamentally the same as executing code.

spacebanana7 · 2024-03-02T13:25:50 1709385950

I reckon this might push app developers to use LLMs locally in the client.

So that even a maliciously behaving LLM can’t cause much damage.

snowfield · 2024-03-02T10:43:28 1709376208

I mean in my mind, the partial point of llm is that you don't control the output. You control the input.

Wanting an generative AI and wanting to cover what it says is like having your cake and eating it too

layer8 · 2024-03-02T12:08:03 1709381283

You want to control certain aspects of the output, and only leave the rest up to the GAI. The issue is that AI models don’t have a reliable mechanism for doing so.

ben_w · 2024-03-03T09:32:38 1709458358

That's not a fundamental limitation of the models, even if it's present in the products running on those models — if you want to populate a database from an LLM, you can constrain the output at each step to be only from the subset of tokens which would be valid at that point.

Emiledel · 2024-03-10T02:29:03 1710037743

functions work fairly well for that https://platform.openai.com/docs/guides/function-calling

29athrowaway · 2024-03-02T18:13:16 1709403196

You control the output during training so no.

And even for humans, we have mechanisms to control their output when they get confused.

andreasmetsala · 2024-03-03T09:31:19 1709458279

> And even for humans, we have mechanisms to control their output when they get confused.

What mechanisms do you mean? I don’t think it’s feasible to use hunger and fear of dismissal to control an instance of an LLM.