Pretty sure we address this issue in the paper/repository? Some of our demos rel...

williamcotton · on March 1, 2023

It’s more like this: subprompts don’t ever inject the full context from a remote query back into the primary prompt. The completions of subprompts are (via few-shot or a fine-tuned model) structured, eg, JSON, which is then parsed. The main prompt is orchestrating the subprompts and never needs to even process the results if there’s a Python or JS interpreter involved.

Here’s the kind of approach I’ve been using:

https://github.com/williamcotton/empirical-philosophy/blob/m...

The initial call to the LLM will return a completion that includes JavaScript. There is no third-party data at this point. The JavaScript includes further calls to the LLM that returns JSON, but at this point no further calls are made to the LLM. This means that responses from remote queries are never sent to an LLM. The text presented to the user could be some instructions to talk like a pirate but all the user suffers from is a surprisingly incorrect result.

Even with LangChain the issue is the chatbot UX. LangChain can also be used in ways that make it not vulnerable to this problem.

Orthogonally, I don’t think that chatbots are a very good UX in general and that there are much better ways to interact with an LLM. If anything your work should accelerate this process!

nullptr_deref · on March 1, 2023

Isn't this similar to your idea? https://github.com/openai/openai-python/commit/75c90a71e88e4...

greshake · on March 1, 2023

Sounds interesting, I'll be sure to have a look!