I’ve played around with this idea a bit. It’s a very interesting experiment. You can have a “supervisor” that looks at the input and the output and judges how well the question has been answered. You can put this in a loop with the supervisor giving hints on how to improve the answers.
This is very similar to how things like auto GPT work.
This is very similar to how things like auto GPT work.