If true, makes me wonder what kind of regression testing OpenAI does for these m...

OutOfHere · 2024-05-13T19:20:16 1715628016

At a high level, ask it to produce a ToC of information about something that you know will exist in the future, but does not yet exist, but also tell it to decline the request if it doesn't verifiably know the answer.

bigyikes · 2024-05-13T19:45:27 1715629527

How do you generalize that for all inputs though?

OutOfHere · 2024-05-13T19:58:09 1715630289

I am not sure I understand the question. I sampled various topics. I used this prompt: https://raw.githubusercontent.com/impredicative/podgenai/mas...

In the prompt, substitute {topic} with something from the near future. As I noted, it behaves correctly for turbo (rejecting the request), and very badly for o (hallucinating nonsense).