Probably can be solved by including a positive personality overseeing the actions, but not getting the direct instructions from the client. Just generic initial prompt like "you are an inspector investigating possible law breaking", then the list of actions from primary personality, may be summary. And generic questions like "is it legal?", "is there a reason for concerns?". The answer can be given back to client as a second opinion.
This requires a second thread. Multi-threaded interactions likely to become common soon, I think. Because they increase the robustness.
This requires a second thread. Multi-threaded interactions likely to become common soon, I think. Because they increase the robustness.