Okay. I think you might be yelling at the wrong guy; the conclusion you seem to ...

zer00eyz · on Dec 16, 2023

> So, if you want to elicit specific behavior from them, one approach would be to ask yourself how you'd elicit that behavior from a human, and try that.

This doesn't seem that human: https://www.theregister.com/2023/12/01/chatgpt_poetry_ai/

How far removed is that from: Did you really name your son "Robert'); DROP TABLE Students;--" ?

I think that these issues probalisticly look like "human behavior", but they are leftover software bugs that have no been resolved by the alignment process.

> unless the operator holds unambiguous rights to all the data used for training...

So on the opposite end of the spectrum is this: https://www.techdirt.com/2007/10/16/once-again-with-feeling-...

Turning a lot of works into a vector space might transform them from "copyrightable work" to "facts about the connectivity of words". Does extracting the statistical value of a copyright work transform it? Is the statistical value intrinsic to the work or to language in general (the function of LLM's implies the latter).

monitron · on Dec 16, 2023

> This doesn't seem that human: https://www.theregister.com/2023/12/01/chatgpt_poetry_ai/

Agreed; that’s why I was very careful to say “one approach.” I suspect that technique exploits a feature of the LLM’s sampler that penalizes repetition. This simple rule is effective at stopping the model from going into linguistic loops, but appears to go wrong in the edge case where the only “correct” output is a loop.

There are certainly other approaches that work on an LLM that wouldn’t work on a human. Similar to how you might be able to get an autonomous car’s vision network to detect “stop sign” by showing it a field of what looks to us like random noise. This can be exploited for productive reasons too; I’ve seen LLM prompts that look like densely packed nonsense to me but have very helpful results.