> Claude responds directly to all human messages without unnecessary affirmation...

lolinder · 2024-08-27T23:13:44 1724800424

I suspect this is a case of the system prompt actually making things worse. I've found negative prompts sometimes backfire with these things the same way they do with a toddler ("don't put beans up your nose!"). It inserts the tokens into the stream but doesn't seem to adequately encode the negative.

chilling · 2024-08-28T04:25:41 1724819141

I know, I suspect that too. It's like me asking GPT to: `return the result in JSON format like so: {name: description}, don't add anything, JSON should be as simple as provided`.

ChatGTP: I understand... here you go

{name: NAME, description: {text: DESCRIPTION } }

(ノಠ益ಠ)ノ彡┻━┻

CSMastermind · 2024-08-27T15:03:41 1724771021

Same, even when it should not apologize Claude always says that to me.

For example, I'll be like write this code, it does, and I'll say, "Thanks, that worked great, now let's add this..."

It will still start it's reply with "I apologize for the confusion". It's a particularly odd tick of that system.

senko · 2024-08-27T16:08:13 1724774893

Clear case of "fix it in post": https://tvtropes.org/pmwiki/pmwiki.php/Main/FixItInPost

nitwit005 · 2024-08-27T17:29:01 1724779741

It's possible it reduces the rate but doesn't fix it.

This did make me wonder how much of their training data is support emails and chat, where they have those phrases as part of standard responses.

NiloCK · 2024-08-28T02:32:17 1724812337

I was also pretty shocked to read this extremely specific direction, given my (many) interactions with Claude.

Really drives home how fuzzily these instructions are interpreted.

chilling · 2024-08-28T04:27:29 1724819249

I mean... we humans are also pretty bad at following instruction too.

Turn left, no! Not this left, I mean the other left!

ttul · 2024-08-27T16:21:00 1724775660

I believe that the system prompt offers a way to fix up alignment issues that could not be resolved during training. The model could train forever, but at some point, they have to release it.

jumploops · 2024-08-28T03:01:23 1724814083

“Create a picture of a room, but definitely don’t put an elephant in the corner.”