Odd how many of those instructions are almost always ignored (eg. "don't apologi...

sltkr · 2024-08-27T16:52:18 1724777538

It's common for neural networks to struggle with negative prompting. Typically it works better to phrase expectations positively, e.g. “be brief” might work better than ”do not write long replies”.

digging · 2024-08-27T17:16:53 1724779013

But surely Anthropic knows better than almost anyone on the planet what does and doesn't work well to shape Claude's responses. I'm curious why they're choosing to write these prompts at all.

esperent · 2024-08-29T07:53:09 1724917989

Maybe it would be even worse without it? I've found that negative prompting is often ignored, but far from always ignored so it's still useful.

handsclean · 2024-08-27T18:01:17 1724781677

I’ve previously noticed that Claude is far less apologetic and more assertive when refusing requests compared to other AIs. I think the answer is as simple as being ok with just making it more that way, not completely that way. The section on pretending not to recognize faces implies they’d take a much more extensive approach if really aiming to make something never happen.

Nihilartikel · 2024-08-27T18:39:14 1724783954

Same with my kindergartener! Like, what's their use if I have to phrase everything as an imperative command?

lemming · 2024-08-27T20:12:34 1724789554

Much like the LLMs, in a few years their capabilities will be much improved and you won't have to.

usaar333 · 2024-08-27T17:54:23 1724781263

It lowers the probability. It's well known LLMs have imperfect reliability at following instructions -- part of the reason "agent" projects so far have not succeeded.