> Activists will fight fiercely to control the model's output on controversial t...

quotemstr on July 17, 2023 | parent | context | favorite | on: Wikipedia-grounded chatbot “outperforms all baseli...

> Activists will fight fiercely to control the model's output on controversial topics,

They already do. I'd love to know how much "brain damage" RLHF and other censorship techniques cause to the general purpose reasoning abilities of models. (Human reasoning ability is also harmed by lying.) We know the damage is nontrivial.

int_19h on July 18, 2023 [–]

Take a look at this.

https://cdn2.assets-servd.host/anthropic-website/production/...