> Bizarrely, this happens even when the question is completely unrelated to Chin...

fnordpiglet · on Sept 7, 2023

I suspect given they seemed to understand LLMs this was a bizarre in the sense of they’re not semantically similar despite being syntactically similar. A sufficiently powerful LLM should be able to distinguish the difference. Probably the prompt classifier isn’t as powerful as the backing LLM.

lsedgwick · on Sept 7, 2023

fair. This was similarly amusing at least: > ERNIE’s opinions are surprising, to say the least. It believes the best American president is Richard Nixon:

Impressive to see the RLHF penetrate deep enough into concept maps to teach the model (surely implicitly!) that the best US President has to be whichever one normalized relations with China.

hn8305823 · on Sept 7, 2023

This result is fascinating. Surely the result of parsing local Chinese content which would naturally reflect positively on Nixon. Contrast that to most US-based opinions of his presidency.

The moral of the story is don't go seeking some sort of absolute or "hidden" truth from LLM's. As currently constructed, they are just a reflection of narratives they consumed during training (just like humans).

fnordpiglet · on Sept 7, 2023

And interestingly it’s obsessively focused on Taiwan reunification, and seems to greatly overstate Nixons stance on it.

lainga · on Sept 7, 2023

Maybe ERNIE's a John Adams fan.

All patriots were brothers once: / let us drink to the time / when they shall be brothers again. Gam bei!

gs17 · on Sept 7, 2023

That's probably it, I was thinking how odd it is that “Should Taiwan be independent” doesn't get the obvious Party-given answer, but they probably couldn't get it to be consistent enough and it's easier to get it to refuse sensitive topics entirely.

hirako2000 · on Sept 7, 2023

Or the CCP also got its eyes on Hawai. That wouldn't be a stretch given the British took over islands even closer to China than Hawai is to the US.

kvn8888 · on Sept 7, 2023

No, OP has it right, it's basic pattern recognition in LLMs.