Yeah, that's a massive problem with the natural language domain all across machi...

a_t48 · on Oct 5, 2021

You might be able to kludge a fix to tokenize the output and replace he/him/she/her with them/their. It's not as sexy as the engine outputting the correct words, but it should get the job done.

cbutner · on Oct 6, 2021

Yes, in this case as long as they still agree when it actually names people, I don't think it would be too difficult. There may be factors I'm not considering though.

Harder would be more general models like GPT-2 and GPT-3.

a_t48 · on Oct 6, 2021

Singular "they" doesn't care about the gender of the person named, so it should be good.

prezjordan · on Oct 5, 2021

Appreciate the honesty here. Pretty wild how natural this model feels with 1 million samples.

cbutner · on Oct 5, 2021

Sometimes it seems really accurate (like the cherry-picked GIF in the overview docs) and sometimes really off.

I think for the most part, it knows more than it lets on, but finding the right sampling methods (or better yet, generalized search) to generate the best comments is a tough problem because it's difficult to evaluate quality.

There's some info on the sampling methods here: https://chrisbutner.github.io/ChessCoach/high-level-explanat...