An excellent article, although: > “Ü” is a single grapheme cluster, even though ...

colejohnson66 · on March 26, 2021

Does anyone know the history behind why there’s two ways to “encode” things like that? What’s the rationale for having both combining and precombined codepoints?

bombcar · on March 26, 2021

I believe a lot of the "combined" characters are (basically) from importing old codepages directly into Unicode, and they did that so it would be a simple formula to convert from the various codepages in use.

I may be wrong however.