We need to hold the line somewhere. Preferably before corporate logos get into U...

WalterBright · on Oct 4, 2016

Unicode is turning into a few useful characters amid a sea of junk. This will continue as long as people acquire status by getting "their" symbol(s) into Unicode. I don't see any way this can change.

Animats · on Oct 4, 2016

How are Windows and Java, which are somewhat tied to 16-bit Unicode, handling this? It used to be that the astral planes didn't matter much, but now they do.

j4_james · on Oct 4, 2016

That's what surrogate pairs are for. [1] You're no longer working with one code point per character, but even with 32-bit Unicode there's no real guarantee of that (consider things like combining characters, accents, emoji skin tones, etc.)

[1] https://msdn.microsoft.com/en-us/library/windows/desktop/dd3...

WalterBright · on Oct 5, 2016

Soon 20 bits won't be enough, either, and every Unicode program out there will break :-(

ygra · on Oct 5, 2016

Unicode is 21 bits wide. And there's lots of space left. Heck, Emoji still make up very little of the total encoded characters, compared to “normal” human writing systems. (And I'd argue that emoji are by now a normal addition to writing, considering how many people use them daily and can be glad to have them interoperable across different platforms, carriers, and devices. Something that hasn't been that way previously.)

ygra · on Oct 4, 2016

Logos can never be encoded because of trademark concerns. So you're safe, there won't ever be a Facebook or Twitter code point.

Skin tone modifiers work pretty much like diacritics already do. It's not complicated and most of the support relies on the font anyway.