How are Windows and Java, which are somewhat tied to 16-bit Unicode, handling th...

j4_james · on Oct 4, 2016

That's what surrogate pairs are for. [1] You're no longer working with one code point per character, but even with 32-bit Unicode there's no real guarantee of that (consider things like combining characters, accents, emoji skin tones, etc.)

[1] https://msdn.microsoft.com/en-us/library/windows/desktop/dd3...

WalterBright · on Oct 5, 2016

Soon 20 bits won't be enough, either, and every Unicode program out there will break :-(

ygra · on Oct 5, 2016

Unicode is 21 bits wide. And there's lots of space left. Heck, Emoji still make up very little of the total encoded characters, compared to “normal” human writing systems. (And I'd argue that emoji are by now a normal addition to writing, considering how many people use them daily and can be glad to have them interoperable across different platforms, carriers, and devices. Something that hasn't been that way previously.)