> "will *kinda sorta* work" That sound more like an anti-feature resulting in un...

marcosdumay · on May 30, 2021

Most of the times, it works flawlessly, other times it fails on some inputs. And that defines how well your old code will behave is what you are using it for, so this property is known at design time and if you go for reliability (you certainly should, but most people don't) you can know before writing the program if you'll need to care about text encoding or not.

Either way, a complete rewrite of the text handling functionality should give you flawless functionality. At this point in time, all the important ecosystems that use UTF-8 are almost there.

This is very different from the other encodings where a complete rewrite of the text handling functionality is needed just not to fail every time. That made all the important ecosystems that used other encodings to get almost there much sooner, but there was an important period when everything was broken, and the improvements are much slower nowadays, because when you need to fix every aspect of something, iterations take much more labor.

masklinn · on May 30, 2021

> That sound more like an anti-feature resulting in unstable programs that almost work.

It's both. It will generally ignore non-ascii data, but that is very commonly something you don't care about, in which case it's a net advantage over plain not working at all.

kbumsik · on May 30, 2021

No, it definitely works without errors, as long as the UTF-8 text is in ASCII space.