Hacker News new | past | comments | ask | show | jobs | submit login

Ever since I started listening to emma essex's music I have found just how half-baked Unicode handling is, even in current year. Some of my favorite examples are:

    "โŽ†", by "HHSU ๐“ƒš ๐•ฎ๐–†๐–’๐–‡๐–Ž๐–š๐–’, ๐•๐•ช๐•๐•–๐•ž,  ๐“—๐“ฎ๐“ช๐“ป๐“ฝ๐”€๐“ธ๐“ธ๐“ญ", from the album "๐…™๐…™" (U+1D159);

    "โ™ซโ™ซโ™ฉโ™ซโ€ฟโ™ฉ but it's ๆ€’้ฆ–้ ˜่œ‚ ๅคงๅพ€็”Ÿ";

    "๏ฝ’๏ฝ”๏ฝ’๏ฝƒ{๏ผˆ''ยป''๏ผ‰ยฒ๏ผ’}๏ผšโ‰ž๏ผˆ''ยป''๏ผ๏ผ‘๏ผ‰๏ผ›";



Apple music dislikes the song "โŽ†" so much that it's entirely missing from the album in my account.


This got me to look up a UTF-8 to unicode code point command line tool, which it turns out is "uconv -x hex/unicode". The first one looks like mostly mathematical alphanumeric symbols:

https://en.wikipedia.org/wiki/Mathematical_Alphanumeric_Symb...

I wonder if anyone has used characters in the unicode private use area. That could have interesting results.


I'm sure some of the characters got stripped out by hn.


I was surprised HN let ๐“ƒš through. Apparently not banning Egyptian hieroglyphs yet.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: