Ever since I started listening to emma essex's music I have found just how half-baked Unicode handling is, even in current year.
Some of my favorite examples are:
"โ", by "HHSU ๐ ๐ฎ๐๐๐๐๐๐, ๐๐ช๐๐๐, ๐๐ฎ๐ช๐ป๐ฝ๐๐ธ๐ธ๐ญ", from the album "๐ ๐ " (U+1D159);
"โซโซโฉโซโฟโฉ but it's ๆ้ฆ้ ่ ๅคงๅพ็";
"๏ฝ๏ฝ๏ฝ๏ฝ{๏ผ''ยป''๏ผยฒ๏ผ}๏ผโ๏ผ''ยป''๏ผ๏ผ๏ผ๏ผ";
This got me to look up a UTF-8 to unicode code point command line tool, which it turns out is "uconv -x hex/unicode". The first one looks like mostly mathematical alphanumeric symbols: