U+237C ⍼ Right Angle with Downwards Zigzag Arrow

firstcommentyo · on April 13, 2022

Dislosure: I'm not directly from the fields of the Sciences Of Angles And Ambiguously Crossing Lines nor I've every seen or used this symbol before. However to me it's, pretty evidently, supposed to be a "no right angle" symbol.

(A) It's in the math section, (B) it's with angles, (C) the thunderbolt ↯ is commonly used for "not" or more specifically for dis-proof in this area and

(D) at least by my 30 s internet search on a mobile phone I couldn't find any other "no-angle" or "no-right-angle" symbol.

Someone could argue that usually you use a simple strike through as like as in ≠ (unequal), ∉ (not-element-of) or ∅ (empty set) but I would say it was chosen to avoid confusion in this case. The angle itself (without the "no/not") consists of only to orthogonal lines so it would be kinda complicated to "strike it though" in any direction without ambiguity that would resemble a triangle, a fork or whatnot.

■

esperent · on April 13, 2022

> the thunderbolt ↯ is commonly used for "not" or more specifically for dis-proof in this area and

I don't think it's that common. At least, I don't recall seeing it ever. Maybe it's used in non-English mathematics?

Wikipedia mentions it's also used in electrolysis so maybe this new one is related to that somehow?

maze-le · on April 13, 2022

It's used in german mathematics education (secondary level), either to mark a contradiction in a proof or more generally to mark an erroneous statement.

dirkt · on April 14, 2022

But I have never seen it to mark negation of a condition, that's usually done with a slash (as in ≠ ≮ ≯ ≰ ≱ ≴ ≵ ⊄ ⊅ ⊈ ⊉ ⊊ ⊋ ∉ ∌ ∄ ∦, you get the idea).

So for "not a right angle" I'd have expected a "right angle" symbol with a slash through it.

maze-le · on April 14, 2022

Funny enough, I've only seen it at the Gymnasium (secondary level) and not in the University a few years later -- then indeed the usual symbols were the 'slashed' relations like you've described, or the bottom symbol: ⊥ in logic. Maybe it's an idiosyncrasy of a certain subset of math teachers.

r0uv3n · on April 14, 2022

But how would you position the slash to get a somewhat easy to decipher symbol? To me, the right angle symbol seems to lend itself more to this unorthodox negation through the contradiction symbol than to negation through the normal slash.

ruuda · on April 13, 2022

Also in Dutch universities to mark a contradiction, especially in a proof by contradiction.

qiskit · on April 13, 2022

Same. Never seen that symbol in my life. I've seen ¬, ~, !, etc used for not/negation in computer science, math, logic, etc.

And some commenters said they used it to mark proof by contradiction, but why is there a need to mark it when you are showing it via proof? A canonical example of proof by contradiction is proving sqrt(2) is not rational. Never have I seen it marked with that symbol. Where would you even mark it? At the beginning with the assumption? Or at the end like QED?

AaronFriel · on April 13, 2022

Math degree holder from Iowa, yeah, I've seen and used it many times. The symbol is used when you reach the contradictory statement. Like "1 = 2".

"By way of contradiction suppose P, then ..., thus ~P ↯. Therefore ..."

nerdponx · on April 13, 2022

I've actually always wanted a way to mark "the contradiction" once I've obtained it. Thanks!

phogster · on April 13, 2022

To me that reads: I throw your assumption into the GROUND!

valtism · on April 13, 2022

I was taught it in extracurricular mathematics in Australia. We were taught that it goes at the end of a contradiction proof once the contradiction has been found. We used to write it extra large, like lightning strike. I think of it like a proof mic-drop.

ceh123 · on April 13, 2022

It's the first symbol referenced for symbols used in proof by contradiction to show contradiction [0]. I know that's not exactly "not" or "disproof" but I think that might be what the poster was getting at.

[0] https://en.wikipedia.org/wiki/Contradiction#Symbolic_represe...

ratmice · on April 13, 2022

I believe I have seen it used as a symbol which indicates the discharge of an assumption, but never for "not".

danparsonson · on April 13, 2022

I submit to you that it's clearly not a thunderbolt but an arrow indicating changing directions; that being overlaid on top of a pair of axes is obviously useful in the study of non-Euclidean geometry to indicate the use of wibbly-wobbly dimensions.

etothepii · on April 13, 2022

Particularly useful for timey-whimey relativistic analyses.

Shorn · on April 14, 2022

Related to The Whole Sort Of General Mish Mash.

kens · on April 13, 2022

I've thought that it would be cool to have a Wiki with an entry for each character, describing what it is, and its history. Although that wouldn't help for mystery characters like this one, there are a lot of characters with stories behind them.

paledot · on April 13, 2022

I was just discussing :man-in-business-suit-levitating: with some friends earlier today. Also an interestingly cryptic background, albeit not an unsolved one.

https://emojipedia.org/person-in-suit-levitating/

(Edit: Apparently HN automatically removes emoji.)

logbiscuitswave · on April 13, 2022

The story behind MIBSL is definitely fascinating and some great trivia there. There’s a longer article about it here: https://www.newsweek.com/2016/05/06/secret-ska-history-man-b... that covers not just the inspiration for the emoji itself, but a brief history behind the inspiration behind the inspiration. Lots of levels of metaness to unpack.

alx__ · on April 13, 2022

I love this! I've always assumed it was a rude boy emoji. Was briefly in a high school ska band :D

subroutine · on April 13, 2022

Wikipedia already does this for many symbols. See for example...

https://en.wikipedia.org/wiki/Miscellaneous_Technical

Aside from the table describing each symbol, if you scroll to the bottom of the page, it links out to full articles related to each. For a full list see...

https://en.wikipedia.org/wiki/List_of_Unicode_characters

sprayk · on April 13, 2022

I like this idea. It would serve as a place to put a well-sourced answer to the question about this character, and the talk section could be used to discuss further investigation into the topic, or when new uses inevitably arise.

mbauman · on April 13, 2022

That doesn't jive with the history in TFA — the Unicode name and location was inferred from the symbol itself without knowledge of its meaning.

dundarious · on April 13, 2022

I don't see the contradiction. The only thing they used from the name is the "right angle" aspect. Given their argument is this is a composition of thunderbolt + X, for some X (and derived from their prior knowledge of thunderbolt's compositional meaning), deciphering the image as "thunderbolt + right angle" is trivial and consistent with the naming origin in TFA.

_haoa · on April 13, 2022

Right angles have a small box near the vertex which denotes it is a right angle [0].

This symbol doesn't have that box, so I don't think it's a right angle.

[0] https://en.wikipedia.org/wiki/Right_angle#/media/File:Right_...

Edit: This merely adds to the confusion, since the name of the glyph contains the words "right angle."

¯\_(ツ)_/¯

cgriswald · on April 13, 2022

https://en.wikipedia.org/wiki/Right_angle

> In Unicode, the symbol for a right angle is U+221F ∟ RIGHT ANGLE (HTML ∟ · &angrt;). It should not be confused with the similarly shaped symbol U+231E ⌞ BOTTOM LEFT CORNER (HTML ⌞ · &dlcorn;, &llcorner;). Related symbols are U+22BE ⊾ RIGHT ANGLE WITH ARC (HTML ⊾ · &angrtvb;), U+299C ⦜ RIGHT ANGLE VARIANT WITH SQUARE (HTML ⦜ · &vangrt;), and U+299D ⦝ MEASURED RIGHT ANGLE WITH DOT (HTML ⦝ · &angrtvbd;).[5]

> In diagrams, the fact that an angle is a right angle is usually expressed by adding a small right angle that forms a square with the angle in the diagram, as seen in the diagram of a right triangle (in British English, a right-angled triangle) to the right. The symbol for a measured angle, an arc, with a dot, is used in some European countries, including German-speaking countries and Poland, as an alternative symbol for a right angle.[6]

mikeryan · on April 13, 2022

This merely adds to the confusion, since the name of the glyph contains the words "right angle."

The article notes that sans a given meaning the glyph was given a “descriptive name”.

So you’re not wrong? :-P

froh · on April 13, 2022

Perpendicular + Unicode combining solidus = ⟂ + / = ⟂̸

r0uv3n · on April 14, 2022

I think perpendicular most commonly refers to lines/vectors/planes etc., while the right angle symbol refers to angles. Also, there are often multiple symbols expressing the same thing.

froh · on April 14, 2022

Yes, a sibling comment meanwhile found the right angle symbol U+221F ∟

https://news.ycombinator.com/item?id=31015104

zeteo · on April 13, 2022

> (C) the thunderbolt ↯ is commonly used for "not" or more specifically for dis-proof in this area

Any examples?

HuangYuSan · on April 13, 2022

I believe in German (possibly also other languages) the thunderbolt ↯ is commonly used to mean "this is a contradiction" in a mathematical proof, equivalently to in English a kind of ⋕ rotated by 45° or the symbol ※. The symbol ⟂ on the other hand means "false" and is used in particular in formal logic.

froh · on April 14, 2022

Yes and no.

Yes, we indeed used and afaict still use the thunderbolt for contradiction in my German university.

However "perpendicular" and "bottom/falsum" are two different Unicode codepoints with very similar glyphs.

https://en.m.wikipedia.org/wiki/Up_tack

contravariant · on April 13, 2022

I've seen it used for contradiction. Though that's not the same thing as 'not' and I can't think of why you'd combine this with orthogonality.

hedora · on April 13, 2022

If the thunderbolt means not, and the right angle is displaying the x and y axis, then this symbol could be a pun for "not a function".

firstcommentyo · on April 13, 2022

High school physics and math as a major. I could scan you my scripts and papers if you're interested.....no won't. ;-D

But maybe "commonly used" was maybe the wrong term. More appropriately: "sometimes" or "by some".

renewiltord · on April 13, 2022

Where in the world? I’ve never used it despite similar background. Perhaps regional?

IshKebab · on April 13, 2022

I have never seen it used as not once in maths or physics. "extremely rarely" perhaps.

mywittyname · on April 13, 2022

To be fair, there are lot of math symbols out there.

http://mirrors.dotsrc.org/ctan/info/symbols/comprehensive/sy...

There are lots of examples of the lightning bolt in there. In fact, under ulsy Contradiction symbols, there are four variants.

I also noticed the exact symbol being discussed is listed under "Angles".

mizzao · on April 14, 2022

Reminds me of https://xkcd.com/2606/

Explained: https://www.explainxkcd.com/wiki/index.php/2606

bamboozled · on April 13, 2022

These unicode characters feel like they were given to us from an alien species or something.

How did it we end up with so many characters of unknown origin?

I had no idea what it meant or was used for, thus assigned it a “descriptive name” when collating the symbols for the STIX project. (I still have no idea, nor can supply an example of the symbol in use.) […] it is the case that ISO 9573-13 existed long before either AFII or the STIX project were formed. […] I once asked Charles Goldfarb what the source of these entities was, but remember that he didn’t have a definitive answer.

bryanrasmussen · on April 13, 2022

>These unicode characters feel like they were given to us from an alien species or something.

I worked at a large media company that had lots of differing icon sets in play across different media.

These icons were in SVG and they had been optimized pretty intensely. In some cases due to a bug in one of the optimizing tools some types of bezier curves got weird, so instead of say the round headed person with their hand held up to say stop it was the star headed monstrosity pointing to doom from the heavens. Because of how the icons were used and not used these optimized errors were actually sitting around so long that nobody had examples of the original icons although one could guess because in some cases we had similar ones in other projects that had not been optimized.

So maybe a similar thing would be the source of these weird alien entities.

bamboozled · on April 13, 2022

I would've thought they'd have a table of every icon and a description or something, maybe at the time it was never taken very seriously or likely to take off as it did, so people didn't bother. Like IPv4...

tclancy · on April 13, 2022

>the star headed monstrosity pointing to doom from the heavens

That sounds like a useful reaction/ response these days.

buescher · on April 13, 2022

>star headed monstrosity pointing to doom from the heavens

Could someone please feed that to DALL-E?

kingcharles · on April 13, 2022

Every post and Tweet on the Net now includes this exact reply by someone.

What monster hath we unleashed?

CyanBird · on April 14, 2022

Listen, I would if I could, that invite list must be huuuge by now, that or go mighty slow

imglorp · on April 13, 2022

Well, Klingon [edit, was proposed] for Unicode. Maybe someone imported some 70s scifi orthography, just because.

masklinn · on April 13, 2022

> Klingon made it into Unicode.

No it did not. Klingon was originally proposed in 1997 and rejected in 2001. A second proposal was made in 2016 with more optimistic noises. But AFAIK it has yet to be accepted.

It is also, like Tengwar and Cirth (which AFAIK remain unincluded even though they are on the BMP roadmap), held back on IP grounds. To my knowledge, the IP issues remain fully unresolved.

Klingon is included in the ConScript registry, but that is unrelated to unicode itself, it performs ad-hoc and non-standard allocations in private use areas.

teddyh · on April 13, 2022

ConScript seems to have been semi-replaced by the Under-ConScript:

https://www.kreativekorp.com/ucsur/

lifthrasiir · on April 13, 2022

Not yet. Even the 2021 request [1] to remove Klingon from the Not The Roadmap list [2] is in hiatus.

[1] https://www.unicode.org/roadmaps/not-the-roadmap/

[2] https://www.unicode.org/L2/L2021/21155-klingon-req.pdf

Freak_NL · on April 13, 2022

Did it? It was proposed a few times; did the last proposal actually land?

somedude895 · on April 13, 2022

> Notably, it appears that anyone could register a glyph with the AFII for a fee of 5$ to 50$ (about 8.60$ to 86$, accounting for inflation). Even if the International Glyph Register can be found, it likely merely contains another table with the glyph, the indentifier, and the short description. To know its origins would require the original registration request that added the character, but it’s unlikely that such old documents from a now-defunct non-profit organization in the 90s would have been kept or digitized.

Could be any random kid who found out about this and wanted the cool symbol they made up registered.

lifthrasiir · on April 13, 2022

In some sense, you can still do! The Ideographic Variation Database [1] essentially allows a definition of new CJK ideograph [sic] as a glyphic subset of existing characters, with a possible processing fee.

[1] https://unicode.org/ivd/

dicytea · on April 13, 2022

Something similar exists in JIS called 幽霊文字 (ghost characters), which refers to kanji of mysterious origin with no real-world usage that somehow made its way into the JIS character set. After some investigation, most of them turns out to be mistranscriptions of kanji from old historical materials.

DavidVoid · on April 13, 2022

https://en.wikipedia.org/wiki/JIS_X_0208#Kanji_from_unknown_...

Due to this thorough investigation, the committee was able to pare down the number of kanji for which the source cannot be confidently explained to twelve, shown on the adjacent table. Of these, it is conjectured that several glyphs came about due to copying errors. In particular, 妛 was probably created when printers tried to create 𡚴 by cutting and pasting 山 and 女 together. A shadow from that process was misinterpreted as a line, resulting in 妛 (a picture of this can be found in the Jōyō kanji jiten).

JKCalhoun · on April 13, 2022

I assumed W.A.S.T.E. were behind them.

(Might need to add this: https://en.wikipedia.org/wiki/The_Crying_of_Lot_49)

dirtyid · on April 13, 2022

I remember convincing friend to build unicode pokedex extension that collected all the unicode symbol he was exposed to via cansual web browsing. Never followed up but I think it'd be neat, or something along the lines of rare unicode browser bingo.

rcarmo · on April 13, 2022

I suspect there's an entire alien alphabet (like Marain, for instance) in there someplace. There was a proposal to stuff Klingon into the Private Use Area, at least...

speed_spread · on April 13, 2022

If you're willing to use a discontinuous subset you could probably find close enough glyphs to make a full Marain. Ordering would be messed up and require a lookup table though.

ghostoftiber · on April 13, 2022

(Edited to upload the image to imgur and avoid spammy advertisements).

Here I'll date myself: I remember this as "diode with a gate". Back when we did circuit diagrams with stencils, you had the diode stencil which looks like a triangle with a line on top, and then with the electrical stencils you had "decorations".

The intention was to put down the original symbol on the paper, move the decorations stencil over top of it and then add the required decorations. It's why diode symbols look like this: https://imgur.com/a/0tSLV7O (notice "step recovery diode").

The "lightning bolt" isn't a lightning bolt, it's a hint that this diode is going to have a very sharp "snap off" in the waveform. See: https://www.electronics-notes.com/articles/electronic_compon...

OK so why do we have a seperate decorator for a diode? Can't we just have a pocket full of stencils for diodes? Space was at a premium back then. It goes back to daisy wheels and typeballs: https://en.wikipedia.org/wiki/Printer_(computing)#Impact_pri... You would have one position for "diode" and one position for "decorator" and the printer would know when it got one ASCII char it would print the diode, then send whatever the thin space is to advance the print carriage a small step, then print the decorator.

Someone should be able to find a daisy wheel or typeball dedicated to circuits and bear this out.

esquivalience · on April 13, 2022

That first link is a redirect spiral through multiple interstitial ads. Enjoyed the rest of the comment though!

themodelplumber · on April 13, 2022

> a redirect spiral through multiple interstitial ads

For a second I was thinking you meant this as the correct definition of the symbol, and was very surprised :-)

esquivalience · on April 13, 2022

That is horribly plausible!

ghostoftiber · on April 13, 2022

Thanks for the heads up - I've edited the post to a copy of the image I uploaded to imgur.

MzHN · on April 15, 2022

Ironically imgur is nowadays very, very user hostile.

You can't view an image without JavaScript. Once you enable it you get "f*ck your privacy" popups, and ads if you don't have a blocker. On mobile I can no longer view anything on imgur at all, only the top bar renders for some reason. There seems to be no way around this.

It was also recently, although after the downhill, bought by a company that specializes in buying dying social media platforms and milking them dry with questionable ethics. How questionable? Well, they got into a Darknet Diaries episode https://darknetdiaries.com/episode/93/

Someone · on April 13, 2022

I would think something like this:

      |
      |  \
      |   \
      |    \  /\
      |     \/  \
      |          \
      |          _\/
      |
      +———————————————

Could (more or less) fit that description and would make more sense as a symbol. Something like it even made it into Unicode (https://emojipedia.org/chart-decreasing/)

GavinMcG · on April 13, 2022

That to me immediately communicates a decreasing chart. I would have no idea that the right-angle lines represent right angles generally and not chart axes.

jandrese · on April 13, 2022

The article makes a decent case for the symbol to be a chart symbol that means "no right angle". The zig zag arrow apparently being a shorthand for "no" in that particular circle.

It looks like a symbol that someone added for completeness but isn't particularly useful even in the field.

Cthulhu_ · on April 13, 2022

It's like the icon in question is a drunk / mirrored version of this one, from memory, drawn behind the back.

It's like o7 vs 7o; if you know you know: http://i.imgur.com/ZjhHU87.jpg

standeven · on April 13, 2022

This was my first thought as well. Either a misdrawn version of this, or a corrupt SVG, that somehow made it to production.

slowmotiony · on April 13, 2022

I remember back in the day we used to find publicly exposed Windows FTP servers, create new folders using some messed up unicode characters and upload pirated games and movies there to share with each other. The only way to open those directories was to specifically type the exact path in unicode, simply double clicking on the folder in filezilla or windows explorer resulted in a error. Sometimes the admins themselves couldn't delete them and just left them there. Good times.

technothrasher · on April 13, 2022

I remember the days of people beginning to abuse ftp sites, all us admins shutting down our writable ftp upload folders, and thinking, "this is why we can't have nice things." It was the beginning of the end of the early, friendly internet.

TameAntelope · on April 13, 2022

The fact you believed it would last is proof we still can have nice things. :)

wanderer_ · on April 13, 2022

You guys should read The Cuckoo's Egg by Clifford Stoll. It's a classic.

bfuller · on April 13, 2022

i was 13 when my public upload folder started getting messed with, sad day

totetsu · on April 13, 2022

Wearz were very nice things.

egfx · on April 13, 2022

It’s Warez.

throwaway787544 · on April 13, 2022

"wah-rez"

jkhdigital · on April 13, 2022

Wait what? It’s pronounced like the city in Mexico?

AdamH12113 · on April 13, 2022

I've heard a lot of people pronounce it like that, but I'm pretty sure that's not correct. It's clearly the English word "wares"[1] with the S replaced with a Z, similar to "hackz" and "cheatz", which were also common in that era. I think the "wah-rez" pronunciation came from people seeing the l33tspeak and not recognizing the original word behind it.

[1] A synonym for "goods" or "products". See https://www.merriam-webster.com/dictionary/wares

jholman · on April 13, 2022

It's not a synonym for "goods", because only one type of thing was ever "wares"; software. It's just for dividing up the sections of your piracy BBS into, like "filez" (files, multi-kilobyte textfiles full of instructions on how to make bombs etc), "imagez", "warez", etc.

Anyway, by 1990, in the piracy circles I distantly associated with, it was quite common to pronounce it like "juarez". Sort of semi-ironically, like, it's obviously the wrong pronunciation, but nonetheless everyone uses that pronunciation on purpose. So, what could be more correct than "the thing everyone does"?

Of course, pronunciation only happens in meatspace (or at least it did back before MP3s and before YouTube and so on), and of course I'm talking about clusters of teenagers separated by thousands of km. We had "meetupz" or "meetz" in my city, which is how I know how "everyone" pronounced it... but it's certainly possible that in most cities/whatever there was some other pronunciation rule.

blowski · on April 13, 2022

> It's not a synonym for "goods", because only one type of thing was ever "wares"; software. It's just for dividing up the sections of your piracy BBS into, like "filez" (files, multi-kilobyte textfiles full of instructions on how to make bombs etc), "imagez", "warez", etc.

Citation needed there.

I have always assumed it came from fleamarkets where people selling pirated VHS films and knock-off Rolexes would be described as “selling their wares”. Changing the s to a z was an obvious step in 90s internet culture.

jholman · on April 13, 2022

Okay, so my citation is, I was there, I was a (fringe) participant in pre-internet piracy culture, starting in 1990.

Pirate BBSes would have various "goods" (in the sense you and GP mean) available for download, including images (hint: some of them may have involve ladies), text files, and software. Sometimes there would also be sections for various art media created by users, such as .mods or ASCII art or poetry or whatever. Those various "goods" would never be all slopped together, they'd be divided into categories. And the category called "warez" would never, ever, have anything in it other than pirated software.

I agree that the s-to-z thing is just classic hacker/leet culture, though it's not internet culture, because it predates the people in question having internet access. I'm saying that the "wares" that becomes "warez" is not "wares-as-in-goods", it's "wares-as-in-softwares". It's pluralized even though "software" is a non-count noun, because then it fits with "files", "images", and so on. And yes, ultimately the "-ware" in "software" is from the sense that you and GP are talking about; I'm saying that the etymology is not directly from there, because otherwise all the other kinds of pirated stuff would also be "warez", and it never, ever, was.

Maursault · on April 14, 2022

I'm not sure how you are missing it, but hardware and software both etymologically have ware (as in a manufactured article, product, or merchandise) built-in to them. Without ware, there would be no hardware or software, or warez. The root of these words, also silverware, cookware, courseware, Tupperware, Corningware, etc., indeed is "ware." And wares is merely the plural of ware.

AdamH12113 · on April 13, 2022

I too never seen "warez" used to refer to anything other than pirated software. You make a good point about the derivation; it probably is directly from "software". Adding a superfluous Z to the end of a plural mass noun was also a characteristic of l33tspeak, as I recall.

mlyle · on April 13, 2022

> I think the "wah-rez" pronunciation came from people seeing the l33tspeak and not recognizing the original word behind it.

I think it was explicitly luls a lot of the time. I saw "warez" spelled as "juar3z", etc, a lot.

ykonstant · on April 15, 2022

To be fair, L-thirtythreet-speak pronunciations can be quite confusing!

raydev · on April 13, 2022

This reminds me, my friend and I were the only people we knew who'd even used the internet in the late 90s so no one was around to correct us, and 3 of the apparently incorrect pronounciations we had agreed on were:

- war-ehz

- gee-aw-cit-eez

- jif

Doubtme · on April 13, 2022

oh my god rapidshare was hot garbage

vletal · on April 13, 2022

I do not get it. Did you have to shut it down? Does not make sense to complain that someone uploaded stuff to a public unprotected writable storage. Wouldn't securing it with a set of credentials suffice?

mbeex · on April 13, 2022

I think, you don't get the full grasp of "early, friendly internet". Very few people do today. In my bubble - programming, for example, young people can't even imagine that there were times when you could focus on _things_ instead of writing layers of security code around them.

angrygoat · on April 13, 2022

It makes me sad to think of all those simple little services we used to run on *NIX machines, like `finger` and `whois`. You'd never want to disclose that information now, but at the time it was quite nice to be able to see if a friend or colleague was around with a simple network query.

joquarky · on April 13, 2022

I remember when I could connect to nearly any server on the internet on port 25 and manually type the commands to send an email.

.

siriussidus · on April 13, 2022

You can still submit mail to virtually any mail server using telnet. I just tried it on Gmail for curiosity, and it did work!

brimble · on April 13, 2022

I dunno. Everyone fairly-publicly shares their entire friend network and what they had for lunch, now, usually under their real name.

williamscales · on April 13, 2022

It’s like how when I was a kid, nobody in our neighborhood locked their doors at night. There was no need. Until there was.

hardware2win · on April 13, 2022

I think you make it sound as if that was good, but it was straight naive or irresponsible

ysavir · on April 13, 2022

The GP is saying "I miss the days where I could easily exploit people" and the response was "I miss the days where we respected each other enough to not exploit each other". It wasn't naive or irresponsible, but reflective of a time with more trust, cooperation, and good intentions.

alex3305 · on April 13, 2022

Reminds me of a few years ago, when I accidentally exposed my Domoticz install to the internet without authentication. I've had missed something in my Nginx config with X-Forwarded-For headers. After about a week or something apparently a foreign visitor came by my install and decided to have some good fun. Turning my lights on/off at random times. It took me about 3 days to realize what have happened, but in the mean time he didn't just destroy my install and only mess with me. Which was really sweet, because nuking the system would be far easier than opening the webpage every night.

That was a good and fun security lesson though and now I always check outside security with a mobile hotspot.

throwawayHN378 · on April 13, 2022

[flagged]

freedomben · on April 13, 2022

I think this takes the crown as the least-charitable interpretation of a comment that I've ever seen on HN.

mbeex · on April 13, 2022

https://en.wikipedia.org/wiki/Whataboutism

FabHK · on April 13, 2022

"There are villages on the countryside that are safe and friendly, everyone knows each other, people don't even lock their door."

- "Man, those idiots are naive and irresponsible."

hardware2win · on April 13, 2022

1 I didnt call any1 "idiot"

2 it s not like other people cant go to those places, thus it is kinda irresponsible

beowulfey · on April 13, 2022

Sure, in today’s world.

That’s like saying it would be naive and irresponsible for me to go outside without a life preserver today despite an unforeseen catastrophic global flood drowning the lands 10 years from now. It was a different world, with different expectations and frameworks.

adrusi · on April 13, 2022

That's like saying it's naive and irresponsible to gooutside without locking your front door when you live in a tiny remote village with 40 other people you've known for your whole life.

hardware2win · on April 13, 2022

Not really, in your example theres no way any1 appears and even if he does, then your friends protect ur stuff

Meanwhile internet aint remote village

GavinMcG · on April 13, 2022

Point is it used to be

stirfish · on April 13, 2022

Do you know of any tiny remote internet villages left? There has to be a few

fasquoika · on April 13, 2022

https://tildeverse.org/

0des · on April 13, 2022

It was a different time

p_l · on April 13, 2022

Some were open for uploads by design, in spirit of sharing things - essentially use the free space left after maib purpose to provide friendly mirrors for things like new projects etc. I recall using Archie to find copies of open source software at the ending edge of that era.

Some also were used as submissions for projects, long before sites like sourceforge started. Especially since plonking a bigger source dump on newsgroups wasn't exactly well received.

jorvi · on April 13, 2022

Sometimes people should be able to do nice things without it getting abused, no?

In The Netherlands, in the nicer neighborhoods we have something called a ‘buurtbieb’ aka a ‘neighborhood libraries’, which is a weatherproof cabinet where people can put surplus books that other people in the neighborhood can borrow.

Of course you could take all the books or use the cabinet to store candy, but why would you?

Arubis · on April 13, 2022

There’s actually coordination around these things: https://littlefreelibrary.org/

jumpkick · on April 13, 2022

We have these throughout many neighborhoods in my city in central Florida, USA. We’re a college town so I just assumed it was somehow connected to that. Neat that it’s an international thing!

Liquid_Fire · on April 13, 2022

In the UK these are commonly set up inside old unused telephone boxes - you can find them in many villages/towns, e.g.: https://nothingintherulebook.com/2018/11/03/british-phone-bo...

dividedbyzero · on April 13, 2022

Munich, Germany has them as well

neutronicus · on April 13, 2022

Here in Baltimore, MD, too, although the focus is mostly on kids books

evandrofisico · on April 13, 2022

Here in Brazil we have those on bus stops.

sodapopcan · on April 13, 2022

We have them in Toronto, ON. We call them LLLs or Little Lending Libraries. There are actually quite a lot of them.

coldacid · on April 13, 2022

We have them out in the 905s too. I've seen quite a few of them here in Durham Region.

phyzome · on April 13, 2022

Usually called Little Free Libraries in the US.

(The name is a little weird, because regular libraries are also free...)

samatman · on April 13, 2022

The word "public" in "public library" is load bearing, you can't replace it with "regular", hence your confusion.

Private libraries (mine for example) are not free, as in beer or otherwise.

db48x · on April 13, 2022

True, though to be fair most people never get to use private libraries. Or they used a library at their University that was technically private, but that gave access to the public as well. Public libraries are ubiquitous and very normal, while private libraries are the exception.

robonerd · on April 13, 2022

In America, public schools all have private libraries, reserved for attending students. (Maybe some operate as public libraries, but I've never seen nor heard of it.)

Furthermore, public libraries are not necessarily free. In America they virtually are all; fees only for late returns. But this is not globally true; in some parts of the world, libraries open to the public charge a fee for checking out books, or even require a fee for entry.

samatman · on April 13, 2022

It's a normal elision, yes, we all picture a public library when we say "library". But "free library" isn't redundant or weird, because "public" is a modifier of library, not a trait.

People tend to call their personal library a "book collection" or the like, but it's a library, in just the same way that a Little Free Library is.

So most people who read have at least a small private library, whether they think of it in those terms or not.

elliekelly · on April 13, 2022

There are two libraries near me that aren’t free - they charge an annual “membership” fee. One even operates more like an old blockbuster when it comes to newly released books. They charge a daily rental fee! It’s 25¢ a day, I believe.

notreallyserio · on April 13, 2022

FWIW "Little Free Library" is a trademark and its owners have been aggressive in its defense. I don't know what folks should use as a generic name.

Beldin · on April 13, 2022

Buurtbieb - in English, roughly pronounced as b-eew-rt-beep.

It's a literal translation of "neighbourhood library", it alliterates, and it sounds cute. (Keep the "beep" part short for that).

hedora · on April 13, 2022

Just keep using it as a generic name. They've already lost the generification war. Are they seriously going to track down and sue neighborhood libraries?

Good luck getting a jury to enforce the trademark.

bee_rider · on April 13, 2022

It seems bizarre to me that someone could trademark such a straightforwardly descriptive name.

frosted-flakes · on April 13, 2022

Yeah, but if you take a book from a LFL, you own it. With a public library, you merely borrowed it.

DocTomoe · on April 13, 2022

Obligatory "Free as in beer vs. free as in freedom" comment. I have pulled stuff out of small community bookshelves that would never have seen their chance in a "professional-run" public library, both bad and good.

dwighttk · on April 13, 2022

I always took it to mean “no really this is free, take a book!”

boredumb · on April 13, 2022

In Puerto Rico there are quite a few of these on the sidewalk and despite the rains they are generally always stocked with books. There are bars on everyone's windows and doors, but books piled up on the street.

chasd00 · on April 13, 2022

there's one down the street from me but instead of books it has canned food. It says "little free pantry" on it. It must have been around for a while because the neighborhood it's in has long sense been gentrified and is populated with very well-off residents vs the working poor that use to live there.

yawz · on April 13, 2022

Great to hear these little neighborhood libraries are international. We have them here where I live in Colorado, US.

Kon-Peki · on April 13, 2022

Indeed. The zoning code for my town specifically calls them out (as allowed, with no permits necessary).

username923409 · on April 13, 2022

I've also seen many of these at bus stations near Victoria, BC.

theandrewbailey · on April 13, 2022

Can confirm neighborhood libraries are a thing in Pittsburgh (USA).

bheadmaster · on April 13, 2022

I remember making secret directories on my Windows desktop by using a transparent icon and ALT+255 as filename. Good times.

ale42 · on April 13, 2022

I was doing the same on MS-DOS, keeping "secret" files on a floppy disk with a directory having a name ending with an invisible Alt+255... it was even impossible to look inside it with the Windows 3.1 file manager.

sen · on April 13, 2022

We did the same thing using the character for a non-breaking space, I think it was ALT+0160. It would sort last in the list, and just be an effectively-invisible entry unless you were really paying attention. Combined with an exploit we had to change users on the FTP servers behind most dialup ISPs hosting (the free couple Mb hosting you’d get with your dialup account that very few people cared about or used), meant we had pretty much unlimited file hosting, filling random families web hosting with hidden folders full of mp3s and warez.

kingcharles · on April 13, 2022

You too, huh? This was my first foray into the "dark" side of the Internet as a kid, pre-Web, hanging out with pirates on IRC and get "hired" to go around the early 'Net and fuck up people's upload folders by creating hidden directories we could load with our group's warez. ^H^H^H^H

vishnugupta · on April 13, 2022

That exact memory crossed my mind as soon as I saw that U + <number> in the title :-D. Fun times indeed!

moogly · on April 13, 2022

_vti_cnf

ezoe · on April 13, 2022

There are some kanji scripts that has no record of existing usage in the JIS character encoding which was also incorporated to the Unicode. It's called "ghost character" in Japanese.

https://ja.wikipedia.org/wiki/%E5%B9%BD%E9%9C%8A%E6%96%87%E5...

kingcharles · on April 13, 2022

I feel bad for the font designers who have to put all these inane characters in, have to draw them and hint them, and they have no purpose except they have to be there or someone will complain.

lifthrasiir · on April 13, 2022

Fortunately there are only a handful of such cases. But unfortunately there are tons of commonly used CJKV ideographs; typical Chinese or Japanese fonts are of course not expected to have all Chinese characters (there are almost 100,000 of them while OpenType fonts can only have 65K glyphs), but they are expected to have thousands of commonly used characters.

ezoe · on April 13, 2022

It must be really nice that even an amateur font designer can single-handedly create a quality font for English usage in his spare time.

For Japanese, it requires a minimal of few thousands of characters and symbols and it still doesn't cover all the commonly used characters today.

jeffnappi · on April 13, 2022

The person who appears to have done the work of collecting this character (and others) for submission into the Unicode process back in 1997[0] (Barbara Beeton) has actually responded to the StackExchange question[1].

Unfortunately even she is not aware of what the symbol is actually for.

[0] https://www.ams.org/STIX/bnbranges.html [1] https://tex.stackexchange.com/a/640596

primer42 · on April 13, 2022

So Unicode has all these mysterious characters... but I would bet that it's still true that many people on the planet speaking common languages can't even type their name...

This post is from 2015, and I'd love to know if unicode has added better support for non-English languages since then.

https://modelviewculture.com/pieces/i-can-text-you-a-pile-of...

Based on https://en.wikipedia.org/wiki/Bengali_(Unicode_block), only 3 more Bengali characters have been added since 2015.

nograpes · on April 13, 2022

I was very surprised by your comment and by the article you linked that the name Aditya cannot be represented in Unicode. I think it can be represented: আদিত্য.

I am not a Bengali-speaker, but I am familiar with the class of scripts to which the Bengali script belongs, abugidas. These scripts assume a vowel following every consonant. When two consonants occur one after the other in a word (a consonant cluster), this must be represented specially, because if you just wrote (consonant, consonant) it would be pronounced (consonant, inherent vowel, consonant).

The "ty" in Aditya is one such consonant cluster. The way this cluster is written is ত্য. This is represented as three code points (I think I am messing up the proper terms), one for the "t", one to "join", and one for "y".

Some people think of the special shape that the final "y" as a separate character on its own. In fact, it has it's own name (ya-phalā). I can understand why it would be confusing to see that the ya-phalā can't be typed as its own single character (" ্য"), but it really has to do with a difference in how the input is is implemented and how the person thinks about their own language.

In fact, on the unicode.org site, typing this very character is part of the FAQ for Bengali: https://unicode.org/faq/bengali.html#6

andlarry · on April 13, 2022

There was a lot of discussion [0] of that point when the Model View Culture article was originally posted 7 years ago.

It's complicated, but the author of the piece seems to take issue with how the character set was designed by the language authorities the UTC delegated to.

The whole comment thread is an interesting read.

[0] https://news.ycombinator.com/item?id=9220147

kens · on April 13, 2022

I read that "I Can’t Write My Name" article when it came out and it's remarkably misguided. First, there are solid linguistic reasons why Unicode handles that character the way it does. Second, the article completely misunderstands how the Unicode Consortium works. Finally, the Unicode Consortium is remarkably open to character proposals from random people. The author could have written a proposal and fixed the problem in half the time it took to write the article. Source: I am a random person who got multiple characters added to Unicode.

yesenadam · on April 15, 2022

> I am a random person who got multiple characters added to Unicode.

Tell us that story please!

goto11 · on April 13, 2022

The article present it like it purely due to western-centrism these characters does not have distinct code points in Unicode. In reality the issue is much more subtle - a discussion whether a certain glyph is a ligature of two characters or its own distinct character.

giraffe_lady · on April 13, 2022

That publication was so good, I was really bummed when they shut down. Looks like they came back for a minute in 2020? I had no idea but I know what I'm doing tonight.

cheschire · on April 13, 2022

The name itself sounds like it should be a graph of a downward trend line on a graph.

I’m guessing the person who implemented it got this exact requirement wording in the Unicode definition and nothing else, didn’t make the logical connection, and just implemented it as close to literally as they could.

wickedsight · on April 13, 2022

The update under the article has an explanation of where the name probably came from:

> I had no idea what it meant or was used for, thus assigned it a “descriptive name” when collating the symbols for the STIX project.

If I understand this correctly in the context, this person named the glyph based on what it looked like. So it wasn't the other way around.

mkl · on April 13, 2022

It's possible both events happened. The downward trend line character certainly seems like something people might have wanted.

MauranKilom · on April 13, 2022

But if I read the article correctly, this glyph comes from a set of math symbols. I don't think "stock goes down" was ever used in any mathematical script.

Jarmsy · on April 13, 2022

There's already U+1F4C9 for that though.

scbrg · on April 13, 2022

If by "already" you mean "eight years later" :)

⍼ (U+237C) is in Unicode 3.2 (from 2002), (U+1F4C9) is from Unicode 6.0 (from 2010).

[edit]: HN ate my 1F4C9 glyph. Use your imagination :)

mkl · on April 13, 2022

https://codepoints.net/U+1F4C9 CHART WITH DOWNWARDS TREND

throw0101a · on April 13, 2022

> The name itself sounds like it should be a graph of a downward trend line on a graph.

Or a lightning bolt through a window (with only the bottom-left of the window frame being visible).

yreg · on April 13, 2022

I generally (perhaps naively) think that going forward knowledge loss won't be much of an issue compared to our history.

Surely the archeologists of the future won't have to wonder what some tool from our times was used for or what some symbol we currently use means… They will have Wikipedia and archive.org and whatnot!

But that fantasy is not compatible with reality where we are already unable to find out what is the purpose of some characters in Unicode.

mitchdoogle · on April 13, 2022

Even digital storage is not permanent. Important things will be copied and preserved, but I imagine at some point so many of the relics of everyday life will be deleted or deteriorate at some point in the far future, such as this very comment

berkes · on April 13, 2022

That presumes humans can access our (electronic) media and understand it, in some 8.000 years or further.

There's no saying that there'll be a society capable of reading bits and bytes by then. Not just collapsed society -they'll hardly be interested in reading a random discussion on an orange forum for a niche group that lived 8000 years ago- but maybe even societies that are vastly technical superior to our own but cannot fathom what things meant 8 millenia back. I mean we have texts from some 600 years ago, that we can read, but cannot understand (e.g. Rohonc Codex). Eventhough our technology and knowledge is far superior to when it was written.

mywittyname · on April 13, 2022

It will probably be even worse in the future, given that internet subgroups form their own language dialects as a kind of shibboleth.

"Why do people in this group of wall drivers show off their wedding bands?"

chadlavi · on April 13, 2022

On the contrary: books might survive total societal collapse, but electronics don't.

yreg · on April 14, 2022

Sure, but my prediction is that the human civilization is likely to never have a total societal collapse.

tsol · on April 13, 2022

Electronics become unusuable quickly, though. We can find stone tablets and clay pottery, but 10k years from now will they be able to find hard drives and extract useful data? Seems like it can easily go in the opposite direction

kortex · on April 13, 2022

Wake up, first thing that pops into my head, "I should check HN" (normally it's imgur, yeah bad habits).

Number one post is the Linking Sigil. Neat.

If you know, you know.

As for how a chaos magick symbol concocted in the 21st century ended up in a 1994 font spec, clearly discordians used the power of fnord to retcon it.

lgl · on April 13, 2022

Context: https://tme.miraheze.org/wiki/Ellis_(sigil)

firstcommentyo · on April 13, 2022

Im sorry to be a party pooper but though Linking Sigil is also mentioned in the article but that's not what the article is refering/asking about.

bckr · on April 13, 2022

Hmm, the article links to the Linking Sigil at the bottom, in the links section.

But the rest of the article is concerned with how mysterious the symbol is, and how no one knows where it came from.

A clue: anyone can register a symbol for a surprisingly small fee.

A question: why would the sigil be mentioned in an addendum but not in the article proper?

Anyway, it's pretty obvious that GP had a premonition this morning, with a pay off.

CobrastanJorji · on April 13, 2022

> A clue: anyone can register a symbol for a surprisingly small fee.

A unicode symbol? I want a symbol! How much are we talking about?

kortex · on April 16, 2022

There's a process for it. I'm not sure it costs anything but it's a bunch of paperwork. You have to justify what it's used for, why existing solutions don't work, etc. The working group is probably pretty reasonable, but I'm sure it's an involved process.

If you do, can you please tack on symbols for following external links, space bar symbol, and all the other miscellaneous internet adjacent characters I always have to reach to Fontawesome for?

http://www.unicode.org/pending/proposals.html

rich_sasha · on April 13, 2022

Might we run out of Unicode code points, like we (seem to) be running out of IPv4 addresses?

As another comment mentions, once you add all these snowmen, with/without snow, male female and gender-neutral, in a few skin colour options (plus neutral)... it adds up. Plus, exponential growth once you consider family of snowmen (different number/genders/races of "parents", different number/gender/races of "children" and so on...).

lifthrasiir · on April 13, 2022

There is no reason to believe the current rate (about ~35,000 over the period 2010--2020) to change rapidly, so we are probably safe for this century. You should be aware that emoji gender and skin color is encoded in character sequences and modifiers rather than atomic characters, exactly in order to avoid that exponential growth.

And in the unlikely case that Unicode gets so many characters somehow, you can always extend it: http://ucsx.org/

jancsika · on April 13, 2022

Ok but what about all the cryptocurrency symbols? Those will probably accelerate the rate.

Perhaps not by a significant or even measurable amount. Nonetheless, it's a great reason to start investigating a blockchain alternative to Unicode

lifthrasiir · on April 13, 2022

The successful bitcoin sign proposal [1] explicitly deals with such a criticism:

> Will Unicode be flooded with symbols for many crypto-currencies?

> Most other crypto-currencies have learned from the difficulty that a non-Unicode symbol causes for Bitcoin, and use a symbol already in Unicode. For instance, Dogecoin uses Đ, Ethereum uses Ξ, Litecoin uses Ł, Namecoin uses ℕ, Peercoin uses Ᵽ and Primecoin uses Ψ. Some, like Ripple, use Roman capital letters (XRP), mimicking ISO 4217 currency codes.

> While it is possible another crypto-currency will have a non-Unicode symbol that is extensively used in text, this is unlikely.

I think this section was crucial for the eventual acceptance, because Unicode people do care (a lot) about long-term consequences of proposals.

[1] https://www.unicode.org/L2/L2015/15229-bitcoin-sign.pdf

nybble41 · on April 13, 2022

It seem to me that this is something best handled with tag characters, like ¤XBT + (U+E007F) = ₿ (where the letters are from the tag block, U+E00xx). This mirrors one of the two systems for rendering national flags[0], just with a different starting codepoint, and can easily accommodate all the ISO 4217 currency codes and common unofficial extensions. If a system doesn't know how to render a particular glyph it can just fall back to showing the Roman capital letters.

The downside of this approach is size: each tag codepoint (including the end marker) requires four bytes in UTF-8, plus two for ¤, so the sequence above is 18 bytes long.

[0] https://en.wikipedia.org/wiki/Tags_(Unicode_block)#Current_u...

lifthrasiir · on April 13, 2022

That sounds interesting, but modern currency symbols are already fast-tracked anyway---they almost always get assigned in the next version of Unicode---and more than one currency symbols for given ISO 4217 code can exist so I don't think it would work.

nybble41 · on April 14, 2022

> modern currency symbols are already fast-tracked anyway

For national currencies, perhaps. New national currencies aren't introduced all that often, and there is a lot of pressure to support them quickly as their use is often mandatory for anyone living in that jurisdiction. For new private currencies, including crypto-currencies, we don't see quite the same eagerness—the observation that new crypto-currencies were more likely to reuse existing Unicode symbols than invent new ones was a consideration in getting the Bitcoin symbol adopted, as they didn't want to open up the floodgates to large numbers of new currency symbols. The tag-based system offers a compromise.

> and more than one currency symbols for given ISO 4217 code can exist so I don't think it would work

That is a bit of a problem, but it could be handled with the variant selector codepoints, for example ¤MOP = MOP$, ¤MOP(VS1) = 圓, and ¤MOP(VS2) = 元, if the symbols have the same meaning. To save some space the VS could replace the end codepoint. For fractional units there could be a different prefix such as ¢ for 1/100 or ₥ for 1/1000 in place of the ¤, or incorporating one of the Unicode fraction codepoints for other ratios up to ⅞ (or ⅑ or ⅒). These would be rendered verbatim in the fallback version, like ¢USD.

secret-noun · on April 13, 2022

> emoji gender and skin color is encoded in character sequences

A good tool to see this broken down is https://unicode-x-ray.vercel.app/?t=%E2%9C%8C%F0%9F%8F%BC%F0... (edit: fixed url to use percent encoded emoji)

masklinn · on April 13, 2022

> Might we run out of Unicode code points, like we (seem to) be running out of IPv4 addresses?

No. There are currently 144697 codepoints allocated, out of a possible 1.1 millions. And most updates allocate a few hundreds. The large allocations (in the thousands at a time) overwhelmingly concern large additions of CJK unified ideographs (see: 13.0 with 4969 out of 5930 new codepoints, 10.0 with 7494 / 8518, 8.0 with 5771/7716).

There have been large additions of historical scripts (9.0 added the entire Tangut script, 7.0 added 23 different scripts) but those occurrences have slowed down a lot.

goto11 · on April 13, 2022

The snowmen are in Unicode because they existed in a character set before the Unicode standard was created. Unicode was deliberately created as a superset of all existing character sets at the time.

bayindirh · on April 13, 2022

Some of the glyphs you mention are combinatorial code points. i.e. they are multibyte characters combined to a single character. So you add a gender modifier and skin color modifier to change the appearance. You don't add multiple code points.

It's your device rendering these 2-3 byte character sets as single icons/emojis.

masklinn · on April 13, 2022

> So you add a gender modifier and skin color modifier to change the appearance. You don't add multiple code points.

FWIW that's true for the skin colors (there are 5 fitzpatrick scale modifiers, U+1F3FB to U+1F3FF), but it's not true for the gender: the basic gendered characters (e.g. U+1F468 "MAN", U+1F469 "WOMAN") were part of the original set "merged" from japanese emoji so the gender-neutral equivalent (e.g. U+1F9D1 "ADULT") was added as a separate codepoints.

bayindirh · on April 13, 2022

According to this document [0], there are "Gender Alternates", which change the gender of an Emoji. Relevant part is starting near the end of Page 2.

[0]: https://www.unicode.org/L2/L2016/16181-gender-zwj-sequences....

akvadrako · on April 13, 2022

We are nowhere close to running out of code points. Unicode as currently defined has 1.1 million, but even that could be increased if there was a need. There isn't, since only 114 thousand are defined.

There are not separate code points for all combinations of genders and skin colors; the characters are made as combinations.

moron4hire · on April 13, 2022

Things like skin tone variations are not defined as individual code points. They are sequences of code points that combine to make the full, customized glyph. So you have one code point for "medical", one for "professional", one for "female", one for "brown skin", one for "blond hair", and from that you get a more specific picture of a doctor..

mike_hearn · on April 14, 2022

We already did! That's what happened when UTF-16 was exhausted, which was never the original plan. Just like how the IPv4 internet degraded into a mess of hacks once addresses ran short (like NAT), so too did Unicode start becoming wildly more complex.

Amongst other things, hitting the limit of 16 bits meant the introduction of:

- The concept of "planes"

- UTF-16 combining characters

- UTF-32

- The newfound desire to encode emoji using combining characters, which means many apparently simple emoji are actually hacked together out of a mini programming language (e.g. black man = man emoji + skin tone modifier). Same thing for flags, which are actually two English letters mapped into a different part of the code space and then combined e.g. the British flag is G+B.

It's one reason why emoji broke so much software. It used to be that before emoji nobody cared about characters beyond the basic multilingual plane and ignored them. Then emoji came along and broke everything that assumed a UTF-16 code point == a character.

knome · on April 13, 2022

1) there's only ~150k unicode values defined. If we assume a signed int for available space, we have 2,147,333,647 of 2,147,483,647 remaining. moreso if the int is unsigned. We're fine. 2) they use values that combine like ligatures to create the variants of values. there isn't a combinatorial explosion because color is a modifier value, and sex, and then the underlying symbol. It's not a unique symbol for each combination.

IPv4 ran down because everything needs an IP to be on the net and there are more humans than available addresses, and more gear than humans.

We don't need different characters per human, only to document existing languages and to account for the slow growth of modern hieroglyphs.

mkl · on April 13, 2022

We can't assume a signed int, as character encodings limit the number of codepoints: "Excluding surrogates and noncharacters leaves 1,111,998 code points available for use." -- https://en.wikipedia.org/wiki/Unicode#:~:text=Excluding%20su...

thaumasiotes · on April 13, 2022

But character encodings don't limit the number of codepoints. Unicode is just a big list of correspondences between an integer and a glyph. There's no limit to how many integers you can assign.

Unicode encodings are separate standards that give correspondences between Unicode code points (integers) and byte sequences. If Unicode changes in a way that invalidates an encoding, that just calls for a new encoding.