Malicious attack on Wikipedia – what we know and what we’re doing

jedieaston · on Sept 7, 2019

Remember: there are BitTorrent links that the Wikimedia Foundation gives out of SQL dumps of Wikipedia and the other projects. You can have a copy in case this happens in your country: https://en.wikipedia.org/wiki/Wikipedia:Database_download#Wh...

Also, the Kiwix project has a hotspot project that allows you to host ZIM files (dumps of Wikipedia and other CC licensed content, like TED talks and StackOverflow) on a Raspberry Pi, allowing you to share it with others. Setup info here: https://www.kiwix.org/en/downloads/kiwix-hotspot/

ollybritton · on Sept 7, 2019

There's also a read-only IPFS mirror of Wikipedia in English: https://ipfs.io/ipfs/QmXoypizjW3WknFiJnKLwHCnL72vedxjQkDDP1m...

mpfundstein · on Sept 7, 2019

I love ipfs. Can this actually be ddos’d as well?

progval · on Sept 7, 2019

Yes. It's based on a bittorrent-like protocol with a DHT, so you can get a list of nodes hosting a particular file; then you can DoS them.

smitop · on Sept 7, 2019

The ipfs.io is just a web-based way to access IPFS, called a gateway. There are a bunch of different gateways. In addition, you can run an IPFS node locally, and then as long as just one node holds the content you're looking for you are looking for you're good. There are also browser extensions to re-write gateway URIs to localhost URIs.

alexeldeib · on Sept 7, 2019

Does this actually answer the question? If there's a node online that means I can reach the content, but would it help with DDOS? Not so sure.

whitexn--g28h · on Sept 7, 2019

A popular IPFS file might be available on thousands of nodes similar to how popular torrents have thousands of seeders. DDoS attack against thousands of servers across multiple countries and networks would be nearly impossible to perform.

viraptor · on Sept 7, 2019

It depends on the kind of node storing the data and how many there are. It's likely easier to DDoS 100 people on DSL than a single Wikimedia endpoint.

nine_k · on Sept 7, 2019

Ipfs rarely stores 100% of content on one node.

vasili111 · on Sept 7, 2019

So there is no central place like bittorent tracker which if down the network does not works?

Or is it like DHT which does not need central tracker?

shakna · on Sept 8, 2019

IPFS uses DHT on a fairly fundamental level.

buildbuildbuild · on Sept 7, 2019

Caveat: the last full Kiwix English Wikipedia archive was made in 2018. They could use some help with automating their build process if anyone here has the time.

voltagex_ · on Sept 7, 2019

From a cursory glance at the site and source code, it's really hard to see who/what is involved with building an archive. There's automated builds set up for the Pi image itself.

Nemo_bis · on Sept 7, 2019

The last time I checked, it was more a problem of lacking servers with sufficient resources: https://phabricator.wikimedia.org/T124960 https://phabricator.wikimedia.org/T219078

It sure doesn't harm if someone creates their own ZIM files and reports on their results (and/or shares the resulting files).

pronoiac · on Sept 7, 2019

Agreed. I can see that other Wikipedia languages are crawled - https://wiki.kiwix.org/wiki/Content_in_all_languages shows dozens of updates this week - but the best leads I have involve poking around the openZIM Github org, https://github.com/openzim . There might be a running "zimfarm" somewhere?

jedieaston · on Sept 7, 2019

You can build your own ZIMs from any MediaWiki instance using this tool: https://github.com/openzim/mwoffliner.

Maybe it would be worth putting together another zimfarm that is constantly updating.

bshep · on Sept 7, 2019

Looks like you are right, you may be able to join the farm to help ( i have not tested as I am away from my computer at the moment )

https://github.com/openzim/zimfarm/blob/master/worker/README...

voltagex_ · on Sept 8, 2019

Someone should ask drone.io or packet.net for an Epyc machine to do this.

Avamander · on Sept 7, 2019

I'd actually love to see a fully working IPFS fallback for wikipedia when regular hosting doesn't work. Would it even be possible with ipfs?

buildbuildbuild · on Sept 7, 2019

IPFS has a Wikipedia mirror but it is fairly out of date since it is dependent on the Kiwix archive.

https://github.com/ipfs/distributed-wikipedia-mirror

pms · on Sept 8, 2019

From a comment above: https://ipfs.io/ipfs/QmXoypizjW3WknFiJnKLwHCnL72vedxjQkDDP1m...

Metus · on Sept 7, 2019

Assuming that all formats contain the exact same data, i.e. they were generated at the exact same time, which is the (1) most useful for offline viewing (2) most future proof for archival and backup? Is there another, more viable/useful format?

Nemo_bis · on Sept 7, 2019

The XML dumps are the most compact and sustainable format in the mid term (let's say decades). https://meta.wikimedia.org/wiki/Mirroring_Wikimedia_project_...

ZIM might be able to survive longer (centuries?) as probably the future will still need some HTML parser, while wikitext parsers or PHP might be long dead, who knows.

Metus · on Sept 7, 2019

Thank you for the answer. So especially for personal use I am better off hoarding the ZIM version, especially considering there is the dedicated Kiwix reader, while I am not aware of a similar tool for the XML dumps.

vonseel · on Sept 8, 2019

Only 85GB uncompressed for current revisions, excluding “user” pages. Not so bad!

judge2020 · on Sept 7, 2019

Someone claimed the attack on twitter with some details (DDoS) - and proved it later by stopping the attack for x minutes then restarting it at a specific time. https://twitter.com/fs0c131y/status/1170093562878472194?s=20 - the attacker also went on to DDoS the twitch ingest servers (not twitch.tv itself) knocking some big streamers offline.

dangxiaopin · on Sept 7, 2019

It looks like a volumetric attack from this tweet. Wikipedia needs to use Verisign BGP mitigation. They create GRE tunnels to your routers and are capable of handling 2Tbps. During an attack, you make a BGP announcement and the traffic goes via Verisign scrubbing/tunnels. No application changes are required, no Matthew Prince selectively and benevolently enforcing CF neutrality. It's used by large banks.

lkoolma · on Sept 7, 2019

After working with a few large corporations and their DDoS protection solutions, I did not have a good experience with Verisign, and they were not able to handle attacks or get things working. However, I have great experiences with Akamai and Cloudflare. I trust the people at Wikimedia will choose wisely. I would I have learned that Verisign has one of the worst BGP mitigation/scraping solutions out there. There are a few alternatives that have more experience and provide much better uptime, include solutions from Cloudflare and Akamai.

dangxiaopin · on Sept 7, 2019

Any serious mitigation solution must be BGP based, not proxy. Besides its technical merits and convenience, it also minimizes the risk of a benevolent controller (e.g. Matthew Prince of Cloudflare) ruining your company, because it becomes your upstream provider only during the attacks. Otherwise the GRE tunnels are not in use. The IP addresses are still yours always.

We used Verisign for mitigation of a 44Gbps volumetric attack and it worked very well. We also evaluated Neustar, but Verisign's infrastructure seemed to be more robust.

aftbit · on Sept 7, 2019

That's your requirement, but it might not be Wikipedia's requirement. Ownership of IPs is really a technical detail invisible to most people; ownership of eyeballs by way of the domain name and top Google result is probably more important. Cloudflare doesn't impact that ownership other than being able to temporarily take you offline if they choose to terminate your site.

Still, large proxy-based CDNs do have the ability to completely bypass all the same-origin protections in the browser. Even if they are angels and don't abuse this trust for identity theft and surveillance, it makes them a juicy target for bad actors, state sponsored and otherwise.

snazz · on Sept 7, 2019

A proxy is a perfectly acceptable “serious” solution for this type of problem, as well as nearly all of the rest. Wikipedia is not the kind of website that would warrant being removed from Cloudflare. What’s wrong with having an upstream provider for caching close to the user and other features when you’re not under attack?

SahAssar · on Sept 7, 2019

> What’s wrong with having an upstream provider for caching close to the user and other features when you’re not under attack?

The problem is that you are basically mitm:ed all the time.

acdha · on Sept 8, 2019

That’s not what MITM means. I get that you don’t like Cloudflare but voluntary use of a CDN isn’t a MITM any more than, say, Amazon is a MITM because you host on EC2.

SahAssar · on Sept 8, 2019

Cloudflare is in between the client and the server, decrypting, rewriting and (if set up right) re-encrypting the request/response. It masquerades as the server by presenting a proper certificate for the domain even though it is not the entity that is actually controlling the domain.

That to me sounds very much like MITM, although it is not a MITM attack since the entity controlling the domain opted into it, so basically it is voluntary MITM.

Using a VPS like EC2 is a different story since the decryption happens within the layer that you control. Of course you need to make sure that you choose a vendor for that layer that you trust, but on EC2 the traffic that amazon sees is encrypted with keys they don't have and decrypted with keys stored on a layer that I control. Amazon could read out the memory of my EC2 to get the keys but their business depends on not doing so, so in this case either I have a vendor that always will decrypt and read traffic (Cloudflare), or a vendor whose business depends on hypothetically being able to but not doing it. There is a clear difference to me.

That is the same for most CDN's (including CloudFront and all the other major offerings), so I'm not trying to single out Cloudflare.

acdha · on Sept 8, 2019

If you don’t trust Cloudflare, don’t use them but there’s no meaningful security distinction between what they do and what AWS does: in both cases you have a vendor with the capability of violating your security and a promise that they won’t abuse that access.

This is why having a threat model is so important: it keeps you from wasting effort on things which sound like security but aren’t actually changing anything meaningful.

SahAssar · on Sept 8, 2019

There is a security distinction, and this has been shown by for example cloudbleed. Every step that has access to plaintext data is a potential attack vector and might be logging/leaking information.

There has also been times where cloudflare (when setup improperly as I mentioned in the previous comment) has misrepresented the security of a connection, as shown by https://www.theregister.co.uk/2016/07/14/cloudflare_investig...

cramforce · on Sept 8, 2019

The MITM can be avoided by using Signed Exchanges. https://developers.google.com/web/updates/2018/11/signed-exc...

SahAssar · on Sept 8, 2019

That only works for static content, right?

cramforce · on Sept 15, 2019

No, they can be created on the fly. That basically makes it a TLS signing Oracle.

snazz · on Sept 8, 2019

Cloudflare’s business also depends on not messing with your traffic, right? It would certainly be easier for them to get your users’ content than for Amazon to do the same, but I think you still have to accept that risk with either. “Hypothetically being able to but not doing it” isn’t a whole lot of confidence if I were hosting some kind of shady website.

SahAssar · on Sept 8, 2019

Sure, but since Cloudflare’s business is actively "messing" with all your traffic, all the time it's a smaller technical step to do it some more, and can also lead to accidents like cloudbleed. Every step that has access to unencrypted data is a potential attack vector or might be logging/leaking data.

dangxiaopin · on Sept 8, 2019

You upload your private SSL key to Cloudflare for example. And I was talking about hosting on your own hardware/colos like most large sites do (7x cheaper than AWS list prices on avg)

acdha · on Sept 8, 2019

Please specify in detail how you believe that’s an MITM using the standard industry definition. In particular, consider whether “attack” and “voluntary business agreement” are synonyms.

lazyguy2 · on Sept 8, 2019

MITM is not a uncommon term to use when you do things like install corporate SSL certs on laptops so you can monitor people's activities.

detaro · on Sept 8, 2019

Breaking open encryption to monitor activity between users and other sites is a completely different thing than having a provider handle hosting for your site.

judge2020 · on Sept 8, 2019

A better comparison would be Cloudfront and Application Load Balancers since you can expose your own ec2 server or load balancer and be e2e encrypted (unless AWS wanted to run commands on your instance, which they could do, but that's a different threat vector entirely).

acdha · on Sept 8, 2019

That was the model I had in mind but it’s not really a meaningful distinction since the host could almost certainly compromise those servers as well. In any case, you’re trusting a third party rather than having their involvement maliciously imposed.

awirth · on Sept 8, 2019

Akamai has a BGP based DDoS mitigation service via their prolexic acquisition.

OBLIQUE_PILLAR · on Sept 7, 2019

[flagged]

dangxiaopin · on Sept 7, 2019

The originalcontent was posted on IG. 8ch took the reposts down when it became known that it was connected to the real shooting. Watch the video with the 8ch founder explaining (unless YouTube took it down too). Matt was preparing for the IPO.

fennecfoxen · on Sept 8, 2019

You appear to be extremely mad that anyone questions the power of political pressure and an angry mob.

Look, you can feel however you like about whether the high-profile takedowns are right or wrong, whether the CEO's promises after the Daily Stormer are hypocritical — but let's be clear-eyed about placing a site in a position where one outside person can do it real harm. The question you should look at is whether the risk is actually acceptable for your organization.

aceofspades19 · on Sept 7, 2019

How did 8chan "encourage" large gun massacres exactly? By allowing users to post content?

Operyl · on Sept 7, 2019

By not moderating content largely, it was no secret what the site was letting go.

o-__-o · on Sept 8, 2019

By your statement then reddit was complicit with the Russian trolls during election season because the bitcoin trolls who evolved into trump trolls were not punished in the slightest (I have a list of 300+ usernames that are still active today)

Operyl · on Sept 8, 2019

Reddit is actively moderated by both paid Admins (site wide rules) and volunteer Mods (per subreddit rules). So no, I disagree.

lucifirius · on Sept 8, 2019

the chans are also actively moderated, and the chans remove CP and did remove other content after events happen

judge2020 · on Sept 8, 2019

The point is that Reddit tries to moderate, which is good enough for their providers (AWS/Fastly).

The 8ch takedown wasn't actually due to issues with moderation, since (at least based on the owner's video) 8ch removed the post, actively responds to real law enforcement requests, and the original post was actually posted to IG. The issue was that CF was getting enough bad press, and more importantly enough calls/concerns from real Enterprise clients (this is speculation on my part), to take down the website.

kevingadd · on Sept 7, 2019

Alternately: The fact that Prince was super okay with hosting those websites until the moment it made him look bad

judge2020 · on Sept 8, 2019

That's a valid stance but they didn't host the website; they only provided DDOS protection for the actual host (which proceeded to drop 8ch once CF stopped providing the protection).

nullc · on Sept 7, 2019

> It looks like a volumetric attack from this tweet. Wikipedia needs to use Verisign BGP mitigation. They create GRE tunnels to your routers and are capable of handling 2Tbps.

Great way for a state actor to intercept your traffic. little bit of volumetric dos and the target themselves responds by tunning through your partner(s).

gruez · on Sept 7, 2019

>no Matthew Prince selectively and benevolently enforcing CF neutrality.

What's the logic behind this? It's still a single point of failure and relying on a corporation. If the daily stormer or 8chan tried to use them, they would probably kicked off as well.

nine_k · on Sept 7, 2019

If you are not a political undesirable, it does help, though. I think Wikipedia is fine in this regard, not something to shun of for a big corp.

sannee · on Sept 7, 2019

Wikipedia is blocked in China. It's politically undesirable for 1/8 of the human population...

NotSammyHagar · on Sept 7, 2019

I think undesirable here describes something like white nationalists. They have a problem getting web hosting.

sannee · on Sept 7, 2019

CloudFlare has strategic business partnership with Baidu [1]. They are very likely to cooperate with the chinese government to implement the great chinese firewall.

Additionally, helping to block Wikipedia because China says so is much easier to excuse than blocking 4chan - they would just be complying with local regulations after all.

[1] https://www.cloudflare.com/press-releases/2015/cloudflare-an...

allard · on Sept 7, 2019

Because all of them don't want it?

nine_k · on Sept 7, 2019

Unlike a DDoS attack, this is not a technological problem.

dangxiaopin · on Sept 7, 2019

There's always something "undesirable" for someone in a big crowdsourced website.

rocqua · on Sept 7, 2019

The cloudfare 8chan action was based on a direct link with multiple actual mass-shootings. Moreover, as they took the decision they went to great pains to explain this was an exceptional case.

Going from that to 'undesired political speech will be censored' requires more of a slippery cliff than a slippery slope.

gruez · on Sept 7, 2019

>The cloudfare 8chan action was based on a direct link with multiple actual mass-shootings

What is this "direct link" you speak of? Did the shooters plan/recruit/organize their attacks on 8chan?

speedplane · on Sept 8, 2019

> What is this "direct link" you speak of? Did the shooters plan/recruit/organize their attacks on 8chan?

Legally, a "direct link" is irrelevant, you can rarely find a "direct link" between two of anything. What matters legally is whether 8chan was a "proximate cause" in creating the mass shootings. Whether one thing is the "proximate cause" of another is often pretty difficult to discern.

However, as a helpful guide towards determining proximate cause, lawyers ask whether one thing was the "but for" cause of another, i.e., would the mass shootings occur "but for" 8Chan? Put another way, if 8Chan did not exist, would these shootings occur?

Unfortunately, we do not have an alternative reality to play out events without 8Chan, so we cannot know for certain, but we can use evidence (e.g., 8Chan chats, how the shooter interacted with 8Chan and others on the service, etc) to try to simulate that alternative reality. All of this analysis also needs to consider related issues like freedom of speech on public forums and any commercial interests.

I'm not saying 8Chan is guilty or innocent, just that the existence (or lack thereof) of a "direct link" is pretty meaningless.

nl · on Sept 8, 2019

There are multiple instances of them announcing them and implying they are follow-ups of previous discussions on 8chan.

These include the Christchurch shootings, the Poway synagogue shooting and the El Paso Walmart shootin.

The Christchurch shooter shared his Facebook stream to 8chan before the shooting started, and it was spread from there.

The Poway shooter blamed/thanked 8chan for his views.

dangxiaopin · on Sept 8, 2019

So FB's internet peers should depeer Facebook then in their routers, since the original material (the stream) was on FB? Or you prefer your justice selective?

nl · on Sept 9, 2019

I'm sure you already realize this, but to make it clear: FB has enormous utility for billions of people outside that and that is worth defending.

You are expanding a lot of effort defending 8chan here. Perhaps consider that it might not be worth defending.

dangxiaopin · on Sept 9, 2019

8ch had a lot of very interesting and non-violent stuff. Have you been reading it regularly? I did.

I lived in a socialist country and you did not. Perhaps consider that you might not know where these current trends are pointing to.

nl · on Sept 9, 2019

[flagged]

marvin-83 · on Sept 9, 2019

you're not really engaging with his point. Effectively banning 8chan by removing network protection does not just restrict extremists; it restricts anyone who used that forum.

Ultimately, such matters should be prosecuted by courts. It is inappropriate for organisations like cloudflare to leverage their position within essential network infrastructure to start editorialising what passes through their network.

nl · on Sept 10, 2019

It is inappropriate for organisations like cloudflare to leverage their position within essential network infrastructure to start editorialising what passes through their network.

No, I think it's entirely appropriate.

"Don't troll" and methods for dealing with trolls has been a thing all sites have done since the internet was invented. I don't see any difference here at all.

marvin-83 · on Sept 10, 2019

the difference is their position in the stack.

Cloudflare blocking people that abuse the network is legitimate (e.g. spam, denial-of-service), just like it is legitimate for forum admins to block people that abuse the forum (trolling, explicit posts).

But cloudflare, or any other network infrastructure provider, shouldn't be determining permissible content for websites because they are not hosts/administrators for that content.

It is like a postal service reading your letters and then saying "we don't like what is being said, so you can't send letters anymore." They can and should stop people sending dangerous materials by post, but they should not be determining permissible content of letters.

nl · on Sept 11, 2019

See, I think 8-chan itself is a troll, and it is entirely reasonable to deal with it by refusing to provide service.

It is like a postal service reading your letters and then saying "we don't like what is being said, so you can't send letters anymore." They can and should stop people sending dangerous materials by post, but they should not be determining permissible content of letters.

No it's not. It's like FedEx declining to deliver for a company which continues to cause it problems, or refusing to service Amazon[1]. Or like Visa refusing to service businesses which have lots of charge-backs.

[1] https://www.nytimes.com/2019/06/07/business/fedex-amazon-exp...

ryacko · on Sept 11, 2019

Actually, it is illegal to mail obscene materials or crime inciting matter through the postal service.

https://www.law.cornell.edu/uscode/text/18/1461

marvin-83 · on Sept 11, 2019

yes, but these are investigated and prosecuted by police, public prosecution services, and courts; not by couriers discontinuing their services.

marvin-83 · on Sept 11, 2019

if 8chan was cut off because they were subject to extensive network attacks and cloudflare did not see any profit or value in serving them then I am ok with that. I just don't think that's the reason.

I expect that a different site with the same contract and payment terms, subject to the same attacks would have continued to be protected. maybe I'm wrong but it looked like a political decision, not a business decision.

rocqua · on Sept 8, 2019

The direct link is that they announced these attacks there.

Beyond that, given the announcement there, it stands to reason they were convinced to do it there.

wwright · on Sept 7, 2019

I think it’s somewhat misleading to refer to those who support genocide and child abuse as simply “political undesirables.”

bitwize · on Sept 7, 2019

It's not just supporting. Taking a neutral stance on censoring these things, or not being adequately proactive on hate speech, is now seen as condoning. You either censor your user base, or upstream will censor you. Gone are the days of "The net interprets censorship as damage and routes around it." The new policy is "The net interprets wrongthink as noise and filters it out."

wwright · on Sept 8, 2019

It’s not censorship: they are not suppressing information, they just aren’t allowing their resources to be used to spread it.

It would be “censorship” if they actively antagonized any attempt to spread the information, such as by lawsuit or DMCA notice. They are just refusing to participate.

And given that the “information” is definitively known to be child pornography and violent white supremacy propaganda presented as news, I would personally say refusing to participate is the only responsible action.

claudiawerner · on Sept 8, 2019

> Gone are the days of "The net interprets censorship as damage and routes around it."

But it's clear that it matters just what's being censored. Surely you wouldn't say the same trite clever-sounding hackerspeak if we're talking about censorship of threats, assault and child pornography, would you?

nine_k · on Sept 7, 2019

They are beyond a certain line; some very-very far past it, some just crossed it. It makes them unsupportable by any corporation that aims to look decent.

the8472 · on Sept 7, 2019

Genocide has been and still is a political tool. It is extreme, but ultimately something that people consider and carry out as part of political processes, not a special category of its own. And realpolitik is to continue dealing with countries that practice genocide. Consider Burma or China.

Cloudflare simply has the luxury of choosing which politically disagreeable parties they do not want to associate with because they are insignificant customers.

Pretending that this is not due to differences in politics and moral judgment is semantic smoke and mirrors.

Anyway, the point is that they are not a neutral carrier/providers. Unlike banks or telecoms which are required by regulation to accept any legal business. CF styles itself as neutral infrastructure, until they decide they are not.

The risk of getting deplatformed due to someone's moral judgment is quite real, even for an entity such as Wikipedia. For example they were blocked in the UK because the Virgin Killer album cover landed it on a block list used by major ISP.

wwright · on Sept 8, 2019

I didn’t say it wasn’t political, but it’s not just undesirable for immediate political reasons — it’s undesirable for nearly universally-agreed moral and ethical reasons. So implying it’s only inconvenient for politics is, in my opinion, misleading.

the8472 · on Sept 8, 2019

The political tends to encompass or at least subsume the moral and ethical aspects, as I tried to allude to with the realpolitik aspect.

But again, this is just a tangent. The core argument is that it is best not to rely on providers that have the freedom to make political/moral decisions who they deal with because that freedom makes them susceptible to moral denial of service attacks. You are one moral outrage away from being deplatformed.

wwright · on Sept 8, 2019

I can see that mentality, but what I’m saying is that, personally, if I become a Nazi, I think I should be deplatformed.

whenchamenia · on Sept 8, 2019

Then who decides what is a Nazi? Deplatforming someone for their speech makes them one in my book. How far down do we go?

judge2020 · on Sept 7, 2019

This recent CF product announcement might be the same thing (not sure, sounds similar): https://blog.cloudflare.com/magic-transit/

jopsen · on Sept 8, 2019

> no Matthew Prince selectively and benevolently enforcing CF neutrality.

Is this a slippery slope argument.

Because there is a world in difference from discontinuing a few extremists customers, to discontinuing service for something akin to Wikipedia.

I'm not sure every slight compromise of principals is a slippery slope. It seems to me that CF generally aims at being neutral.

palerdot · on Sept 8, 2019

The argument made here is there is a chance (however minute) that the same can happen to something like Wikipedia because of some misplaced sense of morality, like say - we don't agree with wikipedia edits and editing process which we see if offending certain sections of X population. It does not matter how right their reason is. The fact that providers like cloud flare are in such position to take a moral high stance is not right ...

jopsen · on Sept 10, 2019

I don't disagree, but has it ever been any different?

joatmon-snoo · on Sept 8, 2019

Anyone that suggests that there is One True Solution TM is either biased or ignorant.

You also don't get to claim it supports 2Tbps if you've only weathered 44Gbps.

lima · on Sept 7, 2019

There's plenty of specialized providers which provide this service, Verisign is one of many.

The issue with on-demand BGP mitigation is that an attacker can do short attacks on and off over a long period of time. Each time the mitigation kicks in, BGP propagation takes at least ~1 minute and will cause some downtime. Proper protection is always-on without requiring redirection.

amaccuish · on Sept 7, 2019

What's with the username? Are you trying to equate dang to Deng Xiaopin?

kyrra · on Sept 8, 2019

The attacker also attack Blizzard's game servers. Is actively taking down WoW classic and Overwatch.

https://www.reddit.com/r/classicwow/comments/d10x4f/servers_...

The attacker was posting updates to Twitter, but their account has since been suspended.

jrochkind1 · on Sept 7, 2019

Did they say anywhere what their motive was?

diveanon · on Sept 7, 2019

They are advertising their services.

impalallama · on Sept 7, 2019

Power tripping most likely.

comboy · on Sept 7, 2019

It's not cheap, advertising their services seem more likely to me.

bollockitis · on Sept 7, 2019

Do these kinds of attacks usually have a motive?

Iv · on Sept 7, 2019

I am pretty certain that when China used the Great Firewall to attack github, this was a test of their capabilities.

slashdev · on Sept 7, 2019

Source?

yorwba · on Sept 7, 2019

https://en.wikipedia.org/wiki/Great_Cannon

slashdev · on Sept 7, 2019

What a world we live in... Government sponsored DDoS.

ngcc_hk · on Sept 7, 2019

What a world live in if you are the 1.x million in som northern part of china. The 1.x billion if live inside Great Wall. The 7m if you are in Hong Kong or 2x million if you are in Taiwan.

You live in a world where a totalitarian communist state is welcomed and controlled a significant portion of the world economy. Even speak in internet summit.

Welcome to the brave new internet and international world of china.

ethelward · on Sept 7, 2019

They can be used by blackhats selling e.g. DDoD-netbots to prove the “quality of the merchandise”.

aldoushuxley001 · on Sept 7, 2019

I definitely get the feeling that’s what they’re going for. They mentioned they’re just testing out a new botnet made from IoT devices.

mirimir · on Sept 8, 2019

Ah, Mirai Mark N. So what, light bulbs? Cameras?

mpfundstein · on Sept 7, 2019

Where did you read this? I am curious...

aldoushuxley001 · on Sept 7, 2019

From the attacker's twitter account, just read through his/her twitter replies (though it sounds like it's actually a group of them).

Havoc · on Sept 7, 2019

And so the IoT wars begin...

canjobear · on Sept 7, 2019

Who are they?

mises · on Sept 7, 2019

Some clown named ukdrillas. That's about all any one knows.

softwaredoug · on Sept 7, 2019

Just want to mention, WMF has a very small but elite team of engineers. Amazed they maintain an Alexa top 5 site with many orders of magnitude less engineering staff than Facebook or Reddit. I think they must count ~100 engineers?

I can't imagine what such a small team must be going through with a major DDOS - wish them well in their efforts!

chillydawg · on Sept 7, 2019

It's because they're just serving a big site, not running the world's most sophisticated surveillance and ad serving machine. Serving giant websites isn't all that hard if you're just spewing out SQL queries into html templates. It all scales in all directions with a properly thought through architecture.

blauditore · on Sept 7, 2019

> Serving giant websites isn't all that hard if you're just spewing out SQL queries into html templates. It all scales in all directions with a properly thought through architecture.

No.

1. Your comment makes it sound like Wikipedia is just, or mostly, serving read-only content, which is far from true. Yes, static read-only content is significantly easier to serve than dynamic, editable one, but Wikipedia is the latter.

2. Claiming it's easy to build something at this scale is "isn't all that hard" just makes me think you've never done anything similar. It reminds me of devs saying they could re-build MS Office over a weekend. It's just ignorant of the software's actual complexity.

I'm not associated with Wikimedia in any way, but have worked on large-scale software projects before, and things are quite different from, say, websites only serving 100k monthly active users.

chillydawg · on Sept 7, 2019

I have, actually, worked on very large and interactive websites at the very core. Notably: betfair.com which has a very busy API and website and used to be something like a 1:10 write:read ratio with multiple clusters and layers of fancy caching to keep it all coherent down to millisecond scales.

Wikipedia does not need to be globally consistent like Betfair does and the ratio of writes to reads is nothing like 10%, I'd guess at one write per million reads or less. There are several pretty obvious ways to architect a site like Wikipedia for effectively unlimited scalability. The main trick is that it doesn't matter if a page is slightly stale and you can queue edits in the backend for quite some time (many seconds) without severely harming end users. Given those constraints it really isn't rocket science given the plethora of amazing tools we have to hand.

What I'm NOT saying is that I could build it in a weekend. It would clearly require a few teams of skilled engineers to put it all together and, crucially, operate it. My initial comment was in the context of Wikipedia having 100 engineers, and I think it's reasonable to say that a team that size is easily capable of such a feat.

floriol · on Sept 8, 2019

Hi, I am interested in this project. Could you please provide some minor detail about the architecture, like what framework was used for serving that many requests?

chillydawg · on Sept 8, 2019

Oracle and java and a whole lotta optimisations.

abraae · on Sept 7, 2019

I've never heard anyone in my life say they could rebuild MS office in a weekend.

What, in your opinion, would be the work needed to go from a 100k monthly active user site to a wikipedia scale site - that would be comparable to rebuilding MS office?

rhizome · on Sept 8, 2019

I've never heard anyone in my life say they could rebuild MS office in a weekend.

The saying usually uses Facebook or Twitter.

ivan_gammel · on Sept 7, 2019

That seems indeed comparable given the scope and functionality of Wikimedia products.

z3t4 · on Sept 7, 2019

The core parts of Office could be done on a weekend, but in order to get the same complexity and incompatibility it would take several "codemonkeys" several years to achieve.

journalctl · on Sept 8, 2019

Silly Microsoft wasted hundreds of people and decades of time. Why didn’t you tell them?

z3t4 · on Sept 8, 2019

Software projects are usually 90% done in 1% of total time taken. And if you just solve the problem with duct tape eg. a shell script, like piping stdin to a file, or contenteditable=true in HTML, you would have a very basic word program, and if you take that route you will probably have the essential features done over a weekend. But going from that to a full Office clone would take years. The real challenge in development though is to solve real problems, eg not make solutions looking for a problem, and not implement new features ( implementing only features that solve real problems).

nickpsecurity · on Sept 7, 2019

Your countering a point they didn't make. SQL to HTML templates indicated dynamic site, not static. From there, they describe surveillance and ad networks that both increase the browser workload and make it rely on 3rd-party dependencies.

I thought it was a good, but snarky, point. Especially given my browsing sped up after I installed extensions that turn all that crap off.

braythwayt · on Sept 7, 2019

Please be careful of logical tautologies:

"It all scales in all directions with a properly thought through architecture" sounds dangerously like, "Programming isn't that hard if you just do it right."

inimino · on Sept 7, 2019

> Programming isn't that hard if you just do it right.

That's not a tautology. In fact, it's actually worth pointing out, especially to junior engineers who get frustrated by how hard everything is, that it actually doesn't need to be that hard if you, well, do it right. Obviously that's not productive feedback without actually helping them be better, but it's far from a tautology.

For anyone wondering, a tautology is a statement that is logically true by construction, rather than contingently true because of the way the world is. For example, "Programming isn't that hard if it's easy" would be a tautology. Constructing a counterexample by changing programming to something else shows that this was not a tautology to begin with: "Sending a man to the moon isn't that hard if you just do it right," which is obviously false, because even if you do it right that's objectively difficult.

Programming is hard, but we make it much harder than it has to be by doing it spectacularly wrong in many ways, both individually and collectively.

braythwayt · on Sept 7, 2019

I think that you are speaking of a logical tautology, while I am speaking of a linguistic tautology. If I am correct about this, we are both right.

A logical tautology is, "A statement that is true by necessity or by virtue of its logical form."

A linguistic tautology is, "A phrase or expression in which the same thing is said twice in different words."

In formal debating, for example, you can call someone out for either type of tautology.

inimino · on Sept 7, 2019

Because we express logical statements in English, these two kinds of tautologies overlap. (If we were using a formal language we could express tautologies like !(A && !A) without using English.)

If you say "All bachelors are unmarried", this is true both because of the meaning of the words, and because of the logical structure implied by the words.

In either case, if you state a tautology, you state something which is true in all possible worlds, given the definitions of the words, at least. Someone can then call you out for stating a tautology, which is to state something that is vacuously true, that is, you've made a statement about the nature of reasoning itself, in any possible world, but you haven't said anything at all about the world we're actually in. So you're wasting your breath even though what you say is unassailably true.

(Note that in mathematics, the tautologies are precisely the theorems with their premises or axioms! So this is by no means always useless.)

The problem with calling out tautologies in common life, however, is that the danger of identifying a false tautology is very high. When someone says "Either X happened, or it didn't." you may be tempted to say "tautology!" but in fact they are probably making some kind of oblique point or highlighting a flaw in someone else's argument, etc. In other words, tautologies may be vacuous as statements about the world within a logical framework, but as speech acts in the real world, they always come with a motivation and that can usually be expressed in a non-tautologous way. For example, "A or not A" can be expanded charitably to "A or not A, and this is relevant to the topic at hand", which is not a tautology anymore.

In this case, if you do something right, it's not extra hard, which is kind of a tautology. But there's a point in saying it, which is that it doesn't have to be that hard... if you do it right. And that's not true of everything or in every possible world, hence not a vacuous statement.

MaxBarraclough · on Sept 7, 2019

> That's not a tautology. In fact, it's actually worth pointing out, especially to junior engineers who get frustrated by how hard everything is, that it actually doesn't need to be that hard if you, well, do it right

But this boils down to If you build systems using a high level of skill and foresight, it's easy to do.

This is of course not a tautology, but a contradiction. I agree that inexperienced developers can, as it were, 'make life hard for themselves', but that's (trivially) due to their inexperience. I don't think there's a silver bullet for inexperience.

Over-engineering is bad, as is under-engineering. Fuzzy principles like 'YAGNI' can't be applied without skilled discernment, which means experience.

> Programming is hard, but we make it much harder than it has to be by doing it spectacularly wrong in many ways, both individually and collectively.

I think I agree with this, but it depends on specifics. What sorts of things are you thinking of?

rocqua · on Sept 7, 2019

> If you build systems using a high level of skill and foresight, it's easy to do.

The point being made here is not necessarily a flippant 'git gud'. Instead, it is a statement that problems are tractable, and that getting some things right up-front can have good pay-offs down the road.

In other words, don't give up and try to figure out what is good and bad practice.

inimino · on Sept 7, 2019

Yes, "don't give up" is the main thrust. However, if an individual exists in an environment where bad practice is rewarded and good practice is scorned, the advice needs to go beyond individual practice. We need to not give up on the environment, and that in turn requires hope that a better environment is possible and within our reach, as an individual, a team, and a discipline/craft/practice. This is a hard problem.

inimino · on Sept 7, 2019

> But this boils down to If you build systems using a high level of skill and foresight, it's easy to do.

Yes.

> This is of course not a tautology, but a contradiction.

It's not quite a contradiction! If you bill $1 for changing the bolt, and $9,999 for knowing which bolt to replace, this shows that the work is easy, but the experience required to make the work easy is not easy. If the master can draw it in seven strokes, but you don't see the seventy thousand strokes they did before, it looks easy, and in fact it is easy, for the master but not for the novice.

> I agree that inexperienced developers can, as it were, 'make life hard for themselves', but that's (trivially) due to their inexperience. I don't think there's a silver bullet for inexperience.

That's right. However, we can also make life hard for each other, and there are some solutions for that that are better than doing nothing.

> Over-engineering is bad, as is under-engineering. Fuzzy principles like 'YAGNI' can't be applied without skilled discernment, which means experience.

Yes. This is why we have code reviews, design reviews, pair programming, and so on, but these aren't silver bullets either and there is no silver bullet, but if these things lead to increased awareness of why and not just what and how, then we can accelerate the process of acquiring that discernment. As Dijkstra said, if it goes to the grave with you and you didn't pass it on, you didn't really do your job as a senior engineer (paraphrasing).

>> Programming is hard, but we make it much harder than it has to be by doing it spectacularly wrong in many ways, both individually and collectively.

> I think I agree with this, but it depends on specifics. What sorts of things are you thinking of?

Using the wrong tool for the job. Using too many tools for the job. Using tools that do not afford mastery, because they are too complex for anything built on top of them to be comprehensible.

This is all quite abstract. A specific example: we (the JavaScript community) had a good thing in that JS was a small, human-scale, useful and commercially valuable language with applications beyond its initial environment on the web. We got excited and built the npm package archive, and filled it up, and now we have unmaintainable, incomprehensible piles of trash piled upon trash that can't possibly be used as a foundation for anything reliable, performant, or maintainable. This is unfortunate. What's even more unfortunate is that this piled-up-trash approach still has momentum, still lets people get useful work done, and still has some value to the community. So we keep using it, and even keep piling more on. It takes considerable effort to step back from all this, take a collective mulligan, and start over with a principle of taking things away to make things better, rather than adding more hacks to hide existing hacks.

This is one example of many, but I mention this one because I was there when JS was simpler and better and I watched as we made it markedly worse. I would have recommended JS as a first language to beginners when node.js and npm were new, and I did, but I cannot now recommend them in good faith, because they have become antagonistic to quality and to mastery of the craft.

MaxBarraclough · on Sept 15, 2019

> If you bill $1 for changing the bolt, and $9,999 for knowing which bolt to replace, this shows that the work is easy, but the experience required to make the work easy is not easy. If the master can draw it in seven strokes, but you don't see the seventy thousand strokes they did before, it looks easy, and in fact it is easy, for the master but not for the novice.

If it takes years to be able to do it well, it's not easy.

> So we keep using it, and even keep piling more on. It takes considerable effort to step back from all this, take a collective mulligan, and start over with a principle of taking things away to make things better, rather than adding more hacks to hide existing hacks.

True, but it can be done. The community moved away from Bower, for instance.

inimino · on Sept 21, 2019

Watch the master attack an intermediate problem. They "make it look easy" because it is easy for them because they have been doing it for so many years they have forgotten that it was ever not easy. This is the real curse of knowledge.

> Bower

Yes, finally! In another 40 years most of the trash we're creating now will also be gone, probably replaced by more unless we find some discipline.

chillydawg · on Sept 7, 2019

I appreciate what you're saying, but I don't think it quite applied. What I meant was that it's easy to create an architecture for an application that doesn't scale well at all. Eg - poorly sharded data, lots of cross dependencies etc. However, if you properly think through your data model and data flows and use cases, it's generally possible to create a system that is extremely scalable in all directions. This is certainly not easy, but it's a hell of a lot easier than creating some huge ai driven data slurping ad empire.

braythwayt · on Sept 7, 2019

I totally agree that you make an excellent point about the relative ease/difficulty of various approaches.

WhitneyLand · on Sept 7, 2019

>>"Programming isn't that hard if you just do it right."

Is this like saying, programming isn't hard if you choose easy enough problems to solve? Or should we ask for a link to see a demo of an AGI implementation?

I guess math is not hard either if you're "doing is right", as long as it's all arithmetic...

>>That's not a tautology.

I would agree tautology is not the best description, probably fallacy would do fine.

inimino · on Sept 7, 2019

> Is this like saying, programming isn't hard if you choose easy enough problems to solve?

No, this is saying that things don't have to be as hard as we make them. You don't need more than a hundred people to run a top-ten website, and that shouldn't be surprising. It is surprising only because we are so good at making things overcomplicated.

benologist · on Sept 7, 2019

But also perhaps it’s because they didn’t allow a team to endlessly iterate on tech minutiae until they required many teams to keep it running and iterate on tech minutiae.

whym · on Sept 7, 2019

> I think they must count ~100 engineers?

https://wikimediafoundation.org/role/staff-contractors/ has the names of 379 employees. I believe (perhaps astonishingly) that is all - engineers and non-engineers combined. Their engineers spread across departments, but judging by the 141 instances of the string 'engineer' in that page, I'd be surprised if the number exceeds 200.

kemayo · on Sept 7, 2019

Speaking as someone listed on that page, and having attended all-hands meetings, yeah, we're not huge.

Though it is worth bearing in mind that everything's open source, and there's a hefty community component. So there's a more vaguely specified number of people who might provide patches, and individual wikis are mainly run by volunteers.

ngcc_hk · on Sept 7, 2019

Just drop by and say thanks you. Not many worldwide charity for human knowledge. Add oil.

sprafa · on Sept 7, 2019

That’s what happens I guess when you’re running a charity, you can recruit top talent (I assume many 10x folks wouldn’t mind working for wikimedia!) and every dollar counts. Pretty incredible.

wolco · on Sept 7, 2019

They seem to be at the leading edge of hiring remotely and they don't pay anywhere near facebook salaries. The culture must be attracting some strong developers.

atdt · on Sept 7, 2019

And they're hiring! https://wikimediafoundation.org/about/jobs/#section-8

I worked there for four years and I miss it every day.

breck · on Sept 7, 2019

> I worked there for four years and I miss it every day.

Sorry but now I'm curious, why did you leave?

atdt · on Sept 7, 2019

Wikipedia has a huge impact in people's lives, particularly in non-English languages, and there's so much work to do, and so much of it feels urgent and necessary. I really responded to that, and I wasn't careful, and burnt myself out. (This was not the fault of the org; Wikimedia is largely a do-ocracy, and if you're intent on working through the small hours of the night, there is very little anyone can do to stop you. Co-workers who saw what I was doing did urge me to pace myself and exercise self-care.) By the time I realized what I was doing, I was in a pretty bad way, and felt like I needed a complete change of scenery to get back on my feet.

breck · on Sept 7, 2019

Interesting! I see that happen to people in the non-tech non-profit world a lot. Well, thank you for your service, and hope you are starting to enjoy a rest well deserved!

adventured · on Sept 7, 2019

What kind of role were you in at Wikimedia if you don't mind answering?

Nemo_bis · on Sept 7, 2019

He was all over the place! It's all public under the same username so I hope he forgives me spoilering it: https://www.mediawiki.org/?search=Ori+livneh, https://gerrit.wikimedia.org/r/#/q/owner:Ori.livneh .

didip · on Sept 7, 2019

Not to diminish Wikipedia engineers talent, of course...

But, I'd consider Wikipedia traffic to skew heavily towards anonymous read-only, with very few logged-in write traffic.

This allows for tons of caching opportunities: Varnish, Memcache, etc. And these techniques are well known.

patsplat · on Sept 7, 2019

The proportion of read/write may skew towards reads, but Wikipedia still is an application where any user can create state visible to all other users. It's not as simple as this comment makes it out to be.

DuskStar · on Sept 7, 2019

But how quickly must those writes be reflected in the reads of others? If you can accept a few minutes of latency there, I imagine things would get easier

bawolff · on Sept 7, 2019

In order for wikipedia's anyone can edit to work, its really important that when someone makes a bad edit to a popular article that it can be removed immediately. This is important both to get things fixed quickly and to make it less of a juicy target so less people vandalize (no fun to vandalize if it doesnt stay up).

I suspect latency in the minutes for cache updates would be unaceptable to wikipedia users

majewsky · on Sept 7, 2019

Power users use very different workflows than read-only users. You can serve pages from a 30-minute-old cache to the 99.9% of passive readers and it doesn't hurt that much. Editors use "Recent Changes" to monitor edits, and that's much easier to render in real time because the audience is comparatively minuscule.

bawolff · on Sept 7, 2019

Yes but if someone replaces the picture on the trump article with goatse, and non power users get this version for 30 minutes until the cache clears - they are going to be pretty pissed and start yelling to power users & just generally cause a PR disaster.

Additionally if vandals know their vandalism will stay for 30 min, they are much more likely to do it, which is a vicious cycle

user5994461 · on Sept 8, 2019

Aren't articles like Trump write protected? If you want to edit, you shouldn't be able to, unless you have an account that's not brand new. You will be banned very quickly as soon as you start putting goatse on most visited pages.

Also, the white house PR team is actively watching and editing political figures articles. They will sort it out too.

patsplat · on Sept 7, 2019

First off, keep in mind that even scaling a broadcast publication can be complex. Sure one can bolt on fastly or s3 but cache invalidation is never a simple problem.

Next "power users" as others put it are not a single set of editors. It's more of a social network with multiple levels of trust. The idea of a wiki is that all users have write access, even if those changes are moderated to have different levels of latency.

Of course there are ways to engineer the system, but at that point one is, well, engineering a system. And WMF is doing so on a shoestring compared to other comparable levels of traffic.

Is WMF creating new paradigms of computing? Probably not. But they are doing a good job, IMHO.

innocenat · on Sept 7, 2019

It must be immediate, because Wikimedia can detect edit conflict (when someone update the article you are in the middle of editing)

alexis_fr · on Sept 7, 2019

It must be immediate for logged-in users, but not others. It’s an excellent thing that Wikipedia doesn’t nudge people to log in all the time, and I susped 95% of users are not logged in.

wwright · on Sept 7, 2019

That doesn’t mean all reads have to be immediate, only some.

iso-8859-1 · on Sept 7, 2019

If you consider the amount of money they are burning in comparison to 5 years ago, are the results really that impressing? See https://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2...

wott · on Sept 7, 2019

I found this updated version: https://en.wikipedia.org/wiki/User:Guy_Macon/Wikipedia_has_C...

Their expenses have doubled in less than 5 years...

Even if their ratio Expenses / Assets has now decreased compared to 3-5 years ago (but stalls now), it means that their goal of financial independence is still very far away and they still rely heavily on a huge amount of donations.

dahart · on Sept 7, 2019

I buy that yes it takes time to be financially independent of donations while offering a global information service for free.

But that essay is clearly pure hyperbole. The expenses aren’t exponential, they’ve been roughly linear for a decade. Notice how the word exponential was removed in the second version. The graph is showing increasing savings along with increasing growth, and the expenses appear to have slowed slightly in the last five years compared to the five prior years. It’s completely failing to demonstrate the stated claim of runaway spending, the numbers practically prove the opposite.

Plus it’s not outlining what the money is used for, so there’s no concept of efficiency here, no reason to doubt that increased service came with increased expenses. There’s zero meat in this argument.

Whatever; last year’s total expenses seems very small to me compared to web sites of similar size; there are startups smaller than Wikipedia’s team that have raised more money than Wikipedia’s yearly expenses without managing to deliver anything. Wikipedia’s value to the world is currently larger than it’s expenses, IMO, and I think it’s impressive what this non-profit has done.

RijilV · on Sept 7, 2019

It’s sorta interesting how the replies to this digressed into linguistics about systems architecture, and nobody called out the “elite engineers” statement.

I’ve worked with a couple of engineers who are now on Wikipedia’s SRE team. They’re good engineers, but not elite by any means. Not “10x” developers or wizards in castles or whatever. Good solid engineers who I would work with again and fight to hire. But they’re not savants or even the top 10% of folks I’ve worked with. Solid mid to sr level engineers I’d be happy to hand a project off to with ambiguous goals and little oversight, and I’d expect them to get a team of 4 or so other engineers to be more productive.

These are the engineers who meet the job requirements for SRE positions.

woadwarrior01 · on Sept 7, 2019

I used to work at FB and now work at Reddit. The engineering staff count at Reddit is within the same order of magnitude as the number you cite above. :)

input_sh · on Sept 7, 2019

Yeah, but unlike on Reddit, I never see "something went wrong" on Wikipedia.

No offense to you nor your team, but to me, as a consumer, reddit's product doesn't appear nowhere near as polished as Wikimedia's projects.

woadwarrior01 · on Sept 7, 2019

No offense taken, I don’t work on the product side of things there.

Also, with the caveat that I don’t know enough about the implementation details of the product at Reddit: I’d argue that Reddit’s workload is more write heavy that Wikipedia’s workload, which makes caching and scaling a bit harder for Reddit, relatively speaking.

Nemo_bis · on Sept 7, 2019

That could be true, but do you have some numbers? Wikimedia wikis are in the order of a few hundreds edits per minute, around a thousand and sometimes more. https://tools.wmflabs.org/wmcounter/ https://wikipulse.herokuapp.com/

adventured · on Sept 7, 2019

To add the Reddit figures to the discussion. There are something like ~2,100-2,200 comments posted to Reddit per minute on average across a year at this point (around or slightly over three million comments per day). That's not the peak minute figure of course, which is no doubt several times higher.

Nemo_bis · on Sept 7, 2019

So it's actually comparable I'd say, in terms of frequency. In terms of how much "writing" this actually means, it varies: an edit to a large article, even if it's just a comma, requires several seconds of parsing to produce the wikitext, a lot of events propagated in various places etc.

majewsky · on Sept 7, 2019

OTOH, shouldn't it be easier to scale Reddit horizontally by sharding the subreddits into separate DBs?

dTal · on Sept 7, 2019

Certainly not trivially - the subreddits are not wholly independent entities. User accounts are shared between them, for instance, and every post a user makes is (searchably) linked to their account, regardless of which subreddit they posted it in. Users can also send each other private messages, and this does not take place in the context of any particular subreddit.

adrusi · on Sept 7, 2019

Not to mention that a huge portion of reddit traffic goes to aggregation pages like users' frontpages, /r/all, and too a lesser extent multireddits.

jasonvorhe · on Sept 7, 2019

Wasn't Instagram famous for having a very small team of engineers responsible for the availability of the entire platform before Facebook acquired them?

qntmfred · on Sept 7, 2019

Not sure how big ig was on acquisition, but I always remember that story being about WhatsApp with only 35 engineers

adventured · on Sept 7, 2019

Business Insider ran a story about them at the point of acquisition.

April 9, 2012 "Instagram was acquired by Facebook today for $1 billion in cash and stock. It only has 13 employees and a handful of investors. ... Meet 11 of the lucky employees and 9 investors behind Instagram. ... Two other employees were hired during South by Southwest last month and their information wasn't available for this story. "

https://www.businessinsider.com/instagram-employees-and-inve...

euske · on Sept 8, 2019

> I can't imagine what such a small team must be going through with a major DDOS - wish them well in their efforts!

Not only that. They do all this with amazing openness. Their records of incidents and deployments, who's in charge of what, rotation schedules are all public and shared in MediaWiki (although they're not that well organized). I can trace this back to circa 2005. Maybe this could be the largest knowledge base of devops that is public.

cf. https://wikitech.wikimedia.org/wiki/Category:Incident_docume... https://wikitech.wikimedia.org/wiki/Deployments/Archive/2019...

NewJazz · on Sept 7, 2019

I was motivated to donate a small amount of BCH to WMF after reading this announcement.

vortico · on Sept 7, 2019

Just like trying to set your local public library on fire. There are always crazies in the world.

mistrial9 · on Sept 7, 2019

except, mental states are perhaps less stable than physical ones, ultimately, and using a 'web site' is largely a mental model on the part of the user, while a technical model on the part of the provider. Dysfunctional mental drivers + lots of access + lots of time .. versus a door that locks each night and an alert attendant or three..

This is dismaying but not shocking.. the first time I saw a newly planted tree on an otherwise bleak urban block, vandalized and broken, I realized that a drive towards "better" is not to be taken for granted, and needs protection.

cycloptic · on Sept 7, 2019

You can't grow a non-profit that large without angering someone. There will always be folks who believe they are more deserving of the "free money."

tempguy9999 · on Sept 7, 2019

Some people just like to break things. It's that simple.

cycloptic · on Sept 7, 2019

That doesn't appear to be the case here. This was done for the attention, not just for the thrill of breaking things.

krispbyte · on Sept 7, 2019

Probably proving their skills to potential clients.

dleslie · on Sept 7, 2019

There was a string of arson attacks on little free libraries in Metro Vancouver; eventually a pair of teenage boys were arrested.

I suspect that the sharing of knowledge and encouragement of developing wisdom is, to some, a threatening prospect. Perhaps they have experienced learning difficulties and are struggling with shame and frustration, or perhaps they disagree strongly with the concept of an intellectually liberated population. Libraries are, after all, a pillar of liberalism.

inimino · on Sept 7, 2019

That's sad to hear, I always love coming across the little free libraries. I would guess they were just bored and angry teens, likely not making any deeper statement but just expressing their anger and frustration and willingness to break the rules. Also, it's fun to watch things burn, and they come with built-in kindling. Hopefully some judge will make them rebuild what they burned, that would be fair and give them more appreciation for the work of others that they destroyed thoughtlessly.

wolco · on Sept 7, 2019

More likely just trying to have fun. I'm sure they would do the same to local schools.

segfaultbuserr · on Sept 8, 2019

> or perhaps they disagree strongly with the concept of an intellectually liberated population. Libraries are, after all, a pillar of liberalism.

Thoughtful comment. I would agree with you if the attack is somehow organized at 4chan /b/, Kiwi Farms, or some underground IRC for mysterious or unexplained reasons. If it happens, its philosophical implications would be deep. And I won't surprise if it occurs one day.

But so far there's no evidence to suggest the attack has any ideological motivation beyond making the attacker famous.

aaron695 · on Sept 7, 2019

> Just like trying to set your local public library on fire.

No, more like superglueing the doors shut for a few hours.

Don't really see the damage, especially since they moved onto Twitch and WOW servers, I'd say more work and a few early nights sleep has got done in the end.

keymone · on Sept 7, 2019

There’s lots of dirty politics on WP. Not sure about other countries, but Russian government is very involved in whitewashing it’s activities.