That was a pretty weak explanation, only a single sentence referencing the actual technical issue with some stuff about it causing some knock-on outages elsewhere:
> We now understand the cause of the outage was stress on our infrastructure—which struggled with unprecedented load. That in turn led to a “thundering herd” effect—triggering a failure of our DNS system.
I was planning to wait for the post-mortem because I dislike speculating and I'm not personally invested here, so I prefer to wait. I'm curious when they'll release the full one because it's going to be one of the most interesting ones in a while, given the large scale of the failure and potential implications... (ignoring today's outage)
> We now understand the cause of the outage was stress on our infrastructure—which struggled with unprecedented load. That in turn led to a “thundering herd” effect—triggering a failure of our DNS system.
I was planning to wait for the post-mortem because I dislike speculating and I'm not personally invested here, so I prefer to wait. I'm curious when they'll release the full one because it's going to be one of the most interesting ones in a while, given the large scale of the failure and potential implications... (ignoring today's outage)