not sure, it looks like its a different system that manages the DNS for DynamoDB. it got out of sync and deleted all the DynamoDB DNS records, that then broke all the internal AWS stuff
Yes, DNS started failing to properly fill name lookups. So, DynamoDB started failing. That started making security and other AWS services fail. Which in turn made higher level services fail.
Unexpected behavior, leading to unexpected inputs to systems, often leads to failures. Put simply, nobody thought X would happen on Y service or at Z level, so nobody wrote code to handle that scenario. May sound crazy at first, but it’s quite hard to cover all possible scenarios when writing code… Or with life in general.
not sure, it looks like its a different system that manages the DNS for DynamoDB. it got out of sync and deleted all the DynamoDB DNS records, that then broke all the internal AWS stuff
AWS is mostly built on AWS.
Yes, DNS started failing to properly fill name lookups. So, DynamoDB started failing. That started making security and other AWS services fail. Which in turn made higher level services fail.
It truly was a house of cards kind of moment.
Reading that article as a layman, I got the impression there were a lot of falling dominoes.
Unexpected behavior, leading to unexpected inputs to systems, often leads to failures. Put simply, nobody thought X would happen on Y service or at Z level, so nobody wrote code to handle that scenario. May sound crazy at first, but it’s quite hard to cover all possible scenarios when writing code… Or with life in general.