Amazon Outage: Automation bug caused global disruptions

Seattle: The Amazon outage this week was caused by a bug in its automation software, leaving services from Signal to smart beds offline for several hours.

In a detailed post, AWS outlined a cascading series of events that triggered the outage, taking down thousands of websites and applications relying on its cloud infrastructure.

The Amazon outage primarily affected DynamoDB, AWS’s database system where customers store data, due to “a latent defect within the service’s automated DNS [domain name system] management system.”

DynamoDB manages hundreds of thousands of DNS records and uses automation to monitor the system, ensuring updates, capacity adjustments, hardware failures, and efficient traffic distribution.

Amazon Outage-Image from-AWS — Image Credits: Amazon Web Services | Cropped by BH

AWS traced the root cause of the Amazon outage to an empty DNS record in its Virginia-based US-East-1 data center. The automation system failed to repair the record automatically, requiring manual operator intervention to resolve the issue.

To prevent further problems, AWS disabled the DynamoDB DNS planner and DNS enactor automation worldwide while addressing the underlying conditions and adding extra protections. Other AWS tools were also impacted during the outage.

Platforms and services affected by the Amazon outage included Signal, Snapchat, Roblox, Duolingo, banking sites, and Ring, with Downdetector reporting over 8.1 million problem reports globally from users across more than 2,000 companies. While services were restored in a matter of hours, the outage caused widespread disruption.

Even connected devices such as Eight Sleep smart beds were impacted. Users were unable to adjust bed temperature or incline via the app during the Amazon outage. CEO Matteo Franceschetti apologized and rolled out an update enabling users to control critical bed functions via Bluetooth in the event of future outages.

Amazon Outage-Image Via-Matteo — Image Via: X@Matteo Franceschetti | Cropped by BH

Experts noted that the Amazon outage highlighted the world’s dependence on single points of failure in the cloud.

Dr Suelette Dreyfus, a computing and information systems lecturer at the University of Melbourne, said that the outages showed how dependent the world was on single points of failure on the internet.

“That single point isn’t just AWS – they’re the biggest cloud provider with 30 percent or so of the market – but rather the cloud as a whole, which is basically just three companies,” Dr. Dreyfus added.

The Amazon outage serves as a reminder of the vulnerability of cloud infrastructure and the far-reaching effects that automation failures can have on businesses, apps, and connected devices worldwide.

Editor's Pick

Apple shelves self-driving electric car project Titan

India-based Tata Group to build $5bn EV battery plant in UK

Cineworld Abandons UK, US, & Ireland Business Sale Plan

Amazon Outage: Automation bug caused global disruptions

Amazon challenges Perplexity over ‘agentic’ shopping bot

Zohran Mamdani wins New York mayor race in major shift

World Tsunami Awareness Day 2025 calls for global preparedness

Amazon challenges Perplexity over ‘agentic’ shopping bot

Shein faces French backlash; Bans all sex dolls globally

Starbucks sells majority stake in China business in $4bn deal

Meta reports record revenue; Profit hit by $15.9bn tax charge

Zodiacal Light: How to spot the subtle pre-dawn sky glow

Rare ‘blood moon’ lunar eclipse to light up UK skies

Massive ice calving at Perito Moreno Glacier sparks concern

Glowing Spiral appears in night sky; Linked to SpaceX Falcon 9 Rocket

World Tsunami Awareness Day 2025 calls for global preparedness

Work Anywhere: How remote work is redefining offices

Voices of tomorrow: How Gen Z is rewriting the global narrative

World Mental Health Day 2025 spotlights psychological care in crises

We Have

Amazon challenges Perplexity over ‘agentic’ shopping bot

Zohran Mamdani wins New York mayor race in major shift

World Tsunami Awareness Day 2025 calls for global preparedness

UPS plane crash in Kentucky leaves multiple dead

Editor's Pick

Amazon Outage: Automation bug caused global disruptions

RELATED POST | AWS resolves massive disruption affecting thousands

Newly Updated