An AWS us-east-1 outage exposed how automation can backfire. Learn why autoscaling failed, how pinning ASGs saved uptime, and what to do in future outages.
5 Ways to Prevent an Outage
In today’s always-on, ever-connected world, we all expect 100% availability. What gets in the way of this? The devil is in the details. Over time, everything breaks: Disks, nodes, containers, networks, DNS servers and configuration issues can all lead to major outages. Amazon’s 13 minutes of downtime in August 2021 translated to almost $5 million […]
AWS Outage and App Resiliency: Did a Roomba Replace the Canary?
Canaries were once sent into coal mines as an early warning sign against danger—for me, it was my Roomba failing to automatically search out dog hair and clean the floor under my sons’ dining room chairs. Cloud-enabled apps also were acting “weird”. Some were down, while others were just slow—although, in my view, slow may […]
NetApp Survey Shows Hybrid Cloud is Maturing
A small survey of 79 midsized to large enterprises conducted by NetApp suggests enterprise IT organizations might finally be embracing hybrid cloud computing beyond use cases involving backup and recovery. The survey finds 20% of respondents are currently employing both on-premises and cloud resources to support the same workload in a production environment, with another […]




