In Part One of this series, I provided a high-level overview of what an efficient incident response process entails from start to finish, including general framework for an incident response team. Now let’s delve a bit deeper into the role each person plays. In my experience, there are several core people who make up an […]
When IT Disaster Strikes, Part 1: Resolving Incidents
As a developer or operations team member, there is nothing quite like the dread you feel when you hear the familiar ringtone of your on-call page at 3 a.m. Being on call means that you may be contacted at any time to investigate and fix issues that arise for the system, but that doesn’t mean […]
Is Your Company’s IT a Disaster Waiting to Happen?
Recent IT disasters suggest gaps in hardware testing, backup system testing and inadequate disaster recovery plans. Although the summer often is the most profitable season for most airlines, this past summer wasn’t great for several of the largest carriers. In July, Southwest Airlines suffered a 12-hour IT outage, triggered by a router malfunction that quickly […]



