A severe cloud infrastructure outage often feels less like a service disruption and more like an earthquake. One incident can barrel its way across an entire region, indiscriminately disrupting commerce, travel, medical care, and communication. Reverberations are felt far, far from the site of the event. There's nothing most of us can do to prevent it from happening or stop it once it starts. We just have to wait for the shaking to end and hope that none of our most valuable stuff got smashed.
The cloud is quickly becoming as foundational to life on earth as the ground beneath our feet. AWS, Microsoft Azure, IBM Cloud, Google Cloud, Oracle Cloud, the cloud delivery networks like Fastly are an essential part of more businesses and industries every day, whether those businesses know it or not.
And that's not such a bad thing. Cloud computing does enable extraordinary innovations.
Yet the companies running cloud services and maintaining cloud infrastructure have the same challenges all IT teams do: People are fallible. Market pressures rush release cycles. Legacy systems hold on with an iron fist. Networks are insufficiently segmented. Credential management is lackluster. Misconfigurations happen. Updates are risky. Regulatory compliance is a headache. Automation can't solve every problem. And 99.99% uptime sounds good until the .01% event happens.
But the consequences when something goes wrong, the impact of those .01% incidents, are far far worse. When shipping, finance, medical care, government all rely on the same infrastructure to conduct basic operations, and that infrastructure is disrupted for hours...
Well, it's worth devoting a whole week of coverage to. So, that's what we're digging into at InformationWeek and Network Computing this week. Here's what we've covered so far, and what's coming up next:
Lessons Learned from Recent Major Outages, by Sal Salamone
Today’s more interconnected business world makes infrastructure and cloud outages all the more impactful. Here’s a recap of recent outages and their root causes.
Cyber Resiliency: How CIOs Can Prepare for a Cloud Outage, by John Edwards
The dangers posed by a cloud outage are clear and omnipresent. Here's how to prepare your organization for the inevitable worst-case cloud scenario.
From the outermost edge of extreme networks to the center of the energy vortex, emerging technologies are set to permanently end cloud outages. Will they eventually eradicate the clouds, too?
Your IT organization may view failing over from one hyperscaler's cloud to another as the ultimate security when it comes to cloud resiliency. Here's why that's not going to happen, plus a look at alternatives.
Legislators Gear Up to Regulate Cloud Providers for Resilience, by Carlo Massimo
The US, UK, and EU are all weighing regulations that would consider cloud companies "critical infrastructure" and require they meet resiliency standards.
Are Cloud Outages the Result of Choosing Price Over Reliability?, by Joao-Pierre Ruth
Market pressures and risk tradeoffs made by cloud providers might be the key to understanding why outages happen and what lies ahead for resolving cloud outages that can bring down regional, if not national commerce.
You Get What You Pay For: Cloud Edition, by Jessica Davis
An 8-hour cloud outage during holiday shopping season could cost a retailer millions of dollars. Here's how CIOs are weighing the risks and costs of outages against the complexities and costs of building resilience.
Can You Recover Losses Sustained During a Cloud Outage? By Richard Pallardy and Carrie Pallardy
The cloud comes with tantalizing promises of greater efficiency, improved data security, and boosted profits. But the cloud is not infallible, and outages are inevitable. Here’s what IT leaders need to know.
When (and if) to Sue Your Cloud Provider, by Richard Pallardy and Carrie Pallardy
Taking legal action is a slippery slope: Cloud providers are known to have covered their bases well. And besides, is it worth ruining your relationship with the cloud provider? It depends.
[Sign up for the InformationWeek Cloud biweekly newsletter, your source of insight on the fast-changing world of cloud technology, and how to use it to transform IT and business.]
What Can Network Managers Do About Cloud Outages? (Not Much), by Sal Salamone
Better observability tools can help net managers maintain some cyber resilience to cloud service outages, but misconfigs and DNS infrastructure is down to the providers.
How Climate Change is Impacting Cloud Resilience, by Samuel Greengard
Datacenter cooling issues are already causing problems for cloud providers. What does a future full of droughts, heat waves, and severe weather events mean for cloud resiliency?
Remember 2008, when "Low" by Flo Rida, featuring T-Pain, was the top single, and Gmail went down for more than 24 hours? We do.
Podcast: Workspot CEO on Coping with Cloud Outages, by Joao-Pierre Ruth