Richi Jennings is an independent analyst/consultant, specializing in content marketing, email, spam, and other security topics. He’s won awards from the American Society of Business Publication Editors, ABM/Jesse H. Neal, and B2B Magazine.
He was previously CTO for Samsung Contact. Prior to this, he was at Hewlett-Packard for 14 years, working in a wide variety of engineering, marketing and architect roles, mainly on OpenMail and its predecessors.
Richi lives in England, is an un-professional DJ, rusty scuba diver, and was voted “Most likely to get up first to sing at karaoke” for 16 years in succession. You can follow him as @RiCHi on Twitter, pretend to be Richi’s friend on Facebook, Plus him at +richijennings, or just use boring old email: [email protected]
On quotations:
When I quote other websites or writers, I will often edit for the busy reader, using a style based on accepted journalistic norms (e.g., AP; Chicago). The intention is never to change the meaning of the original. Omitted text is marked with an ellipsis (…); altered text is marked in [brackets]; typos may be corrected; on rare occasions, I may reorder text. I also aim to stay within the letter and spirit of copyright law: specifically, U.S. Fair Use and UK Fair Dealing codes.
If an organization or individual so quoted objects to being quoted in this way, they can request an edit or complete removal by tweeting @RiCHi; the request will then be considered by editorial staff.
Disclosures:
For a list of disclosures and potential conflicts, please see richi.uk/bio

Microsoft Azure Outage -- Was it a DNS DDoS?

Azure down
Azure DNS not found: (A)bort, (R)etry, (F)ail?

Azure recently experienced a nasty outage, lasting several hours. It seems to have been centered around DNS, but had wide side effects.

Microsoft’s cloud platform took something of a beating. By the sound of it, a DDoS could have been to blame. Or not.

Redmond isn’t saying. In today’s IT Newspro, we avoid the potholes in the cloudy superhighway.

Your humble newswatcher curated these news nuggets for your entertainment. Not to mention: Meta-clickbait and the power of storytelling

What’s the craic? Mary Jo Foley speaks to a Global DNS outage:

The status page said the outage started at7:48 am ET. [By] 11 am ET [the] page is reporting that most of the downed services are back, if not coming back.

Microsoft is identifying the preliminary root cause as a “spike in networking traffic.”The DNS issues were “self-healed by the Azure platform.”

[But by] 4 pm ET, Microsoft still seems to be having OneDrive connectivity issues.


That doesn’t sounds so good. Alexander J Martin dreams up a colorful metaphor—Azure is on fire:

Customers using Azure DNS in multiple regions experienced difficultiesdue to the mysterious issues affecting Microsoft’s cloud computing and infrastructure platform..Azure proudly advertises itself as a global networkusing Anycast routing to provide “outstanding performance and availability.”


Where did the problem start? Peter Gothard has a hint—Azure borkage in central US leads to global woes:

The fault affected API management, web apps, Service Bus and SQL database services in the central US region.Customers noticed confusion with Microsoft’s messages, as Azure Twitter feeds and status pages seemed to disagree on the speed of recovery.


Is this part of a pattern? rufflow wishes Microsoft would quit fiddling with it:

Microsoft and reliability these days.Login issues with Visual Studio team servicespart of Skype offline, then SharePoint issues, then OneDrive issues…

Every time I logon to my Office 365 admin account there are new features, layouts or things are moved. It would be nice if they rolled out new stuff fully instead of living in a SharePoint house that is in constant renovation.

Worrisome, no? Aspecially as hobblegum says this shouldn’t happen:

The “datacenter pair” design is supposed to prevent this. They are not supposed to update the code or configs in both centers at the same time, to reducing the risk for both locations for a geo-redundant solution to go down at the same time. These multi-region outages shows that the regions are still too tightly coupled.


Thankfully for Microsoft, Google also had issues this week. Here’s Caroline Donnelly, with Microsoft and Google cloud users suffer service outages:

Rival service providers both experience technical difficulties.Google Apps for Work users [were] unable to use the service for 90 minutes.


I bet this will make great fodder for the anti-cloud Luddites. Take ma1010, for example:

Someday perhaps I’ll understand why people and businesses want to put their own data on computers that belong to some corporation in some distant location which depend on the Internet to work at all.

If something breaks on your in-house IT and you’re the IT guy, you can do something about fixing it. If something breaks in the cloud, all you can do is whine about it. And wait.


Meanwhile, life goes on. Last word goes to juhunter:

Well this throws a wrench in trying to convince my boss how awesome Azure is.

Buffer Overflow

More great links from Petri, IT Unity, Thurrott and abroad:

And Finally

Main image credit: Federal Highway Administration (public domain)

Related Topics:

  • Cloud Computing
  • DNS
  • Main
  • BECOME A PETRI MEMBER:

    Don't have a login but want to join the conversation? Sign up for a Petri Account

    Register