Microsoft Azure disruption affects Windows VM users worldwide

representational image depicting network service outage
(Image credit: Shutterstock)

Microsoft’s cloud computing platform Azure suffered an over six-hour long outage, which reportedly prevented users from spinning up new Windows-based virtual machines (VMs).

Microsoft Azure’s status page said Wednesday’s VM outage began around 5 am UTC and lasted till around noon UTC, and impacted services across all regions, from Europe, to the Americas, the Middle East, Africa, and Asia Pacific. 

“Between 05:12 UTC and 11:45 UTC on 13 Oct 2021, a subset of customers using Windows Virtual Machines may have received failure notifications when performing service management operations - such as start, create, update, delete. Deployments of new VMs and any updates to extensions may have failed,” read the notification on Azure’s status page.

The notification added that the outage would have impacted services that depend on Windows VMs, though non-Windows VMs, and already running Windows VMs weren’t impacted.

Migration issues

Reporting on the development during the outage, The Register says that while Azure’s Twitter support page didn’t mention the incident, it did confirm the blackout to a customer saying that it was aware of this issue and that its "engineering teams are actively collaborating to resolve this."

In a later update, posted after the issue had been resolved, Azure shared that preliminary investigation seems to suggest that the root cause of the issue stems from the planned migration of the VM Guest Agent Extension publishing architecture to a new platform, which inadvertently caused service management operations to fail. 

“We identified that calls made during service management operations were failing as a required artifact version data could not be queried. Our investigation focused on the backend compute resource provider (CRP) to determine why the calls were failing, and identified that a required VMGuestAgent could not be queried from the repository,” notes Azure in the update.

Azure’s investigation into the incident continues as it works to ensure that such incidents don’t reoccur and promises to publish a full root cause analysis within the next three days.

Via The Register

Mayank Sharma

With almost two decades of writing and reporting on Linux, Mayank Sharma would like everyone to think he’s TechRadar Pro’s expert on the topic. Of course, he’s just as interested in other computing topics, particularly cybersecurity, cloud, containers, and coding.