Overview of the operational excellence pillar

This pillar covers the operations processes that keep an application running in production. Deployments must be reliable and predictable. They should be automated to reduce the chance of human error. They should be a fast and routine process, so they don't slow down the release of new features or bug fixes. Equally important, you must be able to quickly roll back or roll forward if an update has problems.

To assess your workload using the tenets found in the Microsoft Azure Well-Architected Framework, see the Microsoft Azure Well-Architected Review.

These are the disciplines we group in the operational excellence pillar:

Operational excellence disciplines Description
Application design Provides guidance on how to design, build, and orchestrate workloads with DevOps principles in mind
Monitoring Something that enterprises have been doing for years, enriched with some specifics for applications running in the cloud
Application performance management The monitoring and management of performance and availability of software applications through DevOps
Code deployment How you deploy your application code is going to be one of the key factors that will determine your application stability
Infrastructure provisioning Frequently known as "Automation" or "Infrastructure as code", this discipline refers to best practices for deploying the platform where your application will run on
Testing Testing is fundamental to be prepared for the unexpected and to catch mistakes before they impact users