Design Thinking and Service Management
Design thinking principles always point us to focus on the humans. Service Management has many more stakeholders, your users/clients, marketing, sales, and product management, and engineering teams. Service Management is however central to the job of a of Site Reliability Engineer, so I will focus for now on them. Service management is important to this wide variety of stakeholders, so we want to really focus on providing common, usable, shareable language to facilitate alignment.
The term SLA or Service Level Agreement is often used without any meaningful context. Marketing or sales material might brag about a “four-nines SLA”, but that statement leaves the engineering team with more questions than answers. What does Availability mean? How do we measure it to know if you are meeting four-nines? How do we monitor to know when services are not available? Are there allowances for scheduled down time? What is the penalty paid if you fall short of four-nines? Let’s start by creating some clarity around the language we use to describe the business decisions that drive design decisions made by the engineering team.
If you want to read more, the Cloud Adoption Playbook has some really good coverage for this aspect of IT Service Management. If you would like some help defining the service management practices of your engineering team, reach out and schedule some time to discuss your needs.