Working with SRM services
A service represents a functional outcome like networking, payments, or HR services, that is owned by a team. To deliver that outcome, a service can contain one or more technical components like a user authentication service, or a piece of shared infrastructure like a database.
You might want multiple tool integrations to monitor each technical service and receive events from those tools. Add an integration to SRM using the Services module. See Working with SRM integrations.
In addition, you can create reliability metrics for the service. See Working with Reliability metrics
Tying a team and policies to that service makes it easier to divide responsibilities and track technical outcomes. It also makes it easier to automate response routines and focus on who you notify and when.
The state of an exiting service is inherited. The state of a created service in SRM is None.
Services
- Your Services: Count of all the services you or your team manages and monitors for reliability.
- Services with active incidents: Services with one or more open incidents, sorted first by business criticality, most critical at the top; then sorted by number of active incidents, highest number at the top; and finally sorted by % of error budget remaining, lowest at the top.
- Services with critical alerts: Services with open alerts, sorted first by business criticality, most critical at the top; then sorted by number of alerts, highest number at the top; and finally sorted by % of error budget remaining, lowest at the top.
- Services with open changes: All the services your team manages and monitors.
- Services with low error budget: Services with error budget remaining < 25%
The error budget metric is represented as the amount of SLO that you can spend over a specified time. It can be used to manage release velocity.
Each column in the list can be grouped or filtered.
Each list can be edited, sorted or exported.
For more detailed information on individual service details, see Edit service details form.
Services list view metric definitions
- Service: Name of the service.
- Class: Application or Technical service.
- Business criticality: How important this service is the business.Choices are:
- 1 - most critical (default)
- 2 - somewhat critical
- 3 - less critical
- 4 - not critical
- Open alerts: Number of open alerts assigned to the service.
- Open incidents: Number of open incidents assigned to the service.
- Error budget remaining: Percentage of error budget remaining for the service.