Identifying system issues with synthetic monitoring
Summarize
Summary of Identifying System Issues with Synthetic Monitoring
Synthetic monitoring allows service owners and operators to track the performance of service endpoints. It helps ensure that endpoints are available and functioning properly. When tests fail, notifications can be sent to facilitate quick issue resolution. Users can analyze trends in HTTP API success rates and response times.
Show less
Key Features
- Monitor Overview: The synthetic monitoring landing page provides a comprehensive view of all created monitors, including their status (active or inactive). Users can filter the list by the status of monitors, such as failures or unknown states, and sort by various columns.
- Detailed Monitor Insights: Selecting a monitor reveals its performance details, including test results, configuration, and alerts. This helps users gauge the monitor's health and test history.
- Metrics Visualization: The Metrics card displays two charts: one for failed tests (0 for success, 1 for failure) and another for response times in milliseconds, allowing easy identification of performance issues.
- Result History: A table lists the history of test results, sortable by various metrics, aiding in pinpointing issues such as failures or high latency. Links to alerts associated with failures are also available for quick access.
Key Outcomes
By utilizing synthetic monitoring, customers can quickly identify and address system issues, enhancing overall service reliability. The detailed insights and visual metrics empower users to make informed decisions regarding service performance and troubleshooting, ultimately leading to improved user experiences. Test results are stored for 21 days, providing sufficient time for analysis and response to potential issues.
Tests run by synthetic monitors enable service owners and operators to view service endpoint performance at scale.
Overview of using synthetic monitoring
As a service owner, you can use synthetic monitoring to monitor your service's endpoints, verifying that they're available and performing as expected. You can be notified when synthetic tests fail, enabling you to mitigate issues quickly. You can see trends in HTTP API success rates and response times.
As an operator, you can use synthetic monitoring as part of triaging issues. When you learn a service is reported to have issues, you can view test results for that service's endpoints. If tests are failing or response times are slow, that might be something to investigate.
View aggregate information about the monitors
The synthetic monitoring landing page shows an overview of all created monitors, including inactive monitors.
From here, you can see the status for all your monitors. Selecting a card at the top filters the list of monitors. For example, you can view only the monitors that have failed or that are in an unknown state.
By default, the list of monitors is sorted by the timestamp in the Updated column. You can select a different column header to sort by that category.
Selecting a monitor lets you view details about the monitor's tests, including details, configuration, and associated alerts.
View a monitor and its tests
To view a monitor's details and the results of its tests, select a monitor from the synthetic monitoring landing page. The Monitor details page provides key information needed to understand how the monitor is performing.
Use the information in the header to understand the basic health of the monitor, including its status and when it last ran a test.
View individual tests
The Metrics card displays two charts to help you understand the health of each test the monitor has run. The Failed tests chart displays each test with a value of 0 when it was
successful and a value of 1 when it failed. Hover over a point on the chart to view further details.
The Response time chart shows the amount of time in milliseconds that it took to receive a response from the endpoint. Hover over a point on the chart to view details.
The Monitor result history table lists each test. By default, the table is sorted from newest to oldest by the timestamp. You can sort by any column to help find issues. For example, you can sort by Result to see all failures together or you can sort by Response time to view the tests with the highest latency.
If a monitor was configured to generate an alert when it fails, a link to the alert displays in the Alert column. Select an alert link to view its details.