Run multiple scheduled jobs for alert grouping
Run multiple scheduled jobs in parallel to group alerts. This helps prevent overwhelming the system during surges (alert storms).
Before you begin
Role required: admin
About this task
The current scheduled job operates with a single thread. During alert surges (alert storms), processing takes significantly longer. To address this impact, now multiple jobs can process alerts within the scheduled job. This enhances scalability and improves overall performance during high-volume periods.
The number of jobs to be running in the alerts processing is defined in a new property sa_analytics.agg.alert_grouping.num_of_jobs. The job number is sent as a parameter to the relevant method.
- sa_analytics.agg.group_alert_with_same_assignment_group_only: Groups alerts that share the same assignment group. By default, the value is set to false. If you want to set this property to true, create a property with the same name and set the value to true.
- sa_analytics.agg.group_alert_with_same_domain_only: Groups alerts that belong to the same domain. By default, the value is set to true.
- sa_analytics.agg.group_alert_with_same_group_by_fields: “Group by” property, with comma-separated list of field names that need to have matching values across alerts to allow alerts to be grouped together. The property can contain alert field names (such as assignment_group), CI field names (such as alert_cmdb_ci.location), alert additional info field names (such as additional_info.state) or alert tags (such as t_data_center). When the specified field values match each other between alerts, those alerts can be grouped together.
Without setting at least one of these properties to true, the alerts will not be properly segregated. For instance, if you do not define any grouping logic (set any property to true) and both the assignment group and domain separation are set to false, the scale-out will not work, and all alerts will be processed by a single scheduled job.
Procedure
Result
The scheduled job Service Analytics group alerts using RCA/Alert Aggregation is set up to run multiple jobs simultaneously for alert grouping.