Value tab in the Evaluation dashboard
Summarize
Summary of Value tab in the Evaluation dashboard
The Value tab in the Evaluation dashboard provides ServiceNow customers with a detailed, quality-adjusted view of the virtual agent's impact on efficiency and time savings. It offers transparent metrics that realistically estimate the value delivered by the virtual agent by combining conversation volume, quality evaluations, and time saved for users.
Show less
Key Features
- Total conversations: Displays the total number of virtual agent conversations in the selected period, indicating user engagement.
- Estimated total hours: Shows cumulative time saved by users, adjusted by conversation quality.
- Estimated VA efficiency (%): Ratio of estimated time saved to total conversation time, measuring how effectively virtual agent interactions translate to value.
- Gauges by chat size: Breaks down time saved by conversation length—small (≤4 messages), medium (5–9 messages), and large (≥10 messages). These categories can be customized via a system property.
- Value calculations: Converts saved time into monetary value using hourly rates.
- User acceptance score and trends: Presents average user satisfaction from conversation evaluations on a 100-point scale and tracks weekly changes.
- Weekly efficiency trends: Displays efficiency percentage trends, segmented by chat size and content type (Knowledge Base articles or Service Catalog items).
How Value Is Calculated
The estimation of total hours saved involves several steps:
- Sampling: Only 10% of conversations (up to 50 per day) are evaluated to ensure scalability.
- Rolling 30-day dataset: Approximately 1,500 conversations are analyzed over a month to identify stable quality patterns.
- Quality categorization: Conversations are scored and classified into five satisfaction levels from extremely unsatisfied to extremely satisfied.
- Projection: Quality distributions from the 30-day sample are applied to daily conversation volumes to estimate quality across all chats.
- Quality-weighted time savings: Conversation durations are weighted by quality scores (e.g., 100% weight for extremely satisfied, 0% for extremely unsatisfied) to calculate realistic time saved.
- Aggregation: Daily time saved figures are rolled up weekly for trend analysis.
Important note: Only the virtual agent interaction time before any live agent takes over is included in value calculations.
Troubleshooting and Data Refresh
If dashboard data is not up to date, administrators can manually run key scheduled jobs to refresh conversation and evaluation data. This includes activating and executing the "CE Populate Value Aggregates Chats - Daily" job and the "[Conversation Evaluator] Evaluation Score Daily Data Collection" job, followed by verifying successful data collection in job logs.
Practical Benefits for ServiceNow Customers
By leveraging the Value tab, customers can:
- Understand and quantify the time savings and efficiency gains from their virtual agent deployments.
- Monitor user satisfaction trends to identify opportunities for improving virtual agent performance.
- Assess the impact of conversation complexity and content type on automation value.
- Make informed decisions about virtual agent improvements based on quality-weighted performance metrics.
The Value tab in the Evaluation dashboard displays the value, efficiency, and time savings of the virtual agent. The information on this tab gives you a transparent and reliable estimation of the value delivered by the virtual agent, focusing on a quality-adjusted calculation of the time saved for users.
The Value tab shows a robust and realistic evaluation of the virtual agent's value. By incorporating sampling for scalability, rolling averages for stability, and a quality-weighting framework for business relevance, the resulting metrics gives the automation impact and time saved for the organization.
- Total conversations: Shows the total volume of conversations initiated with the virtual agent for the selected date range. This metric is a direct indicator of user engagement and virtual agent usage.
- Estimated total hours: Shows the cumulative, quality-adjusted time saved by users interacting with the virtual agent.
- Estimated VA efficiency (%): Calculated as the ratio of the estimated total hours saved to the total duration of all conversations (Estimate Time Savings/Total Conversation Time). The estimate indicates how effectively conversation time is converted into value.
- Gauges by chat size: The gauges show a breakdown of the estimated total hours saved, segmented by the size of the conversation (long, medium, and short).
Conversations are segmented based on their size to accurately correlate effort with value. The principle is that longer, more complex chats that successfully resolve issues save more time than shorter, simpler ones.
- Large-size conversations: Conversations with more than 10 messages from the user.
- Medium-size conversations: Conversations with more than 4 but fewer than 10 messages from the user.
- Small-size conversations: Conversations with 4 or fewer messages from the user.
Note:To change the definition of small, medium, and large conversations, update the property sn_na_conv_eval.value_chat_classifier. For more information, see Components installed with the Evaluation dashboard. - Value calculations: Shows the amount saved based on the numbers of hours saved and the hourly rate.
- User acceptance score: The average satisfaction score derived from the conversation evaluator, scaled to 100 for easy interpretation. This score reflects the overall user perception of the virtual agent's quality of
assistance.
Weekly user acceptance trend
The line chart tracks the User acceptance score over time, providing insight into how user satisfaction is evolving on a weekly basis.
- Weekly efficiency trends:
The weekly efficiency trend charts display the weekly trend of the Estimated Efficiency (%) metric, further broken down by both chat size and the type of content (Knowledge Base (KB) article or Service Catalog item) used during the conversation.
Calculating the estimated total hours
The estimated total hours calculated is the result of a multi-step, daily process designed to adjust raw time based on the quality of the interaction. It’s calculated through the following steps:
- Data sampling for quality evaluation.
To limit number of assists, every conversation isn't evaluated. Instead, a daily sampling process is used.
- Sample size: 10% of all conversations are selected for evaluation from the previous day.
- Daily cap: The sample is capped at a maximum of 50 conversations daily.
- Analytical dataset: To identify stable patterns in virtual agent quality, a rolling 30-day window is used for these evaluated chats, which creates a dataset of approximately 1,500 evaluated conversations (50 chats/day for 30 days) for analysis.
- Categorizing conversation quality.
Each of the ~1,500 conversations in the 30-day analytical dataset is assigned a quality score by the conversation evaluator. Based on the score, the conversation is categorized into one of five satisfaction levels:
Quality category User satisfaction score range Extremely satisfied 4.5–5.0 Satisfied 3.5–4.5 Average 2.5–3.5 Unsatisfied 1.5–2.5 Extremely unsatisfied 1.0–1.5 - Projecting quality scores onto the daily conversations.
There might not be a quality score for all virtual agent conversations, which limits the ability to accurately estimate the time saved by the virtual agent. To solve this issue, the quality distribution from the 30-day analytical dataset is projected onto the daily virtual agent conversations.
- Calculate rolling averages: From the ~1,500 evaluated chats, the percentage distribution of total chat time across the five quality categories for each segment is calculated.
For example, for all short chats, or for all chats that invoked a KB article. Suppose that the 30-day analysis of short chats shows their total duration was 100 minutes. If 25 of those minutes came from chats rated as extremely satisfied, then the extremely satisfied category for short chats has a 25% time-based score.
- Apply averages to the daily conversations: The daily chats are grouped by their segment (for example, by chat size). The rolling average percentages are then applied to the total time of these groups.
For example, if the daily chats contain 400 short chats with a total duration of 4,000 minutes, the rolling average calculated is used to project that 25% of the time (1,000 minutes = 4000 minutes * 25%) was of extremely satisfied quality. The calculation is repeated for all quality levels and all segments.
- Calculate rolling averages: From the ~1,500 evaluated chats, the percentage distribution of total chat time across the five quality categories for each segment is calculated.
- Applying quality-adjustment weights.
To convert raw conversation time into a realistic measure of time saved, a weight is applied to the projected time in each quality category. Time from low-quality interactions contributes little to no savings, while time from high-quality interactions contributes 100% of its duration to savings.
Conversation quality category Weight applied to time Extremely satisfied 1.0 (100%) Satisfied 0.5 (50%) Average 0.25 (25%) Unsatisfied 0.1 (1%) Extremely unsatisfied 0.0 (0%) Example continued, the 1,000 minutes of extremely satisfied time calculated in Step 4 is multiplied by a weight of 1.0, resulting in 1,000 minutes of estimated time saved. If another 500 minutes were projected to be satisfied, they would contribute 500 * 0.5 = 250 minutes of time saved.
- Weekly aggregation:
The daily Final Daily Time Saved figures are aggregated at a weekly level for display on the dashboard's trend charts.
Conversation evaluations scores are tagged as of evaluation date and not the actual conversation date. Hence older or historical chat scores are considered as of the day of evaluation for value calculation and not as of the conversation created date.
If data isn’t updated on the dashboard, perform the following steps.
Role required: admin
- Navigate to .
- Search for and select the CE Populate Value Aggregates Chats - Daily scheduled job.
- Set it as Active.
- Execute the scheduled job.
- Navigate to .
- Search for and select the “[Conversation Evaluator] Evaluation Score Daily Data Collection scheduled job.
- Execute the job.
- In the Related Lists section, check the Job Logs tab to see if the data was collected successfully.