Value tab in the Evaluation dashboard

  • Release version: Zurich
  • Updated August 13, 2025
  • 5 minutes to read
  • Summarize
    Summarized using AI
    This content was generated using new OpenAI-powered functionality. Results are provided on an as is basis and are not guaranteed to be accurate or complete.

    Summary of Value tab in the Evaluation dashboard

    The Value tab in the Evaluation dashboard provides ServiceNow customers with a clear and reliable assessment of the virtual agent’s impact by measuring value, efficiency, and time savings. It uses a quality-adjusted methodology to estimate how much time users save through interactions with the virtual agent, offering insights into user engagement, satisfaction, and the overall automation benefit.

    Show full answer Show less

    Key Features

    • Total conversations: Displays the volume of virtual agent conversations within a selected date range, indicating user engagement levels.
    • Estimated total hours: Shows the cumulative, quality-adjusted time saved by users interacting with the virtual agent, reflecting real business value.
    • Estimated VA efficiency (%): Represents the ratio of time saved to the total conversation time, indicating how effectively conversation duration is converted into value.
    • Gauges by chat size: Breaks down time saved by conversation length categories—small (≤4 messages), medium (5-10 messages), and large (>10 messages)—to align effort with value.
    • Value calculations: Converts saved time into monetary savings based on hours saved and an hourly rate.
    • User acceptance score: Provides an average satisfaction score from conversation evaluations, scaled to 100 for easy interpretation of user perception.
    • Weekly trends: Tracks user acceptance and efficiency over time, segmented by chat size and content type (Knowledge Base articles or Service Catalog items) for detailed performance insights.

    How Estimated Total Hours Are Calculated

    The estimated time saved is derived through a multi-step process involving data sampling, quality evaluation, and time adjustment:

    • Sampling: 10% of conversations (up to 50 per day) are evaluated for quality to maintain scalability.
    • Rolling 30-day dataset: Creates a stable analytical dataset (~1,500 chats) to identify quality patterns.
    • Quality categorization: Conversations are scored and categorized into five satisfaction levels from “extremely satisfied” to “extremely unsatisfied.”
    • Projection: Quality distributions from the dataset are projected onto daily conversation volumes by chat size and content type.
    • Quality-adjusted weighting: Time saved is weighted by quality category, with high-quality interactions counted fully and low-quality ones discounted accordingly.
    • Weekly aggregation: Daily adjusted time saved values are aggregated weekly for trend analysis.

    Important notes: Only chat time before escalation to a live agent is considered for value calculation. Conversation evaluation scores are tagged by evaluation date, not the original conversation date, affecting how historical data is accounted for.

    Troubleshooting and Data Refresh

    If dashboard data is not current, administrators can refresh it manually by running the following scheduled jobs:

    • CE Populate Value Aggregates Chats - Daily: Activating and executing this job updates daily aggregates.
    • [Conversation Evaluator] Evaluation Score Daily Data Collection: Running this job collects the latest evaluation scores.

    Admins can verify successful data collection by reviewing job logs in the respective scheduled jobs interface.

    The Value tab in the Evaluation dashboard displays the value, efficiency, and time savings of the virtual agent. The information on this tab gives you a transparent and reliable estimation of the value delivered by the virtual agent, focusing on a quality-adjusted calculation of the time saved for users.

    The Value tab shows a robust and realistic evaluation of the virtual agent's value. By incorporating sampling for scalability, rolling averages for stability, and a quality-weighting framework for business relevance, the resulting metrics gives the automation impact and time saved for the organization.

    Value tab.

    The following widgets are available:
    • Total conversations: Shows the total volume of conversations initiated with the virtual agent for the selected date range. This metric is a direct indicator of user engagement and virtual agent usage.
    • Estimated total hours: Shows the cumulative, quality-adjusted time saved by users interacting with the virtual agent.
    • Estimated VA efficiency (%): Calculated as the ratio of the estimated total hours saved to the total duration of all conversations (Estimate Time Savings/Total Conversation Time). The estimate indicates how effectively conversation time is converted into value.
    • Gauges by chat size: The gauges show a breakdown of the estimated total hours saved, segmented by the size of the conversation (long, medium, and short).

      Conversations are segmented based on their size to accurately correlate effort with value. The principle is that longer, more complex chats that successfully resolve issues save more time than shorter, simpler ones.

      • Large-size conversations: Conversations with more than 10 messages from the user.
      • Medium-size conversations: Conversations with more than 4 but fewer than 10 messages from the user.
      • Small-size conversations: Conversations with 4 or fewer messages from the user.
      Note:
      To change the definition of small, medium, and large conversations, update the property sn_na_conv_eval.value_chat_classifier. For more information, see .

      Breakdown by conversation chat sizes.

    • Value calculations: Shows the amount saved based on the numbers of hours saved and the hourly rate.

      Value calculations widget.

    • User acceptance score: The average satisfaction score derived from the conversation evaluator, scaled to 100 for easy interpretation. This score reflects the overall user perception of the virtual agent's quality of assistance.

      Weekly user acceptance trend

      Weekly user acceptance score.

      The line chart tracks the User acceptance score over time, providing insight into how user satisfaction is evolving on a weekly basis.

    • Weekly efficiency trends:

      Value by conversation size.

      Value by content type.

    The weekly efficiency trend charts display the weekly trend of the Estimated Efficiency (%) metric, further broken down by both chat size and the type of content (Knowledge Base (KB) article or Service Catalog item) used during the conversation.

    Note:
    Conversation evaluation scores are tagged as of evaluation date and not the actual conversation date. Hence older or historical chat scores are considered as of the day of evaluation for value calculation and not as of the conversation created date.

    Calculating the estimated total hours

    The estimated total hours calculated is the result of a multi-step, daily process designed to adjust raw time based on the quality of the interaction. It’s calculated through the following steps:

    1. Data sampling for quality evaluation.

      To limit number of assists, every conversation isn't evaluated. Instead, a daily sampling process is used.

      • Sample size: 10% of all conversations are selected for evaluation from the previous day.
      • Daily cap: The sample is capped at a maximum of 50 conversations daily.
      • Analytical dataset: To identify stable patterns in virtual agent quality, a rolling 30-day window is used for these evaluated chats, which creates a dataset of approximately 1,500 evaluated conversations (50 chats/day for 30 days) for analysis.
    2. Categorizing conversation quality.

      Each of the ~1,500 conversations in the 30-day analytical dataset is assigned a quality score by the conversation evaluator. Based on the score, the conversation is categorized into one of five satisfaction levels:

      Quality category User satisfaction score range
      Extremely satisfied 4.5–5.0
      Satisfied 3.5–4.5
      Average 2.5–3.5
      Unsatisfied 1.5–2.5
      Extremely unsatisfied 1.0–1.5
    3. Projecting quality scores onto the daily conversations.

      There might not be a quality score for all virtual agent conversations, which limits the ability to accurately estimate the time saved by the virtual agent. To solve this issue, the quality distribution from the 30-day analytical dataset is projected onto the daily virtual agent conversations.

      1. Calculate rolling averages: From the ~1,500 evaluated chats, the percentage distribution of total chat time across the five quality categories for each segment is calculated.

        For example, for all short chats, or for all chats that invoked a KB article. Suppose that the 30-day analysis of short chats shows their total duration was 100 minutes. If 25 of those minutes came from chats rated as extremely satisfied, then the extremely satisfied category for short chats has a 25% time-based score.

      2. Apply averages to the daily conversations: The daily chats are grouped by their segment (for example, by chat size). The rolling average percentages are then applied to the total time of these groups.

        For example, if the daily chats contain 400 short chats with a total duration of 4,000 minutes, the rolling average calculated is used to project that 25% of the time (1,000 minutes = 4000 minutes * 25%) was of extremely satisfied quality. The calculation is repeated for all quality levels and all segments.

    4. Applying quality-adjustment weights.

      To convert raw conversation time into a realistic measure of time saved, a weight is applied to the projected time in each quality category. Time from low-quality interactions contributes little to no savings, while time from high-quality interactions contributes 100% of its duration to savings.

      Conversation quality category Weight applied to time
      Extremely satisfied 1.0 (100%)
      Satisfied 0.5 (50%)
      Average 0.25 (25%)
      Unsatisfied 0.1 (1%)
      Extremely unsatisfied 0.0 (0%)

      Example continued, the 1,000 minutes of extremely satisfied time calculated in Step 4 is multiplied by a weight of 1.0, resulting in 1,000 minutes of estimated time saved. If another 500 minutes were projected to be satisfied, they would contribute 500 * 0.5 = 250 minutes of time saved.

    5. Weekly aggregation:

      The daily Final Daily Time Saved figures are aggregated at a weekly level for display on the dashboard's trend charts.

    Important:
    Only the chat time before the first live agent invocation is considered for value calculation. That is, the time the user has spent interacting with the virtual agent. The chat time with the live agent isn’t considered for value calculation.

    Conversation evaluations scores are tagged as of evaluation date and not the actual conversation date. Hence older or historical chat scores are considered as of the day of evaluation for value calculation and not as of the conversation created date.

    If data isn’t updated on the dashboard, perform the following steps.

    Role required: admin

    1. Navigate to All > System Definitions > Scheduled Jobs.
    2. Search for and select the CE Populate Value Aggregates Chats - Daily scheduled job.
    3. Set it as Active.
    4. Execute the scheduled job.
    5. Navigate to All > Platform Analytics > Platform Analytics Administration > Data Collector > Jobs.
    6. Search for and select the “[Conversation Evaluator] Evaluation Score Daily Data Collection scheduled job.
    7. Execute the job.
    8. In the Related Lists section, check the Job Logs tab to see if the data was collected successfully.