Agentic evaluation parser tool

  • Release version: Australia
  • Updated August 19, 2025
  • 3 minutes to read
  • Summarize
    Summarized using AI
    This content was generated using new OpenAI-powered functionality. Results are provided on an as is basis and are not guaranteed to be accurate or complete.

    Summary of Agentic evaluation parser tool

    The Agentic evaluation parser tool is designed to extract structured execution data from logs of agentic workflows or AI agents within ServiceNow. This data enables customers to create custom metrics by scripting tailored evaluations of AI agent performance and agentic workflows, enhancing insight into AI-driven processes.

    Show full answer Show less

    Key Features

    • Structured Data Extraction: The tool outputs structured information in the output.payload object, including:
      • executionInputs: Initial workflow setup details such as agent names, tool names, instructions, and the initial user utterance.
      • executionOutputs: AI agent actions and results from tool executions.
      • executionMessages: Ordered conversation flow messages between user and agent.
      • executionPlanDetails: Metadata about the execution status, related tasks, run type, and other contextual information.
    • Testing and Development Support: Customers can run tests in the Script Editor to view complete JSON outputs before designing their custom metrics, facilitating accurate script development.
    • Script Access: The parser tool’s output is accessible within metric scripts via the context parameter, allowing developers to parse and utilize execution data programmatically.

    Practical Application for ServiceNow Customers

    By leveraging the Agentic evaluation parser tool, customers can:

    • Gain detailed insights into AI agent and workflow executions through structured, accessible data.
    • Develop customized metrics that reflect specific evaluation criteria for AI effectiveness and workflow success.
    • Debug and monitor agentic workflows more effectively by analyzing execution metadata.
    • Correlate AI agent performances with particular tasks or conversations using executionPlanDetails metadata.

    Ultimately, this tool empowers customers to tailor AI evaluations, improving operational effectiveness and enabling precise measurement of AI-driven processes within ServiceNow.

    Use the outputs of the agentic evaluation parser tool in your scripts for custom metrics to customize the criteria for effective AI agents and agentic workflows.

    Parser tool overview

    The agentic evaluation parser tool extracts structured execution data from the execution logs of an agentic workflow or AI agent. You can use the information gathered by the tool to create custom metrics that use scripts to evaluate agentic workflows.

    The parser tool returns structured AI agent or agentic workflow execution data within the output.payload object, which contains the following:

    • executionInputs: A JSON object that contains initial workflow setup information, such as the names and instructions of agents and tools and the initial user utterance
    • executionOutputs: A JSON object with AI agent actions and tool execution results
    • executionMessages: A JSON object array of user-facing conversation flow and system responses
    • executionPlanDetails: A JSON object of execution metadata, such as status, sys_ids, and configuration values
    {
         "output": {
     "payload": {
      "executionInputs": { ... },
      "executionOutputs": { ... },
      "executionMessages": [ ... ],
      "executionPlanDetails": { ... }
     }
    }
    

    Accessing the parser tool's output

    To view the complete parser tool output for testing and development, perform the following steps.

    • Navigate to the Script Editor view in your custom metric guided setup.
    • Select Run Test.
    • Wait for the test to complete.
    • View the complete JSON output in the Tools section of the test results.

    Reviewing the parser tool's output before designing your custom metric enables you to inspect the data structure before implementing any specific logic.

    executionInputs data structure

    The executionInputs attribute contains a JSON object with the following structure:

    "executionInputs": {
     "agenticWorkflow": "(name of agentic workflow)",
     "description": "(descriptions for agentic workflow)",
     "instructions": "(list of steps for agentic workflow)", 
     "utterance": "(initial user utterance)", 
     "agents": [ 
      { 
       "name": "(AI agent name)",  
       "instructions": "(list of steps for AI agent)", 
       "tools": [ 
        { 
         "name": "(tool name)", 
         "description": "(tool description)", 
         "executionMode": "(execution mode, either Autonomous or Supervised)", 
         "inputs": { ... } 
        }, 
        { ... }, ... 
       ]
      }, 
      { ... }, ... 
     ]
    }

    executionOutputs data structure

    The executionOutputs attribute contains a JSON object with the following structure:

    "executionOutputs": {
     "agents": [ 
      { 
       "name": "(AI agent name)", 
       "subTask": { ... }, 
       "tools": [ 
        { 
         "name": "(tool name)", 
         "inputs": { ... }, 
         "output": { ... }
        }
       ]
      }, 
      { ... }, ... 
     ]
    }

    executionMessages data structure

    The executionOutputs attribute contains an array of JSON objects with the following structure:

    "executionMessages": [ 
     { 
      "role": "(Message sender, either 'agent' or 'user')", 
       "message": "(Content of message)", 
       "order": "(Sequence number indicating order of message in the conversation)"
      }, 
      { ... }, ... 
    ]
       

    executionPlanDetails data structure

    The executionPlanDetails attribute contains a JSON object with the following structure:

    "executionPlanDetails": { 
        "state": "(Current execution status)", 
        "runType": "(Type of execution)", 
        "conversationId": "(sys_id of conversation)", 
        "relatedTask": "(sys_id of the associated task or record)", 
        "relatedTaskTable": "(Table name where the related task is stored)", 
        "context": { ... } (May be null)
        "builtInTools": [ { ... } ]
        }
       

    This section provides execution metadata for tracking workflow performance, debugging issues, and correlating executions with specific tasks or conversations.

    The possibilities for runType include the following:

    • API
    • Chat
    • Evaluation
    • Testing
    • Trigger

    Using parser tool output in metric scripts

    The parser tool data is available in your metric script through the context parameter. Access the structured data using the following code:

    // Access the parser tool output from context
    var parserToolOutput = context['AgenticExecutionParserTool.output'];
    if (typeof parserToolOutput == "string") {
     parserToolOutput = JSON.parse(parserToolOutput);
    }
    var parserToolPayload = parserToolOutput.payload;
    var parserToolStatus = parserToolOutput.status;
        
    // Extract individual sections from payload
    var inputs = parserToolPayload.executionInputs;
    var outputs = parserToolPayload.executionOutputs;
    var messages = parserToolPayload.executionMessages;
    var planDetails = parserToolPayload.executionPlanDetails;