Evaluate

Australia Enable AI

Release: australia
ft:locale: en-US
ft:publication_title: Australia Enable AI
ft:clusterId: platai
bundleId: platai
workflow: Platform

Evaluating agentic AI assets

Release version: Australia

Updated March 18, 2026

1 minute to read

Find guidance for every stage of the agentic evaluation lifecycle, from initial setup to reevaluation.

Overview of agentic evaluations

To evaluate your agentic AI at scale, follow the workflow described below:

Create your first automated evaluation run.
Get acquainted with the agentic evaluations homepage and the guided setup for an automated evaluation.
Track and monitor progress.
In-progress automated evaluations can provide important information about agentic AI performance. See any initial problems before all the results come through.
Review the result outputs.
1. See LLM-judged scores.
2. Identify consistent issues.
3. Trace issues back to their source.
4. Apply optimizations.
Create automated evaluation runs for other agentic workflows or AI agents.
Create custom metrics to evaluate against your specific business needs.

Additional information