ServiceNow AI Research

DRBench: A Realistic Benchmark for Enterprise Deep Research

Abstract

We introduce DRBench, a benchmark for evaluating AI agents on complex, open-ended deep research tasks in enterprise settings. Unlike prior benchmarks that focus on simple questions or web-only queries, DRBench evaluates agents on multi-step queries (for example, “What changes should we make to our product roadmap to ensure compliance with this standard?”) that require identifying supporting facts from both the public web and private company knowledge base. Each task is grounded in realistic user personas and enterprise context, spanning a heterogeneous search space that includes productivity software, cloud file systems, emails, chat conversations, and the open web. Tasks are generated through a carefully designed synthesis pipeline with human-in-the-loop verification, and agents are evaluated on their ability to recall relevant insights, maintain factual accuracy, and produce coherent, well-structured reports. We release 15 deep research tasks across 10 domains, such as Sales, Cybersecurity, and Compliance. We demonstrate the effectiveness of DRBench by evaluating diverse DR agents across open- and closed-source models (such as GPT, Llama, and Qwen) and DR strategies, highlighting their strengths, weaknesses, and the critical path for advancing enterprise deep research.

Publication
International Conference on Learning Representations
Amirhossein Abaskohi
Amirhossein Abaskohi
Visiting Researcher

Visiting Researcher at Frontier AI Research located at Vancouver, BC, Canada.

Tianyi Chen
Tianyi Chen
Applied Research Scientist

Applied Research Scientist at AI Research Deployment​ located at Toronto, ON, Canada.

Miguel Muñoz-Mármol
Miguel Muñoz-Mármol
AI Developer

AI Developer at AI Research Deployment​ located at Toronto, ON, Canada.

Étienne Marcotte
Étienne Marcotte
Applied Research Scientist

Applied Research Scientist at Frontier AI Research located at Montreal, QC, Canada.

Xing Han Lu
Xing Han Lu
Visiting Researcher

Visiting Researcher at Frontier AI Research located at Montreal, QC, Canada.

Spandana Gella
Spandana Gella
Research Manager

Research Manager at Frontier AI Research located at Montreal, QC, Canada.

Christopher Pal
Christopher Pal
Distinguished Scientist

Distinguished Scientist at AI Research Partnerships & Ecosystem​ located at Montreal, QC, Canada.

Alexandre Drouin
Alexandre Drouin
Head of Frontier AI Research​

Head of Frontier AI Research​ at Frontier AI Research located at Montreal, QC, Canada.

Issam H. Laradji
Issam H. Laradji
Research Manager

Research Manager at Frontier AI Research located at Vancouver, BC, Canada.