1

Framing LLMs as products of complex supply chains rather than monolithic entities facilitates the creation of nuanced approaches to …

Agathe Balayn, Fanny Rancourt, Fabio Casati, Ujwal Gadiraju

Computer-Supported Cooperative Work and Social Computing (CSCW), 2026.

Training-time privileged information (PI) can enable language models to succeed on tasks they would otherwise fail, making it a …

International Conference on Machine Learning (ICML), 2026.

The rapid evolution of software libraries presents a significant challenge for code generation models, which must adapt to frequent …

Annual Meeting of the Association for Computational Linguistics (ACL), 2026.

Recent progress in large language models (LLMs) has focused on producing responses that meet human expectations and align with shared …

ACM Conference on Fairness, Accountability, and Transparency, 2026.

Iterative RAG for multi-hop question answering faces challenges with lengthy contexts and the buildup of irrelevant information. This …

Language Resources and Evaluation Conference, 2026.

We introduce DRBench, a benchmark for evaluating AI agents on complex, open-ended deep research tasks in enterprise settings. Unlike …

International Conference on Learning Representations, 2026.

Building reliable computer-use agents requires grounding: accurately connecting natural language instructions to the correct on-screen …

International Conference on Learning Representations, 2026.

Leading language model (LM) providers like OpenAI and Anthropic allow customers to fine-tune frontier LMs for specific use cases. To …

International Conference on Learning Representations, 2026.

Scientific research often seeks to understand the causal structure underlying high-level variables in a system. For example, climate …

Causal Learning and Reasoning (CLeaR), 2026.

Workflows are a fundamental component of automation in enterprise platforms, enabling the orchestration of tasks, data processing, and …

European Chapter of the Association for Computational Linguistics (EACL), 2026.