About
People
Publications
Open Source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow AI Research
Tags
Large Language Models
ServiceNow AI Research
Large Language Models
Grounding Computer Use Agents on Human Demonstrations
Building reliable computer-use agents requires grounding: accurately connecting natural language instructions to the correct on-screen …
Aarash Feizi
,
Shravan Nayak
,
Xiangru Jian
,
Kevin Qinghong Lin
,
Kaixin Li
,
Rabiul Awal
,
Xing Han Lu
,
Johan Obando
,
Juan A. Rodriguez
,
Nicolas Chapados
,
David Vazquez
,
Adriana Romero Soriano
,
Reihaneh Rabbany
,
Perouz Taslakian
,
Christopher Pal
,
Spandana Gella
,
Sai Rajeswar Mudumba
International Conference on Learning Representations, 2026.
PDF
Cite
Causal Differentiating Concepts: Interpreting LM Behavior via Causal Representation Learning
Language model activations entangle concepts that mediate their behavior, making it difficult to interpret these factors, which has …
Navita Goyal
,
Hal Daumé III
,
Alexandre Drouin
,
Dhanya Sridhar
Neural Information Processing Systems (NeurIPS), 2025.
PDF
Cite
Breaking the Bottleneck with DiffuApriel: High-Throughput Diffusion LMs with Mamba Backbone
Diffusion-based language models have recently emerged as a promising alternative to autoregressive generation, yet their reliance on …
Vaibhav Singh
,
Oleksiy Ostapenko
,
Pierre-André Noël
,
Torsten Scholak
arXiv, 2025.
PDF
Cite
Apriel-MTP: Multi-Token Prediction for Faster and More Efficient Language
We introduce multi-token prediction (MTP) variants of the Apriel model family, designed to generate multiple to- kens per forward pass. …
Raymond Li
,
Nanda Harishankar Krishna
,
Oleksiy Ostapenko
,
Luke Kumar
,
Torsten Scholak
NOW AI, 2025.
Cite
Apriel-SSM: Converting Pre-Trained Transformer LLMs Into Subquadratic Hybrid Models Through Iterative End-to-End Distillation
Large Language Models achieve their success through transformer architectures with attention mechanisms that compute token …
Oleksiy Ostapenko
,
Shambhavi Mishra
,
Luke Kumar
,
Denis Kocetkov
,
Raymond Li
,
Joel Lamy Poirier
,
Sébastien Paquet
,
Torsten Scholak
NOW AI, 2025.
Cite
Faster On-Policy Reinforcement Learning for Long Sequence Generation
Reinforcement Learning (RL) is increasingly utilized to enhance the reasoning capabilities of Large Language Models (LLMs). However, …
Alexandre Piche
,
Ehsan Kamalloo
,
Rafael Pardinas
,
Xiaoyin Chen
,
Dzmitry Bahdanau
NOW AI, 2025.
Cite
StarVLM ReRank: Better UI Grounding via Enhanced Visual Input and Element Position Perception
UI grounding is a fundamental task for enterprise workflow automation. This task maps natural language instructions to precise pixel …
Suyuchen Wang
,
Tianyu Zhang
,
Ahmed Masry
,
Christopher Pal
,
Bang Liu
,
Perouz Taslakian
,
Spandana Gella
NOW AI, 2025.
Cite
Unifying Autoregressive and Diffusion-Based Sequence Generation
We present significant extensions to diffusion-based language models, blurring the line with autoregressive ones. We introduce …
Nima Fathi
,
Torsten Scholak
,
Pierre-André Noël
NOW AI, 2025.
Cite
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation
We present WebMMU, a multilingual benchmark that evaluates three core web tasks: (1) website visual question answering, (2) code …
Sai Rajeswar Mudumba
,
Christopher Pal
,
Perouz Taslakian
,
Spandana Gella
,
Rabiul Awal
,
Aarash Feizi
,
Mahsa Massoud
,
Zichao Li
,
Siva Reddy
,
David Vazquez
,
Suyuchen Wang
NOW AI, 2025.
Cite
Using Scaling Laws for Data Source Utility Estimation in Domain-Specific Pre-Training
We introduce a framework for optimizing domain-specific dataset construction in foundation model training. Specifically, we seek a …
Oleksiy Ostapenko
,
Charles Guille-Escuret
,
Luke Kumar
,
Max Tian
,
Denis Kocetkov
,
Gopeshh Subbaraj
,
Raymond Li
,
Joel Lamy Poirier
,
Sébastien Paquet
,
Torsten Scholak
Conference on Language Modeling Workshops, 2025.
PDF
Cite
»
Cite
×