About
People
Publications
Open source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow AI Research
Publication_types
1
ServiceNow AI Research
1
Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Decoder-only Transformers often struggle with complex reasoning tasks, particularly arithmetic reasoning requiring multiple sequential …
Md Rifat Arefin
,
Gopeshh Subbaraj
,
Nicolas Gontier
,
Yann LeCun
,
Irina Rish
,
Ravid Shwartz-Ziv
,
Christopher Pal
International Conference of Learning Representations (ICLR), 2025.
PDF
Cite
Slides
Video
VCR: Visual Caption Restoration
We introduce Visual Caption Restoration (VCR), a novel vision-language task that challenges models to accurately restore partially …
Tianyu Zhang
,
Suyuchen Wang
,
Lu Li
,
Ge Zhang
,
Perouz Taslakian
,
Sai Rajeswar Mudumba
,
Jie Fu
,
Bang Liu
,
Yoshua Bengio
International Conference of Learning Representations (ICLR), 2025.
PDF
Cite
StarVector: Generating Scalable Vector Graphics Code from Images and Text
Scalable Vector Graphics (SVGs) are vital for modern image rendering due to their scalability and versatility. Previous SVG generation …
Juan A. Rodriguez
,
Abhay Puri
,
Shubham Agarwal
,
Issam H. Laradji
,
Pau Rodriguez
,
Sai Rajeswar Mudumba
,
David Vazquez
,
Christopher Pal
,
Marco Pedersoli
AAAI Demos, 2025.
PDF
Cite
Video
Do LLMs Know When to NOT Answer? Investigating Abstention Abilities of Large Language Models
Abstention Ability (AA) is a critical aspect of Large Language Model (LLM) reliability, referring to an LLM’s capability to …
Nishanth Madhusudhan
,
Sathwik Tejaswi Madhusudhan
,
Vikas Yadav
,
Masoud Hashemi
International Conference on Computational Linguistics (COLING), 2025.
PDF
Cite
Few-shot Learning for Sign Language Recognition with Embedding Propagation
https://nafath.mada.org.qa/nafath-article/mcn2704/
Amjad Alsulami,
,
KHAWLAH BAJBAA
,
Hamzah Luqman
,
Issam H. Laradji
Nafath, 2024.
PDF
Cite
Expecting The Unexpected: Towards Broad Out-Of-Distribution Detection
Deployed machine learning systems require some mechanism to detect out-of-distribution (OOD) inputs. Existing research mainly focuses …
Charles Guille-Escuret
,
Pierre-André Noël
,
Ioannis Mitliagkas
,
David Vazquez
,
João Monteiro
NeurIPS Datasets and Benchmarks Track (NeurIPS Datasets), 2024.
PDF
Cite
Multimodal foundation world models for generalist embodied agents
Learning generalist agents, able to solve multitudes of tasks in different domains is a long-standing problem. Reinforcement learning …
Pietro Mazzaglia
,
Tim Verbelen
,
Bart Dhoedt
,
Aaron Courville
,
Sai Rajeswar Mudumba
Neural Information Processing Systems (NeurIPS), 2024.
PDF
Cite
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content
Large Language Models (LLMs) are trained on vast amounts of data, most of which is automatically scraped from the internet. This data …
João Monteiro
,
Pierre-André Noël
,
Étienne Marcotte
,
Sai Rajeswar Mudumba
,
Valentina Zantedeschi
,
David Vazquez
,
Nicolas Chapados
,
Christopher Pal
,
Perouz Taslakian
NeurIPS Datasets and Benchmarks Track (NeurIPS Datasets), 2024.
PDF
Cite
Video
WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
The ability of large language models (LLMs) to mimic human-like intelligence has led to a surge in LLM-based autonomous agents. Though …
Léo Boisvert
,
Megh Thakkar
,
Maxime Gasse
,
Massimo Caccia
,
Thibault Le Sellier De Chezelles
,
Quentin Cappart
,
Nicolas Chapados
,
Alexandre Lacoste
,
Alexandre Drouin
NeurIPS Datasets and Benchmarks Track (NeurIPS Datasets), 2024.
PDF
Cite
Video
Change Is the Only Constant: Dynamic LLM Slicing based on Layer Redundancy
This paper introduces a novel model compression approach through dynamic layer-specific pruning in Large Language Models (LLMs), …
Razvan-Gabriel Dumitru
,
Paul-Ioan Clotan
,
Vikas Yadav
,
Darius Peteleaza
,
Mihai Surdeanu
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
PDF
Cite
«
»
Cite
×