About
People
Publications
Open source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow AI Research
Publication_types
3
ServiceNow AI Research
3
Breaking the Bottleneck with DiffuApriel: High-Throughput Diffusion LMs with Mamba Backbone
Diffusion-based language models have recently emerged as a promising alternative to autoregressive generation, yet their reliance on …
Vaibhav Singh
,
Oleksiy Ostapenko
,
Pierre-André Noël
,
Torsten Scholak
arXiv, 2025.
PDF
Cite
3rd Continual Learning Workshop Challenge on Egocentric Category and Instance Level Object Understanding
Continual Learning, also known as Lifelong or Incremental Learning, has recently gained renewed interest among the Artificial …
Lorenzo Pellegrini
,
Chenchen Zhu
,
Fanyi Xiao
,
Zhicheng Yan
,
Antonio Carta
,
Matthias De Lange
,
Vincenzo Lomonaco
,
Roshan Sumbaly
,
Pau Rodriguez
,
David Vazquez
ArXiv, 2024.
PDF
Cite
Automatic Data Augmentation Learning using Bilevel Optimization for Histopathological Images
One of the main challenges faced when training a deep learning based model to classify histopathological images is the color and shape …
Saypraseuth Mounsaveng
,
Issam H. Laradji
,
David Vazquez
,
Marco Pedersoli
,
Ismail Ben Ayed
ArXiv, 2024.
PDF
Cite
FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering
Multimodal multihop question answering is a complex task that requires reasoning over multiple sources of information, such as images …
Issam H. Laradji
,
Amirhossein Abaskohi
,
Giuseppe Carenini
,
Spandana Gella
ArXiv, 2024.
PDF
Cite
Video
InCoRo: In-Context Learning for Robotics Control with Feedback Loops
One of the challenges in robotics is to enable robotic units with the reasoning capability that would be robust enough to execute …
Jiaquiang Ye Zhu
,
Carla Gomez
,
David Vazquez
,
Michal Drozdzal
ArXiv, 2024.
PDF
Cite
Language Decision Transformers with Exponential Tilt for Interactive Text Environments
Text-based game environments are challenging because agents must deal with long sequences of text, execute compositional actions using …
Nicolas Gontier
,
Pau Rodriguez
,
Issam H. Laradji
,
David Vazquez
,
Christopher Pal
ArXiv, 2024.
PDF
Cite
Layer-Wise Quantization: A Pragmatic and Effective Method for Quantizing LLMs Beyond Integer Bit-Levels
We present a simple meta quantization approach that quantizes different layers of a large language model (LLM) at different bit levels, …
Razvan-Gabriel Dumitru
,
Vikas Yadav
,
Rishabh Maheshwary
,
Paul-Ioan Clotan
,
Sathwik Tejaswi Madhusudhan
,
Mihai Surdeanu
ArXiv, 2024.
PDF
Cite
Layered gradient accumulation and modular pipeline parallelism: fast and efficient training of large language models
The advent of the transformer has sparked a quick growth in the size of language models, far outpacing hardware improvements. (Dense) …
Joel Lamy Poirier
ArXiv, 2024.
PDF
Cite
LitLLM: A Toolkit for Scientific Literature Review
Literature reviews are an essential component of scientific research. We explore the zero-shot abilities of recent large language …
Shubham Agarwal
,
Abhay Puri
,
Issam H. Laradji
,
Laurent Charlin
,
Christopher Pal
ArXiv, 2024.
PDF
Cite
M-RewardBench: Evaluating Reward Models in Multilingual Settings
Reward models (RMs) have driven the state-of-the-art performance of LLMs today by enabling the integration of human feedback into the …
Srishti Gureja
,
Lester James V. Miranda
,
Shayekh Bin Islam
,
Rishabh Maheshwary
,
Drishti Sharma
,
Gusti Winata
,
Nathan Lambert
,
Sebastian Ruder
,
Sara Hooker
,
Marzieh Fadaee
ArXiv, 2024.
PDF
Cite
»
Cite
×