About
People
Publications
Open Source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow Research
Tags
Computer Vision
ServiceNow Research
Computer Vision
GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks
While numerous recent benchmarks focus on evaluating generic Vision-Language Models (VLMs), they fall short in addressing the unique …
Muhammad Sohail Danish
,
Muhammad Akhtar Munir
,
Syed Roshaan Ali Shah
,
Kartik Kuckreja
,
Fahad Shahbaz Khan
,
Paolo Fraccaro
,
Alexandre Lacoste
,
Salman Khan
International Conference on Computer Vision (ICCV), 2025.
PDF
Cite
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Multimodal AI has the potential to significantly enhance document-understanding tasks, such as processing receipts, understanding …
Juan A. Rodriguez
,
Xiangru Jian
,
Siba Smarak Panigrahi
,
Tianyu Zhang
,
Aarash Feizi
,
Abhay Puri
,
Akshay Kalkunte
,
Francois Savard
,
Ahmed Masry
,
Shravan Nayak
,
Rabiul Awal
,
Mahsa Massoud
,
Amirhossein Abaskohi
,
Zichao Li
,
Suyuchen Wang
,
Pierre-André Noël
,
Mats L. Richter
,
Saverio Vadacchino
,
Shubham Agarwal
,
Sanket Biswas
,
Sara Shanian
,
Ying Zhang
,
Sathwik Tejaswi Madhusudhan
,
João Monteiro
,
Krishnamurthy (Dj) Dvijotham
,
Torsten Scholak
,
Nicolas Chapados
,
Sepideh Kharaghani
,
Sean Hughes
,
Tamer Özsu
,
Siva Reddy
,
Marco Pedersoli
,
Yoshua Bengio
,
Christopher Pal
,
Issam H. Laradji
,
Spandana Gella
,
Perouz Taslakian
,
David Vazquez
,
Sai Rajeswar Mudumba
International Conference of Learning Representations (ICLR), 2025.
PDF
Cite
Code
Video
Deep Learning in Ultrasound localization Microscopy: Applications and perspectives
Ultrasound Localization Microscopy (ULM) is a novel super-resolution imaging technique that can image the vasculature in vivo at depth …
Brice Rauby
,
Paul Xing
,
Maxime Gasse
,
Jean Provost
IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control (IEEE TUFFC), 2025.
PDF
Cite
Pruning Sparse Tensor Neural Networks Enables Deep Learning for 3D Ultrasound Localization Microscopy
Ultrasound Localization Microscopy (ULM) is a non-invasive technique that allows for the imaging of micro-vessels in vivo, at depth and …
Brice Rauby
,
Paul Xing
,
Jonathan Porée
,
Maxime Gasse
,
Jean Provost
IEEE Transactions on Image Processing (IEEE TIP), 2025.
PDF
Cite
VCR: Visual Caption Restoration
We introduce Visual Caption Restoration (VCR), a novel vision-language task that challenges models to accurately restore partially …
Tianyu Zhang
,
Suyuchen Wang
,
Lu Li
,
Ge Zhang
,
Perouz Taslakian
,
Sai Rajeswar Mudumba
,
Jie Fu
,
Bang Liu
,
Yoshua Bengio
Workshop at the Neural Information Processing Systems (NeurIPS), 2024.
PDF
Cite
Code
Few-shot Learning for Sign Language Recognition with Embedding Propagation
Sign language is a primary channel for the deaf and hard-hearing to communicate. Sign language consists of many signs with different …
Amjad Alsulami,
,
KHAWLAH BAJBAA
,
Issam H. Laradji
,
Hamzah Luqman
ArXiv, 2024.
PDF
Cite
FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering
Multimodal multihop question answering is a complex task that requires reasoning over multiple sources of information, such as images …
Issam H. Laradji
,
Amirhossein Abaskohi
,
Giuseppe Carenini
,
Spandana Gella
ArXiv, 2024.
PDF
Cite
Code
Video
Pruning Sparse Tensor Neural Networks Enables Deep Learning for 3D Ultrasound Localization Microscopy
Ultrasound Localization Microscopy (ULM) is a non-invasive technique that allows for the imaging of micro-vessels in vivo, at depth and …
Brice Rauby
,
Paul Xing
,
Jonathan Porée
,
Maxime Gasse
,
Jean Provost
ArXiv, 2024.
PDF
Cite
Code
StarVector: Generating Scalable Vector Graphics Code from Images and Text
Scalable Vector Graphics (SVGs) have become integral in modern image rendering and graphic design applications due to their infinite …
Juan A. Rodriguez
,
Shubham Agarwal
,
Abhay Puri
,
Issam H. Laradji
,
Sai Rajeswar Mudumba
,
Pau Rodriguez
,
David Vazquez
,
Christopher Pal
,
Marco Pedersoli
ArXiv, 2024.
PDF
Cite
Egocentric Planning for Scalable Embodied Task Achievement
Embodied agents face significant challenges when tasked with performing actions in diverse environments, particularly in generalizing …
Xiaotian Liu
,
Hector Palacios
,
Christian Muise
Conference on Neural Information Processing Systems (NeurIPS), 2023.
PDF
Cite
»
Cite
×