About
People
Publications
Open Source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow Research
Tags
Visual Question Answering
ServiceNow Research
Visual Question Answering
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation
Understanding diverse web data and automating web development presents an exciting challenge for agentic AI. While existing benchmarks …
Rabiul Awal
,
Mahsa Massoud
,
Zichao Li
,
Aarash Feizi
,
Suyuchen Wang
,
Christopher Pal
,
Aishwarya Agrawal
,
David Vazquez
,
Siva Reddy
,
Juan A. Rodriguez
,
Perouz Taslakian
,
Sai Rajeswar Mudumba
Workshop at the Computer Vision and Pattern Recognition Conference (CVPR), 2025.
PDF
Cite
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation
Understanding diverse web data and automating web development presents an exciting challenge for agentic AI. While existing benchmarks …
Rabiul Awal
,
Mahsa Massoud
,
Zichao Li
,
Aarash Feizi
,
Suyuchen Wang
,
Christopher Pal
,
Aishwarya Agrawal
,
David Vazquez
,
Siva Reddy
,
Juan A. Rodriguez
,
Perouz Taslakian
,
Spandana Gella
,
Sai Rajeswar Mudumba
Workshop at the International Conference of Learning Representation (ICLR), 2025.
PDF
Cite
Visual Question Answering From Another Perspective: CLEVR Mental Rotation Tests
Different types of mental rotation tests have been used extensively in psychology to understand human visual reasoning and perception. …
Christopher Beckham
,
Martin Weiss
,
Florian Golemo
,
Sina Honari
,
Derek Nowrouzezahrai
,
Christopher Pal
Pattern Recognition (PR), 2022.
PDF
Cite
Cite
×