ServiceNow Research

Multi-modal Learning

Neural Multisensory Scene Inference
For embodied agents to infer representations of the underlying 3D physical world they inhabit, they should efficiently combine …
Integrating Vision and Language in Social Networks for Identifying Visual Patterns of Personality Traits
Social media, as a major platform for communication and information exchange, is a rich repository of the opinions and sentiments of …
BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning
Allowing humans to interactively train artificial agents to understand language instructions is desirable for both practical and …