ServiceNow recherche

Multi-modal Learning

Adaptive Cross-Modal Few-shot Learning
Metric-based meta-learning techniques have successfully been applied to few-shot classification problems. In this paper, we propose to …
Neural Multisensory Scene Inference
For embodied agents to infer representations of the underlying 3D physical world they inhabit, they should efficiently combine …
Integrating Vision and Language in Social Networks for Identifying Visual Patterns of Personality Traits
Social media, as a major platform for communication and information exchange, is a rich repository of the opinions and sentiments of …
BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning
Allowing humans to interactively train artificial agents to understand language instructions is desirable for both practical and …