Multi-modal Learning

One of the challenges in robotics is to enable robotic units with the reasoning capability that would be robust enough to execute …

ArXiv, 2024.

Scalable Vector Graphics (SVGs) have become integral in modern image rendering and graphic design applications due to their infinite …

ArXiv, 2024.

Text-conditioned image generation models have recently shown immense qualitative success using denoising diffusion processes. However, …

Conference on Neural Information Processing Systems (NeurIPS), 2023.

Large pre-trained models have proved to be remarkable zero- and (prompt-based) few-shot learners in unimodal vision and language tasks. …

European Chapter of the Association for Computational Linguistics (EACL), 2023.

The generative modeling landscape has experienced tremendous growth in recent years, particularly in generating natural images and art. …

International Conference of Learning Representations (ICLR), 2023.

Robots in many real-world settings have access to force/torque sensors in their gripper and tactile sensing is often necessary in tasks …

Conference on Robot Learning (CoRL), 2022.

Metric-based meta-learning techniques have successfully been applied to few-shot classification problems. In this paper, we propose to …

Conference on Neural Information Processing Systems (NeurIPS), 2019.

For embodied agents to infer representations of the underlying 3D physical world they inhabit, they should efficiently combine …

Conference on Neural Information Processing Systems (NeurIPS), 2019.

Social media, as a major platform for communication and information exchange, is a rich repository of the opinions and sentiments of …

International Journal of Social Science and Humanity (IJSSH), 2019.

Allowing humans to interactively train artificial agents to understand language instructions is desirable for both practical and …

International Conference on Learning Representations (ICLR), 2019.