Agents

The ability of large language models (LLMs) to mimic human-like intelligence has led to a surge in LLM-based autonomous agents. Though …

NeurIPS Datasets and Benchmarks Track (NeurIPS Datasets), 2024.

The ability to predict outcomes of interactions between embodied agents and objects is paramount in the robotic setting. While …

Learning Effective Abstractions for Planning, 2024.

Recent advancements in large language models (LLMs) have sparked interest in developing autonomous web agents capable of performing …

NOW AI Conference (NOWAI), 2024.

The BrowserGym ecosystem addresses the growing need for efficient evaluation and benchmarking of web agents, particularly those …

Montreal AI Symposium (MAIS), 2024.

Learning generalist embodied agents, able to solve multitudes of tasks in different domains is a long-standing problem. Reinforcement …

Workshop at the International Conference of Machine Learning (ICML), 2024.

We study the use of large language model-based agents for interacting with software via web browsers. Unlike prior work, we focus on …

International Conference on Machine Learning (ICML), 2024.

Contemporary Large Language Models (LLMs) exhibit a high degree of code generation and comprehension capability. A particularly …

Arkil Patel, Siva Reddy, Dzmitry Bahdanau, Pradeep Dasigi

North American Chapter of the Association for Computational Linguistics (NAACL), 2024.

The accurate modeling of dynamics in interactive environments is critical for successful long-range prediction. Such a capability could …

International Conference of Learning Representations (ICLR), 2024.

In today’s digitally driven world, dialogue systems play a pivotal role in enhancing user interactions, from customer service to …

Workshop at the International Conference of Learning Representation (ICLR), 2024.

We study the use of large language model-based agents for interacting with software via web browsers. Unlike prior work, we focus on …

Workshop at the International Conference of Learning Representation (ICLR), 2024.