Large Language Models

We present TapeAgents, an agent framework that leverages a structured, replayable log (tape) of the agent session to facilitate all …

ArXiv, 2024.

This document serves as an overview of the different mechanisms and areas of governance in the BigCode project. It aims to support …

ArXiv, 2024.

The extraction of a small number of relevant insights from vast amounts of data is a crucial component of data-driven decision-making. …

Workshop at the Neural Information Processing Systems (NeurIPS), 2023.

In this work, we present Lag-Llama, a general-purpose probabilistic time series forecasting model trained on a large collection of time …

Workshop at the Neural Information Processing Systems (NeurIPS), 2023.

In this work, we investigate the challenges associated with developing goal-driven AI agents capable of performing open-ended tasks in …

Workshop at the Neural Information Processing Systems (NeurIPS), 2023.

Generating high-quality summaries for chat dialogs often requires large labeled datasets. We propose a method to efficiently use …

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.

Humans possess a remarkable ability to assign novel interpretations to linguistic expressions, enabling them to learn new words and …

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.

Data augmentation is a widely used technique to address the problem of text classification when there is a limited amount of training …

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.

Equivariant networks are specifically designed to ensure consistent behavior with respect to a set of input transformations, leading to …

Conference on Neural Information Processing Systems (NeurIPS), 2023.

Understanding the causal relationships that underlie a system is a fundamental prerequisite to accurate decision-making. In this work, …

Workshop on Structured Probabilistic Inference & Generative Modeling (ICML), 2023.