Large Language Models

Decoder-only Transformers often struggle with complex reasoning tasks, particularly arithmetic reasoning requiring multiple sequential …

International Conference of Learning Representations (ICLR), 2025.

Literature reviews are an essential component of scientific research, but they remain time-intensive and challenging to write, …

Transactions on Machine Learning Research (TMLR), 2025.

Abstention Ability (AA) is a critical aspect of Large Language Model (LLM) reliability, referring to an LLM’s capability to …

International Conference on Computational Linguistics (COLING), 2025.

Recent advancements in large language models (LLMs) have spurred interest in developing autonomous agents capable of performing complex …

Workshop at the Neural Information Processing Systems (NeurIPS), 2024.

Forecasting is a critical task in decision making across various domains. While numerical data provides a foundation, it often lacks …

Workshop at the Neural Information Processing Systems (NeurIPS), 2024.

Numerous decision-making tasks require estimating causal effects under interventions on different parts of a system. As practitioners …

Workshop at the Neural Information Processing Systems (NeurIPS), 2024.

Large Language Models (LLMs) are trained on vast amounts of data, most of which is automatically scraped from the internet. This data …

NeurIPS Datasets and Benchmarks Track (NeurIPS Datasets), 2024.

In-context learning (ICL) approaches typically leverage prompting to condition decoder-only language model generation on reference …

Workshop at the Neural Information Processing Systems (NeurIPS), 2024.

This paper introduces a novel model compression approach through dynamic layer-specific pruning in Large Language Models (LLMs), …

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.

Direct Preference Optimization (DPO) is an effective technique that leverages pairwise preference data (usually one chosen and rejected …

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.