3

MixSumm: Topic-based Data Augmentation using LLMs for Low-resource Extractive Text Summarization

MixSumm: Topic-based Data Augmentation using LLMs for Low-resource Extractive Text Summarization

Low-resource extractive text summarization is a vital but heavily underexplored area of research. Prior literature either focuses on …

Issam H. Laradji, Gaurav Sahu

ArXiv, 2024.

NNetscape Navigator: Complex Demonstrations for Web Agents Without a Demonstrator

NNetscape Navigator: Complex Demonstrations for Web Agents Without a Demonstrator

We introduce NNetscape Navigator (NNetnav), a method for training web agents entirely through synthetic demonstrations. These …

Shikhar Murty, Hao Zhu, Dzmitry Bahdanau, Chris Manning

ArXiv, 2024.

Pruning Sparse Tensor Neural Networks Enables Deep Learning for 3D Ultrasound Localization Microscopy

Pruning Sparse Tensor Neural Networks Enables Deep Learning for 3D Ultrasound Localization Microscopy

Ultrasound Localization Microscopy (ULM) is a non-invasive technique that allows for the imaging of micro-vessels in vivo, at depth and …

Brice Rauby, Paul Xing, Jonathan Porée, Maxime Gasse, Jean Provost

ArXiv, 2024.

StarCoder 2 and The Stack v2: The Next Generation

StarCoder 2 and The Stack v2: The Next Generation

The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code …

Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Terry Yue Zhuo, Nii Osae Osae Dade, Lucas Krauß, Naman Jain, Yixuan Su, Xuanli He, Edoardo Abati, Yekun Chai, Xiangru Tang, Christopher Akiki, Chenghao Mou, Binyuan Hui, Nicolas Patry, Canwen Xu, Julian McAuley, Han Hu, Torsten Scholak, Sébastien Paquet, Jennifer Robinson, Carolyn Jane Anderson, Nicolas Chapados, Mostofa Patwary, Nima Tajbakhsh, Yacine Jernite, Carlos Muñoz Ferrandis, Lingming Zhang, Sean Hughes, Thomas Wolf , Arjun Guha, Leandro von Werra, Harm de Vries, Alex Gu, Armel Zebaze, Evgenii Zheltonozhskii, Jian Zhu, Manan Dey, Marc Marone, Mayank Mishra, Muhtasham Oblokulov, Olivier Dehaene, Qian Liu, Tri Dao, Wenhao Yu, Niklas Muennighoff

ArXiv, 2024.

StarVector: Generating Scalable Vector Graphics Code from Images and Text

Scalable Vector Graphics (SVGs) have become integral in modern image rendering and graphic design applications due to their infinite …

Juan A. Rodriguez, Shubham Agarwal, Abhay Puri, Issam H. Laradji, Sai Rajeswar Mudumba, Pau Rodriguez, David Vazquez, Christopher Pal, Marco Pedersoli

ArXiv, 2024.

TapeAgents: a Holistic Framework for Agent Development and Optimization

TapeAgents: a Holistic Framework for Agent Development and Optimization

We present TapeAgents, an agent framework that leverages a structured, replayable log (tape) of the agent session to facilitate all …

Dzmitry Bahdanau, Nicolas Gontier, Gabriel Huang, Ehsan Kamalloo, Rafael Pardinas, Alexandre Piche, Torsten Scholak, Oleh Shliazhko, Jordan Prince Tremblay, Karam Ghanem, Soham Parikh, Mitul Tiwari, Quaizar Vohra

ArXiv, 2024.

The BigCode Project Governance Card

The BigCode Project Governance Card

This document serves as an overview of the different mechanisms and areas of governance in the BigCode project. It aims to support …

Sean Hughes, Harm de Vries, Jennifer Robinson, Carlos Muñoz Ferrandis, Loubna Ben Allal, Leandro von Werra, Jennifer Ding, Sébastien Paquet, Yacine Jernite

ArXiv, 2024.

RepoFusion: Training Code Models to Understand Your Repository

Despite the huge success of Large Language Models (LLMs) in coding assistants like GitHub Copilot, these models struggle to understand …

Disha Shrivastava, Denis Kocetkov, Harm de Vries, Dzmitry Bahdanau, Torsten Scholak

ArXiv, 2023.

Hierarchical Residual Attention Network for Single Image Super-Resolution

Hierarchical Residual Attention Network for Single Image Super-Resolution

Convolutional neural networks are the most successful models in single image super-resolution. Deeper networks, residual connections, …

Parichehr Behjati, Pau Rodriguez, Armin Mehri, Isabelle Hupont, Carles Fernandez, Jordi Gonzalez

ArXiv, 2020.

On the Information Complexity of Proper Learners for VC Classes in the Realizable Case

We provide a negative resolution to a conjecture of Steinke and Zakynthinou (2020a), by showing that their bound on the conditional …

Mahdi Haghifam, Gintare Karolina Dziugaite, Shay Moran, Daniel M. Roy

ArXiv, 2020.