ServiceNow IA recherche

S-LLM: Semi-Supervised Large Language Model for Chat Summarization

Résumé

As producing high-quality summaries of chat dialogues currently requires large labeled datasets, we propose a method to efficiently leverage unlabeled data. Using a pseudo-labeling approach and post-processing to improve the quality of the pseudo-summaries, we are able to improve the Rouge-2 score of DistilBART by more than 6 points when using only 1% of labeled data on the TWEETSUMM dataset.

Publication
Montreal AI Symposium (MAIS)
Issam H. Laradji
Issam H. Laradji
Research Manager

Research Manager at Frontier AI Research located at [‘Vancouver, Canada’].

David Vazquez
David Vazquez
Director of AI Research

Director of AI Research at AI Research Management located at [‘Montreal, Canada’].