Related papers: Summarization is (Almost) Dead

Benchmarking Large Language Models for News Summarization

Large language models (LLMs) have shown promise for automatic summarization but the reasons behind their successes are poorly understood. By conducting a human evaluation on ten LLMs across different pretraining methods, prompts, and model…

Computation and Language · Computer Science 2023-02-01 Tianyi Zhang , Faisal Ladhak , Esin Durmus , Percy Liang , Kathleen McKeown , Tatsunori B. Hashimoto

Exploring Large Language Models for Code Explanation

Automating code documentation through explanatory text can prove highly beneficial in code understanding. Large Language Models (LLMs) have made remarkable strides in Natural Language Processing, especially within software engineering tasks…

Software Engineering · Computer Science 2023-10-26 Paheli Bhattacharya , Manojit Chakraborty , Kartheek N S N Palepu , Vikas Pandey , Ishan Dindorkar , Rakesh Rajpurohit , Rishabh Gupta

A Comparative Study of Quality Evaluation Methods for Text Summarization

Evaluating text summarization has been a challenging task in natural language processing (NLP). Automatic metrics which heavily rely on reference summaries are not suitable in many situations, while human evaluation is time-consuming and…

Computation and Language · Computer Science 2024-07-02 Huyen Nguyen , Haihua Chen , Lavanya Pobbathi , Junhua Ding

Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization

While large language models (LLMs) can already achieve strong performance on standard generic summarization benchmarks, their performance on more complex summarization task settings is less studied. Therefore, we benchmark LLMs on…

Computation and Language · Computer Science 2024-07-15 Yixin Liu , Alexander R. Fabbri , Jiawen Chen , Yilun Zhao , Simeng Han , Shafiq Joty , Pengfei Liu , Dragomir Radev , Chien-Sheng Wu , Arman Cohan

Zero-shot Conversational Summarization Evaluations with small Large Language Models

Large Language Models (LLMs) exhibit powerful summarization abilities. However, their capabilities on conversational summarization remains under explored. In this work we evaluate LLMs (approx. 10 billion parameters) on conversational…

Computation and Language · Computer Science 2023-12-01 Ramesh Manuvinakurike , Saurav Sahay , Sangeeta Manepalli , Lama Nachman

The Lay Person's Guide to Biomedicine: Orchestrating Large Language Models

Automated lay summarisation (LS) aims to simplify complex technical documents into a more accessible format to non-experts. Existing approaches using pre-trained language models, possibly augmented with external background knowledge, tend…

Computation and Language · Computer Science 2024-02-22 Zheheng Luo , Qianqian Xie , Sophia Ananiadou

Large Language Models and Book Summarization: Reading or Remembering, Which Is Better?

Summarization is a core task in Natural Language Processing (NLP). Recent advances in Large Language Models (LLMs) and the introduction of large context windows reaching millions of tokens make it possible to process entire books in a…

Computation and Language · Computer Science 2026-03-12 Tairan Fu , Javier Conde , Pedro Reviriego , Javier Coronado-Blázquez , Nina Melero , Elena Merino-Gómez

Large Language Models for Automated Literature Review: An Evaluation of Reference Generation, Abstract Writing, and Review Composition

Large language models (LLMs) have emerged as a potential solution to automate the complex processes involved in writing literature reviews, such as literature collection, organization, and summarization. However, it is yet unclear how good…

Computation and Language · Computer Science 2025-08-22 Xuemei Tang , Xufeng Duan , Zhenguang G. Cai

LLMs as Evaluators: A Novel Approach to Evaluate Bug Report Summarization

Summarizing software artifacts is an important task that has been thoroughly researched. For evaluating software summarization approaches, human judgment is still the most trusted evaluation. However, it is time-consuming and fatiguing for…

Software Engineering · Computer Science 2024-09-04 Abhishek Kumar , Sonia Haiduc , Partha Pratim Das , Partha Pratim Chakrabarti

Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals?

In this work, we investigate the controllability of large language models (LLMs) on scientific summarization tasks. We identify key stylistic and content coverage factors that characterize different types of summaries such as paper reviews,…

Computation and Language · Computer Science 2024-06-28 Marcio Fonseca , Shay B. Cohen

Leveraging Large Language Models for Zero-shot Lay Summarisation in Biomedicine and Beyond

In this work, we explore the application of Large Language Models to zero-shot Lay Summarisation. We propose a novel two-stage framework for Lay Summarisation based on real-life processes, and find that summaries generated with this method…

Computation and Language · Computer Science 2025-01-10 Tomas Goldsack , Carolina Scarton , Chenghua Lin

Not too long do read: Evaluating LLM-generated extreme scientific summaries

High-quality scientific extreme summary (TLDR) facilitates effective science communication. How do large language models (LLMs) perform in generating them? How are LLM-generated summaries different from those written by human experts?…

Computation and Language · Computer Science 2025-12-30 Zhuoqi Lyu , Qing Ke

Comparing Approaches to Automatic Summarization in Less-Resourced Languages

Automatic text summarization has achieved high performance in high-resourced languages like English, but comparatively less attention has been given to summarization in less-resourced languages. This work compares a variety of different…

Computation and Language · Computer Science 2026-01-01 Chester Palen-Michel , Constantine Lignos

On Learning to Summarize with Large Language Models as References

Recent studies have found that summaries generated by large language models (LLMs) are favored by human annotators over the original reference summaries in commonly used summarization datasets. Therefore, we study an LLM-as-reference…

Computation and Language · Computer Science 2024-07-19 Yixin Liu , Kejian Shi , Katherine S He , Longtian Ye , Alexander R. Fabbri , Pengfei Liu , Dragomir Radev , Arman Cohan

An Empirical Study of Many-to-Many Summarization with Large Language Models

Many-to-many summarization (M2MS) aims to process documents in any language and generate the corresponding summaries also in any language. Recently, large language models (LLMs) have shown strong multi-lingual abilities, giving them the…

Computation and Language · Computer Science 2025-05-20 Jiaan Wang , Fandong Meng , Zengkui Sun , Yunlong Liang , Yuxuan Cao , Jiarong Xu , Haoxiang Shi , Jie Zhou

LLM-ReSum: A Framework for LLM Reflective Summarization through Self-Evaluation

Reliable evaluation of large language model (LLM)-generated summaries remains an open challenge, particularly across heterogeneous domains and document lengths. We conduct a comprehensive meta-evaluation of 14 automatic summarization…

Computation and Language · Computer Science 2026-04-29 Huyen Nguyen , Haoxuan Zhang , Yang Zhang , Junhua Ding , Haihua Chen

Reranking-based Generation for Unbiased Perspective Summarization

Generating unbiased summaries in real-world settings such as political perspective summarization remains a crucial application of Large Language Models (LLMs). Yet, existing evaluation frameworks rely on traditional metrics for measuring…

Computation and Language · Computer Science 2025-06-23 Narutatsu Ri , Nicholas Deas , Kathleen McKeown

Zero-Shot Strategies for Length-Controllable Summarization

Large language models (LLMs) struggle with precise length control, particularly in zero-shot settings. We conduct a comprehensive study evaluating LLMs' length control capabilities across multiple measures and propose practical methods to…

Computation and Language · Computer Science 2025-02-12 Fabian Retkowski , Alexander Waibel

Applicability of Large Language Models and Generative Models for Legal Case Judgement Summarization

Automatic summarization of legal case judgements, which are known to be long and complex, has traditionally been tried via extractive summarization models. In recent years, generative models including abstractive summarization models and…

Computation and Language · Computer Science 2024-07-23 Aniket Deroy , Kripabandhu Ghosh , Saptarshi Ghosh

Adapted Large Language Models Can Outperform Medical Experts in Clinical Text Summarization

Analyzing vast textual data and summarizing key information from electronic health records imposes a substantial burden on how clinicians allocate their time. Although large language models (LLMs) have shown promise in natural language…

Computation and Language · Computer Science 2024-04-15 Dave Van Veen , Cara Van Uden , Louis Blankemeier , Jean-Benoit Delbrouck , Asad Aali , Christian Bluethgen , Anuj Pareek , Malgorzata Polacin , Eduardo Pontes Reis , Anna Seehofnerova , Nidhi Rohatgi , Poonam Hosamani , William Collins , Neera Ahuja , Curtis P. Langlotz , Jason Hom , Sergios Gatidis , John Pauly , Akshay S. Chaudhari