Related papers: PoMo: Generating Entity-Specific Post-Modifiers in…

MOPO: Multi-Objective Prompt Optimization for Affective Text Generation

How emotions are expressed depends on the context and domain. On X (formerly Twitter), for instance, an author might simply use the hashtag #anger, while in a news headline, emotions are typically written in a more polite, indirect manner.…

Computation and Language · Computer Science 2024-12-18 Yarik Menchaca Resendiz , Roman Klinger

The ApposCorpus: A new multilingual, multi-domain dataset for factual appositive generation

News articles, image captions, product reviews and many other texts mention people and organizations whose name recognition could vary for different audiences. In such cases, background information about the named entities could be provided…

Computation and Language · Computer Science 2020-11-09 Yova Kementchedjhieva , Di Lu , Joel Tetreault

Improving Factual Consistency in Summarization with Compression-Based Post-Editing

State-of-the-art summarization models still struggle to be factually consistent with the input text. A model-agnostic way to address this problem is post-editing the generated summaries. However, existing approaches typically fail to remove…

Computation and Language · Computer Science 2022-11-14 Alexander R. Fabbri , Prafulla Kumar Choubey , Jesse Vig , Chien-Sheng Wu , Caiming Xiong

POQue: Asking Participant-specific Outcome Questions for a Deeper Understanding of Complex Events

Knowledge about outcomes is critical for complex event understanding but is hard to acquire. We show that by pre-identifying a participant in a complex event, crowd workers are able to (1) infer the collective impact of salient events that…

Computation and Language · Computer Science 2022-12-07 Sai Vallurupalli , Sayontan Ghosh , Katrin Erk , Niranjan Balasubramanian , Francis Ferraro

Modular Techniques for Synthetic Long-Context Data Generation in Language Model Training and Evaluation

The ability of large language models (LLMs) to process and reason over long textual inputs is critical for a wide range of real-world applications. However, progress in this area is significantly constrained by the absence of high-quality,…

Computation and Language · Computer Science 2025-09-05 Seganrasan Subramanian , Abhigya Verma

A Dataset for Tracking Entities in Open Domain Procedural Text

We present the first dataset for tracking state changes in procedural text from arbitrary domains by using an unrestricted (open) vocabulary. For example, in a text describing fog removal using potatoes, a car window may transition between…

Computation and Language · Computer Science 2020-11-17 Niket Tandon , Keisuke Sakaguchi , Bhavana Dalvi Mishra , Dheeraj Rajagopal , Peter Clark , Michal Guerquin , Kyle Richardson , Eduard Hovy

Modelling Adjectival Modification Effects on Semantic Plausibility

While the task of assessing the plausibility of events such as ''news is relevant'' has been addressed by a growing body of work, less attention has been paid to capturing changes in plausibility as triggered by event modification.…

Computation and Language · Computer Science 2025-07-30 Anna Golub , Beate Zywietz , Annerose Eichel

Modeling Preconditions in Text with a Crowd-sourced Dataset

Preconditions provide a form of logical connection between events that explains why some events occur together and information that is complementary to the more widely studied relations such as causation, temporal ordering, entailment, and…

Computation and Language · Computer Science 2020-10-15 Heeyoung Kwon , Mahnaz Koupaee , Pratyush Singh , Gargi Sawhney , Anmol Shukla , Keerthi Kumar Kallur , Nathanael Chambers , Niranjan Balasubramanian

Context-Situated Pun Generation

Previous work on pun generation commonly begins with a given pun word (a pair of homophones for heterographic pun generation and a polyseme for homographic pun generation) and seeks to generate an appropriate pun. While this may enable…

Computation and Language · Computer Science 2022-10-26 Jiao Sun , Anjali Narayan-Chen , Shereen Oraby , Shuyang Gao , Tagyoung Chung , Jing Huang , Yang Liu , Nanyun Peng

#MeTooMA: Multi-Aspect Annotations of Tweets Related to the MeToo Movement

In this paper, we present a dataset containing 9,973 tweets related to the MeToo movement that were manually annotated for five different linguistic aspects: relevance, stance, hate speech, sarcasm, and dialogue acts. We present a detailed…

Computation and Language · Computer Science 2020-04-21 Akash Gautam , Puneet Mathur , Rakesh Gosangi , Debanjan Mahata , Ramit Sawhney , Rajiv Ratn Shah

Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image Captioning

Coherent entity-aware multi-image captioning aims to generate coherent captions for neighboring images in a news document. There are coherence relationships among neighboring images because they often describe same entities or events. These…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Jingqiang Chen

Improving Large-scale Paraphrase Acquisition and Generation

This paper addresses the quality issues in existing Twitter-based paraphrase datasets, and discusses the necessity of using two separate definitions of paraphrase for identification and generation tasks. We present a new Multi-Topic…

Computation and Language · Computer Science 2022-11-09 Yao Dou , Chao Jiang , Wei Xu

Mention Memory: incorporating textual knowledge into Transformers through entity mention attention

Natural language understanding tasks such as open-domain question answering often require retrieving and assimilating factual information from multiple sources. We propose to address this problem by integrating a semi-parametric…

Computation and Language · Computer Science 2022-04-21 Michiel de Jong , Yury Zemlyanskiy , Nicholas FitzGerald , Fei Sha , William Cohen

Measuring the Effect of Influential Messages on Varying Personas

Predicting how a user responds to news events enables important applications such as allowing intelligent agents or content producers to estimate the effect on different communities and revise unreleased messages to prevent unexpected bad…

Computation and Language · Computer Science 2023-05-29 Chenkai Sun , Jinning Li , Hou Pong Chan , ChengXiang Zhai , Heng Ji

Understanding Politics via Contextualized Discourse Processing

Politicians often have underlying agendas when reacting to events. Arguments in contexts of various events reflect a fairly consistent set of agendas for a given entity. In spite of recent advances in Pretrained Language Models (PLMs),…

Computation and Language · Computer Science 2021-09-20 Rajkumar Pujari , Dan Goldwasser

The Fellowship of the LLMs: Multi-Model Workflows for Synthetic Preference Optimization Dataset Generation

This paper presents a novel methodology for generating synthetic Preference Optimization (PO) datasets using multi-model workflows. We evaluate the effectiveness and potential of these workflows in automating and enhancing the dataset…

Computation and Language · Computer Science 2025-08-18 Samee Arif , Sualeha Farid , Abdul Hameed Azeemi , Awais Athar , Agha Ali Raza

Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models

While Multi-modal Language Models (MLMs) demonstrate impressive multimodal ability, they still struggle on providing factual and precise responses for tasks like visual question answering (VQA). In this paper, we address this challenge from…

Artificial Intelligence · Computer Science 2023-12-13 Shitian Zhao , Zhuowan Li , Yadong Lu , Alan Yuille , Yan Wang

Usage similarity estimation addresses the semantic proximity of word instances in different contexts. We apply contextualized (ELMo and BERT) word and sentence embeddings to this task, and propose supervised models that leverage these…

Computation and Language · Computer Science 2019-05-22 Aina Garí Soler , Marianna Apidianaki , Alexandre Allauzen

Dynamic Social Media Monitoring for Fast-Evolving Online Discussions

Tracking and collecting fast-evolving online discussions provides vast data for studying social media usage and its role in people's public lives. However, collecting social media data using a static set of keywords fails to satisfy the…

Social and Information Networks · Computer Science 2021-02-26 Maya Srikanth , Anqi Liu , Nicholas Adams-Cohen , Jian Cao , R. Michael Alvarez , Anima Anandkumar

Detecting Future-related Contexts of Entity Mentions

The ability to automatically identify whether an entity is referenced in a future context can have multiple applications including decision making, planning and trend forecasting. This paper focuses on detecting implicit future references…

Computation and Language · Computer Science 2025-02-24 Puneet Prashar , Krishna Mohan Shukla , Adam Jatowt