Related papers: Task-Oriented Paraphrase Analytics

What Makes a Good Paraphrase: Do Automated Evaluations Work?

Paraphrasing is the task of expressing an essential idea or meaning in different words. But how different should the words be in order to be considered an acceptable paraphrase? And can we exclusively use automated metrics to evaluate the…

Computation and Language · Computer Science 2023-07-28 Anna Moskvina , Bhushan Kotnis , Chris Catacata , Michael Janz , Nasrin Saef

Cross-lingual paraphrase identification

The paraphrase identification task involves measuring semantic similarity between two short sentences. It is a tricky task, and multilingual paraphrase identification is even more challenging. In this work, we train a bi-encoder model in a…

Computation and Language · Computer Science 2024-06-24 Inessa Fedorova , Aleksei Musatow

Paraphrase Types for Generation and Detection

Current approaches in paraphrase generation and detection heavily rely on a single general similarity score, ignoring the intricate linguistic properties of language. This paper introduces two new tasks to address this shortcoming by…

Computation and Language · Computer Science 2024-07-17 Jan Philip Wahle , Bela Gipp , Terry Ruas

Crowdsourcing Diverse Paraphrases for Training Task-oriented Bots

A prominent approach to build datasets for training task-oriented bots is crowd-based paraphrasing. Current approaches, however, assume the crowd would naturally provide diverse paraphrases or focus only on lexical diversity. In this WiP we…

Computation and Language · Computer Science 2021-09-21 Jorge Ramírez , Auday Berro , Marcos Baez , Boualem Benatallah , Fabio Casati

Understanding Task Design Trade-offs in Crowdsourced Paraphrase Collection

Linguistically diverse datasets are critical for training and evaluating robust machine learning systems, but data collection is a costly process that often requires experts. Crowdsourcing the process of paraphrase generation is an…

Computation and Language · Computer Science 2020-06-05 Youxuan Jiang , Jonathan K. Kummerfeld , Walter S. Lasecki

PARAPHRASUS : A Comprehensive Benchmark for Evaluating Paraphrase Detection Models

The task of determining whether two texts are paraphrases has long been a challenge in NLP. However, the prevailing notion of paraphrase is often quite simplistic, offering only a limited view of the vast spectrum of paraphrase phenomena.…

Computation and Language · Computer Science 2024-12-17 Andrianos Michail , Simon Clematide , Juri Opitz

Pointwise Paraphrase Appraisal is Potentially Problematic

The prevailing approach for training and evaluating paraphrase identification models is constructed as a binary classification problem: the model is given a pair of sentences, and is judged by how accurately it classifies pairs as either…

Computation and Language · Computer Science 2020-06-25 Hannah Chen , Yangfeng Ji , David Evans

GAPX: Generalized Autoregressive Paraphrase-Identification X

Paraphrase Identification is a fundamental task in Natural Language Processing. While much progress has been made in the field, the performance of many state-of-the-art models often suffer from distribution shift during inference time. We…

Computation and Language · Computer Science 2022-10-06 Yifei Zhou , Renyu Li , Hayden Housen , Ser-Nam Lim

Exploring Multitask Learning for Low-Resource AbstractiveSummarization

This paper explores the effect of using multitask learning for abstractive summarization in the context of small training corpora. In particular, we incorporate four different tasks (extractive summarization, language modeling, concept…

Computation and Language · Computer Science 2021-09-20 Ahmed Magooda , Mohamed Elaraby , Diane Litman

Paraphrasing, textual entailment, and semantic similarity above word level

This dissertation explores the linguistic and computational aspects of the meaning relations that can hold between two or more complex linguistic expressions (phrases, clauses, sentences, paragraphs). In particular, it focuses on…

Computation and Language · Computer Science 2022-08-11 Venelin Kovatchev

Reverse Probing: Evaluating Knowledge Transfer via Finetuned Task Embeddings for Coreference Resolution

In this work, we reimagine classical probing to evaluate knowledge transfer from simple source to more complex target tasks. Instead of probing frozen representations from a complex source task on diverse simple target probing tasks (as…

Computation and Language · Computer Science 2025-03-24 Tatiana Anikina , Arne Binder , David Harbecke , Stalin Varanasi , Leonhard Hennig , Simon Ostermann , Sebastian Möller , Josef van Genabith

LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training

Paraphrases are texts that convey the same meaning while using different words or sentence structures. It can be used as an automatic data augmentation tool for many Natural Language Processing tasks, especially when dealing with…

Computation and Language · Computer Science 2024-06-25 Khoi M. Le , Trinh Pham , Tho Quan , Anh Tuan Luu

Paragraph-based Transformer Pre-training for Multi-Sentence Inference

Inference tasks such as answer sentence selection (AS2) or fact verification are typically solved by fine-tuning transformer-based models as individual sentence-pair classifiers. Recent studies show that these tasks benefit from modeling…

Computation and Language · Computer Science 2022-07-08 Luca Di Liello , Siddhant Garg , Luca Soldaini , Alessandro Moschitti

Distributional semantics beyond words: Supervised learning of analogy and paraphrase

There have been several efforts to extend distributional semantics beyond individual words, to measure the similarity of word pairs, phrases, and sentences (briefly, tuples; ordered sets of words, contiguous or noncontiguous). One way to…

Machine Learning · Computer Science 2013-10-21 Peter D. Turney

Revision for Concision: A Constrained Paraphrase Generation Task

Academic writing should be concise as concise sentences better keep the readers' attention and convey meaning clearly. Writing concisely is challenging, for writers often struggle to revise their drafts. We introduce and formulate revising…

Computation and Language · Computer Science 2022-10-27 Wenchuan Mu , Kwan Hui Lim

Cross-domain Semantic Parsing via Paraphrasing

Existing studies on semantic parsing mainly focus on the in-domain setting. We formulate cross-domain semantic parsing as a domain adaptation problem: train a semantic parser on some source domains and then adapt it to the target domain.…

Computation and Language · Computer Science 2017-07-26 Yu Su , Xifeng Yan

Universal Model for Paraphrasing -- Using Transformation Based on a Defined Criteria --

This paper describes a universal model for paraphrasing that transforms according to defined criteria. We showed that by using different criteria we could construct different kinds of paraphrasing systems including one for answering…

Computation and Language · Computer Science 2007-05-23 Masaki Murata , Hitoshi Isahara

Exemplar-Controllable Paraphrasing and Translation using Bitext

Most prior work on exemplar-based syntactically controlled paraphrase generation relies on automatically-constructed large-scale paraphrase datasets, which are costly to create. We sidestep this prerequisite by adapting models from prior…

Computation and Language · Computer Science 2021-09-21 Mingda Chen , Sam Wiseman , Kevin Gimpel

Learning to Paraphrase Sentences to Different Complexity Levels

While sentence simplification is an active research topic in NLP, its adjacent tasks of sentence complexification and same-level paraphrasing are not. To train models on all three tasks, we present two new unsupervised datasets. We compare…

Computation and Language · Computer Science 2023-11-22 Alison Chi , Li-Kuang Chen , Yi-Chen Chang , Shu-Hui Lee , Jason S. Chang

Language Modeling and Understanding Through Paraphrase Generation and Detection

Language enables humans to share knowledge, reason about the world, and pass on strategies for survival and innovation across generations. At the heart of this process is not just the ability to communicate but also the remarkable…

Computation and Language · Computer Science 2026-02-25 Jan Philip Wahle