Related papers: Intention-based Segmentation: Human Reliability an…

A Focused Study on Sequence Length for Dialogue Summarization

Output length is critical to dialogue summarization systems. The dialogue summary length is determined by multiple factors, including dialogue complexity, summary objective, and personal preferences. In this work, we approach dialogue…

Computation and Language · Computer Science 2022-10-28 Bin Wang , Chen Zhang , Chengwei Wei , Haizhou Li

Improving Implicit Discourse Relation Classification by Modeling Inter-dependencies of Discourse Units in a Paragraph

We argue that semantic meanings of a sentence or clause can not be interpreted independently from the rest of a paragraph, or independently from all discourse relations and the overall paragraph-level discourse structure. With the goal of…

Computation and Language · Computer Science 2018-04-18 Zeyu Dai , Ruihong Huang

A Novel Corpus of Discourse Structure in Humans and Computers

We present a novel corpus of 445 human- and computer-generated documents, comprising about 27,000 clauses, annotated for semantic clause types and coherence relations that allow for nuanced comparison of artificial and natural discourse…

Computation and Language · Computer Science 2021-11-12 Babak Hemmatian , Sheridan Feucht , Rachel Avram , Alexander Wey , Muskaan Garg , Kate Spitalnic , Carsten Eickhoff , Ellie Pavlick , Bjorn Sandstede , Steven Sloman

Linguistically Motivated Sign Language Segmentation

Sign language segmentation is a crucial task in sign language processing systems. It enables downstream tasks such as sign recognition, transcription, and machine translation. In this work, we consider two kinds of segmentation:…

Computation and Language · Computer Science 2023-10-31 Amit Moryossef , Zifan Jiang , Mathias Müller , Sarah Ebling , Yoav Goldberg

Towards Understanding Spontaneous Speech: Word Accuracy vs. Concept Accuracy

In this paper we describe an approach to automatic evaluation of both the speech recognition and understanding capabilities of a spoken dialogue system for train time table information. We use word accuracy for recognition and concept…

cmp-lg · Computer Science 2008-02-03 M. Boros , W. Eckert , F. Gallwitz , G. Goerz , G. Hanrieder , H. Niemann

Computational Sentence-level Metrics Predicting Human Sentence Comprehension

The majority of research in computational psycholinguistics has concentrated on the processing of words. This study introduces innovative methods for computing sentence-level metrics using multilingual large language models. The metrics…

Computation and Language · Computer Science 2024-04-17 Kun Sun , Rong Wang

Unsupervised Induction of Contingent Event Pairs from Film Scenes

Human engagement in narrative is partially driven by reasoning about discourse relations between narrative events, and the expectations about what is likely to happen next that results from such reasoning. Researchers in NLP have tackled…

Computation and Language · Computer Science 2017-09-01 Zhichao Hu , Elahe Rahimtoroghi , Larissa Munishkina , Reid Swanson , Marilyn A. Walker

Emergent Linguistic Rules from Inducing Decision Trees: Disambiguating Discourse Clue Words

We apply decision tree induction to the problem of discourse clue word sense disambiguation with a genetic algorithm. The automatic partitioning of the training set which is intrinsic to decision tree induction gives rise to linguistically…

cmp-lg · Computer Science 2008-02-03 Eric V. Siegel , Kathleen R. McKeown

A Neural Network-Based Linguistic Similarity Measure for Entrainment in Conversations

Linguistic entrainment is a phenomenon where people tend to mimic each other in conversation. The core instrument to quantify entrainment is a linguistic similarity measure between conversational partners. Most of the current similarity…

Computation and Language · Computer Science 2021-09-07 Mingzhi Yu , Diane Litman , Shuang Ma , Jian Wu

SPECTRA: Sparse Structured Text Rationalization

Selective rationalization aims to produce decisions along with rationales (e.g., text highlights or word alignments between two sentences). Commonly, rationales are modeled as stochastic binary masks, requiring sampling-based gradient…

Computation and Language · Computer Science 2021-09-13 Nuno Miguel Guerreiro , André F. T. Martins

Discourse Coherence, Reference Grounding and Goal Oriented Dialogue

Prior approaches to realizing mixed-initiative human--computer referential communication have adopted information-state or collaborative problem-solving approaches. In this paper, we argue for a new approach, inspired by coherence-based…

Computation and Language · Computer Science 2020-07-10 Baber Khalid , Malihe Alikhani , Michael Fellner , Brian McMahan , Matthew Stone

Explainable Human-in-the-Loop Segmentation via Critic Feedback Signals

Segmentation models achieve high accuracy on benchmarks but often fail in real-world domains by relying on spurious correlations instead of true object boundaries. We propose a human-in-the-loop interactive framework that enables…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Pouya Shaeri , Ryan T. Woo , Yasaman Mohammadpour , Ariane Middel

An Attention-Based Model for Predicting Contextual Informativeness and Curriculum Learning Applications

Both humans and machines learn the meaning of unknown words through contextual information in a sentence, but not all contexts are equally helpful for learning. We introduce an effective method for capturing the level of contextual…

Computation and Language · Computer Science 2023-11-10 Sungjin Nam , David Jurgens , Gwen Frishkoff , Kevyn Collins-Thompson

In Tree Structure Should Sentence Be Generated

Generative models reliant on sequential autoregression have been at the forefront of language generation for an extensive period, particularly following the introduction of widely acclaimed transformers. Despite its excellent performance,…

Computation and Language · Computer Science 2024-06-21 Yaguang Li , Xin Chen

Automatically Selecting Useful Phrases for Dialogue Act Tagging

We present an empirical investigation of various ways to automatically identify phrases in a tagged corpus that are useful for dialogue act tagging. We found that a new method (which measures a phrase's deviation from an…

Artificial Intelligence · Computer Science 2007-05-23 Ken Samuel , Sandra Carberry , K. Vijay-Shanker

Sequence-dependent sensitivity explains the accuracy of decisions when cues are separated with a gap

Most decisions require information gathering from a stimulus presented with different gaps. Indeed, the brain process of this integration is rarely ambiguous. Recently, it has been claimed that humans can optimally integrate the information…

Neurons and Cognition · Quantitative Biology 2018-10-29 Maryam Tohidi-Moghaddam , Sajjad Zabbah , Farzaneh Olianezhad , Reza Ebrahimpour

Evaluating topic coherence measures

Topic models extract representative word sets - called topics - from word counts in documents without requiring any semantic annotations. Topics are not guaranteed to be well interpretable, therefore, coherence measures have been proposed…

Machine Learning · Computer Science 2014-03-26 Frank Rosner , Alexander Hinneburg , Michael Röder , Martin Nettling , Andreas Both

Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation

Speech segmentation, which splits long speech into short segments, is essential for speech translation (ST). Popular VAD tools like WebRTC VAD have generally relied on pause-based segmentation. Unfortunately, pauses in speech do not…

Computation and Language · Computer Science 2022-07-14 Ryo Fukuda , Katsuhito Sudoh , Satoshi Nakamura

Identifying Breakdowns in Conversational Recommender Systems using User Simulation

We present a methodology to systematically test conversational recommender systems with regards to conversational breakdowns. It involves examining conversations generated between the system and simulated users for a set of pre-defined…

Information Retrieval · Computer Science 2024-05-24 Nolwenn Bernard , Krisztian Balog

Strings from the Library of Babel: Random Sampling as a Strong Baseline for Prompt Optimisation

Recent prompt optimisation approaches use the generative nature of language models to produce prompts -- even rivaling the performance of human-curated prompts. In this paper, we demonstrate that randomly sampling tokens from the model…

Computation and Language · Computer Science 2024-04-18 Yao Lu , Jiayi Wang , Raphael Tang , Sebastian Riedel , Pontus Stenetorp