Related papers: Intention-based Segmentation: Human Reliability an…

LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning

Understanding human instructions to identify the target objects is vital for perception systems. In recent years, the advancements of Large Language Models (LLMs) have introduced new possibilities for image segmentation. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Junchi Wang , Lei Ke

Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing

We introduce Segment-Phrase Table (SPT), a large collection of bijective associations between textual phrases and their corresponding segmentations. Leveraging recent progress in object recognition and natural language semantics, we show…

Computer Vision and Pattern Recognition · Computer Science 2015-09-29 Hamid Izadinia , Fereshteh Sadeghi , Santosh Kumar Divvala , Yejin Choi , Ali Farhadi

Don't Discard Fixed-Window Audio Segmentation in Speech-to-Text Translation

For real-life applications, it is crucial that end-to-end spoken language translation models perform well on continuous audio, without relying on human-supplied segmentation. For online spoken language translation, where models need to…

Computation and Language · Computer Science 2022-10-25 Chantal Amrhein , Barry Haddow

Machines Getting with the Program: Understanding Intent Arguments of Non-Canonical Directives

Modern dialog managers face the challenge of having to fulfill human-level conversational skills as part of common user expectations, including but not limited to discourse with no clear objective. Along with these requirements, agents are…

Computation and Language · Computer Science 2020-10-08 Won Ik Cho , Young Ki Moon , Sangwhan Moon , Seok Min Kim , Nam Soo Kim

Revisiting Conversation Discourse for Dialogue Disentanglement

Dialogue disentanglement aims to detach the chronologically ordered utterances into several independent sessions. Conversation utterances are essentially organized and described by the underlying discourse, and thus dialogue disentanglement…

Computation and Language · Computer Science 2023-06-13 Bobo Li , Hao Fei , Fei Li , Shengqiong Wu , Lizi Liao , Yinwei Wei , Tat-Seng Chua , Donghong Ji

A statistical model for word discovery in child directed speech

A statistical model for segmentation and word discovery in child directed speech is presented. An incremental unsupervised learning algorithm to infer word boundaries based on this model is described and results of empirical tests showing…

Computation and Language · Computer Science 2007-05-23 Anand Venkataraman

TransSent: Towards Generation of Structured Sentences with Discourse Marker

Structured sentences are important expressions in human writings and dialogues. Previous works on neural text generation fused semantic and structural information by encoding the entire sentence into a mixed hidden representation. However,…

Computation and Language · Computer Science 2020-05-11 Xing Wu , Dongjun Wei , Liangjun Zang , Jizhong Han , Songlin Hu

Measuring Semantic Coherence of a Conversation

Conversational systems have become increasingly popular as a way for humans to interact with computers. To be able to provide intelligent responses, conversational systems must correctly model the structure and semantics of a conversation.…

Computation and Language · Computer Science 2018-06-19 Svitlana Vakulenko , Maarten de Rijke , Michael Cochez , Vadim Savenkov , Axel Polleres

Discourse Structure in Machine Translation Evaluation

In this article, we explore the potential of using sentence-level discourse structure for machine translation evaluation. We first design discourse-aware similarity measures, which use all-subtree kernels to compare discourse parse trees in…

Computation and Language · Computer Science 2017-10-05 Shafiq Joty , Francisco Guzmán , Lluís Màrquez , Preslav Nakov

Iterative Utterance Segmentation for Neural Semantic Parsing

Neural semantic parsers usually fail to parse long and complex utterances into correct meaning representations, due to the lack of exploiting the principle of compositionality. To address this issue, we present a novel framework for…

Computation and Language · Computer Science 2020-12-15 Yinuo Guo , Zeqi Lin , Jian-Guang Lou , Dongmei Zhang

Unsupervised Word Segmentation from Speech with Attention

We present a first attempt to perform attentional word segmentation directly from the speech signal, with the final goal to automatically identify lexical units in a low-resource, unwritten language (UL). Our methodology assumes a pairing…

Computation and Language · Computer Science 2018-06-19 Pierre Godard , Marcely Zanon-Boito , Lucas Ondel , Alexandre Berard , François Yvon , Aline Villavicencio , Laurent Besacier

Event Segmentation Applications in Large Language Model Enabled Automated Recall Assessments

Understanding how individuals perceive and recall information in their natural environments is critical to understanding potential failures in perception (e.g., sensory loss) and memory (e.g., dementia). Event segmentation, the process of…

Computation and Language · Computer Science 2025-10-20 Ryan A. Panela , Alex J. Barnett , Morgan D. Barense , Björn Herrmann

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study

Speech signals, typically sampled at rates in the tens of thousands per second, contain redundancies, evoking inefficiencies in sequence modeling. High-dimensional speech features such as spectrograms are often used as the input for the…

Computation and Language · Computer Science 2023-09-28 Xuankai Chang , Brian Yan , Kwanghee Choi , Jeeweon Jung , Yichen Lu , Soumi Maiti , Roshan Sharma , Jiatong Shi , Jinchuan Tian , Shinji Watanabe , Yuya Fujita , Takashi Maekaku , Pengcheng Guo , Yao-Fei Cheng , Pavel Denisov , Kohei Saijo , Hsiu-Hsuan Wang

Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation

Providing emotional support through dialogue systems is becoming increasingly important in today's world, as it can support both mental health and social interactions in many conversation scenarios. Previous works have shown that using…

Computation and Language · Computer Science 2024-03-08 Seunghee Han , Se Jin Park , Chae Won Kim , Yong Man Ro

Speaker Extraction with Co-Speech Gestures Cue

Speaker extraction seeks to extract the clean speech of a target speaker from a multi-talker mixture speech. There have been studies to use a pre-recorded speech sample or face image of the target speaker as the speaker cue. In human…

Audio and Speech Processing · Electrical Eng. & Systems 2022-07-20 Zexu Pan , Xinyuan Qian , Haizhou Li

Understanding User Intent Modeling for Conversational Recommender Systems: A Systematic Literature Review

Context: User intent modeling is a crucial process in Natural Language Processing that aims to identify the underlying purpose behind a user's request, enabling personalized responses. With a vast array of approaches introduced in the…

Information Retrieval · Computer Science 2023-08-17 Siamak Farshidi , Kiyan Rezaee , Sara Mazaheri , Amir Hossein Rahimi , Ali Dadashzadeh , Morteza Ziabakhsh , Sadegh Eskandari , Slinger Jansen

CORA: Consistency-Guided Semi-Supervised Framework for Reasoning Segmentation

Reasoning segmentation seeks pixel-accurate masks for targets referenced by complex, often implicit instructions, requiring context-dependent reasoning over the scene. Recent multimodal language models have advanced instruction following…

Computer Vision and Pattern Recognition · Computer Science 2025-11-25 Prantik Howlader , Hoang Nguyen-Canh , Srijan Das , Jingyi Xu , Hieu Le , Dimitris Samaras

Generating Segment Durations in a Text-To-Speech System: A Hybrid Rule-Based/Neural Network Approach

A combination of a neural network with rule firing information from a rule-based system is used to generate segment durations for a text-to-speech system. The system shows a slight improvement in performance over a neural network system…

Neural and Evolutionary Computing · Computer Science 2007-05-23 Gerald Corrigan , Noel Massey , Orhan Karaali

Evaluating Discourse in Structured Text Representations

Discourse structure is integral to understanding a text and is helpful in many NLP tasks. Learning latent representations of discourse is an attractive alternative to acquiring expensive labeled discourse data. Liu and Lapata (2018) propose…

Computation and Language · Computer Science 2019-06-11 Elisa Ferracane , Greg Durrett , Junyi Jessy Li , Katrin Erk

Towards Trustworthy Explanation: On Causal Rationalization

With recent advances in natural language processing, rationalization becomes an essential self-explaining diagram to disentangle the black box by selecting a subset of input texts to account for the major variation in prediction. Yet,…

Machine Learning · Computer Science 2023-09-12 Wenbo Zhang , Tong Wu , Yunlong Wang , Yong Cai , Hengrui Cai