Related papers: Text Segmentation as a Supervised Learning Task

Structured Summarization: Unified Text Segmentation and Segment Labeling as a Generation Task

Text segmentation aims to divide text into contiguous, semantically coherent segments, while segment labeling deals with producing labels for each segment. Past work has shown success in tackling segmentation and labeling for documents and…

Computation and Language · Computer Science 2022-09-29 Hakan Inan , Rashi Rungta , Yashar Mehdad

Recent Trends in Linear Text Segmentation: a Survey

Linear Text Segmentation is the task of automatically tagging text documents with topic shifts, i.e. the places in the text where the topics change. A well-established area of research in Natural Language Processing, drawing from…

Computation and Language · Computer Science 2024-11-26 Iacopo Ghinassi , Lin Wang , Chris Newell , Matthew Purver

Segmenting Messy Text: Detecting Boundaries in Text Derived from Historical Newspaper Images

Text segmentation, the task of dividing a document into sections, is often a prerequisite for performing additional natural language processing tasks. Existing text segmentation methods have typically been developed and tested using clean,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Carol Anderson , Phil Crone

Text segmentation on multilabel documents: A distant-supervised approach

Segmenting text into semantically coherent segments is an important task with applications in information retrieval and text summarization. Developing accurate topical segmentation requires the availability of training data with ground…

Computation and Language · Computer Science 2019-04-16 Saurav Manchanda , George Karypis

From Text Segmentation to Smart Chaptering: A Novel Benchmark for Structuring Video Transcriptions

Text segmentation is a fundamental task in natural language processing, where documents are split into contiguous sections. However, prior research in this area has been constrained by limited datasets, which are either small in scale,…

Computation and Language · Computer Science 2024-02-28 Fabian Retkowski , Alexander Waibel

OntoSeg: a Novel Approach to Text Segmentation using Ontological Similarity

Text segmentation (TS) aims at dividing long text into coherent segments which reflect the subtopic structure of the text. It is beneficial to many natural language processing tasks, such as Information Retrieval (IR) and document…

Computation and Language · Computer Science 2015-11-30 Mostafa Bayomi , Killian Levacher , M. Rami Ghorab , Séamus Lawless

SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation

Document layout analysis is a known problem to the documents research community and has been vastly explored yielding a multitude of solutions ranging from text mining, and recognition to graph-based representation, visual feature…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Subhajit Maity , Sanket Biswas , Siladittya Manna , Ayan Banerjee , Josep Lladós , Saumik Bhattacharya , Umapada Pal

Toward Unifying Text Segmentation and Long Document Summarization

Text segmentation is important for signaling a document's structure. Without segmenting a long document into topically coherent sections, it is difficult for readers to comprehend the text, let alone find important information. The problem…

Computation and Language · Computer Science 2022-11-01 Sangwoo Cho , Kaiqiang Song , Xiaoyang Wang , Fei Liu , Dong Yu

Language Segmentation

Language segmentation consists in finding the boundaries where one language ends and another language begins in a text written in more than one language. This is important for all natural language processing tasks. The problem can be solved…

Computation and Language · Computer Science 2015-10-07 David Alfter

Text Classification: A Perspective of Deep Learning Methods

In recent years, with the rapid development of information on the Internet, the number of complex texts and documents has increased exponentially, which requires a deeper understanding of deep learning methods in order to accurately…

Computation and Language · Computer Science 2023-09-26 Zhongwei Wan

Automating Easy Read Text Segmentation

Easy Read text is one of the main forms of access to information for people with reading difficulties. One of the key characteristics of this type of text is the requirement to split sentences into smaller grammatical segments, to…

Computation and Language · Computer Science 2025-07-21 Jesús Calleja , Thierry Etchegoyhen , David Ponce

Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation

This paper addresses text-supervised semantic segmentation, aiming to learn a model capable of segmenting arbitrary visual concepts within images by using only image-text pairs without dense annotations. Existing methods have demonstrated…

Computer Vision and Pattern Recognition · Computer Science 2024-04-08 Ji-Jia Wu , Andy Chia-Hao Chang , Chieh-Yu Chuang , Chun-Pei Chen , Yu-Lun Liu , Min-Hung Chen , Hou-Ning Hu , Yung-Yu Chuang , Yen-Yu Lin

Self-supervised Scene Text Segmentation with Object-centric Layered Representations Augmented by Text Regions

Text segmentation tasks have a very wide range of application values, such as image editing, style transfer, watermark removal, etc.However, existing public datasets are of poor quality of pixel-level labels that have been shown to be…

Computer Vision and Pattern Recognition · Computer Science 2023-08-28 Yibo Wang , Yunhu Ye , Yuanpeng Mao , Yanwei Yu , Yuanping Song

Improving Segmentation for Technical Support Problems

Technical support problems are often long and complex. They typically contain user descriptions of the problem, the setup, and steps for attempted resolution. Often they also contain various non-natural language text elements like outputs…

Computation and Language · Computer Science 2020-05-25 Kushal Chauhan , Abhirut Gupta

Self-Supervised Learning for Image Segmentation: A Comprehensive Survey

Supervised learning demands large amounts of precisely annotated data to achieve promising results. Such data curation is labor-intensive and imposes significant overhead regarding time and costs. Self-supervised learning (SSL) partially…

Computer Vision and Pattern Recognition · Computer Science 2025-05-21 Thangarajah Akilan , Nusrat Jahan , Wandong Zhang

Text Line Segmentation of Historical Documents: a Survey

There is a huge amount of historical documents in libraries and in various National Archives that have not been exploited electronically. Although automatic reading of complete pages remains, in most cases, a long-term objective, tasks such…

Computer Vision and Pattern Recognition · Computer Science 2007-05-23 Laurence Likforman-Sulem , Abderrazak Zahour , Bruno Taconet

A Survey on Semi-Supervised Semantic Segmentation

Semantic segmentation is one of the most challenging tasks in computer vision. However, in many applications, a frequent obstacle is the lack of labeled images, due to the high cost of pixel-level labeling. In this scenario, it makes sense…

Computer Vision and Pattern Recognition · Computer Science 2023-02-21 Adrian Peláez-Vegas , Pablo Mesejo , Julián Luengo

Classifying text using machine learning models and determining conversation drift

Text classification helps analyse texts for semantic meaning and relevance, by mapping the words against this hierarchy. An analysis of various types of texts is invaluable to understanding both their semantic meaning, as well as their…

Machine Learning · Computer Science 2022-11-16 Chaitanya Chadha , Vandit Gupta , Deepak Gupta , Ashish Khanna

Training-Free Semantic Segmentation via LLM-Supervision

Recent advancements in open vocabulary models, like CLIP, have notably advanced zero-shot classification and segmentation by utilizing natural language for class-specific embeddings. However, most research has focused on improving model…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Wenfang Sun , Yingjun Du , Gaowen Liu , Ramana Kompella , Cees G. M. Snoek

Weakly-Supervised Text Instance Segmentation

Text segmentation is a challenging vision task with many downstream applications. Current text segmentation methods require pixel-level annotations, which are expensive in the cost of human labor and limited in application scenarios. In…

Computer Vision and Pattern Recognition · Computer Science 2023-03-24 Xinyan Zu , Haiyang Yu , Bin Li , Xiangyang Xue