Related papers: Genetic Programming for Document Segmentation and …

Document Image Coding and Clustering for Script Discrimination

The paper introduces a new method for discrimination of documents given in different scripts. The document is mapped into a uniformly coded text of numerical values. It is derived from the position of the letters in the text line, based on…

Computer Vision and Pattern Recognition · Computer Science 2016-09-22 Darko Brodic , Alessia Amelio , Zoran N. Milivojevic , Milena Jevtic

Page Segmentation using Visual Adjacency Analysis

Page segmentation is a web page analysis process that divides a page into cohesive segments, such as sidebars, headers, and footers. Current page segmentation approaches use either the DOM, textual content, or rendering style information of…

Computer Vision and Pattern Recognition · Computer Science 2021-12-23 Mohammad Bajammal , Ali Mesbah

Semi Automatic Color Segmentation of Document Pages

-This paper presents a semi automatic method used to segment color documents into different uniform color plans. The practical application is dedicated to administrative documents segmentation. In these documents, like in many other cases,…

Computer Vision and Pattern Recognition · Computer Science 2016-09-28 Stéphane Bres , Véronique Eglin , Vincent Poulain

Document classification methods

Information on different fields which are collected by users requires appropriate management and organization to be structured in a standard way and retrieved fast and more easily. Document classification is a conventional method to…

Information Retrieval · Computer Science 2019-09-18 Madjid Khalilian , Shiva Hassanzadeh

Text Segmentation as a Supervised Learning Task

Text segmentation, the task of dividing a document into contiguous segments based on its semantic structure, is a longstanding challenge in language understanding. Previous work on text segmentation focused on unsupervised methods such as…

Computation and Language · Computer Science 2018-03-28 Omri Koshorek , Adir Cohen , Noam Mor , Michael Rotman , Jonathan Berant

Robust Text Line Detection in Historical Documents: Learning and Evaluation Methods

Text line segmentation is one of the key steps in historical document understanding. It is challenging due to the variety of fonts, contents, writing styles and the quality of documents that have degraded through the years. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2022-10-24 Mélodie Boillet , Christopher Kermorvant , Thierry Paquet

Image Colour Segmentation by Genetic Algorithms

Segmentation of a colour image composed of different kinds of texture regions can be a hard problem, namely to compute for an exact texture fields and a decision of the optimum number of segmentation areas in an image when it contains…

Artificial Intelligence · Computer Science 2007-05-23 Vitorino Ramos , Fernando Muge

A view of computational models for image segmentation

Image segmentation is a central topic in image processing and computer vision and a key issue in many applications, e.g., in medical imaging, microscopy, document analysis and remote sensing. According to the human perception, image…

Numerical Analysis · Mathematics 2022-06-24 Laura Antonelli , Valentina De Simone , Daniela di Serafino

Scene Parsing with Multiscale Feature Learning, Purity Trees, and Optimal Covers

Scene parsing, or semantic segmentation, consists in labeling each pixel in an image with the category of the object it belongs to. It is a challenging task that involves the simultaneous detection, segmentation and recognition of all the…

Computer Vision and Pattern Recognition · Computer Science 2015-06-09 Clément Farabet , Camille Couprie , Laurent Najman , Yann LeCun

Leveraging GenAI for Segmenting and Labeling Centuries-old Technical Documents

Image segmentation and image recognition are well established computational techniques in the broader discipline of image processing. Segmentation allows to locate areas in an image, while recognition identifies specific objects within an…

Computer Vision and Pattern Recognition · Computer Science 2026-03-04 Carlos Monroy , Benjamin Navarro

Convolutional Neural Networks for Page Segmentation of Historical Document Images

This paper presents a Convolutional Neural Network (CNN) based page segmentation method for handwritten historical document images. We consider page segmentation as a pixel labeling problem, i.e., each pixel is classified as one of the…

Computer Vision and Pattern Recognition · Computer Science 2017-04-10 Kai Chen , Mathias Seuret

Labeling, Cutting, Grouping: an Efficient Text Line Segmentation Method for Medieval Manuscripts

This paper introduces a new way for text-line extraction by integrating deep-learning based pre-classification and state-of-the-art segmentation methods. Text-line extraction in complex handwritten documents poses a significant challenge,…

Computer Vision and Pattern Recognition · Computer Science 2019-07-02 Michele Alberti , Lars Vögtlin , Vinaychandran Pondenkandath , Mathias Seuret , Rolf Ingold , Marcus Liwicki

DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning

Document image segmentation is crucial for document analysis and recognition but remains challenging due to the diversity of document formats and segmentation tasks. Existing methods often address these tasks separately, resulting in…

Computer Vision and Pattern Recognition · Computer Science 2025-04-08 Xiao-Hui Li , Fei Yin , Cheng-Lin Liu

Combining Morphological and Histogram based Text Line Segmentation in the OCR Context

Text line segmentation is one of the pre-stages of modern optical character recognition systems. The algorithmic approach proposed by this paper has been designed for this exact purpose. Its main characteristic is the combination of two…

Computer Vision and Pattern Recognition · Computer Science 2023-06-22 Pit Schneider

Segmenting Messy Text: Detecting Boundaries in Text Derived from Historical Newspaper Images

Text segmentation, the task of dividing a document into sections, is often a prerequisite for performing additional natural language processing tasks. Existing text segmentation methods have typically been developed and tested using clean,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Carol Anderson , Phil Crone

Distraction-Based Neural Networks for Document Summarization

Distributed representation learned with neural networks has recently shown to be effective in modeling natural languages at fine granularities such as words, phrases, and even sentences. Whether and how such an approach can be extended to…

Computation and Language · Computer Science 2016-10-27 Qian Chen , Xiaodan Zhu , Zhenhua Ling , Si Wei , Hui Jiang

Automatic Page Segmentation Without Decompressing the Run-Length Compressed Text Documents

Page segmentation is considered to be the crucial stage for the automatic analysis of documents with complex layouts. This has traditionally been carried out in uncompressed documents, although most of the documents in real life exist in a…

Computer Vision and Pattern Recognition · Computer Science 2020-07-03 Mohammed Javed , P. Nagabhushan

A method for the segmentation of images based on thresholding and applied to vesicular textures

In image processing, a segmentation is a process of partitioning an image into multiple sets of pixels, that are defined as super-pixels. Each super-pixel is characterized by a label or parameter. Here, we are proposing a method for…

Computer Vision and Pattern Recognition · Computer Science 2016-12-06 Amelia Carolina Sparavigna

Structured Summarization: Unified Text Segmentation and Segment Labeling as a Generation Task

Text segmentation aims to divide text into contiguous, semantically coherent segments, while segment labeling deals with producing labels for each segment. Past work has shown success in tackling segmentation and labeling for documents and…

Computation and Language · Computer Science 2022-09-29 Hakan Inan , Rashi Rungta , Yashar Mehdad

A comprehensive review and new taxonomy on superpixel segmentation

Superpixel segmentation consists of partitioning images into regions composed of similar and connected pixels. Its methods have been widely used in many computer vision applications since it allows for reducing the workload, removing…

Computer Vision and Pattern Recognition · Computer Science 2024-10-01 I. B. Barcelos , F. de C. Belém , L. de M. João , Z. K. G. do Patrocínio , A. X. Falcão , S. J. F. Guimarães