English
Related papers

Related papers: Efficient Document Image Classification Using Regi…

200 papers

In this work, a region-based Deep Convolutional Neural Network framework is proposed for document structure learning. The contribution of this work involves efficient training of region based classifiers and effective ensembling for…

Computer Vision and Pattern Recognition · Computer Science 2018-09-03 Arindam Das , Saikat Roy , Ujjwal Bhattacharya , Swapan Kumar Parui

Text classification is a fundamental task in NLP applications. Latest research in this field has largely been divided into two major sub-fields. Learning representations is one sub-field and learning deeper models, both sequential and…

Computation and Language · Computer Science 2018-11-09 Mithun Das Gupta

Document structure analysis, such as zone segmentation and table recognition, is a complex problem in document processing and is an active area of research. The recent success of deep learning in solving various computer vision and machine…

Computer Vision and Pattern Recognition · Computer Science 2019-07-04 Shah Rukh Qasim , Hassan Mahmood , Faisal Shafait

For management, documents are categorized into a specific category, and to do these, most of the organizations use manual labor. In today's automation era, manual efforts on such a task are not justified, and to avoid this, we have so many…

Machine Learning · Computer Science 2020-04-20 Ritu Yadav

The rapid progress in image classification has been largely driven by the adoption of Graph Convolutional Networks (GCNs), which offer a robust framework for handling complex data structures. This study introduces a novel approach that…

Computer Vision and Pattern Recognition · Computer Science 2025-08-22 Mustafa Mohammadi Gharasuie , Luis Rueda

Graph-based neural network models are gaining traction in the field of representation learning due to their ability to uncover latent topological relationships between entities that are otherwise challenging to identify. These models have…

Image and Video Processing · Electrical Eng. & Systems 2023-07-25 Aryan Singh , Pepijn Van de Ven , Ciarán Eising , Patrick Denny

This paper presents a new state-of-the-art for document image classification and retrieval, using features learned by deep convolutional neural networks (CNNs). In object and scene analysis, deep neural nets are capable of learning a…

Computer Vision and Pattern Recognition · Computer Science 2015-02-26 Adam W. Harley , Alex Ufkes , Konstantinos G. Derpanis

Document classification is a challenging task with important applications. The deep learning approaches to the problem have gained much attention recently. Despite the progress, the proposed models do not incorporate the knowledge of the…

Computation and Language · Computer Science 2019-10-15 Jader Abreu , Luis Fred , David Macêdo , Cleber Zanchettin

In the field of natural language processing, text classification, as a basic task, has important research value and application prospects. Traditional text classification methods usually rely on feature representations such as the bag of…

Computation and Language · Computer Science 2024-08-29 Erdi Gao , Haowei Yang , Dan Sun , Haohao Xia , Yuhan Ma , Yuanjing Zhu

The outcome of text recognition for degraded color documents is often unsatisfactory due to interference from various contaminants. To extract information more efficiently for text recognition, document image enhancement and binarization…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Rui-Yang Ju , KokSheik Wong , Jen-Shiun Chiang

The state-of-the-art approaches for image classification are based on neural networks. Mathematically, the task of classifying images is equivalent to finding the function that maps an image to the label it is associated with. To rigorously…

Machine Learning · Computer Science 2017-11-15 Yichen Huang

Document classification tasks were primarily tackled at word level. Recent research that works with character-level inputs shows several benefits over word-level approaches such as natural incorporation of morphemes and better handling of…

Computation and Language · Computer Science 2016-02-02 Yijun Xiao , Kyunghyun Cho

Object recognition and detection are well-studied problems with a developed set of almost standard solutions. Identity documents recognition, classification, detection, and localization are the tasks required in a number of applications,…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Mykola Kozlenko , Volodymyr Sendetskyi , Oleksiy Simkiv , Nazar Savchenko , Andy Bosyi

In this paper we investigate image classification with computational resource limits at test time. Two such settings are: 1. anytime classification, where the network's prediction for a test example is progressively updated, facilitating…

Machine Learning · Computer Science 2018-06-08 Gao Huang , Danlu Chen , Tianhong Li , Felix Wu , Laurens van der Maaten , Kilian Q. Weinberger

Deep learning models have demonstrated remarkable results for various computer vision tasks, including the realm of medical imaging. However, their application in the medical domain is limited due to the requirement for large amounts of…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Aryan Singh , Pepijn Van de Ven , Ciarán Eising , Patrick Denny

Text classification is fundamental in natural language processing (NLP), and Graph Neural Networks (GNN) are recently applied in this task. However, the existing graph-based works can neither capture the contextual word relationships within…

Computation and Language · Computer Science 2020-05-13 Yufeng Zhang , Xueli Yu , Zeyu Cui , Shu Wu , Zhongzhen Wen , Liang Wang

Text Classification is the most essential and fundamental problem in Natural Language Processing. While numerous recent text classification models applied the sequential deep learning technique, graph neural network-based models can…

Computation and Language · Computer Science 2024-07-08 Kunze Wang , Yihao Ding , Soyeon Caren Han

In this paper, we introduce a fully convolutional network for the document layout analysis task. While state-of-the-art methods are using models pre-trained on natural scene images, our method Doc-UFCN relies on a U-shaped model trained…

Computer Vision and Pattern Recognition · Computer Science 2021-09-20 Mélodie Boillet , Christopher Kermorvant , Thierry Paquet

Document layout analysis (DLA) is the task of detecting the distinct, semantic content within a document and correctly classifying these items into an appropriate category (e.g., text, title, figure). DLA pipelines enable users to convert…

Machine Learning · Computer Science 2023-08-07 Jilin Wang , Michael Krumdick , Baojia Tong , Hamima Halim , Maxim Sokolov , Vadym Barda , Delphine Vendryes , Chris Tanner

Text in natural images contains rich semantics that are often highly relevant to objects or scene. In this paper, we focus on the problem of fully exploiting scene text for visual understanding. The main idea is combining word…

Computer Vision and Pattern Recognition · Computer Science 2017-05-31 Xiang Bai , Mingkun Yang , Pengyuan Lyu , Yongchao Xu , Jiebo Luo
‹ Prev 1 2 3 10 Next ›