Related papers: Efficient Document Image Classification Using Regi…

Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks

In this work, a region-based Deep Convolutional Neural Network framework is proposed for document structure learning. The contribution of this work involves efficient training of region based classifiers and effective ensembling for…

Computer Vision and Pattern Recognition · Computer Science 2018-09-03 Arindam Das , Saikat Roy , Ujjwal Bhattacharya , Swapan Kumar Parui

Doc2Im: document to image conversion through self-attentive embedding

Text classification is a fundamental task in NLP applications. Latest research in this field has largely been divided into two major sub-fields. Learning representations is one sub-field and learning deeper models, both sequential and…

Computation and Language · Computer Science 2018-11-09 Mithun Das Gupta

Rethinking Table Recognition using Graph Neural Networks

Document structure analysis, such as zone segmentation and table recognition, is a complex problem in document processing and is an active area of research. The recent success of deep learning in solving various computer vision and machine…

Computer Vision and Pattern Recognition · Computer Science 2019-07-04 Shah Rukh Qasim , Hassan Mahmood , Faisal Shafait

Light-Weighted CNN for Text Classification

For management, documents are categorized into a specific category, and to do these, most of the organizations use manual labor. In today's automation era, manual efforts on such a task are not justified, and to avoid this, we have so many…

Machine Learning · Computer Science 2020-04-20 Ritu Yadav

Fast Graph Neural Network for Image Classification

The rapid progress in image classification has been largely driven by the adoption of Graph Convolutional Networks (GCNs), which offer a robust framework for handling complex data structures. This study introduces a novel approach that…

Computer Vision and Pattern Recognition · Computer Science 2025-08-22 Mustafa Mohammadi Gharasuie , Luis Rueda

Compact & Capable: Harnessing Graph Neural Networks and Edge Convolution for Medical Image Classification

Graph-based neural network models are gaining traction in the field of representation learning due to their ability to uncover latent topological relationships between entities that are otherwise challenging to identify. These models have…

Image and Video Processing · Electrical Eng. & Systems 2023-07-25 Aryan Singh , Pepijn Van de Ven , Ciarán Eising , Patrick Denny

Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval

This paper presents a new state-of-the-art for document image classification and retrieval, using features learned by deep convolutional neural networks (CNNs). In object and scene analysis, deep neural nets are capable of learning a…

Computer Vision and Pattern Recognition · Computer Science 2015-02-26 Adam W. Harley , Alex Ufkes , Konstantinos G. Derpanis

Hierarchical Attentional Hybrid Neural Networks for Document Classification

Document classification is a challenging task with important applications. The deep learning approaches to the problem have gained much attention recently. Despite the progress, the proposed models do not incorporate the knowledge of the…

Computation and Language · Computer Science 2019-10-15 Jader Abreu , Luis Fred , David Macêdo , Cleber Zanchettin

Text classification optimization algorithm based on graph neural network

In the field of natural language processing, text classification, as a basic task, has important research value and application prospects. Traditional text classification methods usually rely on feature representations such as the bag of…

Computation and Language · Computer Science 2024-08-29 Erdi Gao , Haowei Yang , Dan Sun , Haohao Xia , Yuhan Ma , Yuanjing Zhu

Efficient Generative Adversarial Networks for Color Document Image Enhancement and Binarization Using Multi-scale Feature Extraction

The outcome of text recognition for degraded color documents is often unsatisfactory due to interference from various contaminants. To extract information more efficiently for text recognition, document image enhancement and binarization…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Rui-Yang Ju , KokSheik Wong , Jen-Shiun Chiang

Provably efficient neural network representation for image classification

The state-of-the-art approaches for image classification are based on neural networks. Mathematically, the task of classifying images is equivalent to finding the function that maps an image to the label it is associated with. To rigorously…

Machine Learning · Computer Science 2017-11-15 Yichen Huang

Efficient Character-level Document Classification by Combining Convolution and Recurrent Layers

Document classification tasks were primarily tackled at word level. Recent research that works with character-level inputs shows several benefits over word-level approaches such as natural incorporation of morphemes and better handling of…

Computation and Language · Computer Science 2016-02-02 Yijun Xiao , Kyunghyun Cho

Identity documents recognition and detection using semantic segmentation with convolutional neural network

Object recognition and detection are well-studied problems with a developed set of almost standard solutions. Identity documents recognition, classification, detection, and localization are the tasks required in a number of applications,…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Mykola Kozlenko , Volodymyr Sendetskyi , Oleksiy Simkiv , Nazar Savchenko , Andy Bosyi

Multi-Scale Dense Networks for Resource Efficient Image Classification

In this paper we investigate image classification with computational resource limits at test time. Two such settings are: 1. anytime classification, where the network's prediction for a test example is progressively updated, facilitating…

Machine Learning · Computer Science 2018-06-08 Gao Huang , Danlu Chen , Tianhong Li , Felix Wu , Laurens van der Maaten , Kilian Q. Weinberger

Connecting the Dots: Graph Neural Network Powered Ensemble and Classification of Medical Images

Deep learning models have demonstrated remarkable results for various computer vision tasks, including the realm of medical imaging. However, their application in the medical domain is limited due to the requirement for large amounts of…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Aryan Singh , Pepijn Van de Ven , Ciarán Eising , Patrick Denny

Every Document Owns Its Structure: Inductive Text Classification via Graph Neural Networks

Text classification is fundamental in natural language processing (NLP), and Graph Neural Networks (GNN) are recently applied in this task. However, the existing graph-based works can neither capture the contextual word relationships within…

Computation and Language · Computer Science 2020-05-13 Yufeng Zhang , Xueli Yu , Zeyu Cui , Shu Wu , Zhongzhen Wen , Liang Wang

Graph Neural Networks for Text Classification: A Survey

Text Classification is the most essential and fundamental problem in Natural Language Processing. While numerous recent text classification models applied the sequential deep learning technique, graph neural network-based models can…

Computation and Language · Computer Science 2024-07-08 Kunze Wang , Yihao Ding , Soyeon Caren Han

Multiple Document Datasets Pre-training Improves Text Line Detection With Deep Neural Networks

In this paper, we introduce a fully convolutional network for the document layout analysis task. While state-of-the-art methods are using models pre-trained on natural scene images, our method Doc-UFCN relies on a U-shaped model trained…

Computer Vision and Pattern Recognition · Computer Science 2021-09-20 Mélodie Boillet , Christopher Kermorvant , Thierry Paquet

A Graphical Approach to Document Layout Analysis

Document layout analysis (DLA) is the task of detecting the distinct, semantic content within a document and correctly classifying these items into an appropriate category (e.g., text, title, figure). DLA pipelines enable users to convert…

Machine Learning · Computer Science 2023-08-07 Jilin Wang , Michael Krumdick , Baojia Tong , Hamima Halim , Maxim Sokolov , Vadym Barda , Delphine Vendryes , Chris Tanner

Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification

Text in natural images contains rich semantics that are often highly relevant to objects or scene. In this paper, we focus on the problem of fully exploiting scene text for visual understanding. The main idea is combining word…

Computer Vision and Pattern Recognition · Computer Science 2017-05-31 Xiang Bai , Mingkun Yang , Pengyuan Lyu , Yongchao Xu , Jiebo Luo