Related papers: FaSTExt: Fast and Small Text Extractor

EnsNet: Ensconce Text in the Wild

A new method is proposed for removing text from natural images. The challenge is to first accurately localize text on the stroke-level and then replace it with a visually plausible background. Unlike previous methods that require image…

Computer Vision and Pattern Recognition · Computer Science 2018-12-04 Shuaitao Zhang , Yuliang Liu , Lianwen Jin , Yaoxiong Huang , Songxuan Lai

Text Detection and Recognition in the Wild: A Review

Detection and recognition of text in natural images are two main problems in the field of computer vision that have a wide variety of applications in analysis of sports videos, autonomous driving, industrial automation, to name a few. They…

Computer Vision and Pattern Recognition · Computer Science 2020-07-02 Zobeir Raisi , Mohamed A. Naiel , Paul Fieguth , Steven Wardell , John Zelek

Scene Parsing with Multiscale Feature Learning, Purity Trees, and Optimal Covers

Scene parsing, or semantic segmentation, consists in labeling each pixel in an image with the category of the object it belongs to. It is a challenging task that involves the simultaneous detection, segmentation and recognition of all the…

Computer Vision and Pattern Recognition · Computer Science 2015-06-09 Clément Farabet , Camille Couprie , Laurent Najman , Yann LeCun

Effective Use of Word Order for Text Categorization with Convolutional Neural Networks

Convolutional neural network (CNN) is a neural network that can make use of the internal structure of data such as the 2D structure of image data. This paper studies CNN on text categorization to exploit the 1D structure (namely, word…

Computation and Language · Computer Science 2015-03-27 Rie Johnson , Tong Zhang

A Novel Scene Text Detection Algorithm Based On Convolutional Neural Network

Candidate text region extraction plays a critical role in convolutional neural network (CNN) based text detection from natural images. In this paper, we propose a CNN based scene text detection algorithm with a new text region extractor.…

Computer Vision and Pattern Recognition · Computer Science 2016-04-08 Xiaohang Ren , Kai Chen , Jun Sun

A Fast Hierarchical Method for Multi-script and Arbitrary Oriented Scene Text Extraction

Typography and layout lead to the hierarchical organisation of text in words, text lines, paragraphs. This inherent structure is a key property of text in any script and language, which has nonetheless been minimally leveraged by existing…

Computer Vision and Pattern Recognition · Computer Science 2024-10-30 Lluis Gomez , Dimosthenis Karatzas

Object Skeleton Extraction in Natural Images by Fusing Scale-associated Deep Side Outputs

Object skeleton is a useful cue for object detection, complementary to the object contour, as it provides a structural representation to describe the relationship among object parts. While object skeleton extraction in natural images is a…

Computer Vision and Pattern Recognition · Computer Science 2016-04-07 Wei Shen , Kai Zhao , Yuan Jiang , Yan Wang , Zhijiang Zhang , Xiang Bai

Reading Scene Text with Attention Convolutional Sequence Modeling

Reading text in the wild is a challenging task in the field of computer vision. Existing approaches mainly adopted Connectionist Temporal Classification (CTC) or Attention models based on Recurrent Neural Network (RNN), which is…

Computer Vision and Pattern Recognition · Computer Science 2017-09-14 Yunze Gao , Yingying Chen , Jinqiao Wang , Hanqing Lu

A Binary Convolutional Encoder-decoder Network for Real-time Natural Scene Text Processing

In this paper, we develop a binary convolutional encoder-decoder network (B-CEDNet) for natural scene text processing (NSTP). It converts a text image to a class-distinguished salience map that reveals the categorical, spatial and…

Computer Vision and Pattern Recognition · Computer Science 2016-12-13 Zichuan Liu , Yixing Li , Fengbo Ren , Hao Yu

Text Classification using Graph Convolutional Networks: A Comprehensive Survey

Text classification is a quintessential and practical problem in natural language processing with applications in diverse domains such as sentiment analysis, fake news detection, medical diagnosis, and document classification. A sizable…

Computation and Language · Computer Science 2024-10-15 Syed Mustafa Haider Rizvi , Ramsha Imran , Arif Mahmood

Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection

Due to the diversity of scene text in aspects such as font, color, shape, and size, accurately and efficiently detecting text is still a formidable challenge. Among the various detection approaches, segmentation-based approaches have…

Computer Vision and Pattern Recognition · Computer Science 2024-09-26 Xu Han , Junyu Gao , Chuang Yang , Yuan Yuan , Qi Wang

Visual Mesh: Real-time Object Detection Using Constant Sample Density

This paper proposes an enhancement of convolutional neural networks for object detection in resource-constrained robotics through a geometric input transformation called Visual Mesh. It uses object geometry to create a graph in vision…

Computer Vision and Pattern Recognition · Computer Science 2018-07-24 Trent Houliston , Stephan K. Chalup

Enhanced Techniques for PDF Image Segmentation and Text Extraction

Extracting text objects from the PDF images is a challenging problem. The text data present in the PDF images contain certain useful information for automatic annotation, indexing etc. However variations of the text due to differences in…

Computer Vision and Pattern Recognition · Computer Science 2012-10-02 D. Sasirekha , E. Chandra

Shift Variance in Scene Text Detection

Theory of convolutional neural networks suggests the property of shift equivariance, i.e., that a shifted input causes an equally shifted output. In practice, however, this is not always the case. This poses a great problem for scene text…

Computer Vision and Pattern Recognition · Computer Science 2022-08-22 Markus Glitzner , Jan-Hendrik Neudeck , Philipp Härtinger

Deep Learning Applied to Image and Text Matching

The ability to describe images with natural language sentences is the hallmark for image and language understanding. Such a system has wide ranging applications such as annotating images and using natural sentences to search for images.In…

Machine Learning · Computer Science 2016-01-15 Afroze Ibrahim Baqapuri

Image retrieval method based on CNN and dimension reduction

An image retrieval method based on convolution neural network and dimension reduction is proposed in this paper. Convolution neural network is used to extract high-level features of images, and to solve the problem that the extracted…

Computer Vision and Pattern Recognition · Computer Science 2019-01-15 Zhihao Cao , Shaomin Mu , Yongyu Xu , Mengping Dong

Real-World Font Recognition Using Deep Network and Domain Adaptation

We address a challenging fine-grain classification problem: recognizing a font style from an image of text. In this task, it is very easy to generate lots of rendered font examples but very hard to obtain real-world labeled images. This…

Computer Vision and Pattern Recognition · Computer Science 2015-04-02 Zhangyang Wang , Jianchao Yang , Hailin Jin , Eli Shechtman , Aseem Agarwala , Jonathan Brandt , Thomas S. Huang

HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition

Handwritten Text Recognition remains challenging due to the limited data, high writing style variance, and scripts with complex diacritics. Existing approaches, though partially address these issues, often struggle to generalize without…

Computer Vision and Pattern Recognition · Computer Science 2025-12-05 Pham Thach Thanh Truc , Dang Hoai Nam , Huynh Tong Dang Khoa , Vo Nguyen Le Duy

An Intelligent CNN-VAE Text Representation Technology Based on Text Semantics for Comprehensive Big Data

In the era of big data, a large number of text data generated by the Internet has given birth to a variety of text representation methods. In natural language processing (NLP), text representation transforms text into vectors that can be…

Machine Learning · Computer Science 2020-08-31 Genggeng Liu , Canyang Guo , Lin Xie , Wenxi Liu , Naixue Xiong , Guolong Chen

Memory Matters: Convolutional Recurrent Neural Network for Scene Text Recognition

Text recognition in natural scene is a challenging problem due to the many factors affecting text appearance. In this paper, we presents a method that directly transcribes scene text images to text without needing of sophisticated character…

Computer Vision and Pattern Recognition · Computer Science 2016-01-07 Guo Qiang , Tu Dan , Li Guohui , Lei Jun