Related papers: FaSTExt: Fast and Small Text Extractor

Visual Instance Retrieval with Deep Convolutional Networks

This paper provides an extensive study on the availability of image representations based on convolutional networks (ConvNets) for the task of visual instance retrieval. Besides the choice of convolutional layers, we present an efficient…

Computer Vision and Pattern Recognition · Computer Science 2016-05-10 Ali Sharif Razavian , Josephine Sullivan , Stefan Carlsson , Atsuto Maki

An End-to-end Video Text Detector with Online Tracking

Video text detection is considered as one of the most difficult tasks in document analysis due to the following two challenges: 1) the difficulties caused by video scenes, i.e., motion blur, illumination changes, and occlusion; 2) the…

Computer Vision and Pattern Recognition · Computer Science 2019-08-21 Hongyuan Yu , Chengquan Zhang , Xuan Li , Junyu Han , Errui Ding , Liang Wang

Text-Pass Filter: An Efficient Scene Text Detector

To pursue an efficient text assembling process, existing methods detect texts via the shrink-mask expansion strategy. However, the shrinking operation loses the visual features of text margins and confuses the foreground and background…

Computer Vision and Pattern Recognition · Computer Science 2026-01-27 Chuang Yang , Haozhao Ma , Xu Han , Yuan Yuan , Qi Wang

Classifying text using machine learning models and determining conversation drift

Text classification helps analyse texts for semantic meaning and relevance, by mapping the words against this hierarchy. An analysis of various types of texts is invaluable to understanding both their semantic meaning, as well as their…

Machine Learning · Computer Science 2022-11-16 Chaitanya Chadha , Vandit Gupta , Deepak Gupta , Ashish Khanna

A Fast Template-based Approach to Automatically Identify Primary Text Content of a Web Page

Search engines have become an indispensable tool for browsing information on the Internet. The user, however, is often annoyed by redundant results from irrelevant Web pages. One reason is because search engines also look at non-informative…

Information Retrieval · Computer Science 2019-11-27 Dat Quoc Nguyen , Dai Quoc Nguyen , Son Bao Pham , The Duy Bui

Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing

Visual text, a pivotal element in both document and scene images, speaks volumes and attracts significant attention in the computer vision domain. Beyond visual text detection and recognition, the field of visual text processing has…

Computer Vision and Pattern Recognition · Computer Science 2024-02-06 Yan Shu , Weichao Zeng , Zhenhang Li , Fangmin Zhao , Yu Zhou

MT: Multi-Perspective Feature Learning Network for Scene Text Detection

Text detection, the key technology for understanding scene text, has become an attractive research topic. For detecting various scene texts, researchers propose plenty of detectors with different advantages: detection-based models enjoy…

Computer Vision and Pattern Recognition · Computer Science 2022-01-25 Chuang Yang , Mulin Chen , Yuan Yuan , Qi Wang

FACLSTM: ConvLSTM with Focused Attention for Scene Text Recognition

Scene text recognition has recently been widely treated as a sequence-to-sequence prediction problem, where traditional fully-connected-LSTM (FC-LSTM) has played a critical role. Due to the limitation of FC-LSTM, existing methods have to…

Computer Vision and Pattern Recognition · Computer Science 2020-01-07 Qingqing Wang , Wenjing Jia , Xiangjian He , Yue Lu , Michael Blumenstein , Ye Huang

Text Detection on Roughly Placed Books by Leveraging a Learning-based Model Trained with Another Domain Data

Text detection enables us to extract rich information from images. In this paper, we focus on how to generate bounding boxes that are appropriate to grasp text areas on books to help implement automatic text detection. We attempt not to…

Computer Vision and Pattern Recognition · Computer Science 2020-06-29 Riku Anegawa , Masayoshi Aritsugi

Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification

Text in natural images contains rich semantics that are often highly relevant to objects or scene. In this paper, we focus on the problem of fully exploiting scene text for visual understanding. The main idea is combining word…

Computer Vision and Pattern Recognition · Computer Science 2017-05-31 Xiang Bai , Mingkun Yang , Pengyuan Lyu , Yongchao Xu , Jiebo Luo

Recurrent Models of Visual Attention

Applying convolutional neural networks to large images is computationally expensive because the amount of computation scales linearly with the number of image pixels. We present a novel recurrent neural network model that is capable of…

Machine Learning · Computer Science 2014-06-25 Volodymyr Mnih , Nicolas Heess , Alex Graves , Koray Kavukcuoglu

Convolutional Sparse Coding Fast Approximation with Application to Seismic Reflectivity Estimation

In sparse coding, we attempt to extract features of input vectors, assuming that the data is inherently structured as a sparse superposition of basic building blocks. Similarly, neural networks perform a given task by learning features of…

Machine Learning · Computer Science 2022-02-16 Deborah Pereg , Israel Cohen , Anthony A. Vassiliou

Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation

We present techniques for speeding up the test-time evaluation of large convolutional networks, designed for object recognition tasks. These models deliver impressive accuracy but each image evaluation requires millions of floating point…

Computer Vision and Pattern Recognition · Computer Science 2024-03-15 Remi Denton , Wojciech Zaremba , Joan Bruna , Yann LeCun , Rob Fergus

Learning Convolutional Text Representations for Visual Question Answering

Visual question answering is a recently proposed artificial intelligence task that requires a deep understanding of both images and texts. In deep learning, images are typically modeled through convolutional neural networks, and texts are…

Machine Learning · Computer Science 2018-09-05 Zhengyang Wang , Shuiwang Ji

A Single Shot Text Detector with Scale-adaptive Anchors

Currently, most top-performing text detection networks tend to employ fixed-size anchor boxes to guide the search for text instances. They usually rely on a large amount of anchors with different scales to discover texts in scene images,…

Computer Vision and Pattern Recognition · Computer Science 2018-07-06 Qi Yuan , Bingwang Zhang , Haojie Li , Zhihui Wang , Zhongxuan Luo

AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting

Scene text spotting aims to detect and recognize the entire word or sentence with multiple characters in natural images. It is still challenging because ambiguity often occurs when the spacing between characters is large or the characters…

Computer Vision and Pattern Recognition · Computer Science 2021-07-07 Wenhai Wang , Xuebo Liu , Xiaozhong Ji , Enze Xie , Ding Liang , Zhibo Yang , Tong Lu , Chunhua Shen , Ping Luo

Discovering influential text using convolutional neural networks

Experimental methods for estimating the impacts of text on human evaluation have been widely used in the social sciences. However, researchers in experimental settings are usually limited to testing a small number of pre-specified text…

Computation and Language · Computer Science 2024-12-04 Megan Ayers , Luke Sanford , Margaret Roberts , Eddie Yang

Rethinking Irregular Scene Text Recognition

Reading text from natural images is challenging due to the great variety in text font, color, size, complex background and etc.. The perspective distortion and non-linear spatial arrangement of characters make it further difficult. While…

Computer Vision and Pattern Recognition · Computer Science 2019-11-12 Shangbang Long , Yushuo Guan , Bingxuan Wang , Kaigui Bian , Cong Yao

Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition

In the deployment of scene-text spotting systems on mobile platforms, lightweight models with low computation are preferable. In concept, end-to-end (E2E) text spotting is suitable for such purposes because it performs text detection and…

Computer Vision and Pattern Recognition · Computer Science 2021-06-11 Ryota Yoshihashi , Tomohiro Tanaka , Kenji Doi , Takumi Fujino , Naoaki Yamashita

CUTIE: Learning to Understand Documents with Convolutional Universal Text Information Extractor

Extracting key information from documents, such as receipts or invoices, and preserving the interested texts to structured data is crucial in the document-intensive streamline processes of office automation in areas that includes but not…

Computer Vision and Pattern Recognition · Computer Science 2019-06-21 Xiaohui Zhao , Endi Niu , Zhuo Wu , Xiaoguang Wang