Related papers: FaSTExt: Fast and Small Text Extractor

Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Network

We present an end-to-end, multimodal, fully convolutional network for extracting semantic structures from document images. We consider document semantic structure extraction as a pixel-wise segmentation task, and propose a unified model…

Computer Vision and Pattern Recognition · Computer Science 2017-06-09 Xiao Yang , Ersin Yumer , Paul Asente , Mike Kraley , Daniel Kifer , C. Lee Giles

Synthetic Data for Text Localisation in Natural Images

In this paper we introduce a new method for text detection in natural images. The method comprises two contributions: First, a fast and scalable engine to generate synthetic images of text in clutter. This engine overlays synthetic text to…

Computer Vision and Pattern Recognition · Computer Science 2016-04-25 Ankush Gupta , Andrea Vedaldi , Andrew Zisserman

Detecting Text in the Wild with Deep Character Embedding Network

Most text detection methods hypothesize texts are horizontal or multi-oriented and thus define quadrangles as the basic detection unit. However, text in the wild is usually perspectively distorted or curved, which can not be easily tackled…

Computer Vision and Pattern Recognition · Computer Science 2019-01-03 Jiaming Liu , Chengquan Zhang , Yipeng Sun , Junyu Han , Errui Ding

Extracting textual overlays from social media videos using neural networks

Textual overlays are often used in social media videos as people who watch them without the sound would otherwise miss essential information conveyed in the audio stream. This is why extraction of those overlays can serve as an important…

Computer Vision and Pattern Recognition · Computer Science 2018-05-02 Adam Słucki , Tomasz Trzcinski , Adam Bielski , Paweł Cyrta

Text Classification Algorithms: A Survey

In recent years, there has been an exponential growth in the number of complex documents and texts that require a deeper understanding of machine learning methods to be able to accurately classify texts in many applications. Many machine…

Machine Learning · Computer Science 2020-05-21 Kamran Kowsari , Kiana Jafari Meimandi , Mojtaba Heidarysafa , Sanjana Mendu , Laura E. Barnes , Donald E. Brown

Learning to Predict More Accurate Text Instances for Scene Text Detection

At present, multi-oriented text detection methods based on deep neural network have achieved promising performances on various benchmarks. Nevertheless, there are still some difficulties for arbitrary shape text detection, especially for a…

Computer Vision and Pattern Recognition · Computer Science 2020-04-17 XiaoQian Li , Jie Liu , ShuWu Zhang , GuiXuan Zhang

Detecting Text in Natural Image with Connectionist Text Proposal Network

We propose a novel Connectionist Text Proposal Network (CTPN) that accurately localizes text lines in natural image. The CTPN detects a text line in a sequence of fine-scale text proposals directly in convolutional feature maps. We develop…

Computer Vision and Pattern Recognition · Computer Science 2016-10-02 Zhi Tian , Weilin Huang , Tong He , Pan He , Yu Qiao

Deep Scene Text Detection with Connected Component Proposals

A growing demand for natural-scene text detection has been witnessed by the computer vision community since text information plays a significant role in scene understanding and image indexing. Deep neural networks are being used due to…

Computer Vision and Pattern Recognition · Computer Science 2017-08-18 Fan Jiang , Zhihui Hao , Xinran Liu

PSSTRNet: Progressive Segmentation-guided Scene Text Removal Network

Scene text removal (STR) is a challenging task due to the complex text fonts, colors, sizes, and background textures in scene images. However, most previous methods learn both text location and background inpainting implicitly within a…

Computer Vision and Pattern Recognition · Computer Science 2023-06-14 Guangtao Lyu , Anna Zhu

Cascaded Segmentation-Detection Networks for Word-Level Text Spotting

We introduce an algorithm for word-level text spotting that is able to accurately and reliably determine the bounding regions of individual words of text "in the wild". Our system is formed by the cascade of two convolutional neural…

Computer Vision and Pattern Recognition · Computer Science 2017-04-05 Siyang Qin , Roberto Manduchi

Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection

Recently, transformer-based methods have achieved promising progresses in object detection, as they can eliminate the post-processes like NMS and enrich the deep representations. However, these methods cannot well cope with scene text due…

Computer Vision and Pattern Recognition · Computer Science 2022-03-31 Jingqun Tang , Wenqing Zhang , Hongye Liu , MingKun Yang , Bo Jiang , Guanglong Hu , Xiang Bai

Text Detection and Recognition in images: A survey

Text Detection and recognition is a one of the important aspect of image processing. This paper analyzes and compares the methods to handle this task. It summarizes the fundamental problems and enumerates factors that need consideration…

Computer Vision and Pattern Recognition · Computer Science 2018-05-03 Tanvi Goswami , Zankhana Barad , Prof. Nikita P. Desai

Shape Robust Text Detection with Progressive Scale Expansion Network

The challenges of shape robust text detection lie in two aspects: 1) most existing quadrangular bounding box based detectors are difficult to locate texts with arbitrary shapes, which are hard to be enclosed perfectly in a rectangle; 2)…

Computer Vision and Pattern Recognition · Computer Science 2018-06-08 Xiang Li , Wenhai Wang , Wenbo Hou , Ruo-Ze Liu , Tong Lu , Jian Yang

Arbitrary-Shaped Text Detection withAdaptive Text Region Representation

Text detection/localization, as an important task in computer vision, has witnessed substantialadvancements in methodology and performance with convolutional neural networks. However, the vastmajority of popular methods use rectangles or…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Xiufeng Jiang , Shugong Xu , Shunqing Zhang , Shan Cao

Detecting Oriented Text in Natural Images by Linking Segments

Most state-of-the-art text detection methods are specific to horizontal Latin text and are not fast enough for real-time applications. We introduce Segment Linking (SegLink), an oriented text detection method. The main idea is to decompose…

Computer Vision and Pattern Recognition · Computer Science 2017-04-14 Baoguang Shi , Xiang Bai , Serge Belongie

Modelling, Visualising and Summarising Documents with a Single Convolutional Neural Network

Capturing the compositional process which maps the meaning of words to that of documents is a central challenge for researchers in Natural Language Processing and Information Retrieval. We introduce a model that is able to represent the…

Computation and Language · Computer Science 2014-06-17 Misha Denil , Alban Demiraj , Nal Kalchbrenner , Phil Blunsom , Nando de Freitas

Text detection and recognition based on a lensless imaging system

Lensless cameras are characterized by several advantages (e.g., miniaturization, ease of manufacture, and low cost) as compared with conventional cameras. However, they have not been extensively employed due to their poor image clarity and…

Computer Vision and Pattern Recognition · Computer Science 2022-10-11 Yinger Zhang , Zhouyi Wu , Peiying Lin , Yuting Wu , Lusong Wei , Zhengjie Huang , Jiangtao Huangfu

Scale-Invariant Multi-Oriented Text Detection in Wild Scene Images

Automatic detection of scene texts in the wild is a challenging problem, particularly due to the difficulties in handling (i) occlusions of varying percentages, (ii) widely different scales and orientations, (iii) severe degradations in the…

Computer Vision and Pattern Recognition · Computer Science 2020-02-18 Kinjal Dasgupta , Sudip Das , Ujjwal Bhattacharya

DeepSkeleton: Learning Multi-task Scale-associated Deep Side Outputs for Object Skeleton Extraction in Natural Images

Object skeletons are useful for object representation and object detection. They are complementary to the object contour, and provide extra information, such as how object scale (thickness) varies among object parts. But object skeleton…

Computer Vision and Pattern Recognition · Computer Science 2017-10-11 Wei Shen , Kai Zhao , Yuan Jiang , Yan Wang , Xiang Bai , Alan Yuille

ReLaText: Exploiting Visual Relationships for Arbitrary-Shaped Scene Text Detection with Graph Convolutional Networks

We introduce a new arbitrary-shaped text detection approach named ReLaText by formulating text detection as a visual relationship detection problem. To demonstrate the effectiveness of this new formulation, we start from using a "link"…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Chixiang Ma , Lei Sun , Zhuoyao Zhong , Qiang Huo