Related papers: A Binary Convolutional Encoder-decoder Network for…

Text-Attentional Convolutional Neural Networks for Scene Text Detection

Recent deep learning models have demonstrated strong capabilities for classifying text and non-text components in natural images. They extract a high-level feature computed globally from a whole image component (patch), where the cluttered…

Computer Vision and Pattern Recognition · Computer Science 2016-05-04 Tong He , Weilin Huang , Yu Qiao , Jian Yao

Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network

We introduce a new top-down pipeline for scene text detection. We propose a novel Cascaded Convolutional Text Network (CCTN) that joints two customized convolutional networks for coarse-to-fine text localization. The CCTN fast detects text…

Computer Vision and Pattern Recognition · Computer Science 2016-04-01 Tong He , Weilin Huang , Yu Qiao , Jian Yao

Contextual Encoder-Decoder Network for Visual Saliency Prediction

Predicting salient regions in natural images requires the detection of objects that are present in a scene. To develop robust representations for this challenging task, high-level visual features at multiple spatial scales must be extracted…

Computer Vision and Pattern Recognition · Computer Science 2024-04-08 Alexander Kroner , Mario Senden , Kurt Driessens , Rainer Goebel

Scene Text Eraser

The character information in natural scene images contains various personal information, such as telephone numbers, home addresses, etc. It is a high risk of leakage the information if they are published. In this paper, we proposed a scene…

Computer Vision and Pattern Recognition · Computer Science 2017-05-09 Toshiki Nakamura , Anna Zhu , Keiji Yanai , Seiichi Uchida

Rethinking BiSeNet For Real-time Semantic Segmentation

BiSeNet has been proved to be a popular two-stream network for real-time segmentation. However, its principle of adding an extra path to encode spatial information is time-consuming, and the backbones borrowed from pretrained tasks, e.g.,…

Computer Vision and Pattern Recognition · Computer Science 2021-04-28 Mingyuan Fan , Shenqi Lai , Junshi Huang , Xiaoming Wei , Zhenhua Chai , Junfeng Luo , Xiaolin Wei

Binary Patterns Encoded Convolutional Neural Networks for Texture Recognition and Remote Sensing Scene Classification

Designing discriminative powerful texture features robust to realistic imaging conditions is a challenging computer vision problem with many applications, including material recognition and analysis of satellite or aerial imagery. In the…

Computer Vision and Pattern Recognition · Computer Science 2018-03-28 Rao Muhammad Anwer , Fahad Shahbaz Khan , Joost van de Weijer , Matthieu Molinier , Jorma Laaksonen

ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network

Scene text detection and recognition has received increasing research attention. Existing methods can be roughly categorized into two groups: character-based and segmentation-based. These methods either are costly for character annotation…

Computer Vision and Pattern Recognition · Computer Science 2020-02-26 Yuliang Liu , Hao Chen , Chunhua Shen , Tong He , Lianwen Jin , Liangwei Wang

SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition

Scene text recognition is a hot research topic in computer vision. Recently, many recognition methods based on the encoder-decoder framework have been proposed, and they can handle scene texts of perspective distortion and curve shape.…

Computer Vision and Pattern Recognition · Computer Science 2020-05-25 Zhi Qiao , Yu Zhou , Dongbao Yang , Yucan Zhou , Weiping Wang

Scene Text Recognition with Sliding Convolutional Character Models

Scene text recognition has attracted great interests from the computer vision and pattern recognition community in recent years. State-of-the-art methods use concolutional neural networks (CNNs), recurrent neural networks with long…

Computer Vision and Pattern Recognition · Computer Science 2017-09-07 Fei Yin , Yi-Chao Wu , Xu-Yao Zhang , Cheng-Lin Liu

Sequence to sequence learning for unconstrained scene text recognition

In this work we present a state-of-the-art approach for unconstrained natural scene text recognition. We propose a cascade approach that incorporates a convolutional neural network (CNN) architecture followed by a long short term memory…

Computer Vision and Pattern Recognition · Computer Science 2016-07-22 Ahmed Mamdouh A. Hassanien

Composite Binary Decomposition Networks

Binary neural networks have great resource and computing efficiency, while suffer from long training procedure and non-negligible accuracy drops, when comparing to the full-precision counterparts. In this paper, we propose the composite…

Machine Learning · Computer Science 2018-11-19 You Qiaoben , Zheng Wang , Jianguo Li , Yinpeng Dong , Yu-Gang Jiang , Jun Zhu

Efficient and Accurate Scene Text Recognition with Cascaded-Transformers

In recent years, vision transformers with text decoder have demonstrated remarkable performance on Scene Text Recognition (STR) due to their ability to capture long-range dependencies and contextual relationships with high learning…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Savas Ozkan , Andrea Maracani , Hyowon Kim , Sijun Cho , Eunchung Noh , Jeongwon Min , Jung Min Cho , Mete Ozay

Fast-SCNN: Fast Semantic Segmentation Network

The encoder-decoder framework is state-of-the-art for offline semantic image segmentation. Since the rise in autonomous systems, real-time computation is increasingly desirable. In this paper, we introduce fast segmentation convolutional…

Computer Vision and Pattern Recognition · Computer Science 2019-02-13 Rudra P K Poudel , Stephan Liwicki , Roberto Cipolla

Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks

In this work, we jointly address the problem of text detection and recognition in natural scene images based on convolutional recurrent neural networks. We propose a unified network that simultaneously localizes and recognizes text with a…

Computer Vision and Pattern Recognition · Computer Science 2017-07-14 Hui Li , Peng Wang , Chunhua Shen

Reading Scene Text with Attention Convolutional Sequence Modeling

Reading text in the wild is a challenging task in the field of computer vision. Existing approaches mainly adopted Connectionist Temporal Classification (CTC) or Attention models based on Recurrent Neural Network (RNN), which is…

Computer Vision and Pattern Recognition · Computer Science 2017-09-14 Yunze Gao , Yingying Chen , Jinqiao Wang , Hanqing Lu

Primitive Representation Learning for Scene Text Recognition

Scene text recognition is a challenging task due to diverse variations of text instances in natural scene images. Conventional methods based on CNN-RNN-CTC or encoder-decoder with attention mechanism may not fully investigate stable and…

Computer Vision and Pattern Recognition · Computer Science 2021-05-11 Ruijie Yan , Liangrui Peng , Shanyu Xiao , Gang Yao

NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition

Scene text recognition has attracted a great many researches due to its importance to various applications. Existing methods mainly adopt recurrence or convolution based networks. Though have obtained good performance, these methods still…

Computer Vision and Pattern Recognition · Computer Science 2019-10-11 Fenfen Sheng , Zhineng Chen , Bo Xu

Deep Scene Text Detection with Connected Component Proposals

A growing demand for natural-scene text detection has been witnessed by the computer vision community since text information plays a significant role in scene understanding and image indexing. Deep neural networks are being used due to…

Computer Vision and Pattern Recognition · Computer Science 2017-08-18 Fan Jiang , Zhihui Hao , Xinran Liu

Improved Bengali Image Captioning via deep convolutional neural network based encoder-decoder model

Image Captioning is an arduous task of producing syntactically and semantically correct textual descriptions of an image in natural language with context related to the image. Existing notable pieces of research in Bengali Image Captioning…

Computer Vision and Pattern Recognition · Computer Science 2021-02-16 Mohammad Faiyaz Khan , S. M. Sadiq-Ur-Rahman Shifath , Md. Saiful Islam

Bilinear CNNs for Fine-grained Visual Recognition

We present a simple and effective architecture for fine-grained visual recognition called Bilinear Convolutional Neural Networks (B-CNNs). These networks represent an image as a pooled outer product of features derived from two CNNs and…

Computer Vision and Pattern Recognition · Computer Science 2017-06-02 Tsung-Yu Lin , Aruni RoyChowdhury , Subhransu Maji