Related papers: Location-Aware Feature Selection Text Detection Ne…

Learning to detect and localize many objects from few examples

The current trend in object detection and localization is to learn predictions with high capacity deep neural networks trained on a very large amount of annotated data and using a high amount of processing power. In this work, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2016-11-18 Bastien Moysset , Christoper Kermorvant , Christian Wolf

Location-Aware Box Reasoning for Anchor-Based Single-Shot Object Detection

In the majority of object detection frameworks, the confidence of instance classification is used as the quality criterion of predicted bounding boxes, like the confidence-based ranking in non-maximum suppression (NMS). However, the quality…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Wenchi Ma , Kaidong Li , Guanghui Wang

LocNet: Improving Localization Accuracy for Object Detection

We propose a novel object localization methodology with the purpose of boosting the localization accuracy of state-of-the-art object detection systems. Our model, given a search region, aims at returning the bounding box of an object of…

Computer Vision and Pattern Recognition · Computer Science 2016-04-08 Spyros Gidaris , Nikos Komodakis

Latency-aware Spatial-wise Dynamic Networks

Spatial-wise dynamic convolution has become a promising approach to improving the inference efficiency of deep networks. By allocating more computation to the most informative pixels, such an adaptive inference paradigm reduces the spatial…

Computer Vision and Pattern Recognition · Computer Science 2022-10-13 Yizeng Han , Zhihang Yuan , Yifan Pu , Chenhao Xue , Shiji Song , Guangyu Sun , Gao Huang

Learning Instance-Aware Object Detection Using Determinantal Point Processes

Recent object detectors find instances while categorizing candidate regions. As each region is evaluated independently, the number of candidate regions from a detector is usually larger than the number of objects. Since the final goal of…

Computer Vision and Pattern Recognition · Computer Science 2019-09-04 Nuri Kim , Donghoon Lee , Songhwai Oh

Single Shot Text Detector with Regional Attention

We present a novel single-shot text detector that directly outputs word-level bounding boxes in a natural image. We propose an attention mechanism which roughly identifies text regions via an automatically learned attentional map. This…

Computer Vision and Pattern Recognition · Computer Science 2017-09-04 Pan He , Weilin Huang , Tong He , Qile Zhu , Yu Qiao , Xiaolin Li

LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation Network

Recently, regression-based methods, which predict parameterized text shapes for text localization, have gained popularity in scene text detection. However, the existing parameterized text shape methods still have limitations in modeling…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 Yuchen Su , Zhineng Chen , Zhiwen Shao , Yuning Du , Zhilong Ji , Jinfeng Bai , Yong Zhou , Yu-Gang Jiang

PixelLink: Detecting Scene Text via Instance Segmentation

Most state-of-the-art scene text detection algorithms are deep learning based methods that depend on bounding box regression and perform at least two kinds of predictions: text/non-text classification and location regression. Regression…

Computer Vision and Pattern Recognition · Computer Science 2018-01-08 Dan Deng , Haifeng Liu , Xuelong Li , Deng Cai

NENET: An Edge Learnable Network for Link Prediction in Scene Text

Text detection in scenes based on deep neural networks have shown promising results. Instead of using word bounding box regression, recent state-of-the-art methods have started focusing on character bounding box and pixel-level prediction.…

Machine Learning · Computer Science 2020-05-26 Mayank Kumar Singh , Sayan Banerjee , Shubhasis Chaudhuri

Text Detection on Roughly Placed Books by Leveraging a Learning-based Model Trained with Another Domain Data

Text detection enables us to extract rich information from images. In this paper, we focus on how to generate bounding boxes that are appropriate to grasp text areas on books to help implement automatic text detection. We attempt not to…

Computer Vision and Pattern Recognition · Computer Science 2020-06-29 Riku Anegawa , Masayoshi Aritsugi

Explore Faster Localization Learning For Scene Text Detection

Generally pre-training and long-time training computation are necessary for obtaining a good-performance text detector based on deep networks. In this paper, we present a new scene text detection network (called FANet) with a Fast…

Computer Vision and Pattern Recognition · Computer Science 2022-07-05 Yuzhong Zhao , Yuanqiang Cai , Weijia Wu , Weiqiang Wang

Shape Robust Text Detection with Progressive Scale Expansion Network

The challenges of shape robust text detection lie in two aspects: 1) most existing quadrangular bounding box based detectors are difficult to locate texts with arbitrary shapes, which are hard to be enclosed perfectly in a rectangle; 2)…

Computer Vision and Pattern Recognition · Computer Science 2018-06-08 Xiang Li , Wenhai Wang , Wenbo Hou , Ruo-Ze Liu , Tong Lu , Jian Yang

Location-Sensitive Visual Recognition with Cross-IOU Loss

Object detection, instance segmentation, and pose estimation are popular visual recognition tasks which require localizing the object by internal or boundary landmarks. This paper summarizes these tasks as location-sensitive visual…

Computer Vision and Pattern Recognition · Computer Science 2021-04-13 Kaiwen Duan , Lingxi Xie , Honggang Qi , Song Bai , Qingming Huang , Qi Tian

Industrial Scene Text Detection with Refined Feature-attentive Network

Detecting the marking characters of industrial metal parts remains challenging due to low visual contrast, uneven illumination, corroded character structures, and cluttered background of metal part images. Affected by these factors,…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Tongkun Guan , Chaochen Gu , Changsheng Lu , Jingzheng Tu , Qi Feng , Kaijie Wu , Xinping Guan

LSANet: Feature Learning on Point Sets by Local Spatial Aware Layer

Directly learning features from the point cloud has become an active research direction in 3D understanding. Existing learning-based methods usually construct local regions from the point cloud and extract the corresponding features.…

Computer Vision and Pattern Recognition · Computer Science 2019-06-21 Lin-Zhuo Chen , Xuan-Yi Li , Deng-Ping Fan , Kai Wang , Shao-Ping Lu , Ming-Ming Cheng

LASFNet: A Lightweight Attention-Guided Self-Modulation Feature Fusion Network for Multimodal Object Detection

Effective deep feature extraction via feature-level fusion is crucial for multimodal object detection. However, previous studies often involve complex training processes that integrate modality-specific features by stacking multiple…

Computer Vision and Pattern Recognition · Computer Science 2025-06-27 Lei Hao , Lina Xu , Chang Liu , Yanni Dong

R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

In this paper, we propose a novel method called Rotational Region CNN (R2CNN) for detecting arbitrary-oriented texts in natural scene images. The framework is based on Faster R-CNN [1] architecture. First, we use the Region Proposal Network…

Computer Vision and Pattern Recognition · Computer Science 2017-07-03 Yingying Jiang , Xiangyu Zhu , Xiaobing Wang , Shuli Yang , Wei Li , Hua Wang , Pei Fu , Zhenbo Luo

LRANet++: Low-Rank Approximation Network for Accurate and Efficient Text Spotting

End-to-end text spotting aims to jointly optimize text detection and recognition within a unified framework. Despite significant progress, designing an accurate and efficient end-to-end text spotter for arbitrary-shaped text remains…

Computer Vision and Pattern Recognition · Computer Science 2026-01-06 Yuchen Su , Zhineng Chen , Yongkun Du , Zuxuan Wu , Hongtao Xie , Yu-Gang Jiang

STELA: A Real-Time Scene Text Detector with Learned Anchor

To achieve high coverage of target boxes, a normal strategy of conventional one-stage anchor-based detectors is to utilize multiple priors at each spatial position, especially in scene text detection tasks. In this work, we present a simple…

Computer Vision and Pattern Recognition · Computer Science 2019-09-24 Linjie Deng , Yanxiang Gong , Xinchen Lu , Yi Lin , Zheng Ma , Mei Xie

Adaptive Segmentation Network for Scene Text Detection

Inspired by deep convolution segmentation algorithms, scene text detectors break the performance ceiling of datasets steadily. However, these methods often encounter threshold selection bottlenecks and have poor performance on text…

Computer Vision and Pattern Recognition · Computer Science 2023-08-17 Guiqin Zhao