Related papers: SNIPER: Efficient Multi-Scale Training

Scale Normalized Image Pyramids with AutoFocus for Object Detection

We present an efficient foveal framework to perform object detection. A scale normalized image pyramid (SNIP) is generated that, like human vision, only attends to objects within a fixed size range at different scales. Such a restriction of…

Computer Vision and Pattern Recognition · Computer Science 2021-02-11 Bharat Singh , Mahyar Najibi , Abhishek Sharma , Larry S. Davis

Snipper: A Spatiotemporal Transformer for Simultaneous Multi-Person 3D Pose Estimation Tracking and Forecasting on a Video Snippet

Multi-person pose understanding from RGB videos involves three complex tasks: pose estimation, tracking and motion forecasting. Intuitively, accurate multi-person pose estimation facilitates robust tracking, and robust tracking builds…

Computer Vision and Pattern Recognition · Computer Science 2023-09-14 Shihao Zou , Yuanlu Xu , Chao Li , Lingni Ma , Li Cheng , Minh Vo

AutoFocus: Efficient Multi-Scale Inference

This paper describes AutoFocus, an efficient multi-scale inference algorithm for deep-learning based object detectors. Instead of processing an entire image pyramid, AutoFocus adopts a coarse to fine approach and only processes regions…

Computer Vision and Pattern Recognition · Computer Science 2019-08-02 Mahyar Najibi , Bharat Singh , Larry S. Davis

Instance Scale Normalization for image understanding

Scale variation remains a challenging problem for object detection. Common paradigms usually adopt multiscale training & testing (image pyramid) or FPN (feature pyramid network) to process objects in a wide scale range. However, multi-scale…

Computer Vision and Pattern Recognition · Computer Science 2020-06-11 Zewen He , He Huang , Yudong Wu , Guan Huang , Wensheng Zhang

Convolutional Neural Pyramid for Image Processing

We propose a principled convolutional neural pyramid (CNP) framework for general low-level vision and image processing tasks. It is based on the essential finding that many applications require large receptive fields for structure…

Computer Vision and Pattern Recognition · Computer Science 2017-04-10 Xiaoyong Shen , Ying-Cong Chen , Xin Tao , Jiaya Jia

Single Pixel Reconstruction for One-stage Instance Segmentation

Object instance segmentation is one of the most fundamental but challenging tasks in computer vision, and it requires the pixel-level image understanding. Most existing approaches address this problem by adding a mask prediction branch to a…

Computer Vision and Pattern Recognition · Computer Science 2019-05-20 Jun Yu , Jinghan Yao , Jian Zhang , Zhou Yu , Dacheng Tao

LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation

Pixel-wise semantic segmentation for visual scene understanding not only needs to be accurate, but also efficient in order to find any use in real-time application. Existing algorithms even though are accurate but they do not focus on…

Computer Vision and Pattern Recognition · Computer Science 2018-04-03 Abhishek Chaurasia , Eugenio Culurciello

Parameter-Inverted Image Pyramid Networks

Image pyramids are commonly used in modern computer vision tasks to obtain multi-scale features for precise understanding of images. However, image pyramids process multiple resolutions of images using the same large-scale model, which…

Computer Vision and Pattern Recognition · Computer Science 2024-10-29 Xizhou Zhu , Xue Yang , Zhaokai Wang , Hao Li , Wenhan Dou , Junqi Ge , Lewei Lu , Yu Qiao , Jifeng Dai

GraphFPN: Graph Feature Pyramid Network for Object Detection

Feature pyramids have been proven powerful in image understanding tasks that require multi-scale features. State-of-the-art methods for multi-scale feature learning focus on performing feature interactions across space and scales using…

Computer Vision and Pattern Recognition · Computer Science 2022-01-12 Gangming Zhao , Weifeng Ge , Yizhou Yu

Iterative Patch Selection for High-Resolution Image Recognition

High-resolution images are prevalent in various applications, such as autonomous driving and computer-aided diagnosis. However, training neural networks on such images is computationally challenging and easily leads to out-of-memory errors…

Computer Vision and Pattern Recognition · Computer Science 2023-03-08 Benjamin Bergner , Christoph Lippert , Aravindh Mahendran

Fast Image Processing with Fully-Convolutional Networks

We present an approach to accelerating a wide variety of image processing operators. Our approach uses a fully-convolutional network that is trained on input-output pairs that demonstrate the operator's action. After training, the original…

Computer Vision and Pattern Recognition · Computer Science 2017-09-05 Qifeng Chen , Jia Xu , Vladlen Koltun

Efficient Deep Spiking Multi-Layer Perceptrons with Multiplication-Free Inference

Advancements in adapting deep convolution architectures for Spiking Neural Networks (SNNs) have significantly enhanced image classification performance and reduced computational burdens. However, the inability of Multiplication-Free…

Neural and Evolutionary Computing · Computer Science 2024-04-29 Boyan Li , Luziwei Leng , Shuaijie Shen , Kaixuan Zhang , Jianguo Zhang , Jianxing Liao , Ran Cheng

An Analysis of Scale Invariance in Object Detection - SNIP

An analysis of different techniques for recognizing and detecting objects under extreme scale variation is presented. Scale specific and scale invariant design of detectors are compared by training them with different configurations of…

Computer Vision and Pattern Recognition · Computer Science 2018-05-28 Bharat Singh , Larry S. Davis

Scanner: Efficient Video Analysis at Scale

A growing number of visual computing applications depend on the analysis of large video collections. The challenge is that scaling applications to operate on these datasets requires efficient systems for pixel data access and parallel…

Computer Vision and Pattern Recognition · Computer Science 2018-05-21 Alex Poms , Will Crichton , Pat Hanrahan , Kayvon Fatahalian

Superpixel Sampling Networks

Superpixels provide an efficient low/mid-level representation of image data, which greatly reduces the number of image primitives for subsequent vision tasks. Existing superpixel algorithms are not differentiable, making them difficult to…

Computer Vision and Pattern Recognition · Computer Science 2018-07-27 Varun Jampani , Deqing Sun , Ming-Yu Liu , Ming-Hsuan Yang , Jan Kautz

Training Multiscale-CNN for Large Microscopy Image Classification in One Hour

Existing approaches to train neural networks that use large images require to either crop or down-sample data during pre-processing, use small batch sizes, or split the model across devices mainly due to the prohibitively limited memory…

Image and Video Processing · Electrical Eng. & Systems 2020-03-12 Kushal Datta , Imtiaz Hossain , Sun Choi , Vikram Saletore , Kyle Ambert , William J. Godinez , Xian Zhang

ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for Efficient Action Recognition

Spiking Neural Networks (SNNs) have emerged as a compelling, energy-efficient alternative to traditional Artificial Neural Networks (ANNs) for static image tasks such as image classification and segmentation. However, in the more complex…

Computer Vision and Pattern Recognition · Computer Science 2024-09-04 Shiting Xiao , Yuhang Li , Youngeun Kim , Donghyun Lee , Priyadarshini Panda

Concatenated Feature Pyramid Network for Instance Segmentation

Low level features like edges and textures play an important role in accurately localizing instances in neural networks. In this paper, we propose an architecture which improves feature pyramid networks commonly used instance segmentation…

Computer Vision and Pattern Recognition · Computer Science 2019-04-02 Yongqing Sun , Pranav Shenoy K P , Jun Shimamura , Atsushi Sagata

SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization

Convolutional neural networks typically encode an input image into a series of intermediate features with decreasing resolutions. While this structure is suited to classification tasks, it does not perform well for tasks requiring…

Computer Vision and Pattern Recognition · Computer Science 2020-06-18 Xianzhi Du , Tsung-Yi Lin , Pengchong Jin , Golnaz Ghiasi , Mingxing Tan , Yin Cui , Quoc V. Le , Xiaodan Song

Training Energy-Efficient Deep Spiking Neural Networks with Single-Spike Hybrid Input Encoding

Spiking Neural Networks (SNNs) have emerged as an attractive alternative to traditional deep learning frameworks, since they provide higher computational efficiency in event driven neuromorphic hardware. However, the state-of-the-art (SOTA)…

Neural and Evolutionary Computing · Computer Science 2021-09-05 Gourav Datta , Souvik Kundu , Peter A. Beerel