Related papers: EffiPerception: an Efficient Framework for Various…

EfficientDet: Scalable and Efficient Object Detection

Model efficiency has become increasingly important in computer vision. In this paper, we systematically study neural network architecture design choices for object detection and propose several key optimizations to improve efficiency.…

Computer Vision and Pattern Recognition · Computer Science 2020-07-28 Mingxing Tan , Ruoming Pang , Quoc V. Le

Resource Efficient Perception for Vision Systems

Despite the rapid advancement in the field of image recognition, the processing of high-resolution imagery remains a computational challenge. However, this processing is pivotal for extracting detailed object insights in areas ranging from…

Computer Vision and Pattern Recognition · Computer Science 2024-05-14 A V Subramanyam , Niyati Singal , Vinay K Verma

UniVision: A Unified Framework for Vision-Centric 3D Perception

The past few years have witnessed the rapid development of vision-centric 3D perception in autonomous driving. Although the 3D perception models share many structural and conceptual similarities, there still exist gaps in their feature…

Computer Vision and Pattern Recognition · Computer Science 2024-01-17 Yu Hong , Qian Liu , Huayuan Cheng , Danjiao Ma , Hang Dai , Yu Wang , Guangzhi Cao , Yong Ding

Efficient Feature Transformations for Discriminative and Generative Continual Learning

As neural networks are increasingly being applied to real-world applications, mechanisms to address distributional shift and sequential task learning without forgetting are critical. Methods incorporating network expansion have shown…

Machine Learning · Computer Science 2021-03-26 Vinay Kumar Verma , Kevin J Liang , Nikhil Mehta , Piyush Rai , Lawrence Carin

EffiScene: Efficient Per-Pixel Rigidity Inference for Unsupervised Joint Learning of Optical Flow, Depth, Camera Pose and Motion Segmentation

This paper addresses the challenging unsupervised scene flow estimation problem by jointly learning four low-level vision sub-tasks: optical flow $\textbf{F}$, stereo-depth $\textbf{D}$, camera pose $\textbf{P}$ and motion segmentation…

Computer Vision and Pattern Recognition · Computer Science 2021-05-18 Yang Jiao , Trac D. Tran , Guangming Shi

CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception

Cooperative perception aims to address the inherent limitations of single-vehicle autonomous driving systems through information exchange among multiple agents. Previous research has primarily focused on single-frame perception tasks.…

Computer Vision and Pattern Recognition · Computer Science 2025-07-28 Jiaru Zhong , Jiahao Wang , Jiahui Xu , Xiaofan Li , Zaiqing Nie , Haibao Yu

Efficiency 360: Efficient Vision Transformers

Transformers are widely used for solving tasks in natural language processing, computer vision, speech, and music domains. In this paper, we talk about the efficiency of transformers in terms of memory (the number of parameters),…

Computer Vision and Pattern Recognition · Computer Science 2023-02-27 Badri N. Patro , Vijay Srinivas Agneeswaran

EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training

The superior performance of modern visual backbones usually comes with a costly training procedure. We contribute to this issue by generalizing the idea of curriculum learning beyond its original formulation, i.e., training models using…

Computer Vision and Pattern Recognition · Computer Science 2024-05-15 Yulin Wang , Yang Yue , Rui Lu , Yizeng Han , Shiji Song , Gao Huang

Efficient Feature Extraction for High-resolution Video Frame Interpolation

Most deep learning methods for video frame interpolation consist of three main components: feature extraction, motion estimation, and image synthesis. Existing approaches are mainly distinguishable in terms of how these modules are…

Computer Vision and Pattern Recognition · Computer Science 2022-11-28 Moritz Nottebaum , Stefan Roth , Simone Schaub-Meyer

ePose: Let's Make EfficientPose More Generally Applicable

EfficientPose is an impressive 3D object detection model. It has been demonstrated to be quick, scalable, and accurate, especially when considering that it uses only RGB inputs. In this paper we try to improve on EfficientPose by giving it…

Computer Vision and Pattern Recognition · Computer Science 2021-12-01 Austin Lally , Robert Bain , Mazen Alotaibi

3D-FFS: Faster 3D object detection with Focused Frustum Search in sensor fusion based networks

In this work we propose 3D-FFS, a novel approach to make sensor fusion based 3D object detection networks significantly faster using a class of computationally inexpensive heuristics. Existing sensor fusion based networks generate 3D region…

Computer Vision and Pattern Recognition · Computer Science 2021-10-05 Aniruddha Ganguly , Tasin Ishmam , Khandker Aftarul Islam , Md Zahidur Rahman , Md. Shamsuzzoha Bayzid

EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense Prediction

High-resolution dense prediction enables many appealing real-world applications, such as computational photography, autonomous driving, etc. However, the vast computational cost makes deploying state-of-the-art high-resolution dense…

Computer Vision and Pattern Recognition · Computer Science 2024-02-07 Han Cai , Junyan Li , Muyan Hu , Chuang Gan , Song Han

Learned Image Compression for Machine Perception

Recent work has shown that learned image compression strategies can outperform standard hand-crafted compression algorithms that have been developed over decades of intensive research on the rate-distortion trade-off. With growing…

Image and Video Processing · Electrical Eng. & Systems 2021-11-04 Felipe Codevilla , Jean Gabriel Simard , Ross Goroshin , Chris Pal

Visual Perception Engine: Fast and Flexible Multi-Head Inference for Robotic Vision Tasks

Deploying multiple machine learning models on resource-constrained robotic platforms for different perception tasks often results in redundant computations, large memory footprints, and complex integration challenges. In response, this work…

Robotics · Computer Science 2025-08-19 Jakub Łucki , Jonathan Becktor , Georgios Georgakis , Rob Royce , Shehryar Khattak

Dynamic Proposals for Efficient Object Detection

Object detection is a basic computer vision task to loccalize and categorize objects in a given image. Most state-of-the-art detection methods utilize a fixed number of proposals as an intermediate representation of object candidates, which…

Computer Vision and Pattern Recognition · Computer Science 2022-07-13 Yiming Cui , Linjie Yang , Ding Liu

EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model

Monocular depth estimation (MDE) plays a pivotal role in various computer vision applications, such as robotics, augmented reality, and autonomous driving. Despite recent advancements, existing methods often fail to meet key requirements…

Computer Vision and Pattern Recognition · Computer Science 2025-09-29 Andrii Litvynchuk , Ivan Livinsky , Anand Ravi , Nima Kalantari , Andrii Tsarov

Universal Instance Perception as Object Discovery and Retrieval

All instance perception tasks aim at finding certain objects specified by some queries such as category names, language expressions, and target annotations, but this complete field has been split into multiple independent subtasks. In this…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Bin Yan , Yi Jiang , Jiannan Wu , Dong Wang , Ping Luo , Zehuan Yuan , Huchuan Lu

Beyond Discrete Selection: Continuous Embedding Space Optimization for Generative Feature Selection

The goal of Feature Selection - comprising filter, wrapper, and embedded approaches - is to find the optimal feature subset for designated downstream tasks. Nevertheless, current feature selection methods are limited by: 1) the selection…

Machine Learning · Computer Science 2023-09-18 Meng Xiao , Dongjie Wang , Min Wu , Pengfei Wang , Yuanchun Zhou , Yanjie Fu

EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones

The superior performance of modern deep networks usually comes with a costly training procedure. This paper presents a new curriculum learning approach for the efficient training of visual backbones (e.g., vision Transformers). Our work is…

Computer Vision and Pattern Recognition · Computer Science 2023-08-17 Yulin Wang , Yang Yue , Rui Lu , Tianjiao Liu , Zhao Zhong , Shiji Song , Gao Huang

ExtPerFC: An Efficient 2D and 3D Perception Hardware-Software Framework for Mobile Cobot

As the reliability of the robot's perception correlates with the number of integrated sensing modalities to tackle uncertainty, a practical solution to manage these sensors from different computers, operate them simultaneously, and maintain…

Robotics · Computer Science 2023-06-09 Tuan Dang , Khang Nguyen , Manfred Huber