English
Related papers

Related papers: Visual Accommodation: Rethinking Image Scale as a …

200 papers

We propose a Dynamic Scale Training paradigm (abbreviated as DST) to mitigate scale variation challenge in object detection. Previous strategies like image pyramid, multi-scale training, and their variants are aiming at preparing…

Computer Vision and Pattern Recognition · Computer Science 2021-03-16 Yukang Chen , Peizhen Zhang , Zeming Li , Yanwei Li , Xiangyu Zhang , Lu Qi , Jian Sun , Jiaya Jia

Recently, object detection models have witnessed notable performance improvements, particularly with transformer-based models. However, new objects frequently appear in the real world, requiring detection models to continually learn without…

Computer Vision and Pattern Recognition · Computer Science 2025-06-11 Duc Thanh Pham , Hong Dang Nguyen , Nhat Minh Nguyen Quoc , Linh Ngo Van , Sang Dinh Viet , Duc Anh Nguyen

This paper introduces a new fundamental characteristic, \ie, the dynamic range, from real-world metric tools to deep visual recognition. In metrology, the dynamic range is a basic quality of a metric tool, indicating its flexibility to…

Computer Vision and Pattern Recognition · Computer Science 2021-03-23 Yifan Sun , Yuke Zhu , Yuhan Zhang , Pengkun Zheng , Xi Qiu , Chi Zhang , Yichen Wei

It is a common practice to exploit pyramidal feature representation to tackle the problem of scale variation in object instances. However, most of them still predict the objects in a certain range of scales based solely or mainly on a…

Computer Vision and Pattern Recognition · Computer Science 2020-12-10 Zehui Gong , Dong Li

Given the variety of the visual world there is not one true scale for recognition: objects may appear at drastically different sizes across the visual field. Rather than enumerate variations across filter channels or pyramid levels, dynamic…

Computer Vision and Pattern Recognition · Computer Science 2019-08-09 Dequan Wang , Evan Shelhamer , Bruno Olshausen , Trevor Darrell

Object detection in aerial imagery presents a significant challenge due to large scale variations among objects. This paper proposes an evolutionary reinforcement learning agent, integrated within a coarse-to-fine object detection…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Jialu Zhang , Xiaoying Yang , Wentao He , Jianfeng Ren , Qian Zhang , Titian Zhao , Ruibin Bai , Xiangjian He , Jiang Liu

Scale variation has been a challenge from traditional to modern approaches in computer vision. Most solutions to scale issues have a similar theme: a set of intuitive and manually designed policies that are generic and fixed (e.g. SIFT or…

Computer Vision and Pattern Recognition · Computer Science 2019-04-10 Huiyu Wang , Aniruddha Kembhavi , Ali Farhadi , Alan Yuille , Mohammad Rastegari

Detection Transformer (DETR) has redefined object detection by casting it as a set prediction task within an end-to-end framework. Despite its elegance, DETR and its variants still rely on fixed learnable queries and suffer from severe…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Zhengjian Kang , Jun Zhuang , Kangtong Mo , Qi Chen , Rui Liu , Ye Zhang

In the domain of moment retrieval, accurately identifying temporal segments within videos based on natural language queries remains challenging. Traditional methods often employ pre-trained models that struggle with fine-grained information…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Haojian Huang , Kaijing Ma , Jin Chen , Haodong Chen , Zhou Wu , Xianghao Zang , Han Fang , Chao Ban , Hao Sun , Mulin Chen , Zhongjiang He

This paper revisits the problem of orientation estimation for rigid bodies through a novel framework based on scalar measurements. Unlike traditional vector-based methods, the proposed approach enables selective utilization of only the…

Systems and Control · Electrical Eng. & Systems 2025-06-10 Hassan Alnahhal , Sifeddine Benahmed , Soulaimane Berkane , Tarel Hamel

Selectivity estimation aims at estimating the number of database objects that satisfy a selection criterion. Answering this problem accurately and efficiently is essential to many applications, such as density estimation, outlier detection,…

Databases · Computer Science 2021-05-28 Yaoshu Wang , Chuan Xiao , Jianbin Qin , Rui Mao , Onizuka Makoto , Wei Wang , Rui Zhang , Yoshiharu Ishikawa

We address the problem of class incremental learning, which is a core step towards achieving adaptive vision intelligence. In particular, we consider the task setting of incremental learning with limited memory and aim to achieve better…

Computer Vision and Pattern Recognition · Computer Science 2021-04-01 Shipeng Yan , Jiangwei Xie , Xuming He

For object detection detectors, enhancing model performance hinges on the ability to simultaneously consider inconsistencies across tasks and focus on difficult-to-train samples. Achieving this necessitates incorporating information from…

Computer Vision and Pattern Recognition · Computer Science 2024-09-02 Yanquan Huang , Liu Wei Zhen , Yun Hao , Mengyuan Zhang , Qingyao Wu , Zikun Deng , Xueming Liu , Hong Deng

High-dimensional, heterogeneous data with complex feature interactions pose significant challenges for traditional predictive modeling approaches. While Projection to Latent Structures (PLS) remains a popular technique, it struggles to…

Machine Learning · Computer Science 2025-10-21 Farwa Abbas , Hussain Ahmad , Claudia Szabo

In this paper, we seek to develop a versatile test-time adaptation (TTA) objective for a variety of tasks - classification and regression across image-, object-, and pixel-level predictions. We achieve this through a self-bootstrapping…

Computer Vision and Pattern Recognition · Computer Science 2025-04-14 Shuaicheng Niu , Guohao Chen , Peilin Zhao , Tianyi Wang , Pengcheng Wu , Zhiqi Shen

Scale variation is a deep-rooted problem in object counting, which has not been effectively addressed by existing scale-aware algorithms. An important factor is that they typically involve cooperative learning across multi-resolutions,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Tao Han , Lei Bai , Lingbo Liu , Wanli Ouyang

We propose a one-stage framework for real-time multi-person 3D human mesh estimation from a single RGB image. While current one-stage methods, which follow a DETR-style pipeline, achieve state-of-the-art (SOTA) performance with…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Chi Su , Xiaoxuan Ma , Jiajun Su , Yizhou Wang

Geometric variations of objects, which do not modify the object class, pose a major challenge for object recognition. These variations could be rigid as well as non-rigid transformations. In this paper, we design a framework for training…

Machine Learning · Statistics 2017-12-20 Jiajun Shen , Yali Amit

LiDAR 3D object detection models are inevitably biased towards their training dataset. The detector clearly exhibits this bias when employed on a target dataset, particularly towards object sizes. However, object sizes vary heavily between…

Computer Vision and Pattern Recognition · Computer Science 2022-10-18 Dušan Malić , Christian Fruhwirth-Reisinger , Horst Possegger , Horst Bischof

In this paper, we introduce a shape-based, time-scale invariant feature descriptor for 1-D sensor signals. The time-scale invariance of the feature allows us to use feature from one training event to describe events of the same semantic…

Multimedia · Computer Science 2011-05-31 Jierui Xie , Mandis S. Beigi
‹ Prev 1 2 3 10 Next ›