Related papers: Coding local and global binary visual features ext…

Hybrid coding of visual content and local image features

Distributed visual analysis applications, such as mobile visual search or Visual Sensor Networks (VSNs) require the transmission of visual content on a bandwidth-limited network, from a peripheral node to a processing unit. Traditionally, a…

Multimedia · Computer Science 2016-11-17 Luca Baroffio , Matteo Cesana , Alessandro Redondi , Marco Tagliasacchi , Stefano Tubaro

Towards Real-World Visual Tracking with Temporal Contexts

Visual tracking has made significant improvements in the past few decades. Most existing state-of-the-art trackers 1) merely aim for performance in ideal conditions while overlooking the real-world conditions; 2) adopt the…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Ziang Cao , Ziyuan Huang , Liang Pan , Shiwei Zhang , Ziwei Liu , Changhong Fu

Large-margin Learning of Compact Binary Image Encodings

The use of high-dimensional features has become a normal practice in many computer vision applications. The large dimension of these features is a limiting factor upon the number of data points which may be effectively stored and processed,…

Computer Vision and Pattern Recognition · Computer Science 2015-06-18 Sakrapee Paisitkriangkrai , Chunhua Shen , Anton van den Hengel

BTEL: A Binary Tree Encoding Approach for Visual Localization

Visual localization algorithms have achieved significant improvements in performance thanks to recent advances in camera technology and vision-based techniques. However, there remains one critical caveat: all current approaches that are…

Computer Vision and Pattern Recognition · Computer Science 2019-07-01 Huu Le , Tuan Hoang , Michael Milford

ATC: an Advanced Tucker Compression library for multidimensional data

We present ATC, a C++ library for advanced Tucker-based lossy compression of dense multidimensional numerical data in a shared-memory parallel setting, based on the sequentially truncated higher-order singular value decomposition (ST-HOSVD)…

Mathematical Software · Computer Science 2024-07-02 Wouter Baert , Nick Vannieuwenhoven

Visual Encoding and Debiasing for CTR Prediction

Extracting expressive visual features is crucial for accurate Click-Through-Rate (CTR) prediction in visual search advertising systems. Current commercial systems use off-the-shelf visual encoders to facilitate fast online service. However,…

Information Retrieval · Computer Science 2022-05-10 Si Chen , Chen Lin , Wanxian Guan , Jiayi Wei , Xingyuan Bu , He Guo , Hui Li , Xubin Li , Jian Xu , Bo Zheng

Video Coding for Machine: Compact Visual Representation Compression for Intelligent Collaborative Analytics

Video Coding for Machines (VCM) is committed to bridging to an extent separate research tracks of video/image compression and feature compression, and attempts to optimize compactness and efficiency jointly from a unified perspective of…

Computer Vision and Pattern Recognition · Computer Science 2021-10-19 Wenhan Yang , Haofeng Huang , Yueyu Hu , Ling-Yu Duan , Jiaying Liu

Auto-Encoding Twin-Bottleneck Hashing

Conventional unsupervised hashing methods usually take advantage of similarity graphs, which are either pre-computed in the high-dimensional space or obtained from random anchor points. On the one hand, existing methods uncouple the…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Yuming Shen , Jie Qin , Jiaxin Chen , Mengyang Yu , Li Liu , Fan Zhu , Fumin Shen , Ling Shao

Simultaneous Feature Aggregating and Hashing for Compact Binary Code Learning

Representing images by compact hash codes is an attractive approach for large-scale content-based image retrieval. In most state-of-the-art hashing-based image retrieval systems, for each image, local descriptors are first aggregated as a…

Computer Vision and Pattern Recognition · Computer Science 2019-09-04 Thanh-Toan Do , Khoa Le , Tuan Hoang , Huu Le , Tam V. Nguyen , Ngai-Man Cheung

Enhancing Learned Image Compression via Cross Window-based Attention

In recent years, learned image compression methods have demonstrated superior rate-distortion performance compared to traditional image compression methods. Recent methods utilize convolutional neural networks (CNN), variational…

Computer Vision and Pattern Recognition · Computer Science 2025-02-14 Priyanka Mudgal , Feng Liu

Quantised Global Autoencoder: A Holistic Approach to Representing Visual Data

In quantised autoencoders, images are usually split into local patches, each encoded by one token. This representation is redundant in the sense that the same number of tokens is spend per region, regardless of the visual information…

Computer Vision and Pattern Recognition · Computer Science 2024-08-06 Tim Elsner , Paula Usinger , Victor Czech , Gregor Kobsik , Yanjiang He , Isaak Lim , Leif Kobbelt

Optimized Feature Space Learning for Generating Efficient Binary Codes for Image Retrieval

In this paper we propose an approach for learning low dimensional optimized feature space with minimum intra-class variance and maximum inter-class variance. We address the problem of high-dimensionality of feature vectors extracted from…

Image and Video Processing · Electrical Eng. & Systems 2020-01-31 Abin Jose , Erik Stefan Ottlik , Christian Rohlfing , Jens-Rainer Ohm

Revisit Visual Representation in Analytics Taxonomy: A Compression Perspective

Visual analytics have played an increasingly critical role in the Internet of Things, where massive visual signals have to be compressed and fed into machines. But facing such big data and constrained bandwidth capacity, existing…

Computer Vision and Pattern Recognition · Computer Science 2021-06-17 Yueyu Hu , Wenhan Yang , Haofeng Huang , Jiaying Liu

An Efficient Token Compression Framework for Visual Object Tracking

Refining visual representations by eliminating their internal feature-level redundancy is crucial for simultaneously optimizing the performance and computational cost of models in visual tracking. To enhance their performance, many…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Weijing Wu , Qihua Liang , Bineng Zhong , Haiying Xia , Zhiyi Mo , Shuxiang Song

CTA-Net: A CNN-Transformer Aggregation Network for Improving Multi-Scale Feature Extraction

Convolutional neural networks (CNNs) and vision transformers (ViTs) have become essential in computer vision for local and global feature extraction. However, aggregating these architectures in existing methods often results in…

Computer Vision and Pattern Recognition · Computer Science 2024-10-16 Chunlei Meng , Jiacheng Yang , Wei Lin , Bowen Liu , Hongda Zhang , chun ouyang , Zhongxue Gan

Content-aware Token Sharing for Efficient Semantic Segmentation with Vision Transformers

This paper introduces Content-aware Token Sharing (CTS), a token reduction approach that improves the computational efficiency of semantic segmentation networks that use Vision Transformers (ViTs). Existing works have proposed token…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Chenyang Lu , Daan de Geus , Gijs Dubbelman

Context Adaptive Extended Chain Coding for Semantic Map Compression

Semantic maps are increasingly utilized in areas such as robotics, autonomous systems, and extended reality, motivating the investigation of efficient compression methods that preserve structured semantic information. This paper studies…

Image and Video Processing · Electrical Eng. & Systems 2026-03-30 Runyu Yang , Junqi Liao , Hyomin Choi , Fabien Racapé , Ivan V. Bajić

Data coding as a building block of several image processing algorithms has been received great attention recently. Indeed, the importance of the locality assumption in coding approaches is studied in numerous works and several methods are…

Computer Vision and Pattern Recognition · Computer Science 2014-03-06 Amirreza Shaban , Hamid R. Rabiee , Mahyar Najibi

Graph-Collaborated Auto-Encoder Hashing for Multi-view Binary Clustering

Unsupervised hashing methods have attracted widespread attention with the explosive growth of large-scale data, which can greatly reduce storage and computation by learning compact binary codes. Existing unsupervised hashing methods attempt…

Computer Vision and Pattern Recognition · Computer Science 2023-01-09 Huibing Wang , Mingze Yao , Guangqi Jiang , Zetian Mi , Xianping Fu

Accelerated Event-Based Feature Detection and Compression for Surveillance Video Systems

The strong temporal consistency of surveillance video enables compelling compression performance with traditional methods, but downstream vision applications operate on decoded image frames with a high data rate. Since it is not…

Multimedia · Computer Science 2024-02-09 Andrew C. Freeman , Ketan Mayer-Patel , Montek Singh