Related papers: Compositional Model based Fisher Vector Coding for…

Encoding High Dimensional Local Features by Sparse Coding Based Fisher Vectors

Deriving from the gradient vector of a generative model of local features, Fisher vector coding (FVC) has been identified as an effective coding method for image classification. Most, if not all, % FVC implementations employ the Gaussian…

Computer Vision and Pattern Recognition · Computer Science 2014-11-25 Lingqiao Liu , Chunhua Shen , Lei Wang , Anton van den Hengel , Chao Wang

Probing the Intra-Component Correlations within Fisher Vector for Material Classification

Fisher vector (FV) has become a popular image representation. One notable underlying assumption of the FV framework is that local descriptors are well decorrelated within each cluster so that the covariance matrix for each Gaussian can be…

Computer Vision and Pattern Recognition · Computer Science 2016-04-18 Xiaopeng Hong , Xianbiao Qi , Guoying Zhao , Matti Pietikäinen

Fisher Vectors Derived from Hybrid Gaussian-Laplacian Mixture Models for Image Annotation

In the traditional object recognition pipeline, descriptors are densely sampled over an image, pooled into a high dimensional non-linear representation and then passed to a classifier. In recent years, Fisher Vectors have proven empirically…

Computer Vision and Pattern Recognition · Computer Science 2015-01-27 Benjamin Klein , Guy Lev , Gil Sadeh , Lior Wolf

Efficient Image Categorization with Sparse Fisher Vector

In object recognition, Fisher vector (FV) representation is one of the state-of-art image representations ways at the expense of dense, high dimensional features and increased computation time. A simplification of FV is attractive, so we…

Computer Vision and Pattern Recognition · Computer Science 2014-10-16 Xiankai Lu , Zheng Fang , Tao Xu , Haiting Zhang , Hongya Tuo

Deep neural networks with Fisher vector encoding for medical image classification

Orderless encoding methods have shown to improve Convolutional Neural Networks (CNNs) for image classification in the context of limited availability of data. Additionally, hybrid CNN + Vision Transformers (ViT) models have been recently…

Computer Vision and Pattern Recognition · Computer Science 2026-05-05 Lucas O. Lyra , Antonio E. Fabris , Joao B. Florindo

Deep FisherNet for Object Classification

Despite the great success of convolutional neural networks (CNN) for the image classification task on datasets like Cifar and ImageNet, CNN's representation power is still somewhat limited in dealing with object images that have large…

Computer Vision and Pattern Recognition · Computer Science 2016-08-02 Peng Tang , Xinggang Wang , Baoguang Shi , Xiang Bai , Wenyu Liu , Zhuowen Tu

Deep Quantization: Encoding Convolutional Activations with Deep Generative Model

Deep convolutional neural networks (CNNs) have proven highly effective for visual recognition, where learning a universal representation from activations of convolutional layer plays a fundamental problem. In this paper, we present Fisher…

Computer Vision and Pattern Recognition · Computer Science 2016-11-30 Zhaofan Qiu , Ting Yao , Tao Mei

FVC: A New Framework towards Deep Video Compression in Feature Space

Learning based video compression attracts increasing attention in the past few years. The previous hybrid coding approaches rely on pixel space operations to reduce spatial and temporal redundancy, which may suffer from inaccurate motion…

Image and Video Processing · Electrical Eng. & Systems 2021-08-24 Zhihao Hu , Guo Lu , Dong Xu

Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior

The vector quantization is a widely used method to map continuous representation to discrete space and has important application in tokenization for generative mode, bottlenecking information and many other tasks in machine learning. Vector…

Machine Learning · Computer Science 2024-10-15 Mingyuan Yan , Jiawei Wu , Rushi Shah , Dianbo Liu

A Factorial Mixture Prior for Compositional Deep Generative Models

We assume that a high-dimensional datum, like an image, is a compositional expression of a set of properties, with a complicated non-linear relationship between the datum and its properties. This paper proposes a factorial mixture prior for…

Machine Learning · Statistics 2018-12-19 Ulrich Paquet , Sumedh K. Ghaisas , Olivier Tieleman

Compositional Coding for Collaborative Filtering

Efficiency is crucial to the online recommender systems. Representing users and items as binary vectors for Collaborative Filtering (CF) can achieve fast user-item affinity computation in the Hamming space, in recent years, we have…

Information Retrieval · Computer Science 2019-05-10 Chenghao Liu , Tao Lu , Xin Wang , Zhiyong Cheng , Jianling Sun , Steven C. H. Hoi

Saccadic Vision for Fine-Grained Visual Classification

Fine-grained visual classification (FGVC) requires distinguishing between visually similar categories through subtle, localized features - a task that remains challenging due to high intra-class variability and limited inter-class…

Computer Vision and Pattern Recognition · Computer Science 2025-09-22 Johann Schmidt , Sebastian Stober , Joachim Denzler , Paul Bodesheim

A Compositional Feature Embedding and Similarity Metric for Ultra-Fine-Grained Visual Categorization

Fine-grained visual categorization (FGVC), which aims at classifying objects with small inter-class variances, has been significantly advanced in recent years. However, ultra-fine-grained visual categorization (ultra-FGVC), which targets at…

Computer Vision and Pattern Recognition · Computer Science 2021-10-28 Yajie Sun , Miaohua Zhang , Xiaohan Yu , Yi Liao , Yongsheng Gao

Controllable Generative Video Compression

Perceptual video compression adopts generative video modeling to improve perceptual realism but frequently sacrifices signal fidelity, diverging from the goal of video compression to faithfully reproduce visual signal. To alleviate the…

Computer Vision and Pattern Recognition · Computer Science 2026-04-09 Ding Ding , Daowen Li , Ying Chen , Yixin Gao , Ruixiao Dong , Kai Li , Li Li

Fast forward feature selection for the nonlinear classification of hyperspectral images

A fast forward feature selection algorithm is presented in this paper. It is based on a Gaussian mixture model (GMM) classifier. GMM are used for classifying hyperspectral images. The algorithm selects iteratively spectral features that…

Computer Vision and Pattern Recognition · Computer Science 2015-01-06 Mathieu Fauvel , Clement Dechesne , Anthony Zullo , Frédéric Ferraty

End-to-end Learning of a Fisher Vector Encoding for Part Features in Fine-grained Recognition

Part-based approaches for fine-grained recognition do not show the expected performance gain over global methods, although explicitly focusing on small details that are relevant for distinguishing highly similar classes. We assume that…

Computer Vision and Pattern Recognition · Computer Science 2023-07-31 Dimitri Korsch , Paul Bodesheim , Joachim Denzler

FlashGMM: Fast Gaussian Mixture Entropy Model for Learned Image Compression

High-performance learned image compression codecs require flexible probability models to fit latent representations. Gaussian Mixture Models (GMMs) were proposed to satisfy this demand, but suffer from a significant runtime performance…

Image and Video Processing · Electrical Eng. & Systems 2025-09-24 Shimon Murai , Fangzheng Lin , Jiro Katto

Gaussian Approximation of Collective Graphical Models

The Collective Graphical Model (CGM) models a population of independent and identically distributed individuals when only collective statistics (i.e., counts of individuals) are observed. Exact inference in CGMs is intractable, and previous…

Machine Learning · Computer Science 2014-05-21 Li-Ping Liu , Daniel Sheldon , Thomas G. Dietterich

Compressive Sensing via Low-Rank Gaussian Mixture Models

We develop a new compressive sensing (CS) inversion algorithm by utilizing the Gaussian mixture model (GMM). While the compressive sensing is performed globally on the entire image as implemented in our lensless camera, a low-rank GMM is…

Machine Learning · Statistics 2015-08-28 Xin Yuan , Hong Jiang , Gang Huang , Paul A. Wilford

Gaussian Mixture Models with Component Means Constrained in Pre-selected Subspaces

We investigate a Gaussian mixture model (GMM) with component means constrained in a pre-selected subspace. Applications to classification and clustering are explored. An EM-type estimation algorithm is derived. We prove that the subspace…

Machine Learning · Statistics 2015-08-27 Mu Qiao , Jia Li