Related papers: Deep Tensor Encoding

End-to-end Learning of a Fisher Vector Encoding for Part Features in Fine-grained Recognition

Part-based approaches for fine-grained recognition do not show the expected performance gain over global methods, although explicitly focusing on small details that are relevant for distinguishing highly similar classes. We assume that…

Computer Vision and Pattern Recognition · Computer Science 2023-07-31 Dimitri Korsch , Paul Bodesheim , Joachim Denzler

Deep TEN: Texture Encoding Network

We propose a Deep Texture Encoding Network (Deep-TEN) with a novel Encoding Layer integrated on top of convolutional layers, which ports the entire dictionary learning and encoding pipeline into a single model. Current methods build from…

Computer Vision and Pattern Recognition · Computer Science 2016-12-12 Hang Zhang , Jia Xue , Kristin Dana

Compositional Factorization of Visual Scenes with Convolutional Sparse Coding and Resonator Networks

We propose a system for visual scene analysis and recognition based on encoding the sparse, latent feature-representation of an image into a high-dimensional vector that is subsequently factorized to parse scene content. The sparse feature…

Computer Vision and Pattern Recognition · Computer Science 2024-07-01 Christopher J. Kymn , Sonia Mazelet , Annabel Ng , Denis Kleyko , Bruno A. Olshausen

Encoding High Dimensional Local Features by Sparse Coding Based Fisher Vectors

Deriving from the gradient vector of a generative model of local features, Fisher vector coding (FVC) has been identified as an effective coding method for image classification. Most, if not all, % FVC implementations employ the Gaussian…

Computer Vision and Pattern Recognition · Computer Science 2014-11-25 Lingqiao Liu , Chunhua Shen , Lei Wang , Anton van den Hengel , Chao Wang

Deep FisherNet for Object Classification

Despite the great success of convolutional neural networks (CNN) for the image classification task on datasets like Cifar and ImageNet, CNN's representation power is still somewhat limited in dealing with object images that have large…

Computer Vision and Pattern Recognition · Computer Science 2016-08-02 Peng Tang , Xinggang Wang , Baoguang Shi , Xiang Bai , Wenyu Liu , Zhuowen Tu

Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding

Attentional mechanisms are order-invariant. Positional encoding is a crucial component to allow attention-based deep model architectures such as Transformer to address sequences or images where the position of information matters. In this…

Machine Learning · Computer Science 2021-11-10 Yang Li , Si Si , Gang Li , Cho-Jui Hsieh , Samy Bengio

Probabilistic Tracking with Deep Factors

In many applications of computer vision it is important to accurately estimate the trajectory of an object over time by fusing data from a number of sources, of which 2D and 3D imagery is only one. In this paper, we show how to use a deep…

Computer Vision and Pattern Recognition · Computer Science 2021-12-06 Fan Jiang , Andrew Marmon , Ildebrando De Courten , Marc Rasi , Frank Dellaert

Score Function Features for Discriminative Learning: Matrix and Tensor Framework

Feature learning forms the cornerstone for tackling challenging learning problems in domains such as speech, computer vision and natural language processing. In this paper, we consider a novel class of matrix and tensor-valued features,…

Machine Learning · Computer Science 2014-12-12 Majid Janzamin , Hanie Sedghi , Anima Anandkumar

Compressive Sensing with Tensorized Autoencoder

Deep networks can be trained to map images into a low-dimensional latent space. In many cases, different images in a collection are articulated versions of one another; for example, same object with different lighting, background, or pose.…

Computer Vision and Pattern Recognition · Computer Science 2023-03-14 Rakib Hyder , M. Salman Asif

Discriminative feature encoding for intrinsic image decomposition

Intrinsic image decomposition is an important and long-standing computer vision problem. Given an input image, recovering the physical scene properties is ill-posed. Several physically motivated priors have been used to restrict the…

Computer Vision and Pattern Recognition · Computer Science 2022-09-27 Zongji Wang , Yunfei Liu , Feng Lu

On Surgical Fine-tuning for Language Encoders

Fine-tuning all the layers of a pre-trained neural language encoder (either using all the parameters or using parameter-efficient methods) is often the de-facto way of adapting it to a new task. We show evidence that for different…

Computation and Language · Computer Science 2023-10-27 Abhilasha Lodha , Gayatri Belapurkar , Saloni Chalkapurkar , Yuanming Tao , Reshmi Ghosh , Samyadeep Basu , Dmitrii Petrov , Soundararajan Srinivasan

Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features

Text contained in an image carries high-level semantics that can be exploited to achieve richer image understanding. In particular, the mere presence of text provides strong guiding content that should be employed to tackle a diversity of…

Computer Vision and Pattern Recognition · Computer Science 2020-01-15 Andres Mafla , Sounak Dey , Ali Furkan Biten , Lluis Gomez , Dimosthenis Karatzas

Score Function Features for Discriminative Learning

Feature learning forms the cornerstone for tackling challenging learning problems in domains such as speech, computer vision and natural language processing. In this paper, we consider a novel class of matrix and tensor-valued features,…

Machine Learning · Computer Science 2015-04-21 Majid Janzamin , Hanie Sedghi , Anima Anandkumar

Deep Fishing: Gradient Features from Deep Nets

Convolutional Networks (ConvNets) have recently improved image recognition performance thanks to end-to-end learning of deep feed-forward models from raw pixels. Deep learning is a marked departure from the previous state of the art, the…

Computer Vision and Pattern Recognition · Computer Science 2015-07-24 Albert Gordo , Adrien Gaidon , Florent Perronnin

Deep Feature Embedding for Tabular Data

Tabular data learning has extensive applications in deep learning but its existing embedding techniques are limited in numerical and categorical features such as the inability to capture complex relationships and engineering. This paper…

Machine Learning · Computer Science 2024-09-02 Yuqian Wu , Hengyi Luo , Raymond S. T. Lee

Learning Mixtures of Separable Dictionaries for Tensor Data: Analysis and Algorithms

This work addresses the problem of learning sparse representations of tensor data using structured dictionary learning. It proposes learning a mixture of separable dictionaries to better capture the structure of tensor data by generalizing…

Machine Learning · Computer Science 2020-06-16 Mohsen Ghassemi , Zahra Shakeri , Anand D. Sarwate , Waheed U. Bajwa

Structural Deep Encoding for Table Question Answering

Although Transformers-based architectures excel at processing textual information, their naive adaptation for tabular data often involves flattening the table structure. This simplification can lead to the loss of essential…

Computation and Language · Computer Science 2025-03-04 Raphaël Mouravieff , Benjamin Piwowarski , Sylvain Lamprier

Tensor Methods in Computer Vision and Deep Learning

Tensors, or multidimensional arrays, are data structures that can naturally represent visual data of multiple dimensions. Inherently able to efficiently capture structured, latent semantic spaces and high-order interactions, tensors have a…

Computer Vision and Pattern Recognition · Computer Science 2021-07-09 Yannis Panagakis , Jean Kossaifi , Grigorios G. Chrysos , James Oldfield , Mihalis A. Nicolaou , Anima Anandkumar , Stefanos Zafeiriou

Deep learning based dictionary learning and tomographic image reconstruction

This work presents an approach for image reconstruction in clinical low-dose tomography that combines principles from sparse signal processing with ideas from deep learning. First, we describe sparse signal representation in terms of…

Machine Learning · Statistics 2023-11-27 Jevgenija Rudzusika , Thomas Koehler , Ozan Öktem

Dictionary learning for clustering on hyperspectral images

Dictionary learning and sparse coding have been widely studied as mechanisms for unsupervised feature learning. Unsupervised learning could bring enormous benefit to the processing of hyperspectral images and to other remote sensing data…

Image and Video Processing · Electrical Eng. & Systems 2022-02-03 Joshua Bruton , Hairong Wang