English
Related papers

Related papers: Deep Tensor Encoding

200 papers

Part-based approaches for fine-grained recognition do not show the expected performance gain over global methods, although explicitly focusing on small details that are relevant for distinguishing highly similar classes. We assume that…

Computer Vision and Pattern Recognition · Computer Science 2023-07-31 Dimitri Korsch , Paul Bodesheim , Joachim Denzler

We propose a Deep Texture Encoding Network (Deep-TEN) with a novel Encoding Layer integrated on top of convolutional layers, which ports the entire dictionary learning and encoding pipeline into a single model. Current methods build from…

Computer Vision and Pattern Recognition · Computer Science 2016-12-12 Hang Zhang , Jia Xue , Kristin Dana

We propose a system for visual scene analysis and recognition based on encoding the sparse, latent feature-representation of an image into a high-dimensional vector that is subsequently factorized to parse scene content. The sparse feature…

Computer Vision and Pattern Recognition · Computer Science 2024-07-01 Christopher J. Kymn , Sonia Mazelet , Annabel Ng , Denis Kleyko , Bruno A. Olshausen

Deriving from the gradient vector of a generative model of local features, Fisher vector coding (FVC) has been identified as an effective coding method for image classification. Most, if not all, % FVC implementations employ the Gaussian…

Computer Vision and Pattern Recognition · Computer Science 2014-11-25 Lingqiao Liu , Chunhua Shen , Lei Wang , Anton van den Hengel , Chao Wang

Despite the great success of convolutional neural networks (CNN) for the image classification task on datasets like Cifar and ImageNet, CNN's representation power is still somewhat limited in dealing with object images that have large…

Computer Vision and Pattern Recognition · Computer Science 2016-08-02 Peng Tang , Xinggang Wang , Baoguang Shi , Xiang Bai , Wenyu Liu , Zhuowen Tu

Attentional mechanisms are order-invariant. Positional encoding is a crucial component to allow attention-based deep model architectures such as Transformer to address sequences or images where the position of information matters. In this…

Machine Learning · Computer Science 2021-11-10 Yang Li , Si Si , Gang Li , Cho-Jui Hsieh , Samy Bengio

In many applications of computer vision it is important to accurately estimate the trajectory of an object over time by fusing data from a number of sources, of which 2D and 3D imagery is only one. In this paper, we show how to use a deep…

Computer Vision and Pattern Recognition · Computer Science 2021-12-06 Fan Jiang , Andrew Marmon , Ildebrando De Courten , Marc Rasi , Frank Dellaert

Feature learning forms the cornerstone for tackling challenging learning problems in domains such as speech, computer vision and natural language processing. In this paper, we consider a novel class of matrix and tensor-valued features,…

Machine Learning · Computer Science 2014-12-12 Majid Janzamin , Hanie Sedghi , Anima Anandkumar

Deep networks can be trained to map images into a low-dimensional latent space. In many cases, different images in a collection are articulated versions of one another; for example, same object with different lighting, background, or pose.…

Computer Vision and Pattern Recognition · Computer Science 2023-03-14 Rakib Hyder , M. Salman Asif

Intrinsic image decomposition is an important and long-standing computer vision problem. Given an input image, recovering the physical scene properties is ill-posed. Several physically motivated priors have been used to restrict the…

Computer Vision and Pattern Recognition · Computer Science 2022-09-27 Zongji Wang , Yunfei Liu , Feng Lu

Fine-tuning all the layers of a pre-trained neural language encoder (either using all the parameters or using parameter-efficient methods) is often the de-facto way of adapting it to a new task. We show evidence that for different…

Text contained in an image carries high-level semantics that can be exploited to achieve richer image understanding. In particular, the mere presence of text provides strong guiding content that should be employed to tackle a diversity of…

Computer Vision and Pattern Recognition · Computer Science 2020-01-15 Andres Mafla , Sounak Dey , Ali Furkan Biten , Lluis Gomez , Dimosthenis Karatzas

Feature learning forms the cornerstone for tackling challenging learning problems in domains such as speech, computer vision and natural language processing. In this paper, we consider a novel class of matrix and tensor-valued features,…

Machine Learning · Computer Science 2015-04-21 Majid Janzamin , Hanie Sedghi , Anima Anandkumar

Convolutional Networks (ConvNets) have recently improved image recognition performance thanks to end-to-end learning of deep feed-forward models from raw pixels. Deep learning is a marked departure from the previous state of the art, the…

Computer Vision and Pattern Recognition · Computer Science 2015-07-24 Albert Gordo , Adrien Gaidon , Florent Perronnin

Tabular data learning has extensive applications in deep learning but its existing embedding techniques are limited in numerical and categorical features such as the inability to capture complex relationships and engineering. This paper…

Machine Learning · Computer Science 2024-09-02 Yuqian Wu , Hengyi Luo , Raymond S. T. Lee

This work addresses the problem of learning sparse representations of tensor data using structured dictionary learning. It proposes learning a mixture of separable dictionaries to better capture the structure of tensor data by generalizing…

Machine Learning · Computer Science 2020-06-16 Mohsen Ghassemi , Zahra Shakeri , Anand D. Sarwate , Waheed U. Bajwa

Although Transformers-based architectures excel at processing textual information, their naive adaptation for tabular data often involves flattening the table structure. This simplification can lead to the loss of essential…

Computation and Language · Computer Science 2025-03-04 Raphaël Mouravieff , Benjamin Piwowarski , Sylvain Lamprier

Tensors, or multidimensional arrays, are data structures that can naturally represent visual data of multiple dimensions. Inherently able to efficiently capture structured, latent semantic spaces and high-order interactions, tensors have a…

Computer Vision and Pattern Recognition · Computer Science 2021-07-09 Yannis Panagakis , Jean Kossaifi , Grigorios G. Chrysos , James Oldfield , Mihalis A. Nicolaou , Anima Anandkumar , Stefanos Zafeiriou

This work presents an approach for image reconstruction in clinical low-dose tomography that combines principles from sparse signal processing with ideas from deep learning. First, we describe sparse signal representation in terms of…

Machine Learning · Statistics 2023-11-27 Jevgenija Rudzusika , Thomas Koehler , Ozan Öktem

Dictionary learning and sparse coding have been widely studied as mechanisms for unsupervised feature learning. Unsupervised learning could bring enormous benefit to the processing of hyperspectral images and to other remote sensing data…

Image and Video Processing · Electrical Eng. & Systems 2022-02-03 Joshua Bruton , Hairong Wang
‹ Prev 1 2 3 10 Next ›