Related papers: Sparse Coding and Autoencoders

Subgradient Descent Learns Orthogonal Dictionaries

This paper concerns dictionary learning, i.e., sparse coding, a fundamental representation learning problem. We show that a subgradient descent algorithm, with random initialization, can provably recover orthogonal dictionaries on a natural…

Machine Learning · Computer Science 2019-07-02 Yu Bai , Qijia Jiang , Ju Sun

Pivotal Auto-Encoder via Self-Normalizing ReLU

Sparse auto-encoders are useful for extracting low-dimensional representations from high-dimensional data. However, their performance degrades sharply when the input noise at test time differs from the noise employed during training. This…

Machine Learning · Computer Science 2024-07-01 Nelson Goldenstein , Jeremias Sulam , Yaniv Romano

On the convergence of group-sparse autoencoders

Recent approaches in the theoretical analysis of model-based deep learning architectures have studied the convergence of gradient descent in shallow ReLU networks that arise from generative models whose hidden layers are sparse. Motivated…

Machine Learning · Computer Science 2022-01-24 Emmanouil Theodosis , Bahareh Tolooshams , Pranay Tankala , Abiy Tasissa , Demba Ba

Dictionary Learning and Tensor Decomposition via the Sum-of-Squares Method

We give a new approach to the dictionary learning (also known as "sparse coding") problem of recovering an unknown $n\times m$ matrix $A$ (for $m \geq n$) from examples of the form \[ y = Ax + e, \] where $x$ is a random vector in $\mathbb…

Data Structures and Algorithms · Computer Science 2014-11-11 Boaz Barak , Jonathan A. Kelner , David Steurer

Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth

Autoencoders are a prominent model in many empirical branches of machine learning and lossy data compression. However, basic theoretical questions remain unanswered even in a shallow two-layer setting. In particular, to what degree does a…

Machine Learning · Computer Science 2024-02-08 Kevin Kögler , Alexander Shevchenko , Hamed Hassani , Marco Mondelli

Scaling and evaluating sparse autoencoders

Sparse autoencoders provide a promising unsupervised approach for extracting interpretable features from a language model by reconstructing activations from a sparse bottleneck layer. Since language models learn many concepts, autoencoders…

Machine Learning · Computer Science 2024-06-07 Leo Gao , Tom Dupré la Tour , Henk Tillman , Gabriel Goh , Rajan Troll , Alec Radford , Ilya Sutskever , Jan Leike , Jeffrey Wu

Deep Learning Optimization Using Self-Adaptive Weighted Auxiliary Variables

In this paper, we develop a new optimization framework for the least squares learning problem via fully connected neural networks or physics-informed neural networks. The gradient descent sometimes behaves inefficiently in deep learning…

Machine Learning · Computer Science 2025-05-01 Yaru Liu , Yiqi Gu , Michael K. Ng

Beyond L2-Loss Functions for Learning Sparse Models

Incorporating sparsity priors in learning tasks can give rise to simple, and interpretable models for complex high dimensional data. Sparse models have found widespread use in structure discovery, recovering data from corruptions, and a…

Machine Learning · Statistics 2014-03-27 Karthikeyan Natesan Ramamurthy , Aleksandr Y. Aravkin , Jayaraman J. Thiagarajan

Sparse Recovery and Dictionary Learning from Nonlinear Compressive Measurements

Sparse coding and dictionary learning are popular techniques for linear inverse problems such as denoising or inpainting. However in many cases, the measurement process is nonlinear, for example for clipped, quantized or 1-bit measurements.…

Signal Processing · Electrical Eng. & Systems 2020-01-08 Lucas Rencker , Francis Bach , Wenwu Wang , Mark D. Plumbley

Gradient Descent for One-Hidden-Layer Neural Networks: Polynomial Convergence and SQ Lower Bounds

We study the complexity of training neural network models with one hidden nonlinear activation layer and an output weighted sum layer. We analyze Gradient Descent applied to learning a bounded target function on $n$ real-valued inputs. We…

Machine Learning · Computer Science 2019-05-28 Santosh Vempala , John Wilmes

Learning Dictionaries with Bounded Self-Coherence

Sparse coding in learned dictionaries has been established as a successful approach for signal denoising, source separation and solving inverse problems in general. A dictionary learning method adapts an initial dictionary to a particular…

Machine Learning · Statistics 2012-10-18 Christian D. Sigg , Tomas Dikk , Joachim M. Buhmann

Towards Learning Sparsely Used Dictionaries with Arbitrary Supports

Dictionary learning is a popular approach for inferring a hidden basis or dictionary in which data has a sparse representation. Data generated from the dictionary A (an n by m matrix, with m > n in the over-complete setting) is given by Y =…

Machine Learning · Computer Science 2018-05-09 Pranjal Awasthi , Aravindan Vijayaraghavan

Autoencoders Learn Generative Linear Models

We provide a series of results for unsupervised learning with autoencoders. Specifically, we study shallow two-layer autoencoder architectures with shared weights. We focus on three generative models for data that are common in statistical…

Machine Learning · Statistics 2019-02-18 Thanh V. Nguyen , Raymond K. W. Wong , Chinmay Hegde

New Algorithms for Learning Incoherent and Overcomplete Dictionaries

In sparse recovery we are given a matrix $A$ (the dictionary) and a vector of the form $A X$ where $X$ is sparse, and the goal is to recover $X$. This is a central notion in signal processing, statistics and machine learning. But in…

Data Structures and Algorithms · Computer Science 2014-05-27 Sanjeev Arora , Rong Ge , Ankur Moitra

Deeply-Sparse Signal rePresentations ($\text{D}\text{S}^2\text{P}$)

A recent line of work shows that a deep neural network with ReLU nonlinearities arises from a finite sequence of cascaded sparse coding models, the outputs of which, except for the last element in the cascade, are sparse and unobservable.…

Signal Processing · Electrical Eng. & Systems 2020-04-27 Demba Ba

On Optimality Conditions for Auto-Encoder Signal Recovery

Auto-Encoders are unsupervised models that aim to learn patterns from observed data by minimizing a reconstruction cost. The useful representations learned are often found to be sparse and distributed. On the other hand, compressed sensing…

Machine Learning · Statistics 2017-07-14 Devansh Arpit , Yingbo Zhou , Hung Q. Ngo , Nils Napp , Venu Govindaraju

Features that Make a Difference: Leveraging Gradients for Improved Dictionary Learning

Sparse Autoencoders (SAEs) are a promising approach for extracting neural network representations by learning a sparse and overcomplete decomposition of the network's internal activations. However, SAEs are traditionally trained considering…

Machine Learning · Computer Science 2025-04-02 Jeffrey Olmo , Jared Wilson , Max Forsey , Bryce Hepner , Thomas Vin Howe , David Wingate

Dictionary Adaptation in Sparse Recovery Based on Different Types of Coherence

In sparse recovery, the unique sparsest solution to an under-determined system of linear equations is of main interest. This scheme is commonly proposed to be applied to signal acquisition. In most cases, the signals are not sparse…

Information Theory · Computer Science 2013-07-16 Henning Zörlein , Faisal Akram , Martin Bossert

Learning Retrieval Models with Sparse Autoencoders

Sparse autoencoders (SAEs) provide a powerful mechanism for decomposing the dense representations produced by Large Language Models (LLMs) into interpretable latent features. We posit that SAEs constitute a natural foundation for Learned…

Machine Learning · Computer Science 2026-03-17 Thibault Formal , Maxime Louis , Hervé Dejean , Stéphane Clinchant

Scalable Convolutional Dictionary Learning with Constrained Recurrent Sparse Auto-encoders

Given a convolutional dictionary underlying a set of observed signals, can a carefully designed auto-encoder recover the dictionary in the presence of noise? We introduce an auto-encoder architecture, termed constrained recurrent sparse…

Machine Learning · Computer Science 2018-07-13 Bahareh Tolooshams , Sourav Dey , Demba Ba