Related papers: Binary autoencoder with random binary weights

Switched linear encoding with rectified linear autoencoders

Several recent results in machine learning have established formal connections between autoencoders---artificial neural network models that attempt to reproduce their inputs---and other coding models like sparse coding and K-means. This…

Machine Learning · Computer Science 2013-01-22 Leif Johnson , Craig Corcoran

Zero-bias autoencoders and the benefits of co-adapting features

Regularized training of an autoencoder typically results in hidden unit biases that take on large negative values. We show that negative biases are a natural result of using a hidden layer whose responsibility is to both represent the input…

Machine Learning · Statistics 2015-04-09 Kishore Konda , Roland Memisevic , David Krueger

Binary Autoencoder for Mechanistic Interpretability of Large Language Models

Existing works are dedicated to untangling atomized numerical components (features) from the hidden states of Large Language Models (LLMs). However, they typically rely on autoencoders constrained by some training-time regularization on…

Machine Learning · Computer Science 2026-02-13 Hakaze Cho , Haolin Yang , Yanshu Li , Brian M. Kurkoski , Naoya Inoue

Binary Sparse Coding for Interpretability

Sparse autoencoders (SAEs) are used to decompose neural network activations into sparsely activating features, but many SAE features are only interpretable at high activation strengths. To address this issue we propose to use binary sparse…

Machine Learning · Computer Science 2025-10-01 Lucia Quirke , Stepan Shabalin , Nora Belrose

Optimal Binary Autoencoding with Pairwise Correlations

We formulate learning of a binary autoencoder as a biconvex optimization problem which learns from the pairwise correlations between encoded and decoded bits. Among all possible algorithms that use this information, ours finds the…

Machine Learning · Computer Science 2016-11-08 Akshay Balsubramani

Exponential Capacity in an Autoencoder Neural Network with a Hidden Layer

A fundamental aspect of limitations in learning any computation in neural architectures is characterizing their optimal capacities. An important, widely-used neural architecture is known as autoencoders where the network reconstructs the…

Neurons and Cognition · Quantitative Biology 2017-05-23 Alireza Alemi , Alia Abbara

On Optimality Conditions for Auto-Encoder Signal Recovery

Auto-Encoders are unsupervised models that aim to learn patterns from observed data by minimizing a reconstruction cost. The useful representations learned are often found to be sparse and distributed. On the other hand, compressed sensing…

Machine Learning · Statistics 2017-07-14 Devansh Arpit , Yingbo Zhou , Hung Q. Ngo , Nils Napp , Venu Govindaraju

k-Sparse Autoencoders

Recently, it has been observed that when representations are learnt in a way that encourages sparsity, improved performance is obtained on classification tasks. These methods involve combinations of activation functions, sampling steps and…

Machine Learning · Computer Science 2014-03-25 Alireza Makhzani , Brendan Frey

Deep Autoencoder Model Construction Based on Pytorch

This paper proposes a deep autoencoder model based on Pytorch. This algorithm introduces the idea of Pytorch into the auto-encoder, and randomly clears the input weights connected to the hidden layer neurons with a certain probability, so…

Machine Learning · Computer Science 2022-08-18 Junan Pan , Zhihao Zhao

Sparse Autoencoders Find Highly Interpretable Features in Language Models

One of the roadblocks to a better understanding of neural networks' internals is \textit{polysemanticity}, where neurons appear to activate in multiple, semantically distinct contexts. Polysemanticity prevents us from identifying concise,…

Machine Learning · Computer Science 2023-10-05 Hoagy Cunningham , Aidan Ewart , Logan Riggs , Robert Huben , Lee Sharkey

Finding Manifolds With Bilinear Autoencoders

Sparse autoencoders are a standard tool for uncovering interpretable latent representations in neural networks. Yet, their interpretation depends on the inputs, making their isolated study incomplete. Polynomials offer a solution; they…

Machine Learning · Computer Science 2025-10-21 Thomas Dooms , Ward Gauderis

Discriminative Recurrent Sparse Auto-Encoders

We present the discriminative recurrent sparse auto-encoder model, comprising a recurrent encoder of rectified linear units, unrolled for a fixed number of iterations, and connected to two linear decoders that reconstruct the input and…

Machine Learning · Computer Science 2013-03-20 Jason Tyler Rolfe , Yann LeCun

Stacked autoencoders based machine learning for noise reduction and signal reconstruction in geophysical data

Autoencoders are neural network formulations where the input and output of the network are identical and the goal is to identify the hidden representation in the provided datasets. Generally, autoencoders project the data nonlinearly onto a…

Signal Processing · Electrical Eng. & Systems 2019-07-10 Debjani Bhowick , Deepak K. Gupta , Saumen Maiti , Uma Shankar

Rank Ordered Autoencoders

A new method for the unsupervised learning of sparse representations using autoencoders is proposed and implemented by ordering the output of the hidden units by their activation value and progressively reconstructing the input in this…

Machine Learning · Computer Science 2016-05-09 Paul Bertens

Bilinear autoencoders find interpretable manifolds

Sparse autoencoders have become a standard tool for uncovering interpretable latent representations in neural networks. Yet salient concepts often span manifolds that current linear methods cannot capture without post hoc analysis. This…

Machine Learning · Computer Science 2026-05-12 Thomas Dooms , Ward Gauderis , Geraint Wiggins , Jose Oramas

From Principal Subspaces to Principal Components with Linear Autoencoders

The autoencoder is an effective unsupervised learning model which is widely used in deep learning. It is well known that an autoencoder with a single fully-connected hidden layer, a linear activation function and a squared error cost…

Machine Learning · Statistics 2019-01-01 Elad Plaut

Autoencoders Learn Generative Linear Models

We provide a series of results for unsupervised learning with autoencoders. Specifically, we study shallow two-layer autoencoder architectures with shared weights. We focus on three generative models for data that are common in statistical…

Machine Learning · Statistics 2019-02-18 Thanh V. Nguyen , Raymond K. W. Wong , Chinmay Hegde

Sparse, Geometric Autoencoder Models of V1

The classical sparse coding model represents visual stimuli as a linear combination of a handful of learned basis functions that are Gabor-like when trained on natural image data. However, the Gabor-like filters learned by classical sparse…

Artificial Intelligence · Computer Science 2023-02-23 Jonathan Huml , Abiy Tasissa , Demba Ba

How a Bilingual LM Becomes Bilingual: Tracing Internal Representations with Sparse Autoencoders

This study explores how bilingual language models develop complex internal representations. We employ sparse autoencoders to analyze internal representations of bilingual language models with a focus on the effects of training steps,…

Computation and Language · Computer Science 2025-10-13 Tatsuro Inaba , Go Kamoda , Kentaro Inui , Masaru Isonuma , Yusuke Miyao , Yohei Oseki , Benjamin Heinzerling , Yu Takagi

Improving Autoencoder Training Performance for Hyperspectral Unmixing with Network Reinitialisation

Neural networks, in particular autoencoders, are one of the most promising solutions for unmixing hyperspectral data, i.e. reconstructing the spectra of observed substances (endmembers) and their relative mixing fractions (abundances),…

Image and Video Processing · Electrical Eng. & Systems 2022-04-13 Kamil Książek , Przemysław Głomb , Michał Romaszewski , Michał Cholewa , Bartosz Grabowski , Krisztián Búza