Related papers: An Integrated Autoencoder-Based Filter for Sparse …

Anomaly Detection and Interpretation using Multimodal Autoencoder and Sparse Optimization

Automated anomaly detection is essential for managing information and communications technology (ICT) systems to maintain reliable services with minimum burden on operators. For detecting varying and continually emerging anomalies as…

Machine Learning · Statistics 2018-12-19 Yasuhiro Ikeda , Keisuke Ishibashi , Yuusuke Nakano , Keishiro Watanabe , Ryoichi Kawahara

Sparse Autoencoders, Again?

Is there really much more to say about sparse autoencoders (SAEs)? Autoencoders in general, and SAEs in particular, represent deep architectures that are capable of modeling low-dimensional latent structure in data. Such structure could…

Machine Learning · Computer Science 2025-06-09 Yin Lu , Xuening Zhu , Tong He , David Wipf

Data Augmentation with Variational Autoencoder for Imbalanced Dataset

Learning from an imbalanced distribution presents a major challenge in predictive modeling, as it generally leads to a reduction in the performance of standard algorithms. Various approaches exist to address this issue, but many of them…

Machine Learning · Computer Science 2024-12-11 Samuel Stocksieker , Denys Pommeret , Arthur Charpentier

An information theoretic approach to the autoencoder

We present a variation of the Autoencoder (AE) that explicitly maximizes the mutual information between the input data and the hidden representation. The proposed model, the InfoMax Autoencoder (IMAE), by construction is able to learn a…

Machine Learning · Computer Science 2019-01-24 Vincenzo Crescimanna , Bruce Graham

From Knots to Knobs: Towards Steerable Collaborative Filtering Using Sparse Autoencoders

Sparse autoencoders (SAEs) have recently emerged as pivotal tools for introspection into large language models. SAEs can uncover high-quality, interpretable features at different levels of granularity and enable targeted steering of the…

Information Retrieval · Computer Science 2026-01-19 Martin Spišák , Ladislav Peška , Petr Škoda , Vojtěch Vančura , Rodrigo Alves

Efficient Data Compression for 3D Sparse TPC via Bicephalous Convolutional Autoencoder

Real-time data collection and analysis in large experimental facilities present a great challenge across multiple domains, including high energy physics, nuclear physics, and cosmology. To address this, machine learning (ML)-based methods…

Machine Learning · Computer Science 2021-11-11 Yi Huang , Yihui Ren , Shinjae Yoo , Jin Huang

Retrospective: EIE: Efficient Inference Engine on Sparse and Compressed Neural Network

EIE proposed to accelerate pruned and compressed neural networks, exploiting weight sparsity, activation sparsity, and 4-bit weight-sharing in neural network accelerators. Since published in ISCA'16, it opened a new design space to…

Hardware Architecture · Computer Science 2023-06-19 Song Han , Xingyu Liu , Huizi Mao , Jing Pu , Ardavan Pedram , Mark A. Horowitz , William J. Dally

Diversity-driven Data Selection for Language Model Tuning through Sparse Autoencoder

Instruction tuning data are often quantity-saturated due to the large volume of data collection and fast model iteration, leaving data selection important but underexplored. Existing quality-driven data selection methods, such as LIMA…

Computation and Language · Computer Science 2025-04-02 Xianjun Yang , Shaoliang Nie , Lijuan Liu , Suchin Gururangan , Ujjwal Karn , Rui Hou , Madian Khabsa , Yuning Mao

Embarrassingly Shallow Autoencoders for Sparse Data

Combining simple elements from the literature, we define a linear model that is geared toward sparse data, in particular implicit feedback data for recommender systems. We show that its training objective has a closed-form solution, and…

Information Retrieval · Computer Science 2019-05-10 Harald Steck

Progressive VAE Training on Highly Sparse and Imbalanced Data

In this paper, we present a novel approach for training a Variational Autoencoder (VAE) on a highly imbalanced data set. The proposed training of a high-resolution VAE model begins with the training of a low-resolution core model, which can…

Computer Vision and Pattern Recognition · Computer Science 2019-12-19 Dmitry Utyamishev , Inna Partin-Vaisband

Hybrid Embedded Deep Stacked Sparse Autoencoder with w_LPPD SVM Ensemble

Deep learning is a kind of feature learning method with strong nonliear feature transformation and becomes more and more important in many fields of artificial intelligence. Deep autoencoder is one representative method of the deep learning…

Machine Learning · Computer Science 2020-02-18 Yongming Li , Yan Lei , Pin Wang , Yuchuan Liu

Variable Rate Neural Compression for Sparse Detector Data

High-energy large-scale particle colliders generate data at extraordinary rates. Developing real-time high-throughput data compression algorithms to reduce data volume and meet the bandwidth requirement for storage has become increasingly…

Instrumentation and Detectors · Physics 2024-11-20 Yi Huang , Yeonju Go , Jin Huang , Shuhang Li , Xihaier Luo , Thomas Marshall , Joseph Osborn , Christopher Pinkenburg , Yihui Ren , Evgeny Shulga , Shinjae Yoo , Byung-Jun Yoon

Learning sparse auto-encoders for green AI image coding

Recently, convolutional auto-encoders (CAE) were introduced for image coding. They achieved performance improvements over the state-of-the-art JPEG2000 method. However, these performances were obtained using massive CAEs featuring a large…

Image and Video Processing · Electrical Eng. & Systems 2022-09-13 Cyprien Gille , Frédéric Guyard , Marc Antonini , Michel Barlaud

Deep Learning of Part-based Representation of Data Using Sparse Autoencoders with Nonnegativity Constraints

We demonstrate a new deep learning autoencoder network, trained by a nonnegativity constraint algorithm (NCAE), that learns features which show part-based representation of data. The learning algorithm is based on constraining negative…

Machine Learning · Computer Science 2016-01-13 Ehsan Hosseini-Asl , Jacek M. Zurada , Olfa Nasraoui

Binary Autoencoder for Mechanistic Interpretability of Large Language Models

Existing works are dedicated to untangling atomized numerical components (features) from the hidden states of Large Language Models (LLMs). However, they typically rely on autoencoders constrained by some training-time regularization on…

Machine Learning · Computer Science 2026-02-13 Hakaze Cho , Haolin Yang , Yanshu Li , Brian M. Kurkoski , Naoya Inoue

Compute Optimal Inference and Provable Amortisation Gap in Sparse Autoencoders

A recent line of work has shown promise in using sparse autoencoders (SAEs) to uncover interpretable features in neural network representations. However, the simple linear-nonlinear encoding mechanism in SAEs limits their ability to perform…

Machine Learning · Computer Science 2025-01-31 Charles O'Neill , Alim Gumran , David Klindt

Handling Incomplete Heterogeneous Data using VAEs

Variational autoencoders (VAEs), as well as other generative models, have been shown to be efficient and accurate for capturing the latent structure of vast amounts of complex high-dimensional data. However, existing VAEs can still not…

Machine Learning · Computer Science 2020-05-25 Alfredo Nazabal , Pablo M. Olmos , Zoubin Ghahramani , Isabel Valera

Interpretable Embeddings with Sparse Autoencoders: A Data Analysis Toolkit

Analyzing large-scale text corpora is a core challenge in machine learning, crucial for tasks like identifying undesirable model behaviors or biases in training data. Current methods often rely on costly LLM-based techniques (e.g.…

Artificial Intelligence · Computer Science 2025-12-12 Nick Jiang , Xiaoqing Sun , Lisa Dunlap , Lewis Smith , Neel Nanda

Fusing Climate Data Products using a Spatially Varying Autoencoder

Autoencoders are powerful machine learning models used to compress information from multiple data sources. However, autoencoders, like all artificial neural networks, are often unidentifiable and uninterpretable. This research focuses on…

Applications · Statistics 2024-03-13 Jacob A. Johnson , Matthew J. Heaton , William F. Christensen , Lynsie R. Warr , Summer B. Rupper

Knowledge-integrated AutoEncoder Model

Data encoding is a common and central operation in most data analysis tasks. The performance of other models downstream in the computational process highly depends on the quality of data encoding. One of the most powerful ways to encode…

Machine Learning · Computer Science 2025-09-03 Teddy Lazebnik , Liron Simon-Keren