Related papers: Minimal Random Code Learning with Mean-KL Paramete…

Minimal Random Code Learning: Getting Bits Back from Compressed Model Parameters

While deep neural networks are a highly successful model class, their large memory footprint puts considerable strain on energy consumption, communication bandwidth, and storage requirements. Consequently, model size reduction has become an…

Machine Learning · Statistics 2018-10-02 Marton Havasi , Robert Peharz , José Miguel Hernández-Lobato

Bandwidth-constrained Variational Message Encoding for Cooperative Multi-agent Reinforcement Learning

Graph-based multi-agent reinforcement learning (MARL) enables coordinated behavior under partial observability by modeling agents as nodes and communication links as edges. While recent methods excel at learning sparse coordination…

Machine Learning · Computer Science 2026-04-13 Wei Duan , Jie Lu , En Yu , Junyu Xuan

Regularization Strategies and Empirical Bayesian Learning for MKL

Multiple kernel learning (MKL), structured sparsity, and multi-task learning have recently received considerable attention. In this paper, we show how different MKL algorithms can be understood as applications of either regularization on…

Machine Learning · Statistics 2011-03-03 Ryota Tomioka , Taiji Suzuki

Motion-Compensated Weight Compression

Neural network weights are increasingly a bottleneck for deployment, yet most compression pipelines treat layers independently and overlook cross-layer redundancy induced by function-preserving symmetries. We propose Motion-Compensated…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Ismail Lamaakal

Learning, compression, and leakage: Minimising classification error via meta-universal compression principles

Learning and compression are driven by the common aim of identifying and exploiting statistical regularities in data, which opens the door for fertile collaboration between these areas. A promising group of compression techniques for…

Machine Learning · Computer Science 2021-02-02 Fernando E. Rosas , Pedro A. M. Mediano , Michael Gastpar

Compressing Large Language Models using Low Rank and Low Precision Decomposition

The prohibitive sizes of Large Language Models (LLMs) today make it difficult to deploy them on memory-constrained edge devices. This work introduces $\rm CALDERA$ -- a new post-training LLM compression algorithm that harnesses the inherent…

Machine Learning · Computer Science 2024-11-05 Rajarshi Saha , Naomi Sagan , Varun Srivastava , Andrea J. Goldsmith , Mert Pilanci

Approximate maximum likelihood decoding with $K$ minimum weight matchings

The minimum weight matching (MWM) and maximum likelihood decoding (MLD) are two widely used and distinct decoding strategies for quantum error correction. For a given syndrome, the MWM decoder finds the most probable physical error…

Quantum Physics · Physics 2025-10-29 Mao Lin

Covariance-Domain Near-Field Channel Estimation under Hybrid Compression: USW/Fresnel Model, Curvature Learning, and KL Covariance Fitting

Near-field propagation in extremely large aperture arrays requires joint angle-range estimation. In hybrid architectures, only $N_\mathrm{RF}\ll M$ compressed snapshots are available per slot, making the $N_\mathrm{RF}\times N_\mathrm{RF}$…

Signal Processing · Electrical Eng. & Systems 2026-04-01 Rıfat Volkan Şenyuva

Variational method for estimating the rate of convergence of Markov Chain Monte Carlo algorithms

We demonstrate the use of a variational method to determine a quantitative lower bound on the rate of convergence of Markov Chain Monte Carlo (MCMC) algorithms as a function of the target density and proposal density. The bound relies on…

Data Analysis, Statistics and Probability · Physics 2013-05-29 Fergal P. Casey , Joshua J. Waterfall , Ryan N. Gutenkunst , Christopher R. Myers , James P. Sethna

Compression with Bayesian Implicit Neural Representations

Many common types of data can be represented as functions that map coordinates to signal values, such as pixel locations to RGB values in the case of an image. Based on this view, data can be compressed by overfitting a compact neural…

Machine Learning · Computer Science 2023-10-31 Zongyu Guo , Gergely Flamich , Jiajun He , Zhibo Chen , José Miguel Hernández-Lobato

Quantized criterion-based kernel recursive least squares adaptive filtering for time series prediction

The robustness of the kernel recursive least square (KRLS) algorithm has recently been improved by combining them with more robust information-theoretic learning criteria, such as minimum error entropy (MEE) and generalized MEE (GMEE),…

Information Theory · Computer Science 2023-09-07 Jiacheng He , Gang Wang , Kun Zhang , Shan Zhong , Bei Peng

Uncertainty-based Continual Learning with Adaptive Regularization

We introduce a new neural network-based continual learning algorithm, dubbed as Uncertainty-regularized Continual Learning (UCL), which builds on traditional Bayesian online learning framework with variational inference. We focus on two…

Machine Learning · Computer Science 2019-11-15 Hongjoon Ahn , Sungmin Cha , Donggyu Lee , Taesup Moon

Double-Weighting for Covariate Shift Adaptation

Supervised learning is often affected by a covariate shift in which the marginal distributions of instances (covariates $x$) of training and testing samples $\mathrm{p}_\text{tr}(x)$ and $\mathrm{p}_\text{te}(x)$ are different but the label…

Machine Learning · Statistics 2023-06-12 José I. Segovia-Martín , Santiago Mazuelas , Anqi Liu

SQS: Bayesian DNN Compression through Sparse Quantized Sub-distributions

Compressing large-scale neural networks is essential for deploying models on resource-constrained devices. Most existing methods adopt weight pruning or low-bit quantization individually, often resulting in suboptimal compression rates to…

Machine Learning · Computer Science 2025-10-13 Ziyi Wang , Nan Jiang , Guang Lin , Qifan Song

Training with Fewer Bits: Unlocking Edge LLMs Training with Stochastic Rounding

LLM training is resource-intensive. Quantized training improves computational and memory efficiency but introduces quantization noise, which can hinder convergence and degrade model accuracy. Stochastic Rounding (SR) has emerged as a…

Machine Learning · Computer Science 2025-11-04 Taowen Liu , Marta Andronic , Deniz Gündüz , George A. Constantinides

The k-tied Normal Distribution: A Compact Parameterization of Gaussian Mean Field Posteriors in Bayesian Neural Networks

Variational Bayesian Inference is a popular methodology for approximating posterior distributions over Bayesian neural network weights. Recent work developing this class of methods has explored ever richer parameterizations of the…

Machine Learning · Computer Science 2020-07-07 Jakub Swiatkowski , Kevin Roth , Bastiaan S. Veeling , Linh Tran , Joshua V. Dillon , Jasper Snoek , Stephan Mandt , Tim Salimans , Rodolphe Jenatton , Sebastian Nowozin

Efficient Maximal Coding Rate Reduction by Variational Forms

The principle of Maximal Coding Rate Reduction (MCR$^2$) has recently been proposed as a training objective for learning discriminative low-dimensional structures intrinsic to high-dimensional data to allow for more robust training than…

Machine Learning · Computer Science 2022-04-04 Christina Baek , Ziyang Wu , Kwan Ho Ryan Chan , Tianjiao Ding , Yi Ma , Benjamin D. Haeffele

Learning Distributions via Monte-Carlo Marginalization

We propose a novel method to learn intractable distributions from their samples. The main idea is to use a parametric distribution model, such as a Gaussian Mixture Model (GMM), to approximate intractable distributions by minimizing the…

Machine Learning · Computer Science 2023-08-15 Chenqiu Zhao , Guanfang Dong , Anup Basu

Attributed Graph Clustering with Multi-Scale Weight-Based Pairwise Coarsening and Contrastive Learning

This study introduces the Multi-Scale Weight-Based Pairwise Coarsening and Contrastive Learning (MPCCL) model, a novel approach for attributed graph clustering that effectively bridges critical gaps in existing methods, including long-range…

Machine Learning · Computer Science 2025-07-29 Binxiong Li , Yuefei Wang , Binyu Zhao , Heyang Gao , Benhan Yang , Quanzhou Luo , Xue Li , Xu Xiang , Yujie Liu , Huijie Tang

Quantum-Inspired Weight-Constrained Neural Network: Reducing Variable Numbers by 100x Compared to Standard Neural Networks

Although quantum machine learning has shown great promise, the practical application of quantum computers remains constrained in the noisy intermediate-scale quantum era. To take advantage of quantum machine learning, we investigate the…

Quantum Physics · Physics 2026-02-20 Shaozhi Li , M Sabbir Salek , Mashrur Chowdhury , Yao Wang