English
Related papers

Related papers: Minimal Random Code Learning with Mean-KL Paramete…

200 papers

While deep neural networks are a highly successful model class, their large memory footprint puts considerable strain on energy consumption, communication bandwidth, and storage requirements. Consequently, model size reduction has become an…

Machine Learning · Statistics 2018-10-02 Marton Havasi , Robert Peharz , José Miguel Hernández-Lobato

Graph-based multi-agent reinforcement learning (MARL) enables coordinated behavior under partial observability by modeling agents as nodes and communication links as edges. While recent methods excel at learning sparse coordination…

Machine Learning · Computer Science 2026-04-13 Wei Duan , Jie Lu , En Yu , Junyu Xuan

Multiple kernel learning (MKL), structured sparsity, and multi-task learning have recently received considerable attention. In this paper, we show how different MKL algorithms can be understood as applications of either regularization on…

Machine Learning · Statistics 2011-03-03 Ryota Tomioka , Taiji Suzuki

Neural network weights are increasingly a bottleneck for deployment, yet most compression pipelines treat layers independently and overlook cross-layer redundancy induced by function-preserving symmetries. We propose Motion-Compensated…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Ismail Lamaakal

Learning and compression are driven by the common aim of identifying and exploiting statistical regularities in data, which opens the door for fertile collaboration between these areas. A promising group of compression techniques for…

Machine Learning · Computer Science 2021-02-02 Fernando E. Rosas , Pedro A. M. Mediano , Michael Gastpar

The prohibitive sizes of Large Language Models (LLMs) today make it difficult to deploy them on memory-constrained edge devices. This work introduces $\rm CALDERA$ -- a new post-training LLM compression algorithm that harnesses the inherent…

Machine Learning · Computer Science 2024-11-05 Rajarshi Saha , Naomi Sagan , Varun Srivastava , Andrea J. Goldsmith , Mert Pilanci

The minimum weight matching (MWM) and maximum likelihood decoding (MLD) are two widely used and distinct decoding strategies for quantum error correction. For a given syndrome, the MWM decoder finds the most probable physical error…

Quantum Physics · Physics 2025-10-29 Mao Lin

Near-field propagation in extremely large aperture arrays requires joint angle-range estimation. In hybrid architectures, only $N_\mathrm{RF}\ll M$ compressed snapshots are available per slot, making the $N_\mathrm{RF}\times N_\mathrm{RF}$…

Signal Processing · Electrical Eng. & Systems 2026-04-01 Rıfat Volkan Şenyuva

We demonstrate the use of a variational method to determine a quantitative lower bound on the rate of convergence of Markov Chain Monte Carlo (MCMC) algorithms as a function of the target density and proposal density. The bound relies on…

Data Analysis, Statistics and Probability · Physics 2013-05-29 Fergal P. Casey , Joshua J. Waterfall , Ryan N. Gutenkunst , Christopher R. Myers , James P. Sethna

Many common types of data can be represented as functions that map coordinates to signal values, such as pixel locations to RGB values in the case of an image. Based on this view, data can be compressed by overfitting a compact neural…

Machine Learning · Computer Science 2023-10-31 Zongyu Guo , Gergely Flamich , Jiajun He , Zhibo Chen , José Miguel Hernández-Lobato

The robustness of the kernel recursive least square (KRLS) algorithm has recently been improved by combining them with more robust information-theoretic learning criteria, such as minimum error entropy (MEE) and generalized MEE (GMEE),…

Information Theory · Computer Science 2023-09-07 Jiacheng He , Gang Wang , Kun Zhang , Shan Zhong , Bei Peng

We introduce a new neural network-based continual learning algorithm, dubbed as Uncertainty-regularized Continual Learning (UCL), which builds on traditional Bayesian online learning framework with variational inference. We focus on two…

Machine Learning · Computer Science 2019-11-15 Hongjoon Ahn , Sungmin Cha , Donggyu Lee , Taesup Moon

Supervised learning is often affected by a covariate shift in which the marginal distributions of instances (covariates $x$) of training and testing samples $\mathrm{p}_\text{tr}(x)$ and $\mathrm{p}_\text{te}(x)$ are different but the label…

Machine Learning · Statistics 2023-06-12 José I. Segovia-Martín , Santiago Mazuelas , Anqi Liu

Compressing large-scale neural networks is essential for deploying models on resource-constrained devices. Most existing methods adopt weight pruning or low-bit quantization individually, often resulting in suboptimal compression rates to…

Machine Learning · Computer Science 2025-10-13 Ziyi Wang , Nan Jiang , Guang Lin , Qifan Song

LLM training is resource-intensive. Quantized training improves computational and memory efficiency but introduces quantization noise, which can hinder convergence and degrade model accuracy. Stochastic Rounding (SR) has emerged as a…

Machine Learning · Computer Science 2025-11-04 Taowen Liu , Marta Andronic , Deniz Gündüz , George A. Constantinides

Variational Bayesian Inference is a popular methodology for approximating posterior distributions over Bayesian neural network weights. Recent work developing this class of methods has explored ever richer parameterizations of the…

The principle of Maximal Coding Rate Reduction (MCR$^2$) has recently been proposed as a training objective for learning discriminative low-dimensional structures intrinsic to high-dimensional data to allow for more robust training than…

Machine Learning · Computer Science 2022-04-04 Christina Baek , Ziyang Wu , Kwan Ho Ryan Chan , Tianjiao Ding , Yi Ma , Benjamin D. Haeffele

We propose a novel method to learn intractable distributions from their samples. The main idea is to use a parametric distribution model, such as a Gaussian Mixture Model (GMM), to approximate intractable distributions by minimizing the…

Machine Learning · Computer Science 2023-08-15 Chenqiu Zhao , Guanfang Dong , Anup Basu

This study introduces the Multi-Scale Weight-Based Pairwise Coarsening and Contrastive Learning (MPCCL) model, a novel approach for attributed graph clustering that effectively bridges critical gaps in existing methods, including long-range…

Machine Learning · Computer Science 2025-07-29 Binxiong Li , Yuefei Wang , Binyu Zhao , Heyang Gao , Benhan Yang , Quanzhou Luo , Xue Li , Xu Xiang , Yujie Liu , Huijie Tang

Although quantum machine learning has shown great promise, the practical application of quantum computers remains constrained in the noisy intermediate-scale quantum era. To take advantage of quantum machine learning, we investigate the…

Quantum Physics · Physics 2026-02-20 Shaozhi Li , M Sabbir Salek , Mashrur Chowdhury , Yao Wang
‹ Prev 1 2 3 10 Next ›