Related papers: Differential Description Length for Hyperparameter…

Extending the Use of MDL for High-Dimensional Problems: Variable Selection, Robust Fitting, and Additive Modeling

In the signal processing and statistics literature, the minimum description length (MDL) principle is a popular tool for choosing model complexity. Successful examples include signal denoising and variable selection in linear regression,…

Signal Processing · Electrical Eng. & Systems 2022-01-28 Zhenyu Wei , Raymond K. W. Wong , Thomas C. M. Lee

Minimum Description Length Revisited

This is an up-to-date introduction to and overview of the Minimum Description Length (MDL) Principle, a theory of inductive inference that can be applied to general problems in statistics, machine learning and pattern recognition. While MDL…

Methodology · Statistics 2019-12-19 Peter Grünwald , Teemu Roos

The Minimum Description Length Principle for Pattern Mining: A Survey

This is about the Minimum Description Length (MDL) principle applied to pattern mining. The length of this description is kept to the minimum. Mining patterns is a core task in data analysis and, beyond issues of efficient enumeration, the…

Databases · Computer Science 2022-07-29 Esther Galbrun

Minimum Description Length Principle for Maximum Entropy Model Selection

Model selection is central to statistics, and many learning problems can be formulated as model selection problems. In this paper, we treat the problem of selecting a maximum entropy model given various feature subsets and their moments, as…

Information Theory · Computer Science 2013-11-28 Gaurav Pandey , Ambedkar Dukkipati

Asymptotics of Discrete MDL for Online Prediction

Minimum Description Length (MDL) is an important principle for induction and prediction, with strong relations to optimal Bayesian learning. This paper deals with learning non-i.i.d. processes by means of two-part MDL, where the underlying…

Information Theory · Computer Science 2007-07-13 Jan Poland , Marcus Hutter

Discrete MDL Predicts in Total Variation

The Minimum Description Length (MDL) principle selects the model that has the shortest code for data plus model. We show that for a countable class of models, MDL predictions are close to the true distribution in a strong sense. The result…

Probability · Mathematics 2010-12-30 Marcus Hutter

Minimum Encoding Approaches for Predictive Modeling

We analyze differences between two information-theoretically motivated approaches to statistical inference and model selection: the Minimum Description Length (MDL) principle, and the Minimum Message Length (MML) principle. Based on this…

Machine Learning · Computer Science 2013-02-01 Peter D Grunwald , Petri Kontkanen , Petri Myllymaki , Tomi Silander , Henry Tirri

Descriptive Dimensionality and Its Characterization of MDL-based Learning and Change Detection

This paper introduces a new notion of dimensionality of probabilistic models from an information-theoretic view point. We call it the "descriptive dimension"(Ddim). We show that Ddim coincides with the number of independent parameters for…

Machine Learning · Computer Science 2019-10-28 Kenji Yamanishi

A Minimum Description Length Approach to Regularization in Neural Networks

State-of-the-art neural networks can be trained to become remarkable solutions to many problems. But while these architectures can express symbolic, perfect solutions, trained models often arrive at approximations instead. We show that the…

Machine Learning · Computer Science 2025-09-09 Matan Abudy , Orr Well , Emmanuel Chemla , Roni Katzir , Nur Lan

Using Causal Information and Local Measures to Learn Bayesian Networks

In previous work we developed a method of learning Bayesian Network models from raw data. This method relies on the well known minimal description length (MDL) principle. The MDL principle is particularly well suited to this task as it…

Artificial Intelligence · Computer Science 2013-03-08 Wai Lam , Fahiem Bacchus

Applying MDL to Learning Best Model Granularity

The Minimum Description Length (MDL) principle is solidly based on a provably ideal method of inference using Kolmogorov complexity. We test how the theory behaves in practice on a general problem in model selection: that of learning the…

Data Analysis, Statistics and Probability · Physics 2007-05-23 Qiong Gao , Ming Li , Paul Vitanyi

Sequential Learning Of Neural Networks for Prequential MDL

Minimum Description Length (MDL) provides a framework and an objective for principled model evaluation. It formalizes Occam's Razor and can be applied to data from non-stationary sources. In the prequential formulation of MDL, the objective…

Machine Learning · Statistics 2022-10-17 Jorg Bornschein , Yazhe Li , Marcus Hutter

Information-Theoretic Probing with Minimum Description Length

To measure how well pretrained representations encode some linguistic property, it is common to use accuracy of a probe, i.e. a classifier trained to predict the property from the representations. Despite widespread adoption of probes,…

Computation and Language · Computer Science 2020-03-30 Elena Voita , Ivan Titov

Minimum Description Length Control

We propose a novel framework for multitask reinforcement learning based on the minimum description length (MDL) principle. In this approach, which we term MDL-control (MDL-C), the agent learns the common structure among the tasks with which…

Machine Learning · Computer Science 2022-07-26 Ted Moskovitz , Ta-Chu Kao , Maneesh Sahani , Matthew M. Botvinick

Sparsification and feature selection by compressive linear regression

The Minimum Description Length (MDL) principle states that the optimal model for a given data set is that which compresses it best. Due to practial limitations the model can be restricted to a class such as linear regression models, which…

Machine Learning · Statistics 2015-03-13 Florin Popescu , Daniel Renz

Minimum Description Length and Generalization Guarantees for Representation Learning

A major challenge in designing efficient statistical supervised learning algorithms is finding representations that perform well not only on available training samples but also on unseen data. While the study of representation learning has…

Machine Learning · Statistics 2024-02-06 Milad Sefidgaran , Abdellatif Zaidi , Piotr Krasnowski

Evaluating Representations with Readout Model Switching

Although much of the success of Deep Learning builds on learning good representations, a rigorous method to evaluate their quality is lacking. In this paper, we treat the evaluation of representations as a model selection problem and…

Machine Learning · Computer Science 2024-11-19 Yazhe Li , Jorg Bornschein , Marcus Hutter

High-dimensional Penalty Selection via Minimum Description Length Principle

We tackle the problem of penalty selection of regularization on the basis of the minimum description length (MDL) principle. In particular, we consider that the design space of the penalty function is high-dimensional. In this situation,…

Machine Learning · Statistics 2018-04-27 Kohei Miyaguchi , Kenji Yamanishi

An MDL framework for sparse coding and dictionary learning

The power of sparse signal modeling with learned over-complete dictionaries has been demonstrated in a variety of applications and fields, from signal processing to statistical inference and machine learning. However, the statistical…

Information Theory · Computer Science 2017-04-26 Ignacio Ramírez , Guillermo Sapiro

Machine Learning vs Deep Learning: The Generalization Problem

The capacity to generalize beyond the range of training data is a pivotal challenge, often synonymous with a model's utility and robustness. This study investigates the comparative abilities of traditional machine learning (ML) models and…

Machine Learning · Computer Science 2024-03-05 Yong Yi Bay , Kathleen A. Yearick