English
Related papers

Related papers: Minimum Description Length codes are critical

200 papers

Model selection is central to statistics, and many learning problems can be formulated as model selection problems. In this paper, we treat the problem of selecting a maximum entropy model given various feature subsets and their moments, as…

Information Theory · Computer Science 2013-11-28 Gaurav Pandey , Ambedkar Dukkipati

The Minimum Description Length (MDL) principle selects the model that has the shortest code for data plus model. We show that for a countable class of models, MDL predictions are close to the true distribution in a strong sense. The result…

Probability · Mathematics 2010-12-30 Marcus Hutter

In the signal processing and statistics literature, the minimum description length (MDL) principle is a popular tool for choosing model complexity. Successful examples include signal denoising and variable selection in linear regression,…

Signal Processing · Electrical Eng. & Systems 2022-01-28 Zhenyu Wei , Raymond K. W. Wong , Thomas C. M. Lee

The power of sparse signal modeling with learned over-complete dictionaries has been demonstrated in a variety of applications and fields, from signal processing to statistical inference and machine learning. However, the statistical…

Information Theory · Computer Science 2017-04-26 Ignacio Ramírez , Guillermo Sapiro

This is an up-to-date introduction to and overview of the Minimum Description Length (MDL) Principle, a theory of inductive inference that can be applied to general problems in statistics, machine learning and pattern recognition. While MDL…

Methodology · Statistics 2019-12-19 Peter Grünwald , Teemu Roos

We analyze differences between two information-theoretically motivated approaches to statistical inference and model selection: the Minimum Description Length (MDL) principle, and the Minimum Message Length (MML) principle. Based on this…

Machine Learning · Computer Science 2013-02-01 Peter D Grunwald , Petri Kontkanen , Petri Myllymaki , Tomi Silander , Henry Tirri

The power of sparse signal coding with learned dictionaries has been demonstrated in a variety of applications and fields, from signal processing to statistical inference and machine learning. However, the statistical properties of these…

Information Theory · Computer Science 2010-10-25 Ignacio Ramírez , Guillermo Sapiro

The Minimum Description Length (MDL) principle is solidly based on a provably ideal method of inference using Kolmogorov complexity. We test how the theory behaves in practice on a general problem in model selection: that of learning the…

Data Analysis, Statistics and Probability · Physics 2007-05-23 Qiong Gao , Ming Li , Paul Vitanyi

Complexity is a fundamental concept underlying statistical learning theory that aims to inform generalization performance. Parameter count, while successful in low-dimensional settings, is not well-justified for overparameterized settings…

Machine Learning · Computer Science 2023-10-16 Raaz Dwivedi , Chandan Singh , Bin Yu , Martin J. Wainwright

Minimum Description Length (MDL) is an important principle for induction and prediction, with strong relations to optimal Bayesian learning. This paper deals with learning non-i.i.d. processes by means of two-part MDL, where the underlying…

Information Theory · Computer Science 2007-07-13 Jan Poland , Marcus Hutter

This is about the Minimum Description Length (MDL) principle applied to pattern mining. The length of this description is kept to the minimum. Mining patterns is a core task in data analysis and, beyond issues of efficient enumeration, the…

Databases · Computer Science 2022-07-29 Esther Galbrun

An efficient representation of observed data has many benefits in various domains of engineering and science. Representing static data sets, such as images, is a living branch in machine learning and eases downstream tasks, such as…

Systems and Control · Computer Science 2018-09-28 Friedrich Solowjow , Arash Mehrjou , Bernhard Schölkopf , Sebastian Trimpe

A major challenge in designing efficient statistical supervised learning algorithms is finding representations that perform well not only on available training samples but also on unseen data. While the study of representation learning has…

Machine Learning · Statistics 2024-02-06 Milad Sefidgaran , Abdellatif Zaidi , Piotr Krasnowski

This paper introduces a new notion of dimensionality of probabilistic models from an information-theoretic view point. We call it the "descriptive dimension"(Ddim). We show that Ddim coincides with the number of independent parameters for…

Machine Learning · Computer Science 2019-10-28 Kenji Yamanishi

To measure how well pretrained representations encode some linguistic property, it is common to use accuracy of a probe, i.e. a classifier trained to predict the property from the representations. Despite widespread adoption of probes,…

Computation and Language · Computer Science 2020-03-30 Elena Voita , Ivan Titov

The (non-)equivalence of canonical and microcanonical ensembles is a fundamental question in statistical physics, concerning whether the use of soft and hard constraints in the maximum-entropy construction leads to the same description of a…

Statistical Mechanics · Physics 2025-11-25 Francesca Giuffrida , Tiziano Squartini , Peter Grünwald , Diego Garlaschelli

State-of-the-art neural networks can be trained to become remarkable solutions to many problems. But while these architectures can express symbolic, perfect solutions, trained models often arrive at approximations instead. We show that the…

Machine Learning · Computer Science 2025-09-09 Matan Abudy , Orr Well , Emmanuel Chemla , Roni Katzir , Nur Lan

The Minimum Description Length (MDL) principle states that the optimal model for a given data set is that which compresses it best. Due to practial limitations the model can be restricted to a class such as linear regression models, which…

Machine Learning · Statistics 2015-03-13 Florin Popescu , Daniel Renz

It is shown that the two-part Minimum Description Length Principle can be used to discriminate among different models that can explain a given observed dataset. The description length is chosen to be the sum of the lengths of the message…

Astrophysics · Physics 2008-11-26 A. Asensio Ramos

When constructing models of the world, we aim for optimal compressions: models that include as few details as possible while remaining as accurate as possible. But which details -- or features measured in data -- should we choose to include…

Quantitative Methods · Quantitative Biology 2025-05-06 David P. Carcamo , Nicholas J. Weaver , Purushottam D. Dixit , Christopher W. Lynn
‹ Prev 1 2 3 10 Next ›