Related papers: Falsification and future performance

Complex-Valued Random Vectors and Channels: Entropy, Divergence, and Capacity

Recent research has demonstrated significant achievable performance gains by exploiting circularity/non-circularity or propeness/improperness of complex-valued signals. In this paper, we investigate the influence of these properties on…

Information Theory · Computer Science 2016-11-17 Georg Tauboeck

Learning Capacity: A Measure of the Effective Dimensionality of a Model

We use a formal correspondence between thermodynamics and inference, where the number of samples can be thought of as the inverse temperature, to study a quantity called ``learning capacity'' which is a measure of the effective…

Machine Learning · Computer Science 2024-10-22 Daiwei Chen , Wei-Kai Chang , Pratik Chaudhari

Information, learning and falsification

There are (at least) three approaches to quantifying information. The first, algorithmic information or Kolmogorov complexity, takes events as strings and, given a universal Turing machine, quantifies the information content of a string as…

Information Theory · Computer Science 2011-11-29 David Balduzzi

On Measuring Excess Capacity in Neural Networks

We study the excess capacity of deep networks in the context of supervised classification. That is, given a capacity measure of the underlying hypothesis class - in our case, empirical Rademacher complexity - to what extent can we (a…

Machine Learning · Computer Science 2023-01-20 Florian Graf , Sebastian Zeng , Bastian Rieck , Marc Niethammer , Roland Kwitt

The Coverage Principle: How Pre-Training Enables Post-Training

Language models demonstrate remarkable abilities when pre-trained on large text corpora and fine-tuned for specific tasks, but how and why pre-training shapes the success of the final model remains poorly understood. Notably, although…

Machine Learning · Statistics 2025-10-23 Fan Chen , Audrey Huang , Noah Golowich , Sadhika Malladi , Adam Block , Jordan T. Ash , Akshay Krishnamurthy , Dylan J. Foster

Variational Encoder-based Reliable Classification

Machine learning models provide statistically impressive results which might be individually unreliable. To provide reliability, we propose an Epistemic Classifier (EC) that can provide justification of its belief using support from the…

Machine Learning · Computer Science 2020-10-20 Chitresh Bhushan , Zhaoyuan Yang , Nurali Virani , Naresh Iyer

Information Theoretic Perspective on Representation Learning

An information-theoretic framework is introduced to analyze last-layer embedding, focusing on learned representations for regression tasks. We define representation-rate and derive limits on the reliability with which input-output…

Information Theory · Computer Science 2026-05-27 Deborah Pereg , Michael Wand

Complexity and Second Moment of the Mathematical Theory of Communication

The performance of an error correcting code is evaluated by its error probability, rate, and en/decoding complexity. The performance of a series of codes is evaluated by, as the block lengths approach infinity, whether their error…

Information Theory · Computer Science 2021-07-15 Hsin-Po Wang

The Combinatorics of Falsification and Hypothesis Testing

The present paper is concerned with the question of how falsifiable a single proposition is in the short and long run. Formal Learning theorists such as Schulte and Juhl have argued that long-run falsifiability is characterized by the…

Logic · Mathematics 2022-09-27 Reid Dale

Optimality Implies Kernel Sum Classifiers are Statistically Efficient

We propose a novel combination of optimization tools with learning theory bounds in order to analyze the sample complexity of optimal kernel sum classifiers. This contrasts the typical learning theoretic results which hold for all…

Machine Learning · Computer Science 2019-06-04 Raphael Arkady Meyer , Jean Honorio

Entropy Reweighted Conformal Classification

Conformal Prediction (CP) is a powerful framework for constructing prediction sets with guaranteed coverage. However, recent studies have shown that integrating confidence calibration with CP can lead to a degradation in efficiency. In this…

Machine Learning · Computer Science 2024-07-25 Rui Luo , Nicolo Colombo

Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis

Recent work has explored sequence-to-sequence latent variable models for expressive speech synthesis (supporting control and transfer of prosody and style), but has not presented a coherent framework for understanding the trade-offs between…

Computation and Language · Computer Science 2019-10-29 Eric Battenberg , Soroosh Mariooryad , Daisy Stanton , RJ Skerry-Ryan , Matt Shannon , David Kao , Tom Bagby

Conformal Correction for Efficiency May be at Odds with Entropy

Conformal prediction (CP) provides a comprehensive framework to produce statistically rigorous uncertainty sets for black-box machine learning models. To further improve the efficiency of CP, conformal correction is proposed to fine-tune or…

Machine Learning · Computer Science 2025-12-03 Senrong Xu , Tianyu Wang , Zenan Li , Yuan Yao , Taolue Chen , Feng Xu , Xiaoxing Ma

A Unifying Information-theoretic Perspective on Evaluating Generative Models

Considering the difficulty of interpreting generative model output, there is significant current research focused on determining meaningful evaluation metrics. Several recent approaches utilize "precision" and "recall," borrowed from the…

Machine Learning · Computer Science 2025-02-28 Alexis Fox , Samarth Swarup , Abhijin Adiga

Practical Estimation of Renyi Entropy

Entropy Estimation is an important problem with many applications in cryptography, statistic,machine learning. Although the estimators optimal with respect to the sample complexity have beenrecently developed, there are still some…

Data Structures and Algorithms · Computer Science 2020-02-24 Maciej Skorski

Entropy and information in neural spike trains: Progress on the sampling problem

The major problem in information theoretic analysis of neural responses and other biological data is the reliable estimation of entropy--like quantities from small samples. We apply a recently introduced Bayesian entropy estimator to…

Data Analysis, Statistics and Probability · Physics 2009-09-29 Ilya Nemenman , William Bialek , Rob de Ruyter van Steveninck

Monotonic Learning in the PAC Framework: A New Perspective

Monotone learning describes learning processes in which expected performance consistently improves as the amount of training data increases. However, recent studies challenge this conventional wisdom, revealing significant gaps in the…

Machine Learning · Computer Science 2025-05-22 Ming Li , Chenyi Zhang , Qin Li

One-Shot Classical-Quantum Capacity and Hypothesis Testing

The one-shot classical capacity of a quantum channel quantifies the amount of classical information that can be transmitted through a single use of the channel such that the error probability is below a certain threshold. In this work, we…

Quantum Physics · Physics 2013-01-29 Ligong Wang , Renato Renner

The $\varphi$ Curve: The Shape of Generalization through the Lens of Norm-based Capacity Control

Understanding how the test risk scales with model complexity is a central question in machine learning. Classical theory is challenged by the learning curves observed for large over-parametrized deep networks. Capacity measures based on…

Machine Learning · Statistics 2025-10-22 Yichen Wang , Yudong Chen , Lorenzo Rosasco , Fanghui Liu

One-shot and asymptotic classical capacity in general physical theories

With the recent development of quantum information theory, some attempts exist to construct information theory beyond quantum theory. Here we consider hypothesis testing relative entropy and one-shot classical capacity, that is, the optimal…

Quantum Physics · Physics 2024-06-14 Shintaro Minagawa , Hayato Arai