Related papers: Information-Theoretic Foundations for Machine Lear…

A Theory of the Mechanics of Information: Generalization Through Measurement of Uncertainty (Learning is Measuring)

Traditional machine learning relies on explicit models and domain assumptions, limiting flexibility and interpretability. We introduce a model-free framework using surprisal (information theoretic uncertainty) to directly analyze and…

Machine Learning · Computer Science 2025-10-28 Christopher J. Hazard , Michael Resnick , Jacob Beel , Jack Xia , Cade Mack , Dominic Glennie , Matthew Fulp , David Maze , Andrew Bassett , Martin Koistinen

Machine learning and information theory concepts towards an AI Mathematician

The current state-of-the-art in artificial intelligence is impressive, especially in terms of mastery of language, but not so much in terms of mathematical reasoning. What could be missing? Can we learn something useful about that gap from…

Artificial Intelligence · Computer Science 2024-03-08 Yoshua Bengio , Nikolay Malkin

Generalizing Information to the Evolution of Rational Belief

Information theory provides a mathematical foundation to measure uncertainty in belief. Belief is represented by a probability distribution that captures our understanding of an outcome's plausibility. Information measures based on…

Information Theory · Computer Science 2020-01-17 Jed A. Duersch , Thomas A. Catanach

Information-Theoretic Framework for Understanding Modern Machine-Learning

We introduce an information-theoretic framework that views learning as universal prediction under log loss, characterized through regret bounds. Central to the framework is an effective notion of architecture-based model complexity, defined…

Machine Learning · Computer Science 2025-11-04 Meir Feder , Ruediger Urbanke , Yaniv Fogel

On Information Processing Limitations In Humans and Machines

Information theory is concerned with the study of transmission, processing, extraction, and utilization of information. In its most abstract form, information is conceived as a means of resolving uncertainty. Shannon and Weaver (1949) were…

Computers and Society · Computer Science 2021-12-08 Birgitta Dresp-Langley

A Bayesian Framework for Information-Theoretic Probing

Pimentel et al. (2020) recently analysed probing from an information-theoretic perspective. They argue that probing should be seen as approximating a mutual information. This led to the rather unintuitive conclusion that representations…

Computation and Language · Computer Science 2021-09-10 Tiago Pimentel , Ryan Cotterell

A Survey on Bayesian Deep Learning

A comprehensive artificial intelligence system needs to not only perceive the environment with different `senses' (e.g., seeing and hearing) but also infer the world's conditional (or even causal) relations and corresponding uncertainty.…

Machine Learning · Statistics 2021-01-07 Hao Wang , Dit-Yan Yeung

A Unified Information-Theoretic Framework for Meta-Learning Generalization

In recent years, information-theoretic generalization bounds have gained increasing attention for analyzing the generalization capabilities of meta-learning algorithms. However, existing results are confined to two-step bounds, failing to…

Machine Learning · Statistics 2025-10-14 Wen Wen , Tieliang Gong , Yuxin Dong , Zeyu Gao , Yong-Jin Liu

Information Theory for Complex Systems Scientists

In the 21st century, many of the crucial scientific and technical issues facing humanity can be understood as problems associated with understanding, modelling, and ultimately controlling complex systems: systems comprised of a large number…

Information Theory · Computer Science 2025-01-20 Thomas F. Varley

Generalization Bounds: Perspectives from Information Theory and PAC-Bayes

A fundamental question in theoretical machine learning is generalization. Over the past decades, the PAC-Bayesian approach has been established as a flexible framework to address the generalization capabilities of machine learning…

Machine Learning · Computer Science 2024-03-28 Fredrik Hellström , Giuseppe Durisi , Benjamin Guedj , Maxim Raginsky

Informed Machine Learning -- A Taxonomy and Survey of Integrating Knowledge into Learning Systems

Despite its great success, machine learning can have its limits when dealing with insufficient training data. A potential solution is the additional integration of prior knowledge into the training process which leads to the notion of…

Machine Learning · Statistics 2021-05-31 Laura von Rueden , Sebastian Mayer , Katharina Beckh , Bogdan Georgiev , Sven Giesselbach , Raoul Heese , Birgit Kirsch , Julius Pfrommer , Annika Pick , Rajkumar Ramamurthy , Michal Walczak , Jochen Garcke , Christian Bauckhage , Jannis Schuecker

Foundations of Bayesian Learning from Synthetic Data

There is significant growth and interest in the use of synthetic data as an enabler for machine learning in environments where the release of real data is restricted due to privacy or availability constraints. Despite a large number of…

Machine Learning · Computer Science 2020-11-25 Harrison Wilde , Jack Jewson , Sebastian Vollmer , Chris Holmes

Information theory and learning: a physical approach

We try to establish a unified information theoretic approach to learning and to explore some of its applications. First, we define {\em predictive information} as the mutual information between the past and the future of a time series,…

Data Analysis, Statistics and Probability · Physics 2007-05-23 Ilya Nemenman

Information Science Principles of Machine Learning: A Causal Chain Meta-Framework Based on Formalized Information Mapping

This paper addresses the current lack of a unified formal framework in machine learning theory, as well as the absence of robust theoretical foundations for interpretability and ethical safety assurance. We first construct a formal…

Logic in Computer Science · Computer Science 2025-11-11 Jianfeng Xu

A Deep Learning Framework for Lifelong Machine Learning

Humans can learn a variety of concepts and skills incrementally over the course of their lives while exhibiting many desirable properties, such as continual learning without forgetting, forward transfer and backward transfer of knowledge,…

Artificial Intelligence · Computer Science 2021-05-04 Charles X. Ling , Tanner Bohn

Towards Bayesian Deep Learning: A Framework and Some Existing Methods

While perception tasks such as visual object recognition and text understanding play an important role in human intelligence, the subsequent tasks that involve inference, reasoning and planning require an even higher level of intelligence.…

Machine Learning · Statistics 2016-09-06 Hao Wang , Dit-Yan Yeung

Machine Learning and the Future of Bayesian Computation

Bayesian models are a powerful tool for studying complex data, allowing the analyst to encode rich hierarchical dependencies and leverage prior information. Most importantly, they facilitate a complete characterization of uncertainty…

Machine Learning · Statistics 2023-04-25 Steven Winter , Trevor Campbell , Lizhen Lin , Sanvesh Srivastava , David B. Dunson

On the coherent extension of some Fano-type learning bounds

Information theory provides tools to predict the performance of a learning algorithm on a given dataset. For instance, the accuracy of learning an unknown parameter can be upper bounded by reducing the learning task to hypothesis testing…

Quantum Physics · Physics 2026-04-21 Evan Peters

Information theoretic analysis of computational models as a tool to understand the neural basis of behaviors

One of the greatest research challenges of this century is to understand the neural basis for how behavior emerges in brain-body-environment systems. To this end, research has flourished along several directions but have predominantly…

Neurons and Cognition · Quantitative Biology 2021-06-10 Madhavun Candadai

Can Information Behaviour Inform Machine Learning?

The objective of this paper is to explore the opportunities for human information behaviour research to inform and influence the field of machine learning and the resulting machine information behaviour. Using the development of foundation…

Machine Learning · Computer Science 2022-05-03 Michael Ridley