Related papers: Confidence-Constrained Maximum Entropy Framework f…

Mutual Information Learned Classifiers: an Information-theoretic Viewpoint of Training Deep Learning Classification Systems

Deep learning systems have been reported to achieve state-of-the-art performances in many applications, and a key is the existence of well trained classifiers on benchmark datasets. As a main-stream loss function, the cross entropy can…

Machine Learning · Computer Science 2022-09-22 Jirong Yi , Qiaosheng Zhang , Zhen Chen , Qiao Liu , Wei Shao

Self-Supervised Learning via Maximum Entropy Coding

A mainstream type of current self-supervised learning methods pursues a general-purpose representation that can be well transferred to downstream tasks, typically by optimizing on a given pretext task such as instance discrimination. In…

Computer Vision and Pattern Recognition · Computer Science 2022-10-21 Xin Liu , Zhongdao Wang , Yali Li , Shengjin Wang

Instance Cross Entropy for Deep Metric Learning

Loss functions play a crucial role in deep metric learning thus a variety of them have been proposed. Some supervise the learning process by pairwise or tripletwise similarity constraints while others take advantage of structured similarity…

Machine Learning · Computer Science 2019-11-25 Xinshao Wang , Elyor Kodirov , Yang Hua , Neil Robertson

MEP-Net: Generating Solutions to Scientific Problems with Limited Knowledge by Maximum Entropy Principle

Maximum entropy principle (MEP) offers an effective and unbiased approach to inferring unknown probability distributions when faced with incomplete information, while neural networks provide the flexibility to learn complex distributions…

Machine Learning · Statistics 2024-12-04 Wuyue Yang , Liangrong Peng , Guojie Li , Liu Hong

MEMe: An Accurate Maximum Entropy Method for Efficient Approximations in Large-Scale Machine Learning

Efficient approximation lies at the heart of large-scale machine learning problems. In this paper, we propose a novel, robust maximum entropy algorithm, which is capable of dealing with hundreds of moments and allows for computationally…

Machine Learning · Statistics 2019-06-05 Diego Granziol , Binxin Ru , Stefan Zohren , Xiaowen Doing , Michael Osborne , Stephen Roberts

DiME: Maximizing Mutual Information by a Difference of Matrix-Based Entropies

We introduce an information-theoretic quantity with similar properties to mutual information that can be estimated from data without making explicit assumptions on the underlying distribution. This quantity is based on a recently proposed…

Machine Learning · Computer Science 2023-07-31 Oscar Skean , Jhoan Keider Hoyos Osorio , Austin J. Brockmeier , Luis Gonzalo Sanchez Giraldo

MIM: Mutual Information Machine

We introduce the Mutual Information Machine (MIM), a probabilistic auto-encoder for learning joint distributions over observations and latent variables. MIM reflects three design principles: 1) low divergence, to encourage the encoder and…

Machine Learning · Computer Science 2020-02-24 Micha Livne , Kevin Swersky , David J. Fleet

Maximum Entropy and Bayesian Data Analysis: Entropic Priors

The problem of assigning probability distributions which objectively reflect the prior information available about experiments is one of the major stumbling blocks in the use of Bayesian methods of data analysis. In this paper the method of…

Data Analysis, Statistics and Probability · Physics 2009-11-10 Ariel Caticha , Roland Preuss

Entropy, Information, and the Updating of Probabilities

This paper is a review of a particular approach to the method of maximum entropy as a general framework for inference. The discussion emphasizes the pragmatic elements in the derivation. An epistemic notion of information is defined in…

Data Analysis, Statistics and Probability · Physics 2021-08-04 Ariel Caticha

Notes on Generalizing the Maximum Entropy Principle to Uncertain Data

The principle of maximum entropy is a broadly applicable technique for computing a distribution with the least amount of information possible constrained to match empirical data, for instance, feature expectations. We seek to generalize…

Information Theory · Computer Science 2022-05-30 Kenneth Bogert

Neural Entropic Estimation: A faster path to mutual information estimation

We point out a limitation of the mutual information neural estimation (MINE) where the network fails to learn at the initial training phase, leading to slow convergence in the number of training iterations. To solve this problem, we propose…

Information Theory · Computer Science 2019-06-03 Chung Chan , Ali Al-Bashabsheh , Hing Pang Huang , Michael Lim , Da Sun Handason Tam , Chao Zhao

Finding emergence in data by maximizing effective information

Quantifying emergence and modeling emergent dynamics in a data-driven manner for complex dynamical systems is challenging due to the lack of direct observations at the micro-level. Thus, it's crucial to develop a framework to identify…

Physics and Society · Physics 2024-08-16 Mingzhe Yang , Zhipeng Wang , Kaiwei Liu , Yingqi Rong , Bing Yuan , Jiang Zhang

COME: Test-time adaption by Conservatively Minimizing Entropy

Machine learning models must continuously self-adjust themselves for novel data distribution in the open world. As the predominant principle, entropy minimization (EM) has been proven to be a simple yet effective cornerstone in existing…

Machine Learning · Statistics 2024-10-16 Qingyang Zhang , Yatao Bian , Xinke Kong , Peilin Zhao , Changqing Zhang

Maximum Entropy Diverse Exploration: Disentangling Maximum Entropy Reinforcement Learning

Two hitherto disconnected threads of research, diverse exploration (DE) and maximum entropy RL have addressed a wide range of problems facing reinforcement learning algorithms via ostensibly distinct mechanisms. In this work, we identify a…

Machine Learning · Computer Science 2019-11-05 Andrew Cohen , Lei Yu , Xingye Qiao , Xiangrong Tong

Maximum Entropy: The Universal Method for Inference

In this thesis we start by providing some detail regarding how we arrived at our present understanding of probabilities and how we manipulate them - the product and addition rules by Cox. We also discuss the modern view of entropy and how…

Data Analysis, Statistics and Probability · Physics 2009-01-21 Adom Giffin

Restricted Minimum Error Entropy Criterion for Robust Classification

The minimum error entropy (MEE) criterion has been verified as a powerful approach for non-Gaussian signal processing and robust machine learning. However, the implementation of MEE on robust classification is rather a vacancy in the…

Machine Learning · Computer Science 2025-08-07 Yuanhao Li , Badong Chen , Natsue Yoshimura , Yasuharu Koike

Boltzmann Machine Learning with the Latent Maximum Entropy Principle

We present a new statistical learning paradigm for Boltzmann machines based on a new inference principle we have proposed: the latent maximum entropy principle (LME). LME is different both from Jaynes maximum entropy principle and from…

Machine Learning · Computer Science 2012-12-12 Shaojun Wang , Dale Schuurmans , Fuchun Peng , Yunxin Zhao

Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation

Continuous-time reinforcement learning (CTRL) provides a natural framework for sequential decision-making in dynamic environments where interactions evolve continuously over time. While CTRL has shown growing empirical success, its ability…

Machine Learning · Computer Science 2025-12-04 Runze Zhao , Yue Yu , Ruhan Wang , Chunfeng Huang , Dongruo Zhou

IRL with Partial Observations using the Principle of Uncertain Maximum Entropy

The principle of maximum entropy is a broadly applicable technique for computing a distribution with the least amount of information possible while constrained to match empirically estimated feature expectations. However, in many real-world…

Machine Learning · Computer Science 2022-08-16 Kenneth Bogert , Yikang Gui , Prashant Doshi

How multiplicity determines entropy and the derivation of the maximum entropy principle for complex systems

The maximum entropy principle (MEP) is a method for obtaining the most likely distribution functions of observables from statistical systems, by maximizing entropy under constraints. The MEP has found hundreds of applications in ergodic and…

Classical Physics · Physics 2016-10-03 Rudolf Hanel , Stefan Thurner , Murray Gell-Mann