English
Related papers

Related papers: Generalized Maximum Entropy Differential Dynamic P…

200 papers

We study expected utility maximization problem with constant relative risk aversion utility function in a complete market under the reinforcement learning framework. To induce exploration, we introduce the Tsallis entropy regularizer, which…

Machine Learning · Computer Science 2025-02-04 Chen Ziyi , Gu Jia-wen

In this paper, we present a new class of Markov decision processes (MDPs), called Tsallis MDPs, with Tsallis entropy maximization, which generalizes existing maximum entropy reinforcement learning (RL). A Tsallis MDP provides a unified…

Machine Learning · Computer Science 2019-02-08 Kyungjae Lee , Sungyub Kim , Sungbin Lim , Sungjoon Choi , Songhwai Oh

By using the maximum entropy principle with Tsallis entropy we obtain a fragment size distribution function which undergoes a transition to scaling. This distribution function reduces to those obtained by other authors using Shannon…

Soft Condensed Matter · Physics 2015-06-24 Oscar Sotolongo-Costa , Arezky H. Rodriguez , G. J. Rodgers

A new method is proposed for analyzing complexity and studying the information in random geometric networks using Tsallis entropy tool. Tsallis entropy of the ensemble of random geometric networks is calculated based on the components of…

Statistical Mechanics · Physics 2025-02-20 O. K. Kazemi , S. M. Taheri

In this paper, we provide a generalized framework for Variational Inference-Stochastic Optimal Control by using thenon-extensive Tsallis divergence. By incorporating the deformed exponential function into the optimality likelihood function,…

Machine Learning · Computer Science 2021-04-02 Ziyi Wang , Oswin So , Jason Gibson , Bogdan Vlahov , Manan S. Gandhi , Guan-Horng Liu , Evangelos A. Theodorou

We study optimal control in models with latent factors where the agent controls the distribution over actions, rather than actions themselves, in both discrete and continuous time. To encourage exploration of the state space, we reward…

Mathematical Finance · Quantitative Finance 2024-01-03 Ryan Donnelly , Sebastian Jaimungal

In this paper, we consider the information content of maximum ranked set sampling procedure with unequal samples (MRSSU) in terms of Tsallis entropy which is a nonadditive generalization of Shannon entropy. We obtain several results of…

Statistics Theory · Mathematics 2020-11-04 S. Tahmasebi , M. Longobardi , M. R. Kazemi , M. Alizadeh

This study derived the vertical distribution of streamwise velocity in wide open channels by maximizing Tsallis entropy, in accordance with the maximum entropy principle, subject to the total probability rule and the conservation of mass,…

Computational Physics · Physics 2024-03-04 Manotosh Kumbhakar , Rajendra K. Ray , Suvra Kanti Chakraborty , Koeli Ghoshal , Vijay P. Singh

In this paper, we investigate new procedures for statistical testing based on Tsallis entropy, a parametric generalization of Shannon entropy. Focusing on multivariate generalized Gaussian and $q$-Gaussian distributions, we develop…

Methodology · Statistics 2025-06-18 Mehmet Sıddık Çadırcı

In this paper, we present a novel maximum entropy formulation of the Differential Dynamic Programming algorithm and derive two variants using unimodal and multimodal value functions parameterizations. By combining the maximum entropy…

Optimization and Control · Mathematics 2022-03-01 Oswin So , Ziyi Wang , Evangelos A. Theodorou

We introduce a variational algorithm based on Matrix Product States that is trained by minimizing a generalized free energy defined using Tsallis entropy instead of the standard Gibbs entropy. As a result, our model can generate the…

Statistical Mechanics · Physics 2024-09-16 Pablo Díez-Valle , Fernando Martínez-García , Juan José García-Ripoll , Diego Porras

Within a framework of utmost generality, we show that the entropy maximization procedure with linear constraints uniquely leads to the Shannon-Boltzmann-Gibbs entropy. Therefore, the use of this procedure with linear constraints should not…

Statistical Mechanics · Physics 2018-05-01 Thomas Oikonomou , G. Baris Bagci

This paper studies the continuous-time reinforcement learning in jump-diffusion models by featuring the q-learning (the continuous-time counterpart of Q-learning) under Tsallis entropy regularization. Contrary to the Shannon entropy, the…

Optimization and Control · Mathematics 2026-02-16 Lijun Bo , Yijie Huang , Xiang Yu , Tingting Zhang

We present a novel second-order trajectory optimization algorithm based on Stein Variational Newton's Method and Maximum Entropy Differential Dynamic Programming. The proposed algorithm, called Stein Variational Differential Dynamic…

Optimization and Control · Mathematics 2024-10-10 Yuichiro Aoyama , Peter Lehmamnn , Evangelos A. Theodorou

In this paper, inspired from our previous algorithm, which was based on the theory of Tsallis statistical mechanics, we develop a new evolving stochastic learning algorithm for neural networks. The new algorithm combines deterministic and…

Neural and Evolutionary Computing · Computer Science 2009-11-11 Aristoklis D. Anastasiadis , George D. Magoulas

The construction of efficient and effective decision trees remains a key topic in machine learning because of their simplicity and flexibility. A lot of heuristic algorithms have been proposed to construct near-optimal decision trees. ID3,…

Machine Learning · Statistics 2016-08-24 Yisen Wang , Chaobing Song , Shu-Tao Xia

Algorithmic entropy and Shannon entropy are two conceptually different information measures, as the former is based on size of programs and the later in probability distributions. However, it is known that, for any recursive probability…

Information Theory · Computer Science 2010-06-03 Andreia Teixeira , Andre Souto , Armando Matos , Luis Antunes

Sample-based trajectory optimisers are a promising tool for the control of robotics with non-differentiable dynamics and cost functions. Contemporary approaches derive from a restricted subclass of stochastic optimal control where the…

Robotics · Computer Science 2021-10-07 Tom Lefebvre , Guillaume Crevecoeur

An amended MaxEnt formulation for systems displaced from the conventional MaxEnt equilibrium is proposed. This formulation involves the minimization of the Kullback-Leibler divergence to a reference $Q$ (or maximization of Shannon…

Mathematical Physics · Physics 2009-11-11 Jean-François Bercher

Optimization of expensive computer models with the help of Gaussian process emulators in now commonplace. However, when several (competing) objectives are considered, choosing an appropriate sampling strategy remains an open question. We…

Optimization and Control · Mathematics 2013-10-03 Victor Picheny
‹ Prev 1 2 3 10 Next ›