English
Related papers

Related papers: A Minimum Relative Entropy Principle for Learning …

200 papers

Adaptive control problems are notoriously difficult to solve even in the presence of plant-specific controllers. One way to by-pass the intractable computation of the optimal policy is to restate the adaptive control as the minimization of…

Artificial Intelligence · Computer Science 2010-02-09 Pedro A. Ortega , Daniel A. Braun

Explaining adaptive behavior is a central problem in artificial intelligence research. Here we formalize adaptive agents as mixture distributions over sequences of inputs and outputs (I/O). Each distribution of the mixture constitutes a…

Artificial Intelligence · Computer Science 2009-12-31 Pedro A. Ortega , Daniel A. Braun

Recently, new approaches to adaptive control have sought to reformulate the problem as a minimization of a relative entropy criterion to obtain tractable solutions. In particular, it has been shown that minimizing the expected deviation…

Artificial Intelligence · Computer Science 2010-02-17 Pedro A. Ortega , Daniel A. Braun

When deploying artificial agents in real-world environments where they interact with humans, it is crucial that their behavior is aligned with the values, social norms or other requirements of that environment. However, many environments…

Machine Learning · Computer Science 2023-05-05 Mattijs Baert , Pietro Mazzaglia , Sam Leroux , Pieter Simoens

Modeling the purposeful behavior of imperfect agents from a small number of observations is a challenging task. When restricted to the single-agent decision-theoretic setting, inverse optimal control techniques assume that observed behavior…

Computer Science and Game Theory · Computer Science 2013-08-19 Kevin Waugh , Brian D. Ziebart , J. Andrew Bagnell

Both entropy-minimizing and entropy-maximizing (curiosity) objectives for unsupervised reinforcement learning (RL) have been shown to be effective in different environments, depending on the environment's level of natural entropy. However,…

Machine Learning · Computer Science 2024-08-19 Adriana Hugessen , Roger Creus Castanyer , Faisal Mohamed , Glen Berseth

The principle of maximum entropy is a broadly applicable technique for computing a distribution with the least amount of information possible while constrained to match empirically estimated feature expectations. However, in many real-world…

Machine Learning · Computer Science 2022-08-16 Kenneth Bogert , Yikang Gui , Prashant Doshi

An agent choosing between various actions tends to take the one with the lowest cost. But this choice is arguably too rigid (not adaptive) to be useful in complex situations, e.g., where exploration-exploitation trade-off is relevant in…

Data Analysis, Statistics and Probability · Physics 2018-12-04 Armen E. Allahverdyan , Aram Galstyan , Ali E. Abbas , Zbigniew R. Struzik

This paper addresses the adaptive consensus problem in uncertain multi-agent systems, particularly under challenges posed by quantized communication. We consider agents with general linear dynamics subject to nonlinear uncertainties and…

Optimization and Control · Mathematics 2025-06-10 Woocheol Choi , Piljae Jang

Contextual policy search allows adapting robotic movement primitives to different situations. For instance, a locomotion primitive might be adapted to different terrain inclinations or desired walking speeds. Such an adaptation is often…

Machine Learning · Statistics 2015-11-17 Jan Hendrik Metzen

Modeling the purposeful behavior of imperfect agents from a small number of observations is a challenging task. When restricted to the single-agent decision-theoretic setting, inverse optimal control techniques assume that observed behavior…

Computer Science and Game Theory · Computer Science 2015-03-19 Kevin Waugh , Brian D. Ziebart , J. Andrew Bagnell

In this note, the problem of simultaneous leader-following consensus and parameter estimation is studied for a class of multi-agent systems subject to an uncertain leader system. The leader system is described by a sum of sinusoids with…

Systems and Control · Electrical Eng. & Systems 2020-07-28 Shimin Wang , Xiangyu Meng

We analyze the problem of learning a single user's preferences in an active learning setting, sequentially and adaptively querying the user over a finite time horizon. Learning is conducted via choice-based queries, where the user selects…

Machine Learning · Statistics 2017-02-27 Stephen N. Pallone , Peter I. Frazier , Shane G. Henderson

We address the problem of reinforcement learning in which observations may exhibit an arbitrary form of stochastic dependence on past observations and actions. The task for an agent is to attain the best possible asymptotic reward where the…

Machine Learning · Computer Science 2007-05-23 Daniil Ryabko , Marcus Hutter

We exhibit optimal control strategies for a simple toy problem in which the underlying dynamics depend on a parameter that is initially unknown and must be learned. We consider a cost function posed over a finite time interval, in contrast…

Optimization and Control · Mathematics 2020-02-27 Charles L. Fefferman , Bernat Guillen Pegueroles , Clarence W. Rowley , Melanie Weber

We pose an active perception problem where an autonomous agent actively interacts with a second agent with potentially adversarial behaviors. Given the uncertainty in the intent of the other agent, the objective is to collect further…

Artificial Intelligence · Computer Science 2019-09-20 Macheng Shen , Jonathan P How

Agents acting in the natural world aim at selecting appropriate actions based on noisy and partial sensory observations. Many behaviors leading to decision mak- ing and action selection in a closed loop setting are naturally phrased within…

Machine Learning · Statistics 2014-06-30 Alex Susemihl , Ron Meir , Manfred Opper

We describe the results of analytic calculations and computer simulations of adaptive predictors (predictive agents) responding to an evolving chaotic environment and to one another. Our simulations are designed to quantify adaptation and…

adap-org · Physics 2008-02-03 Alfred Hübler , David Pines

This paper modifies Jaynes's axioms of plausible reasoning and derives the minimum relative entropy principle, Bayes's rule, as well as maximum likelihood from first principles. The new axioms, which I call the Optimum Information…

Information Theory · Computer Science 2011-03-30 Alexis Akira Toda

The ideal Bayesian agent reasons from a global probability model, but real agents are restricted to simplified models which they know to be adequate only in restricted circumstances. Very little formal theory has been developed to help…

Artificial Intelligence · Computer Science 2013-03-25 Kathryn Blackmond Laskey
‹ Prev 1 2 3 10 Next ›