Related papers: On Avoidance Learning with Partial Observability

Avoidance Learning Using Observational Reinforcement Learning

Imitation learning seeks to learn an expert policy from sampled demonstrations. However, in the real world, it is often difficult to find a perfect expert and avoiding dangerous behaviors becomes relevant for safety reasons. We present the…

Machine Learning · Computer Science 2019-09-26 David Venuto , Leonard Boussioux , Junhao Wang , Rola Dali , Jhelum Chakravorty , Yoshua Bengio , Doina Precup

Learning to Act and Observe in Partially Observable Domains

We consider a learning agent in a partially observable environment, with which the agent has never interacted before, and about which it learns both what it can observe and how its actions affect the environment. The agent can learn about…

Artificial Intelligence · Computer Science 2021-09-14 Thomas Bolander , Nina Gierasimczuk , Andrés Occhipinti Liberman

Apprenticeship Learning for Model Parameters of Partially Observable Environments

We consider apprenticeship learning, i.e., having an agent learn a task by observing an expert demonstrating the task in a partially observable environment when the model of the environment is uncertain. This setting is useful in…

Machine Learning · Computer Science 2012-07-03 Takaki Makino , Johane Takeuchi

Learning from Rational Behavior: Predicting Solutions to Unknown Linear Programs

We define and study the problem of predicting the solution to a linear program (LP) given only partial information about its objective and constraints. This generalizes the problem of learning to predict the purchasing behavior of a…

Data Structures and Algorithms · Computer Science 2016-10-27 Shahin Jabbari , Ryan Rogers , Aaron Roth , Zhiwei Steven Wu

Predict Responsibly: Improving Fairness and Accuracy by Learning to Defer

In many machine learning applications, there are multiple decision-makers involved, both automated and human. The interaction between these agents often goes unaddressed in algorithmic development. In this work, we explore a simple version…

Machine Learning · Statistics 2018-09-10 David Madras , Toniann Pitassi , Richard Zemel

On Planning while Learning

This paper introduces a framework for Planning while Learning where an agent is given a goal to achieve in an environment whose behavior is only partially known to the agent. We discuss the tractability of various plan-design processes. We…

Artificial Intelligence · Computer Science 2014-11-17 S. Safra , M. Tennenholtz

Learning to Abstain from Binary Prediction

A binary classifier capable of abstaining from making a label prediction has two goals in tension: minimizing errors, and avoiding abstaining unnecessarily often. In this work, we exactly characterize the best achievable tradeoff between…

Machine Learning · Computer Science 2016-11-30 Akshay Balsubramani

Observational Learning by Reinforcement Learning

Observational learning is a type of learning that occurs as a function of observing, retaining and possibly replicating or imitating the behaviour of another agent. It is a core mechanism appearing in various instances of social learning…

Machine Learning · Computer Science 2017-06-22 Diana Borsa , Bilal Piot , Rémi Munos , Olivier Pietquin

A Deterministic Protocol for Sequential Asymptotic Learning

In the classic herding model, agents receive private signals about an underlying binary state of nature, and act sequentially to choose one of two possible actions, after observing the actions of their predecessors. We investigate what…

Computer Science and Game Theory · Computer Science 2018-02-21 Yu Cheng , Wade Hann-Caruthers , Omer Tamuz

Bayesian Nonparametric Feature and Policy Learning for Decision-Making

Learning from demonstrations has gained increasing interest in the recent past, enabling an agent to learn how to make decisions by observing an experienced teacher. While many approaches have been proposed to solve this problem, there is…

Machine Learning · Computer Science 2017-02-28 Jürgen Hahn , Abdelhak M. Zoubir

Active learning algorithm through the lens of rejection arguments

Active learning is a paradigm of machine learning which aims at reducing the amount of labeled data needed to train a classifier. Its overall principle is to sequentially select the most informative data points, which amounts to determining…

Statistics Theory · Mathematics 2022-09-01 Christophe Denis , Mohamed Hebiri , Boris Ndjia Njike , Xavier Siebert

Policy Learning with Abstention

Policy learning algorithms are widely used in areas such as personalized medicine and advertising to develop individualized treatment regimes. However, most methods force a decision even when predictions are uncertain, which is risky in…

Machine Learning · Computer Science 2026-01-30 Ayush Sawarni , Jikai Jin , Justin Whitehouse , Vasilis Syrgkanis

Intent-aware Multi-agent Reinforcement Learning

This paper proposes an intent-aware multi-agent planning framework as well as a learning algorithm. Under this framework, an agent plans in the goal space to maximize the expected utility. The planning process takes the belief of other…

Artificial Intelligence · Computer Science 2018-03-07 Siyuan Qi , Song-Chun Zhu

Switching to Learn

A network of agents attempt to learn some unknown state of the world drawn by nature from a finite set. Agents observe private signals conditioned on the true state, and form beliefs about the unknown state accordingly. Each agent may face…

Machine Learning · Computer Science 2015-03-13 Shahin Shahrampour , Mohammad Amin Rahimian , Ali Jadbabaie

On Assessing The Safety of Reinforcement Learning algorithms Using Formal Methods

The increasing adoption of Reinforcement Learning in safety-critical systems domains such as autonomous vehicles, health, and aviation raises the need for ensuring their safety. Existing safety mechanisms such as adversarial training,…

Machine Learning · Computer Science 2021-11-11 Paulina Stevia Nouwou Mindom , Amin Nikanjam , Foutse Khomh , John Mullins

Modeling the effects of environmental and perceptual uncertainty using deterministic reinforcement learning dynamics with partial observability

Assessing the systemic effects of uncertainty that arises from agents' partial observation of the true states of the world is critical for understanding a wide range of scenarios. Yet, previous modeling work on agent learning and…

Adaptation and Self-Organizing Systems · Physics 2022-04-15 Wolfram Barfuss , Richard P. Mann

Learning Purposeful Behaviour in the Absence of Rewards

Artificial intelligence is commonly defined as the ability to achieve goals in the world. In the reinforcement learning framework, goals are encoded as reward functions that guide agent behaviour, and the sum of observed rewards provide a…

Machine Learning · Computer Science 2016-05-26 Marlos C. Machado , Michael Bowling

Reinforcement Learning from Passive Data via Latent Intentions

Passive observational data, such as human videos, is abundant and rich in information, yet remains largely untapped by current RL methods. Perhaps surprisingly, we show that passive data, despite not having reward or action labels, can…

Machine Learning · Computer Science 2023-04-12 Dibya Ghosh , Chethan Bhateja , Sergey Levine

Efficient Active Learning with Abstention

The goal of active learning is to achieve the same accuracy achievable by passive learning, while using much fewer labels. Exponential savings in terms of label complexity have been proved in very special cases, but fundamental lower bounds…

Machine Learning · Statistics 2026-01-01 Yinglun Zhu , Robert Nowak

Learning to Make Adherence-Aware Advice

As artificial intelligence (AI) systems play an increasingly prominent role in human decision-making, challenges surface in the realm of human-AI interactions. One challenge arises from the suboptimal AI policies due to the inadequate…

Machine Learning · Statistics 2024-03-22 Guanting Chen , Xiaocheng Li , Chunlin Sun , Hanzhao Wang