Related papers: Reciprocal Learning

Reciprocal Learning Networks for Human Trajectory Prediction

We observe that the human trajectory is not only forward predictable, but also backward predictable. Both forward and backward trajectories follow the same social norms and obey the same physical constraints with the only difference in…

Computer Vision and Pattern Recognition · Computer Science 2020-04-10 Hao Sun , Zhiqun Zhao , Zhihai He

Reinforcement Learning in Economics and Finance

Reinforcement learning algorithms describe how an agent can learn an optimal action policy in a sequential decision process, through repeated experience. In a given environment, the agent policy provides him some running and terminal…

Theoretical Economics · Economics 2020-03-24 Arthur Charpentier , Romuald Elie , Carl Remlinger

Reinforcement Learning with an Abrupt Model Change

The problem of reinforcement learning is considered where the environment or the model undergoes a change. An algorithm is proposed that an agent can apply in such a problem to achieve the optimal long-time discounted reward. The algorithm…

Systems and Control · Electrical Eng. & Systems 2023-04-25 Wuxia Chen , Taposh Banerjee , Jemin George , Carl Busart

Contextual Bandit Learning with Predictable Rewards

Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on the action and context. We consider this problem under a…

Machine Learning · Computer Science 2012-03-05 Alekh Agarwal , Miroslav Dudík , Satyen Kale , John Langford , Robert E. Schapire

Imitation Learning by Reinforcement Learning

Imitation learning algorithms learn a policy from demonstrations of expert behavior. We show that, for deterministic experts, imitation learning can be done by reduction to reinforcement learning with a stationary reward. Our theoretical…

Machine Learning · Statistics 2022-03-16 Kamil Ciosek

Continual Auxiliary Task Learning

Learning auxiliary tasks, such as multiple predictions about the world, can provide many benefits to reinforcement learning systems. A variety of off-policy learning algorithms have been developed to learn such predictions, but as yet there…

Machine Learning · Computer Science 2022-02-24 Matthew McLeod , Chunlok Lo , Matthew Schlegel , Andrew Jacobsen , Raksha Kumaraswamy , Martha White , Adam White

Learning Models for Actionable Recourse

As machine learning models are increasingly deployed in high-stakes domains such as legal and financial decision-making, there has been growing interest in post-hoc methods for generating counterfactual explanations. Such explanations…

Machine Learning · Computer Science 2022-03-22 Alexis Ross , Himabindu Lakkaraju , Osbert Bastani

Online Reciprocal Recommendation with Theoretical Performance Guarantees

A reciprocal recommendation problem is one where the goal of learning is not just to predict a user's preference towards a passive item (e.g., a book), but to recommend the targeted user on one side another user from the other side such…

Machine Learning · Computer Science 2018-06-05 Fabio Vitale , Nikos Parotsidis , Claudio Gentile

Generalization Bounds and Stopping Rules for Learning with Self-Selected Data

Many learning paradigms self-select training data in light of previously learned parameters. Examples include active learning, semi-supervised learning, bandits, or boosting. Rodemann et al. (2024) unify them under the framework of…

Machine Learning · Computer Science 2025-05-13 Julian Rodemann , James Bailie

Robust Learning with Private Information

Firms increasingly delegate decisions to learning algorithms in platform markets. Standard algorithms perform well when platform policies are stationary, but firms often face ambiguity about whether policies are stationary or adapt…

Theoretical Economics · Economics 2026-02-11 Kyohei Okumura

Agreement-based Learning

Model selection is a problem that has occupied machine learning researchers for a long time. Recently, its importance has become evident through applications in deep learning. We propose an agreement-based learning framework that prevents…

Machine Learning · Computer Science 2018-06-05 Emmanouil Antonios Platanios

Prospective Learning in Retrospect

In most real-world applications of artificial intelligence, the distributions of the data and the goals of the learners tend to change over time. The Probably Approximately Correct (PAC) learning framework, which underpins most machine…

Machine Learning · Computer Science 2025-11-13 Yuxin Bai , Cecelia Shuai , Ashwin De Silva , Siyu Yu , Pratik Chaudhari , Joshua T. Vogelstein

Learning the Truth From Only One Side of the Story

Learning under one-sided feedback (i.e., where we only observe the labels for examples we predicted positively on) is a fundamental problem in machine learning -- applications include lending and recommendation systems. Despite this, there…

Machine Learning · Computer Science 2020-10-14 Heinrich Jiang , Qijia Jiang , Aldo Pacchiano

Self-Paced Contextual Reinforcement Learning

Generalization and adaptation of learned skills to novel situations is a core requirement for intelligent autonomous robots. Although contextual reinforcement learning provides a principled framework for learning and generalization of…

Machine Learning · Computer Science 2019-10-08 Pascal Klink , Hany Abdulsamad , Boris Belousov , Jan Peters

Learning with Monotone Adversarial Corruptions

We study the extent to which standard machine learning algorithms rely on exchangeability and independence of data by introducing a monotone adversarial corruption model. In this model, an adversary, upon looking at a "clean" i.i.d.…

Machine Learning · Computer Science 2026-01-06 Kasper Green Larsen , Chirag Pabbaraju , Abhishek Shetty

Online Bandit Learning against an Adaptive Adversary: from Regret to Policy Regret

Online learning algorithms are designed to learn even when their input is generated by an adversary. The widely-accepted formal definition of an online algorithm's ability to learn is the game-theoretic notion of regret. We argue that the…

Machine Learning · Computer Science 2012-07-03 Raman Arora , Ofer Dekel , Ambuj Tewari

Theoretical Models of Learning to Learn

A Machine can only learn if it is biased in some way. Typically the bias is supplied by hand, for example through the choice of an appropriate set of features. However, if the learning machine is embedded within an {\em environment} of…

Machine Learning · Computer Science 2020-03-02 Jonathan Baxter

Learning What's going on: reconstructing preferences and priorities from opaque transactions

We consider a setting where $n$ buyers, with combinatorial preferences over $m$ items, and a seller, running a priority-based allocation mechanism, repeatedly interact. Our goal, from observing limited information about the results of these…

Computer Science and Game Theory · Computer Science 2014-08-29 Avrim Blum , Yishay Mansour , Jamie Morgenstern

Topological Foundations of Reinforcement Learning

The goal of this work is to serve as a foundation for deep studies of the topology of state, action, and policy spaces in reinforcement learning. By studying these spaces from a mathematical perspective, we expect to gain more insight into…

Machine Learning · Computer Science 2024-10-08 David Krame Kadurha

Inverse Reinforcement Learning from Non-Stationary Learning Agents

In this paper, we study an inverse reinforcement learning problem that involves learning the reward function of a learning agent using trajectory data collected while this agent is learning its optimal policy. To address this problem, we…

Machine Learning · Computer Science 2024-10-21 Kavinayan P. Sivakumar , Yi Shen , Zachary Bell , Scott Nivison , Boyuan Chen , Michael M. Zavlanos