Aditya Modi — Scifaro

Provably Learning from Language Feedback

Interactively learning from observation and language feedback is an increasingly studied area driven by the emergence of large language model (LLM) agents. While impressive empirical demonstrations have been shown, so far a principled…

Machine Learning · Computer Science 2025-06-13 Wanqiao Xu , Allen Nie , Ruijie Zheng , Aditya Modi , Adith Swaminathan , Ching-An Cheng

How to Solve Contextual Goal-Oriented Problems with Offline Datasets?

We present a novel method, Contextual goal-Oriented Data Augmentation (CODA), which uses commonly available unlabeled trajectories and context-goal pairs to solve Contextual Goal-Oriented (CGO) problems. By carefully constructing an…

Machine Learning · Computer Science 2025-05-01 Ying Fan , Jingling Li , Adith Swaminathan , Aditya Modi , Ching-An Cheng

Joint Learning of Linear Time-Invariant Dynamical Systems

Linear time-invariant systems are very popular models in system theory and applications. A fundamental problem in system identification that remains rather unaddressed in extant literature is to leverage commonalities amongst related linear…

Machine Learning · Statistics 2024-01-03 Aditya Modi , Mohamad Kazem Shirani Faradonbeh , Ambuj Tewari , George Michailidis

On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL

We study reward-free reinforcement learning (RL) under general non-linear function approximation, and establish sample efficiency and hardness results under various standard structural assumptions. On the positive side, we propose the…

Machine Learning · Computer Science 2022-10-25 Jinglin Chen , Aditya Modi , Akshay Krishnamurthy , Nan Jiang , Alekh Agarwal

Model-free Representation Learning and Exploration in Low-rank MDPs

The low rank MDP has emerged as an important model for studying representation learning and exploration in reinforcement learning. With a known representation, several model-free exploration strategies exist. In contrast, all algorithms for…

Machine Learning · Computer Science 2022-06-23 Aditya Modi , Jinglin Chen , Akshay Krishnamurthy , Nan Jiang , Alekh Agarwal

Joint Learning-Based Stabilization of Multiple Unknown Linear Systems

Learning-based control of linear systems received a lot of attentions recently. In popular settings, the true dynamical models are unknown to the decision-maker and need to be interactively learned by applying control inputs to the systems.…

Systems and Control · Electrical Eng. & Systems 2022-01-06 Mohamad Kazem Shirani Faradonbeh , Aditya Modi

Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued Policies

Standard reinforcement learning (RL) aims to find an optimal policy that identifies the best action for each state. However, in healthcare settings, many actions may be near-equivalent with respect to the reward (e.g., survival). We…

Machine Learning · Computer Science 2020-07-27 Shengpu Tang , Aditya Modi , Michael W. Sjoding , Jenna Wiens

No-regret Exploration in Contextual Reinforcement Learning

We consider the recently proposed reinforcement learning (RL) framework of Contextual Markov Decision Processes (CMDP), where the agent interacts with a (potentially adversarial) sequence of episodic tabular MDPs. In addition, a context…

Machine Learning · Computer Science 2020-06-19 Aditya Modi , Ambuj Tewari

Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles

Reinforcement learning (RL) methods have been shown to be capable of learning intelligent behavior in rich domains. However, this has largely been done in simulated domains without adequate focus on the process of building the simulator. In…

Machine Learning · Computer Science 2019-10-24 Aditya Modi , Nan Jiang , Ambuj Tewari , Satinder Singh

Markov Decision Processes with Continuous Side Information

We consider a reinforcement learning (RL) setting in which the agent interacts with a sequence of episodic MDPs. At the start of each episode the agent has access to some side-information or context that determines the dynamics of the MDP…

Machine Learning · Statistics 2019-10-24 Aditya Modi , Nan Jiang , Satinder Singh , Ambuj Tewari

Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations

Assemblies of modular subsystems are being pressed into service to perform sensing, reasoning, and decision making in high-stakes, time-critical tasks in such areas as transportation, healthcare, and industrial automation. We address the…

Machine Learning · Computer Science 2019-05-15 Aditya Modi , Debadeepta Dey , Alekh Agarwal , Adith Swaminathan , Besmira Nushi , Sean Andrist , Eric Horvitz