Related papers: Learning in Networked Control Systems

Minimal Expected Regret in Linear Quadratic Control

We consider the problem of online learning in Linear Quadratic Control systems whose state transition and state-action transition matrices $A$ and $B$ may be initially unknown. We devise an online learning algorithm and provide guarantees…

Machine Learning · Computer Science 2021-09-30 Yassir Jedra , Alexandre Proutiere

Regret Bounds for Adaptive Nonlinear Control

We study the problem of adaptively controlling a known discrete-time nonlinear system subject to unmodeled disturbances. We prove the first finite-time regret bounds for adaptive nonlinear control with matched uncertainty in the stochastic…

Machine Learning · Computer Science 2020-11-30 Nicholas M. Boffi , Stephen Tu , Jean-Jacques E. Slotine

Sequential Decision Problems with Missing Feedback

This paper investigates the challenges of optimal online policy learning under missing data. State-of-the-art algorithms implicitly assume that rewards are always observable. I show that when rewards are missing at random, the Upper…

Econometrics · Economics 2025-07-29 Filippo Palomba

Learning Decentralized Linear Quadratic Regulators with $\sqrt{T}$ Regret

We propose an online learning algorithm that adaptively designs a decentralized linear quadratic regulator when the system model is unknown a priori and new data samples from a single system trajectory become progressively available. The…

Optimization and Control · Mathematics 2024-07-08 Lintao Ye , Ming Chi , Ruiquan Liao , Vijay Gupta

Safe Adaptive Learning-based Control for Constrained Linear Quadratic Regulators with Regret Guarantees

We study the adaptive control of an unknown linear system with a quadratic cost function subject to safety constraints on both the states and actions. The challenges of this problem arise from the tension among safety, exploration,…

Systems and Control · Electrical Eng. & Systems 2021-11-02 Yingying Li , Subhro Das , Jeff Shamma , Na Li

Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits

By leveraging the representation power of deep neural networks, neural upper confidence bound (UCB) algorithms have shown success in contextual bandits. To further balance the exploration and exploitation, we propose…

Machine Learning · Computer Science 2025-03-12 Ha Manh Bui , Enrique Mallada , Anqi Liu

Regret Lower Bounds for Learning Linear Quadratic Gaussian Systems

TWe establish regret lower bounds for adaptively controlling an unknown linear Gaussian system with quadratic costs. We combine ideas from experiment design, estimation theory and a perturbation bound of certain information matrices to…

Machine Learning · Computer Science 2024-06-13 Ingvar Ziemann , Henrik Sandberg

Collaborative Multi-agent Stochastic Linear Bandits

We study a collaborative multi-agent stochastic linear bandit setting, where $N$ agents that form a network communicate locally to minimize their overall regret. In this setting, each agent has its own linear bandit problem (its own reward…

Machine Learning · Computer Science 2022-05-16 Ahmadreza Moradipari , Mohammad Ghavamzadeh , Mahnoosh Alizadeh

Continuous-time Online Learning via Mean-Field Neural Networks: Regret Analysis in Diffusion Environments

We study continuous-time online learning where data are generated by a diffusion process with unknown coefficients. The learner employs a two-layer neural network, continuously updating its parameters in a non-anticipative manner. The…

Machine Learning · Computer Science 2026-04-14 Erhan Bayraktar , Bingyan Han , Ziqing Zhang

Foundations of Safe Online Reinforcement Learning in the Linear Quadratic Regulator: $\sqrt{T}$-Regret

Understanding how to efficiently learn while adhering to safety constraints is essential for using online reinforcement learning in practical applications. However, proving rigorous regret bounds for safety-constrained reinforcement…

Machine Learning · Statistics 2025-04-29 Benjamin Schiffer , Lucas Janson

Cooperative Online Learning: Keeping your Neighbors Updated

We study an asynchronous online learning setting with a network of agents. At each time step, some of the agents are activated, requested to make a prediction, and pay the corresponding loss. The loss function is then revealed to these…

Machine Learning · Computer Science 2020-01-16 Nicolò Cesa-Bianchi , Tommaso R. Cesari , Claire Monteleoni

Control design and analysis of a stochastic network control system

A Network Control System (NCS) consists of control components that interact with the plant over a shared network. The system dynamics of a NCS could be subject to noise arising from randomness in the times at which the data is transmitted…

Systems and Control · Computer Science 2017-04-04 Mohammad Soltani , Abhyudai Singh

Balancing Exploration for Online Receding Horizon Learning Control with Provable Regret Guarantees

We address the problem of simultaneously learning and control in an online receding horizon control setting. We consider the control of an unknown linear dynamical system with general cost functions and affine constraints on the control…

Optimization and Control · Mathematics 2022-11-02 Deepan Muthirayan , Jianjun Yuan , Pramod P. Khargonekar

Learning to target with network interference

This paper studies adaptive targeting under network interference in a bandit setting, where treatments applied to one individual may affect others through spillover effects. We consider a linear model in a sparse regime, where each…

Machine Learning · Statistics 2026-05-28 Xiaomeng Wang , Hamsa Bastani , Osbert Bastani , Zhimei Ren

Steady-state Based Approach to Online Non-stochastic Control

We study the problem of online non-stochastic control (ONC), which is the control of a linear system under adversarial disturbances and adversarial cost functions, with the aim of minimizing the total cost incurred. A recent line of…

Optimization and Control · Mathematics 2026-04-21 Vijeth Hebbar , Spencer Hutchinson , Mahnoosh Alizadeh , Cédric Langbort

Adaptive Policy Learning Under Unknown Network Interference

Adaptive experimentation under unknown network interference requires solving two coupled problems: (i) learning the underlying dynamics of interference among units and (ii) using these dynamics to inform treatment allocation in order to…

Machine Learning · Statistics 2026-05-13 Aidan Gleich , Eric Laber , Alexander Volfovsky

A probabilistic algorithm for scheduling networked control systems under data losses

This paper deals with the design of scheduling logics for networked control systems (NCSs) whose communication networks have limited capacity and are prone to data losses. Our contributions are twofold. First, we present a probabilistic…

Systems and Control · Electrical Eng. & Systems 2024-02-23 Anubhab Dasgupta , Darsana Udayakumar , Atreyee Kundu

Logarithmic Regret for Nonlinear Control

We address the problem of learning to control an unknown nonlinear dynamical system through sequential interactions. Motivated by high-stakes applications in which mistakes can be catastrophic, such as robotics and healthcare, we study…

Machine Learning · Computer Science 2025-04-14 James Wang , Bruce D. Lee , Ingvar Ziemann , Nikolai Matni

Q-Learning with Fine-Grained Gap-Dependent Regret

We study fine-grained gap-dependent regret bounds for model-free reinforcement learning in episodic tabular Markov Decision Processes. Existing model-free algorithms achieve minimax worst-case regret, but their gap-dependent bounds remain…

Machine Learning · Statistics 2025-10-09 Haochen Zhang , Zhong Zheng , Lingzhou Xue

Online Control of Linear Systems under Unbounded Noise

This paper investigates the problem of controlling a linear system under possibly unbounded stochastic noise with unknown convex cost functions, known as an online control problem. In contrast to the existing work, which assumes the…

Systems and Control · Electrical Eng. & Systems 2025-06-03 Kaito Ito , Taira Tsuchiya