Related papers: Bandit Linear Control

Non-Stochastic Control with Bandit Feedback

We study the problem of controlling a linear dynamical system with adversarial perturbations where the only feedback available to the controller is the scalar loss, and the loss function itself is unknown. For this problem, with either a…

Machine Learning · Computer Science 2020-08-14 Paula Gradu , John Hallman , Elad Hazan

Tight Rates for Bandit Control Beyond Quadratics

Unlike classical control theory, such as Linear Quadratic Control (LQC), real-world control problems are highly complex. These problems often involve adversarial perturbations, bandit feedback models, and non-quadratic, adversarially chosen…

Machine Learning · Computer Science 2024-10-03 Y. Jennifer Sun , Zhou Lu

Safe Non-Stochastic Control of Linear Dynamical Systems

We study the problem of \textit{safe control of linear dynamical systems corrupted with non-stochastic noise}, and provide an algorithm that guarantees (i) zero constraint violation of convex time-varying constraints, and (ii) bounded…

Systems and Control · Electrical Eng. & Systems 2023-08-25 Hongyu Zhou , Vasileios Tzoumas

Efficient Online Linear Control with Stochastic Convex Costs and Unknown Dynamics

We consider the problem of controlling an unknown linear dynamical system under a stochastic convex cost and full feedback of both the state and cost function. We present a computationally efficient algorithm that attains an optimal…

Optimization and Control · Mathematics 2022-06-23 Asaf Cassel , Alon Cohen , Tomer Koren

Linear Bandits with Feature Feedback

This paper explores a new form of the linear bandit problem in which the algorithm receives the usual stochastic rewards as well as stochastic feedback about which features are relevant to the rewards, the latter feedback being the novel…

Machine Learning · Computer Science 2019-03-13 Urvashi Oswal , Aniruddha Bhargava , Robert Nowak

Adversarial Bandit Optimization with Globally Bounded Perturbations to Linear Losses

We study a class of adversarial bandit optimization problems in which the loss functions may be non-convex and non-smooth. In each round, the learner observes a loss that consists of an underlying linear component together with an…

Machine Learning · Computer Science 2026-03-30 Zhuoyu Cheng , Kohei Hatano , Eiji Takimoto

Geometric Exploration for Online Control

We study the control of an \emph{unknown} linear dynamical system under general convex costs. The objective is minimizing regret vs. the class of disturbance-feedback-controllers, which encompasses all stabilizing…

Machine Learning · Computer Science 2020-10-30 Orestis Plevrakis , Elad Hazan

Online Control with Adversarial Disturbances

We study the control of a linear dynamical system with adversarial disturbances (as opposed to statistical noise). The objective we consider is one of regret: we desire an online control procedure that can do nearly as well as that of a…

Machine Learning · Computer Science 2019-02-26 Naman Agarwal , Brian Bullins , Elad Hazan , Sham M. Kakade , Karan Singh

The Nonstochastic Control Problem

We consider the problem of controlling an unknown linear dynamical system in the presence of (nonstochastic) adversarial perturbations and adversarial convex loss functions. In contrast to classical control, the a priori determination of an…

Machine Learning · Computer Science 2020-01-22 Elad Hazan , Sham M. Kakade , Karan Singh

Online Control of Linear Systems under Unbounded Noise

This paper investigates the problem of controlling a linear system under possibly unbounded stochastic noise with unknown convex cost functions, known as an online control problem. In contrast to the existing work, which assumes the…

Systems and Control · Electrical Eng. & Systems 2025-06-03 Kaito Ito , Taira Tsuchiya

Adversarial bandit optimization for approximately linear functions

We consider a bandit optimization problem for nonconvex and non-smooth functions, where in each trial the loss function is the sum of a linear function and a small but arbitrary perturbation chosen after observing the player's choice. We…

Machine Learning · Computer Science 2026-01-07 Zhuoyu Cheng , Kohei Hatano , Eiji Takimoto

Directional Optimism for Safe Linear Bandits

The safe linear bandit problem is a version of the classical stochastic linear bandit problem where the learner's actions must satisfy an uncertain constraint at all rounds. Due its applicability to many real-world settings, this problem…

Machine Learning · Computer Science 2024-03-13 Spencer Hutchinson , Berkay Turan , Mahnoosh Alizadeh

Optimal Rates for Bandit Nonstochastic Control

Linear Quadratic Regulator (LQR) and Linear Quadratic Gaussian (LQG) control are foundational and extensively researched problems in optimal control. We investigate LQR and LQG problems with semi-adversarial perturbations and time-varying…

Machine Learning · Computer Science 2023-10-26 Y. Jennifer Sun , Stephen Newman , Elad Hazan

Stochastic Linear Bandits Robust to Adversarial Attacks

We consider a stochastic linear bandit problem in which the rewards are not only subject to random noise, but also adversarial attacks subject to a suitable budget $C$ (i.e., an upper bound on the sum of corruption magnitudes across the…

Machine Learning · Statistics 2020-10-29 Ilija Bogunovic , Arpan Losalka , Andreas Krause , Jonathan Scarlett

Online Stochastic Linear Optimization under One-bit Feedback

In this paper, we study a special bandit setting of online stochastic linear optimization, where only one-bit of information is revealed to the learner at each round. This problem has found many applications including online advertisement…

Machine Learning · Computer Science 2015-09-28 Lijun Zhang , Tianbao Yang , Rong Jin , Zhi-Hua Zhou

Linear Bandits with Non-i.i.d. Noise

We study the linear stochastic bandit problem, relaxing the standard i.i.d. assumption on the observation noise. As an alternative to this restrictive assumption, we allow the noise terms across rounds to be sub-Gaussian but interdependent,…

Machine Learning · Statistics 2025-05-28 Baptiste Abélès , Eugenio Clerico , Hamish Flynn , Gergely Neu

Rate-Optimal Online Convex Optimization in Adaptive Linear Control

We consider the problem of controlling an unknown linear dynamical system under adversarially changing convex costs and full feedback of both the state and cost function. We present the first computationally-efficient algorithm that attains…

Machine Learning · Computer Science 2022-06-06 Asaf Cassel , Alon Cohen , Tomer Koren

Making Non-Stochastic Control (Almost) as Easy as Stochastic

Recent literature has made much progress in understanding \emph{online LQR}: a modern learning-theoretic take on the classical control problem in which a learner attempts to optimally control an unknown linear dynamical system with fully…

Machine Learning · Computer Science 2020-10-06 Max Simchowitz

Linear bandits with polylogarithmic minimax regret

We study a noise model for linear stochastic bandits for which the subgaussian noise parameter vanishes linearly as we select actions on the unit sphere closer and closer to the unknown vector. We introduce an algorithm for this problem…

Machine Learning · Computer Science 2025-10-28 Josep Lumbreras , Marco Tomamichel

On the Complexity of Bandit and Derivative-Free Stochastic Convex Optimization

The problem of stochastic convex optimization with bandit feedback (in the learning community) or without knowledge of gradients (in the optimization community) has received much attention in recent years, in the form of algorithms and…

Machine Learning · Computer Science 2013-04-30 Ohad Shamir