English

Bandit problems with Levy processes

Probability 2015-08-23 v1

Abstract

Bandit problems model the trade-off between exploration and exploitation in various decision problems. We study two-armed bandit problems in continuous time, where the risky arm can have two types: High or Low; both types yield stochastic payoffs generated by a Levy process. We show that the optimal strategy is a cut-off strategy and we provide an explicit expression for the cut-off and for the optimal payoff.

Keywords

Cite

@article{arxiv.1407.7241,
  title  = {Bandit problems with Levy processes},
  author = {Asaf Cohen and Eilon Solan},
  journal= {arXiv preprint arXiv:1407.7241},
  year   = {2015}
}

Comments

arXiv admin note: text overlap with arXiv:0906.0835

R2 v1 2026-06-22T05:14:15.756Z