Related papers: Cognitive Medium Access: Exploration, Exploitation…

Optimal Medium Access Protocols for Cognitive Radio Networks

This paper focuses on the design of medium access control protocols for cognitive radio networks. The scenario in which a single cognitive user wishes to opportunistically exploit the availability of empty frequency bands within parts of…

Information Theory · Computer Science 2008-02-20 Lifeng Lai , Hesham El Gamal , Hai Jiang , H. Vincent Poor

Optimal Medium Access Control in Cognitive Radios: A Sequential Design Approach

The design of medium access control protocols for a cognitive user wishing to opportunistically exploit frequency bands within parts of the radio spectrum having multiple bands is considered. In the scenario under consideration, the…

Information Theory · Computer Science 2016-11-17 Lifeng Lai , Hesham El Gamal , Hai Jiang , H. Vincent Poor

Multi-user lax communications: a multi-armed bandit approach

Inspired by cognitive radio networks, we consider a setting where multiple users share several channels modeled as a multi-user multi-armed bandit (MAB) problem. The characteristics of each channel are unknown and are different for each…

Machine Learning · Computer Science 2015-12-03 Orly Avner , Shie Mannor

Multi-channel Opportunistic Access: A Case of Restless Bandits with Multiple Plays

This paper considers the following stochastic control problem that arises in opportunistic spectrum access: a system consists of n channels (Gilbert-Elliot channels)where the state (good or bad) of each channel evolves as independent and…

Information Theory · Computer Science 2009-10-13 Sahand Haji Ali Ahmad , Mingyan Liu

Bayesian Incentive-Compatible Bandit Exploration

Individual decision-makers consume information revealed by the previous decision makers, and produce information that may help in future decisions. This phenomenon is common in a wide range of scenarios in the Internet economy, as well as…

Computer Science and Game Theory · Computer Science 2019-05-06 Yishay Mansour , Aleksandrs Slivkins , Vasilis Syrgkanis

Channel Exploration and Exploitation with Imperfect Spectrum Sensing in Cognitive Radio Networks

In this paper, the problem of opportunistic channel sensing and access in cognitive radio networks when the sensing is imperfect and a secondary user has limited traffic to send at a time is investigated. Primary users' statistical…

Information Theory · Computer Science 2012-01-23 Zhou Zhang , Hai Jiang , Peng Tan , Jim Slevinsky

Decision Maker using Coupled Incompressible-Fluid Cylinders

The multi-armed bandit problem (MBP) is the problem of finding, as accurately and quickly as possible, the most profitable option from a set of options that gives stochastic rewards by referring to past experiences. Inspired by fluctuated…

Artificial Intelligence · Computer Science 2015-02-16 Song-Ju Kim , Masashi Aono

Multi-user Communication Networks: A Coordinated Multi-armed Bandit Approach

Communication networks shared by many users are a widespread challenge nowadays. In this paper we address several aspects of this challenge simultaneously: learning unknown stochastic network characteristics, sharing resources with other…

Machine Learning · Computer Science 2018-08-16 Orly Avner , Shie Mannor

Decentralized Stochastic Multi-Player Multi-Armed Walking Bandits

Multi-player multi-armed bandit is an increasingly relevant decision-making problem, motivated by applications to cognitive radio systems. Most research for this problem focuses exclusively on the settings that players have \textit{full…

Machine Learning · Computer Science 2022-12-14 Guojun Xiong , Jian Li

Multi-Armed Bandits in Brain-Computer Interfaces

The multi-armed bandit (MAB) problem models a decision-maker that optimizes its actions based on current and acquired new knowledge to maximize its reward. This type of online decision is prominent in many procedures of Brain-Computer…

Artificial Intelligence · Computer Science 2022-11-10 Frida Heskebeck , Carolina Bergeling , Bo Bernhardsson

Productization Challenges of Contextual Multi-Armed Bandits

Contextual Multi-Armed Bandits is a well-known and accepted online optimization algorithm, that is used in many Web experiences to tailor content or presentation to users' traffic. Much has been published on theoretical guarantees (e.g.…

Information Retrieval · Computer Science 2019-07-12 David Abensur , Ivan Balashov , Shaked Bar , Ronny Lempel , Nurit Moscovici , Ilan Orlov , Danny Rosenstein , Ido Tamir

On Optimality of Myopic Policy for Restless Multi-armed Bandit Problem with Non i.i.d. Arms and Imperfect Detection

We consider the channel access problem in a multi-channel opportunistic communication system with imperfect channel sensing, where the state of each channel evolves as a non independent and identically distributed Markov process. This…

Systems and Control · Computer Science 2015-06-05 Kehao Wang , Lin Chen , Quan Liu , Khaldoun Al Agha

Multi-user Cognitive Interference Channels: A Survey and New Capacity Results

This paper provides a survey of the state-of-the-art information theoretic analysis for overlay multi-user (more than two pairs) cognitive networks and reports new capacity results. In an overlay scenario, cognitive / secondary users share…

Information Theory · Computer Science 2015-10-26 Diana Maamari , Daniela Tuninetti , Natasha Devroye

Structure and Optimality of Myopic Policy in Opportunistic Access with Noisy Observations

A restless multi-armed bandit problem that arises in multichannel opportunistic communications is considered, where channels are modeled as independent and identical Gilbert-Elliot channels and channel state observations are subject to…

Networking and Internet Architecture · Computer Science 2008-11-13 Qing Zhao , Bhaskar Krishnamachari

Pure Exploration in Asynchronous Federated Bandits

We study the federated pure exploration problem of multi-armed bandits and linear bandits, where $M$ agents cooperatively identify the best arm via communicating with the central server. To enhance the robustness against latency and…

Machine Learning · Computer Science 2024-10-01 Zichen Wang , Chuanhao Li , Chenyu Song , Lianghui Wang , Quanquan Gu , Huazheng Wang

Decentralized Online Learning Algorithms for Opportunistic Spectrum Access

The fundamental problem of multiple secondary users contending for opportunistic spectrum access over multiple channels in cognitive radio networks has been formulated recently as a decentralized multi-armed bandit (D-MAB) problem. In a…

Machine Learning · Computer Science 2011-04-04 Yi Gai , Bhaskar Krishnamachari

An order optimal policy for exploiting idle spectrum in cognitive radio networks

In this paper a spectrum sensing policy employing recency-based exploration is proposed for cognitive radio networks. We formulate the problem of finding a spectrum sensing policy for multi-band dynamic spectrum access as a stochastic…

Signal Processing · Electrical Eng. & Systems 2017-09-18 Jan Oksanen , Visa Koivunen

Multi-Armed Bandits with Minimum Aggregated Revenue Constraints

We examine a multi-armed bandit problem with contextual information, where the objective is to ensure that each arm receives a minimum aggregated reward across contexts while simultaneously maximizing the total cumulative reward. This…

Machine Learning · Computer Science 2025-10-15 Ahmed Ben Yahmed , Hafedh El Ferchichi , Marc Abeille , Vianney Perchet

Solving Multi-Arm Bandit Using a Few Bits of Communication

The multi-armed bandit (MAB) problem is an active learning framework that aims to select the best among a set of actions by sequentially observing rewards. Recently, it has become popular for a number of applications over wireless networks,…

Machine Learning · Computer Science 2021-11-12 Osama A. Hanna , Lin F. Yang , Christina Fragouli

A Gang of Bandits

Multi-armed bandit problems are receiving a great deal of attention because they adequately formalize the exploration-exploitation trade-offs arising in several industrially relevant applications, such as online advertisement and, more…

Machine Learning · Computer Science 2013-11-05 Nicolò Cesa-Bianchi , Claudio Gentile , Giovanni Zappella