Author

Deepak Dev

results may include different authors with the same name

1 papers

Indexability of Finite State Restless Multi-Armed Bandit and Rollout Policy

We consider finite state restless multi-armed bandit problem. The decision maker can act on M bandits out of N bandits in each time step. The play of arm (active arm) yields state dependent rewards based on action and when the arm is not…

Machine Learning · Computer Science 2023-05-02 Vishesh Mittal , Rahul Meshram , Deepak Dev , Surya Prakash