Related papers: The Bayesian Linear Information Filtering Problem

Exploration vs. Exploitation in the Information Filtering Problem

We consider information filtering, in which we face a stream of items too voluminous to process by hand (e.g., scientific articles, blog posts, emails), and must rely on a computer system to automatically filter out irrelevant items. Such…

Optimization and Control · Mathematics 2015-02-10 Xiaoting Zhao , Peter I. Frazier

Bayesian Policy Optimization for Model Uncertainty

Addressing uncertainty is critical for autonomous systems to robustly adapt to the real world. We formulate the problem of model uncertainty as a continuous Bayes-Adaptive Markov Decision Process (BAMDP), where an agent maintains a…

Robotics · Computer Science 2019-05-09 Gilwoo Lee , Brian Hou , Aditya Mandalika , Jeongseok Lee , Sanjiban Choudhury , Siddhartha S. Srinivasa

A Markov Decision Process Analysis of the Cold Start Problem in Bayesian Information Filtering

We consider the information filtering problem, in which we face a stream of items, and must decide which ones to forward to a user to maximize the number of relevant items shown, minus a penalty for each irrelevant item shown. Forwarding…

Machine Learning · Computer Science 2014-10-31 Xiaoting Zhao , Peter I. Frazier

Bayes-Optimal Entropy Pursuit for Active Choice-Based Preference Learning

We analyze the problem of learning a single user's preferences in an active learning setting, sequentially and adaptively querying the user over a finite time horizon. Learning is conducted via choice-based queries, where the user selects…

Machine Learning · Statistics 2017-02-27 Stephen N. Pallone , Peter I. Frazier , Shane G. Henderson

Two-Stage Neural Contextual Bandits for Personalised News Recommendation

We consider the problem of personalised news recommendation where each user consumes news in a sequential fashion. Existing personalised news recommendation methods focus on exploiting user interests and ignores exploration in…

Information Retrieval · Computer Science 2022-06-30 Mengyan Zhang , Thanh Nguyen-Tang , Fangzhao Wu , Zhenyu He , Xing Xie , Cheng Soon Ong

Towards Bursting Filter Bubble via Contextual Risks and Uncertainties

A rising topic in computational journalism is how to enhance the diversity in news served to subscribers to foster exploration behavior in news reading. Despite the success of preference learning in personalized news recommendation, their…

Machine Learning · Statistics 2017-07-03 Rikiya Takahashi , Shunan Zhang

Bayesian Non-stationary Linear Bandits for Large-Scale Recommender Systems

Taking advantage of contextual information can potentially boost the performance of recommender systems. In the era of big data, such side information often has several dimensions. Thus, developing decision-making algorithms to cope with…

Machine Learning · Computer Science 2023-07-26 Saeed Ghoorchian , Evgenii Kortukov , Setareh Maghsudi

Deep Bayesian Bandits: Exploring in Online Personalized Recommendations

Recommender systems trained in a continuous learning fashion are plagued by the feedback loop problem, also known as algorithmic bias. This causes a newly trained model to act greedily and favor items that have already been engaged by…

Machine Learning · Computer Science 2020-08-04 Dalin Guo , Sofia Ira Ktena , Ferenc Huszar , Pranay Kumar Myana , Wenzhe Shi , Alykhan Tejani

Towards Bayesian Data Selection

A wide range of machine learning algorithms iteratively add data to the training sample. Examples include semi-supervised learning, active learning, multi-armed bandits, and Bayesian optimization. We embed this kind of data addition into…

Machine Learning · Statistics 2024-06-25 Julian Rodemann

Recurrent Neural Networks and Universal Approximation of Bayesian Filters

We consider the Bayesian optimal filtering problem: i.e. estimating some conditional statistics of a latent time-series signal from an observation sequence. Classical approaches often rely on the use of assumed or estimated transition and…

Machine Learning · Statistics 2023-03-16 Adrian N. Bishop , Edwin V. Bonilla

Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space

Models of many real-life applications, such as queuing models of communication networks or computing systems, have a countably infinite state-space. Algorithmic and learning procedures that have been developed to produce optimal policies…

Systems and Control · Electrical Eng. & Systems 2024-03-19 Saghar Adler , Vijay Subramanian

An Analytically Tractable Bayesian Approximation to Optimal Point Process Filtering

The process of dynamic state estimation (filtering) based on point process observations is in general intractable. Numerical sampling techniques are often practically useful, but lead to limited conceptual insight about optimal…

Machine Learning · Statistics 2015-08-04 Yuval Harel , Ron Meir , Manfred Opper

Optimal Learning for Sequential Decision Making for Expensive Cost Functions with Stochastic Binary Feedbacks

We consider the problem of sequentially making decisions that are rewarded by "successes" and "failures" which can be predicted through an unknown relationship that depends on a partially controllable vector of attributes for each instance.…

Machine Learning · Statistics 2017-09-18 Yingfei Wang , Chu Wang , Warren Powell

Sequential Decision Making on Unmatched Data using Bayesian Kernel Embeddings

The problem of sequentially maximizing the expectation of a function seeks to maximize the expected value of a function of interest without having direct control on its features. Instead, the distribution of such features depends on a given…

Machine Learning · Statistics 2022-10-26 Diego Martinez-Taboada , Dino Sejdinovic

Bayesian Active Learning for Classification and Preference Learning

Information theoretic active learning has been widely studied for probabilistic models. For simple regression an optimal myopic policy is easily tractable. However, for other tasks and with more complex models, such as classification with…

Machine Learning · Statistics 2011-12-30 Neil Houlsby , Ferenc Huszár , Zoubin Ghahramani , Máté Lengyel

Adaptive, Robust and Scalable Bayesian Filtering for Online Learning

In this thesis, we introduce Bayesian filtering as a principled framework for tackling diverse sequential machine learning problems, including online (continual) learning, prequential (one-step-ahead) forecasting, and contextual bandits. To…

Machine Learning · Statistics 2025-05-13 Gerardo Duran-Martin

A Bayesian Theory of Change Detection in Statistically Periodic Random Processes

A new class of stochastic processes called independent and periodically identically distributed (i.p.i.d.) processes is defined to capture periodically varying statistical behavior. A novel Bayesian theory is developed for detecting a…

Signal Processing · Electrical Eng. & Systems 2019-04-09 Taposh Banerjee , Prudhvi Gurram , Gene Whipps

Fully probabilistic design for knowledge fusion between Bayesian filters under uniform disturbances

This paper considers the problem of Bayesian transfer learning-based knowledge fusion between linear state-space processes driven by uniform state and observation noise processes. The target task conditions on probabilistic state…

Machine Learning · Computer Science 2021-09-23 Lenka Kuklišová Pavelková , Ladislav Jirsa , Anthony Quinn

Bayesian preference elicitation for multiobjective combinatorial optimization

We introduce a new incremental preference elicitation procedure able to deal with noisy responses of a Decision Maker (DM). The originality of the contribution is to propose a Bayesian approach for determining a preferred solution in a…

Artificial Intelligence · Computer Science 2020-07-30 Nadjet Bourdache , Patrice Perny , Olivier Spanjaard

An Information-Theoretic Analysis of Nonstationary Bandit Learning

In nonstationary bandit learning problems, the decision-maker must continually gather information and adapt their action selection as the latent state of the environment evolves. In each time period, some latent optimal action maximizes…

Machine Learning · Computer Science 2023-12-27 Seungki Min , Daniel Russo