Related papers: SOBA: Session optimal MDP-based network friendly r…

Network Friendly Recommendations: Optimizing for Long Viewing Sessions

Caching algorithms try to predict content popularity, and place the content closer to the users. Additionally, nowadays requests are increasingly driven by recommendation systems (RS). These important trends, point to the following:…

Networking and Internet Architecture · Computer Science 2021-10-05 Theodoros Giannakas , Pavlos Sermpezis , Thrasyvoulos Spyropoulos

Proactive Caching for Energy-Efficiency in Wireless Networks: A Markov Decision Process Approach

Content caching in wireless networks provides a substantial opportunity to trade off low cost memory storage with energy consumption, yet finding the optimal causal policy with low computational complexity remains a challenge. This paper…

Signal Processing · Electrical Eng. & Systems 2020-01-22 Zhijie Chen , Hoshyar Mohammed , Wei Chen

Service Provisioning and Profit Maximization in Network-assisted Adaptive HTTP Streaming

Adaptive HTTP streaming with centralized consideration of multiple streams has gained increasing interest. It poses a special challenge that the interests of both content provider and network operator need to be deliberately balanced. More…

Networking and Internet Architecture · Computer Science 2015-01-20 Zhisheng Yan , Cedric Westphal , Xin Wang , Chang Wen Chen

One-Shot Session Recommendation Systems with Combinatorial Items

In recent years, content recommendation systems in large websites (or \emph{content providers}) capture an increased focus. While the type of content varies, e.g.\ movies, articles, music, advertisements, etc., the high level problem…

Machine Learning · Statistics 2016-07-06 Yahel David , Dotan Di Castro , Zohar Karnin

Constrained Reinforcement Learning for Short Video Recommendation

The wide popularity of short videos on social media poses new opportunities and challenges to optimize recommender systems on the video-sharing platforms. Users provide complex and multi-faceted responses towards recommendations, including…

Machine Learning · Computer Science 2022-05-27 Qingpeng Cai , Ruohan Zhan , Chi Zhang , Jie Zheng , Guangwei Ding , Pinghua Gong , Dong Zheng , Peng Jiang

Optimal Dynamic Multicast Scheduling for Cache-Enabled Content-Centric Wireless Networks

Caching and multicasting at base stations are two promising approaches to support massive content delivery over wireless networks. However, existing scheduling designs do not make full use of the advantages of the two approaches. In this…

Information Theory · Computer Science 2016-02-25 Bo Zhou , Ying Cui , Meixia Tao

A Markov Decision Model for Adaptive Scheduling of Stored Scalable Videos

We propose two scheduling algorithms that seek to optimize the quality of scalably coded videos that have been stored at a video server before transmission.} The first scheduling algorithm is derived from a Markov Decision Process (MDP)…

Multimedia · Computer Science 2013-11-26 Chao Chen , Robert W. Heath , Alan C. Bovik , Gustavo de Veciana

The Order of Things: Position-Aware Network-friendly Recommendations in Long Viewing Sessions

Caching has recently attracted a lot of attention in the wireless communications community, as a means to cope with the increasing number of users consuming web content from mobile devices. Caching offers an opportunity for a win-win…

Networking and Internet Architecture · Computer Science 2019-05-14 Theodoros Giannakas , Thrasyvoulos Spyropoulos , Pavlos Sermpezis

Sufficient Markov Decision Processes with Alternating Deep Neural Networks

Advances in mobile computing technologies have made it possible to monitor and apply data-driven interventions across complex systems in real time. Markov decision processes (MDPs) are the primary model for sequential decision problems with…

Methodology · Statistics 2018-03-20 Longshaokan Wang , Eric B. Laber , Katie Witkiewitz

Towards Wi-Fi AP-Assisted Content Prefetching for On-Demand TV Series: A Reinforcement Learning Approach

The emergence of smart Wi-Fi APs (Access Point), which are equipped with huge storage space, opens a new research area on how to utilize these resources at the edge network to improve users' quality of experience (QoE) (e.g., a short…

Multimedia · Computer Science 2017-03-13 Wen Hu , Yichao Jin , Yonggang Wen , Zhi Wang , Lifeng Sun

Finite-Horizon Markov Decision Processes with Sequentially-Observed Transitions

Markov Decision Processes (MDPs) have been used to formulate many decision-making problems in science and engineering. The objective is to synthesize the best decision (action selection) policies to maximize expected rewards (or minimize…

Optimization and Control · Mathematics 2015-07-07 Mahmoud El Chamie , Behcet Acikmese

Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework

We present a framework to address a class of sequential decision making problems. Our framework features learning the optimal control policy with robustness to noisy data, determining the unknown state and action parameters, and performing…

Machine Learning · Computer Science 2022-01-20 Amber Srivastava , Srinivasa M Salapaka

Fast Reinforcement Learning for Energy-Efficient Wireless Communications

We consider the problem of energy-efficient point-to-point transmission of delay-sensitive data (e.g. multimedia data) over a fading channel. Existing research on this topic utilizes either physical-layer centric solutions, namely…

Machine Learning · Computer Science 2017-03-29 Nicholas Mastronarde , Mihaela van der Schaar

Maximizing Cumulative User Engagement in Sequential Recommendation: An Online Optimization Perspective

To maximize cumulative user engagement (e.g. cumulative clicks) in sequential recommendation, it is often needed to tradeoff two potentially conflicting objectives, that is, pursuing higher immediate user engagement (e.g., click-through…

Information Retrieval · Computer Science 2020-06-09 Yifei Zhao , Yu-Hang Zhou , Mingdong Ou , Huan Xu , Nan Li

A Systematic Framework for Dynamically Optimizing Multi-User Wireless Video Transmission

In this paper, we formulate the collaborative multi-user wireless video transmission problem as a multi-user Markov decision process (MUMDP) by explicitly considering the users' heterogeneous video traffic characteristics, time-varying…

Multimedia · Computer Science 2009-03-03 Fangwen Fu , Mihaela van der Schaar

Robust Batch Policy Learning in Markov Decision Processes

We study the offline data-driven sequential decision making problem in the framework of Markov decision process (MDP). In order to enhance the generalizability and adaptivity of the learned policy, we propose to evaluate each policy by a…

Statistics Theory · Mathematics 2021-11-11 Zhengling Qi , Peng Liao

Dynamic Service Migration in Mobile Edge Computing Based on Markov Decision Process

In mobile edge computing, local edge servers can host cloud-based services, which reduces network overhead and latency but requires service migrations as users move to new locations. It is challenging to make migration decisions optimally…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-05-10 Shiqiang Wang , Rahul Urgaonkar , Murtaza Zafer , Ting He , Kevin Chan , Kin K. Leung

Towards Multi-agent Reinforcement Learning for Wireless Network Protocol Synthesis

This paper proposes a multi-agent reinforcement learning based medium access framework for wireless networks. The access problem is formulated as a Markov Decision Process (MDP), and solved using reinforcement learning with every network…

Machine Learning · Computer Science 2021-04-30 Hrishikesh Dutta , Subir Biswas

SDM: Sequential Deep Matching Model for Online Large-scale Recommender System

Capturing users' precise preferences is a fundamental problem in large-scale recommender system. Currently, item-based Collaborative Filtering (CF) methods are common matching approaches in industry. However, they are not effective to model…

Information Retrieval · Computer Science 2020-01-01 Fuyu Lv , Taiwei Jin , Changlong Yu , Fei Sun , Quan Lin , Keping Yang , Wilfred Ng

Optimizing Quality of Experience of Dynamic Video Streaming over Fading Wireless Networks

We address the problem of video streaming packets from an Access Point (AP) to multiple clients over a shared wireless channel with fading. In such systems, each client maintains a buffer of packets from which to play the video, and an…

Networking and Internet Architecture · Computer Science 2021-04-27 Rahul Singh , P. R. Kumar