Related papers: Learning-based Dynamic Cache Management in a Cloud

A Hybrid Cache Architecture for Meeting Per-Tenant Performance Goals in a Private Cloud

The in-memory cache system is an important component in a cloud for the data access performance. As the tenants may have different performance goals for data access depending on the nature of their tasks, effectively managing the memory…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-06-05 Taejoon Kim , Yu Gu , Jinoh Kim

Elastic Provisioning of Cloud Caches: a Cost-aware TTL Approach

We consider elastic resource provisioning in the cloud, focusing on in-memory key-value stores used as caches. Our goal is to dynamically scale resources to the traffic pattern minimizing the overall cost, which includes not only the…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-02-14 Damiano Carra , Giovanni Neglia , Pietro Michiardi

Optimizing SSD Caches for Cloud Block Storage Systems Using Machine Learning Approaches

The growing demand for efficient cloud storage solutions has led to the widespread adoption of Solid-State Drives (SSDs) for caching in cloud block storage systems. The management of data writes to SSD caches plays a crucial role in…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-01-30 Chiyu Cheng , Chang Zhou , Yang Zhao , Jin Cao

Performance-Aware Management of Cloud Resources: A Taxonomy and Future Directions

Dynamic nature of the cloud environment has made distributed resource management process a challenge for cloud service providers. The importance of maintaining the quality of service in accordance with customer expectations as well as the…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-08-08 Sara Kardani-Moghaddam , Rajkumar Buyya , Kotagiri Ramamohanarao

A Dynamic Web Page Prediction Model Based on Access Patterns to Offer Better User Latency

The growth of the World Wide Web has emphasized the need for improvement in user latency. One of the techniques that are used for improving user latency is Caching and another is Web Prefetching. Approaches that bank solely on caching offer…

Networking and Internet Architecture · Computer Science 2011-02-04 Debajyoti Mukhopadhyay , Priyanka Mishra , Dwaipayan Saha , Young-Chon Kim

Dynamic Adaptation in Data Storage: Real-Time Machine Learning for Enhanced Prefetching

The exponential growth of data storage demands has necessitated the evolution of hierarchical storage management strategies [1]. This study explores the application of streaming machine learning [3] to revolutionize data prefetching within…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-01-30 Chiyu Cheng , Chang Zhou , Yang Zhao , Jin Cao

Self-Aware and Self-Adaptive Autoscaling for Cloud Based Services

Modern Internet services are increasingly leveraging on cloud computing for flexible, elastic and on-demand provision. Typically, Quality of Service (QoS) of cloud-based services can be tuned using different underlying cloud configurations…

Software Engineering · Computer Science 2016-08-16 Tao Chen

Online QoS Modeling in the Cloud: A Hybrid and Adaptive Multi-Learners Approach

Given the on-demand nature of cloud computing, managing cloud-based services requires accurate modeling for the correlation between their Quality of Service (QoS) and cloud configurations/resources. The resulted models need to cope with the…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-04-16 Tao Chen , Rami Bahsoon , Xin Yao

Model and Machine Learning based Caching and Routing Algorithms for Cache-enabled Networks

In-network caching is likely to become an integral part of various networked systems (e.g., 5G networks, LPWAN and IoT systems) in the near future. In this paper, we compare and contrast model-based and machine learning approaches for…

Networking and Internet Architecture · Computer Science 2020-04-16 Adita Kulkarni , Anand Seetharam

Optimized Dynamic Cache Instantiation and Accurate LRU Approximations under Time-varying Request Volume

Content-delivery applications can achieve scalability and reduce wide-area network traffic using geographically distributed caches. However, each deployed cache has an associated cost, and under time-varying request rates (e.g., a daily…

Networking and Internet Architecture · Computer Science 2021-12-30 Niklas Carlsson , Derek Eager

Learning-based Caching in Cloud-Aided Wireless Networks

This paper studies content caching in cloud-aided wireless networks where small cell base stations with limited storage are connected to the cloud via limited capacity fronthaul links. By formulating a utility (inverse of service delay)…

Networking and Internet Architecture · Computer Science 2017-10-03 Syed Tamoor-ul-Hassan , Sumudu Samarakoon , Mehdi Bennis , Matti Latva-aho , Choong-Seong Hong

Deep Reinforcement Learning for Adaptive Caching in Hierarchical Content Delivery Networks

Caching is envisioned to play a critical role in next-generation content delivery infrastructure, cellular networks, and Internet architectures. By smartly storing the most popular contents at the storage-enabled network entities during…

Information Theory · Computer Science 2019-07-12 Alireza Sadeghi , Gang Wang , Georgios B. Giannakis

Adaptive Cache Management for Complex Storage Systems Using CNN-LSTM-Based Spatiotemporal Prediction

This paper proposes an intelligent cache management strategy based on CNN-LSTM to improve the performance and cache hit rate of storage systems. Through comparative experiments with traditional algorithms (such as LRU and LFU) and other…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-11-20 Xiaoye Wang , Xuan Li , Linji Wang , Tingyi Ruan , Pochun Li

LQoCo: Learning to Optimize Cache Capacity Overloading in Storage Systems

Cache plays an important role to maintain high and stable performance (i.e. high throughput, low tail latency and throughput jitter) in storage systems. Existing rule-based cache management methods, coupled with engineers' manual…

Hardware Architecture · Computer Science 2022-03-28 Ji Zhang , Xijun Li , Xiyao Zhou , Mingxuan Yuan , Zhuo Cheng , Keji Huang , Yifan Li

Intelligent Resource Allocation Optimization for Cloud Computing via Machine Learning

With the rapid expansion of cloud computing applications, optimizing resource allocation has become crucial for improving system performance and cost efficiency. This paper proposes an intelligent resource allocation algorithm that…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-04-08 Yuqing Wang , Xiao Yang

Learning Forward Reuse Distance

Caching techniques are widely used in the era of cloud computing from applications, such as Web caches to infrastructures, Memcached and memory caches in computer architectures. Prediction of cached data can greatly help improve cache…

Machine Learning · Computer Science 2020-08-03 Pengcheng Li , Yongbin Gu

From Traditional Adaptive Data Caching to Adaptive Context Caching: A Survey

Context information is in demand more than ever with the rapid increase in the number of context-aware Internet of Things applications developed worldwide. Research in context and context-awareness is being conducted to broaden its…

Human-Computer Interaction · Computer Science 2023-02-10 Shakthi Weerasinghe , Arkady Zaslavsky , Seng W. Loke , Alireza Hassani , Amin Abken , Alexey Medvedev

A Learning-Based Caching Mechanism for Edge Content Delivery

With the advent of 5G networks and the rise of the Internet of Things (IoT), Content Delivery Networks (CDNs) are increasingly extending into the network edge. This shift introduces unique challenges, particularly due to the limited cache…

Networking and Internet Architecture · Computer Science 2024-04-05 Hoda Torabi , Hamzeh Khazaei , Marin Litoiu

DistCache: Provable Load Balancing for Large-Scale Storage Systems with Distributed Caching

Load balancing is critical for distributed storage to meet strict service-level objectives (SLOs). It has been shown that a fast cache can guarantee load balancing for a clustered storage system. However, when the system scales out to…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-02-18 Zaoxing Liu , Zhihao Bai , Zhenming Liu , Xiaozhou Li , Changhoon Kim , Vladimir Braverman , Xin Jin , Ion Stoica

Towards Fair and Firm Real-Time Scheduling in DNN Multi-Tenant Multi-Accelerator Systems via Reinforcement Learning

This paper addresses the critical challenge of managing Quality of Service (QoS) in cloud services, focusing on the nuances of individual tenant expectations and varying Service Level Indicators (SLIs). It introduces a novel approach…

Hardware Architecture · Computer Science 2024-03-05 Enrico Russo , Francesco Giulio Blanco , Maurizio Palesi , Giuseppe Ascia , Davide Patti , Vincenzo Catania