Related papers: Exploiting Data Skew for Improved Query Performanc…

VIP Hashing -- Adapting to Skew in Popularity of Data on the Fly (extended version)

All data is not equally popular. Often, some portion of data is more frequently accessed than the rest, which causes a skew in popularity of the data items. Adapting to this skew can improve performance, and this topic has been studied…

Databases · Computer Science 2022-06-27 Aarati Kakaraparthy , Jignesh M. Patel , Brian P. Kroth , Kwanghyun Park

Critical Database Size for Effective Caching

Replicating or caching popular content in memories distributed across the network is a technique to reduce peak network loads. Conventionally, the performance gain of caching was thought to result from making part of the requested data…

Information Theory · Computer Science 2015-01-13 Ajaykrishnan N. , Navya S. Prem , Vinod M. Prabhakaran , Rahul Vaze

Data Placement and Replica Selection for Improving Co-location in Distributed Environments

Increasing need for large-scale data analytics in a number of application domains has led to a dramatic rise in the number of distributed data management systems, both parallel relational databases, and systems that support alternative…

Databases · Computer Science 2013-02-19 K. Ashwin Kumar , Amol Deshpande , Samir Khuller

LocationSpark: In-memory Distributed Spatial Query Processing and Optimization

Due to the ubiquity of spatial data applications and the large amounts of spatial data that these applications generate and process, there is a pressing need for scalable spatial query processing. In this paper, we present new techniques…

Databases · Computer Science 2019-07-17 Mingjie Tang , Yongyang Yu , Walid G. Aref , Ahmed R. Mahmood , Qutaibah M. Malluhi , Mourad Ouzzani

Caching Stars in the Sky: A Semantic Caching Approach to Accelerate Skyline Queries

Multi-criteria decision making has been made possible with the advent of skyline queries. However, processing such queries for high dimensional datasets remains a time consuming task. Real-time applications are thus infeasible, especially…

Databases · Computer Science 2011-06-13 Arnab Bhattacharya , B. Palvali Teja , Sourav Dutta

Optimal Edge Caching For Individualized Demand Dynamics

The ever-growing end user data demands, and the simultaneous reductions in memory costs are fueling edge-caching deployments. Caching at the edge is substantially different from that at the core and needs to take into account the nature of…

Networking and Internet Architecture · Computer Science 2023-10-24 Guocong Quan , Atilla Eryilmaz , Ness Shroff

Skew in Parallel Query Processing

We study the problem of computing a conjunctive query q in parallel, using p of servers, on a large database. We consider algorithms with one round of communication, and study the complexity of the communication. We are especially…

Databases · Computer Science 2014-01-10 Paul Beame , Paraschos Koutris , Dan Suciu

Stochastic Database Cracking: Towards Robust Adaptive Indexing in Main-Memory Column-Stores

Modern business applications and scientific databases call for inherently dynamic data storage environments. Such environments are characterized by two challenging features: (a) they have little idle system time to devote on physical…

Databases · Computer Science 2012-03-02 Felix Halim , Stratos Idreos , Panagiotis Karras , Roland H. C. Yap

Predictive Edge Caching through Deep Mining of Sequential Patterns in User Content Retrievals

Edge caching plays an increasingly important role in boosting user content retrieval performance while reducing redundant network traffic. The effectiveness of caching ultimately hinges on the accuracy of predicting content popularity in…

Networking and Internet Architecture · Computer Science 2022-10-07 Chen Li , Xiaoyu Wang , Tongyu Zong , Houwei Cao , Yong Liu

Topical Result Caching in Web Search Engines

Caching search results is employed in information retrieval systems to expedite query processing and reduce back-end server workload. Motivated by the observation that queries belonging to different topics have different temporal-locality…

Information Retrieval · Computer Science 2020-01-10 Ida Mele , Nicola Tonellotto , Ophir Frieder , Raffaele Perego

Caching Historical Embeddings in Conversational Search

Rapid response, namely low latency, is fundamental in search applications; it is particularly so in interactive search sessions, such as those encountered in conversational settings. An observation with a potential to reduce latency asserts…

Information Retrieval · Computer Science 2022-11-28 Ophir Frieder , Ida Mele , Cristina Ioana Muntean , Franco Maria Nardini , Raffaele Perego , Nicola Tonellotto

Efficient analysis of caching strategies under dynamic content popularity

In this paper we develop a novel technique to analyze both isolated and interconnected caches operating under different caching strategies and realistic traffic conditions. The main strength of our approach is the ability to consider…

Performance · Computer Science 2014-11-27 Michele Garetto , Emilio Leonardi , Stefano Traverso

Reinforcement Learning for Caching with Space-Time Popularity Dynamics

With the tremendous growth of data traffic over wired and wireless networks along with the increasing number of rich-media applications, caching is envisioned to play a critical role in next-generation networks. To intelligently prefetch…

Information Theory · Computer Science 2020-05-20 Alireza Sadeghi , Georgios B. Giannakis , Gang Wang , Fatemeh Sheikholeslami

Access Trends of In-network Cache for Scientific Data

Scientific collaborations are increasingly relying on large volumes of data for their work and many of them employ tiered systems to replicate the data to their worldwide user communities. Each user in the community often selects a…

Networking and Internet Architecture · Computer Science 2022-05-12 Ruize Han , Alex Sim , Kesheng Wu , Inder Monga , Chin Guok , Frank Würthwein , Diego Davila , Justas Balcas , Harvey Newman

Web-enabling Cache Daemon for Complex Data

One of the most common basic techniques for improving the performance of web applications is caching frequently accessed data in fast data stores, colloquially known as cache daemons. In this paper we present a cache daemon suitable for…

Distributed, Parallel, and Cluster Computing · Computer Science 2009-10-02 Ivan Voras , Mario Zagar

Multi-Dimensional Data Compression and Query Processing in Array Databases

In recent times, the production of multidimensional data in various domains and their storage in array databases has witnessed a sharp increase; this rapid growth in data volumes necessitates compression in array databases. However,…

Databases · Computer Science 2022-11-14 Minsoo Kim , Hyubjin Lee , Yon Dohn Chung

Shift-Table: A Low-latency Learned Index for Range Queries using Model Correction

Indexing large-scale databases in main memory is still challenging today. Learned index structures -- in which the core components of classical indexes are replaced with machine learning models -- have recently been suggested to…

Databases · Computer Science 2021-01-27 Ali Hadian , Thomas Heinis

Sub-query Fragmentation for Query Analysis and Data Caching in the Distributed Environment

When data stores and users are distributed geographically, it is essential to organize distributed data cache points at ideal locations to minimize data transfers. To answer this, we are developing an adaptive distributed data caching…

Databases · Computer Science 2019-10-14 Santhilata Kuppili Venkata , Katarzyna Musial

Accelerating Large-scale Data Exploration through Data Diffusion

Data-intensive applications often require exploratory analysis of large datasets. If analysis is performed on distributed resources, data locality can be crucial to high throughput and performance. We propose a "data diffusion" approach…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-11-17 Ioan Raicu , Yong Zhao , Ian Foster , Alex Szalay

Joint Frequency Reuse and Cache Optimization in Backhaul-Limited Small-Cell Wireless Networks

Caching at base stations (BSs) is a promising approach for supporting the tremendous traffic growth of content delivery over future small-cell wireless networks with limited backhaul. This paper considers exploiting spatial caching…

Signal Processing · Electrical Eng. & Systems 2018-08-09 Wei Han , An Liu , Wei Yu , Vincent K. N. Lau