English
Related papers

Related papers: Optimizing Relational Queries over Array-Valued Da…

200 papers

Data processing systems roughly group into families such as relational, array, graph, and key-value. Many data processing tasks exceed the capabilities of any one family, require data stored across families, or run faster when partitioned…

Databases · Computer Science 2016-04-14 Dylan Hutchison , Bill Howe , Dan Suciu

Hybrid complex analytics workloads typically include (i) data management tasks (joins, selections, etc. ), easily expressed using relational algebra (RA)-based languages, and (ii) complex analytics tasks (regressions, matrix decompositions,…

Databases · Computer Science 2021-03-24 Rana Alotaibi , Bogdan Cautis , Alin Deutsch , Ioana Manolescu

Analytical queries often require a mixture of relational and linear algebra operations applied to the same data. This poses a challenge to analytic systems that must bridge the gap between relations and matrices. Previous work has mainly…

Databases · Computer Science 2020-04-14 Oksana Dolmatova , Nikolaus Augsten , Michael H. Boehlen

Modern Internet applications often produce a large volume of user activity records. Data analysts are interested in cohort analysis, or finding unusual user behavioral trends, in these large tables of activity records. In a traditional…

Databases · Computer Science 2016-05-05 Dawei Jiang , Qingchao Cai , Gang Chen , H. V. Jagadish , Beng Chin Ooi , Kian-Lee Tan , Anthony K. H. Tung

In modern data analytics, analysts frequently face the challenge of searching for desirable entities by evaluating, for each entity, a collection of its feature relations to derive key analytical properties. This search is challenging…

Databases · Computer Science 2025-07-25 Xi Wu , Eugene Wu , Zichen Zhu , Fengan Li , Jeffrey F. Naughton

Analytics tasks manipulate structured data with variants of relational algebra (RA) and quantitative data with variants of linear algebra (LA). The two computational models have overlapping expressiveness, motivating a common programming…

Databases · Computer Science 2017-05-16 Dylan Hutchison , Bill Howe , Dan Suciu

Recent advances with in-memory columnar database techniques have increased the performance of analytical queries on very large databases and data warehouses. At the same time, advances in artificial intelligence (AI) algorithms have…

Databases · Computer Science 2017-12-11 Brad Carlile , Akiko Marti , Guy Delamarter

Text analytical tasks like word embedding, phrase mining, and topic modeling, are placing increasing demands as well as challenges to existing database management systems. In this paper, we provide a novel algebraic approach based on…

Databases · Computer Science 2020-05-05 Xiuwen Zheng , Amarnath Gupta

The database community lacks a unified relational query language for subset selection and optimisation queries, limiting both user expression and query optimiser reasoning about such problems. Decades of research (latterly under the rubric…

Databases · Computer Science 2025-09-09 David Robert Pratten , Luke Mathieson , Fahimeh Ramezani

Relational data stored in RDBMS is foundational to many real-world applications across domains such as e-commerce, finance, and sociality. While deep neural networks (DNNs) have achieved strong performance on tabular data with a single…

Databases · Computer Science 2026-05-15 Lingze Zeng , Shaofeng Cai , Changshuo Liu , Zhongle Xie , Yuncheng Wu , Beng Chin Ooi

Data analysis often involves comparing subsets of data across many dimensions for finding unusual trends and patterns. While the comparison between subsets of data can be expressed using SQL, they tend to be complex to write, and suffer…

Databases · Computer Science 2021-07-28 Tarique Siddiqui , Surajit Chaudhuri , Vivek Narasayya

Aggregation in relational databases is accomplished through hashing and sorting interval data, which is computationally expensive and scales poorly as the data volumes grow. In this paper, we show how quantitative interval and time-series…

Databases · Computer Science 2022-11-14 Derek Colley , Md Asaduzzaman

Analytical database systems are typically designed to use a column-first data layout to access only the desired fields. On the other hand, storing data row-first works great for accessing, inserting, or updating entire rows. Transforming…

As the complexity of modern workloads and hardware increasingly outpaces human research and engineering capacity, existing methods for database performance optimization struggle to keep pace. To address this gap, a new class of techniques,…

Databases · Computer Science 2026-04-09 Audrey Cheng , Harald Ng , Aaron Kabcenell , Peter Bailis , Matei Zaharia , Lin Ma , Xiao Shi , Ion Stoica

The need for Knowledge and Data Discovery Management Systems (KDDMS) that support ad hoc data mining queries has been long recognized. A significant amount of research has gone into building tightly coupled systems that integrate…

Databases · Computer Science 2007-05-23 Raj P. Gopalan , Tariq Nuruddin , Yudho Giri Sucahyo

Recent advances in query optimization have shifted from traditional rule-based and cost-based techniques towards machine learning-driven approaches. Among these, reinforcement learning (RL) has attracted significant attention due to its…

Databases · Computer Science 2026-04-17 Seokwon Lee , Jaeyoung Sim , Sihyun Kim , Yuhsing Li , Yiwen Zhu , Kwanghyun Park

Linear algebraic expressions are the essence of many computationally intensive problems, including scientific simulations and machine learning applications. However, translating high-level formulations of these expressions to efficient…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-03-22 Dániel Berényi , András Leitereg , Gábor Lehel

The enormous quantity of data produced every day together with advances in data analytics has led to a proliferation of data management and analysis systems. Typically, these systems are built around highly specialized monolithic operators…

Databases · Computer Science 2021-09-30 Dimitrios Koutsoukos , Ingo Müller , Renato Marroquín , Ana Klimovic , Gustavo Alonso

The use of large-scale machine learning methods is becoming ubiquitous in many applications ranging from business intelligence to self-driving cars. These methods require a complex computation pipeline consisting of various types of…

Databases · Computer Science 2021-11-10 Yongyang Yu , Mingjie Tang , Walid G. Aref

Machine learning algorithms are commonly specified in linear algebra (LA). LA expressions can be rewritten into more efficient forms, by taking advantage of input properties such as sparsity, as well as program properties such as common…

Databases · Computer Science 2020-12-24 Yisu Remy Wang , Shana Hutchison , Jonathan Leang , Bill Howe , Dan Suciu
‹ Prev 1 2 3 10 Next ›