Related papers: Exact Selectivity Computation for Modern In-Memory…

Novel Selectivity Estimation Strategy for Modern DBMS

Selectivity estimation is important in query optimization, however accurate estimation is difficult when predicates are complex. Instead of existing database synopses and statistics not helpful for such cases, we introduce a new approach to…

Databases · Computer Science 2018-06-25 Jun Hyung Shin

Accelerating Exact and Approximate Inference for (Distributed) Discrete Optimization with GPUs

Discrete optimization is a central problem in artificial intelligence. The optimization of the aggregated cost of a network of cost functions arises in a variety of problems including (W)CSP, DCOP, as well as optimization in stochastic…

Artificial Intelligence · Computer Science 2018-01-12 Ferdinando Fioretto , Enrico Pontelli , William Yeoh , Rina Dechter

Scalable Cross-Entropy Loss for Sequential Recommendations with Large Item Catalogs

Scalability issue plays a crucial role in productionizing modern recommender systems. Even lightweight architectures may suffer from high computational overload due to intermediate calculations, limiting their practicality in real-world…

Information Retrieval · Computer Science 2024-12-03 Gleb Mezentsev , Danil Gusak , Ivan Oseledets , Evgeny Frolov

Online Sketch-based Query Optimization

Cost-based query optimization remains a critical task in relational databases even after decades of research and industrial development. Query optimizers rely on a large range of statistical synopses -- including attribute-level histograms…

Databases · Computer Science 2021-02-05 Yesdaulet Izenov , Asoke Datta , Florin Rusu , Jun Hyung Shin

Optimizing Differentially-Maintained Recursive Queries on Dynamic Graphs

Differential computation (DC) is a highly general incremental computation/view maintenance technique that can maintain the output of an arbitrary and possibly recursive dataflow computation upon changes to its base inputs. As such, it is a…

Databases · Computer Science 2022-08-02 Khaled Ammar , Siddhartha Sahu , Semih Salihoglu , M. Tamer Ozsu

Consistent and Flexible Selectivity Estimation for High-Dimensional Data

Selectivity estimation aims at estimating the number of database objects that satisfy a selection criterion. Answering this problem accurately and efficiently is essential to many applications, such as density estimation, outlier detection,…

Databases · Computer Science 2021-05-28 Yaoshu Wang , Chuan Xiao , Jianbin Qin , Rui Mao , Onizuka Makoto , Wei Wang , Rui Zhang , Yoshiharu Ishikawa

Cache-based Multi-query Optimization for Data-intensive Scalable Computing Frameworks

In modern large-scale distributed systems, analytics jobs submitted by various users often share similar work, for example scanning and processing the same subset of data. Instead of optimizing jobs independently, which may result in…

Databases · Computer Science 2018-05-23 Pietro Michiardi , Damiano Carra , Sara Migliorini

Multi-Attribute Selectivity Estimation Using Deep Learning

Selectivity estimation - the problem of estimating the result size of queries - is a fundamental problem in databases. Accurate estimation of query selectivity involving multiple correlated attributes is especially challenging. Poor…

Databases · Computer Science 2019-06-19 Shohedul Hasan , Saravanan Thirumuruganathan , Jees Augustine , Nick Koudas , Gautam Das

Selectivity Estimation of Inequality Joins In Databases

Selectivity estimation refers to the ability of the SQL query optimizer to estimate the size of the results of a predicate in the query. It is the main calculation, based on which the optimizer can select the cheapest plan to execute. While…

Databases · Computer Science 2022-06-16 Diogo Repas , Zhicheng Luo , Maxime Schoemans , Mahmoud Sakr

Batch Sequential Adaptive Designs for Global Optimization

Compared with the fixed-run designs, the sequential adaptive designs (SAD) are thought to be more efficient and effective. Efficient global optimization (EGO) is one of the most popular SAD methods for expensive black-box optimization…

Machine Learning · Computer Science 2020-10-22 Jianhui Ning , Yao Xiao , Zikang Xiong

QuickSel: Quick Selectivity Learning with Mixture Models

Estimating the selectivity of a query is a key step in almost any cost-based query optimizer. Most of today's databases rely on histograms or samples that are periodically refreshed by re-scanning the data as the underlying data changes.…

Databases · Computer Science 2020-04-14 Yongjoo Park , Shucheng Zhong , Barzan Mozafari

Redefining The Query Optimization Process

Traditionally, query optimizers have been designed for computer systems that share a common architecture, consisting of a CPU, main memory and disk subsystem. The efficiency of query optimizers and their successful employment relied on the…

Databases · Computer Science 2022-03-03 K. F. D. Rietveld , H. A. G. Wijshoff

Learned Query Superoptimization

Traditional query optimizers are designed to be fast and stateless: each query is quickly optimized using approximate statistics, sent off to the execution engine, and promptly forgotten. Recent work on learned query optimization have shown…

Databases · Computer Science 2023-07-12 Ryan Marcus

Efficient Exascale Discretizations: High-Order Finite Element Methods

Efficient exploitation of exascale architectures requires rethinking of the numerical algorithms used in many large-scale applications. These architectures favor algorithms that expose ultra fine-grain parallelism and maximize the ratio of…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-09-13 Tzanio Kolev , Paul Fischer , Misun Min , Jack Dongarra , Jed Brown , Veselin Dobrev , Tim Warburton , Stanimire Tomov , Mark S. Shephard , Ahmad Abdelfattah , Valeria Barra , Natalie Beams , Jean-Sylvain Camier , Noel Chalmers , Yohann Dudouit , Ali Karakus , Ian Karlin , Stefan Kerkemeier , Yu-Hsiang Lan , David Medina , Elia Merzari , Aleksandr Obabko , Will Pazner , Thilina Rathnayake , Cameron W. Smith , Lukas Spies , Kasia Swirydowicz , Jeremy Thompson , Ananias Tomboulides , Vladimir Tomov

The Selection Problem in Multi-Query Optimization: a Comprehensive Survey

View materialization, index selection, and plan caching are well-known techniques for optimization of query processing in database systems. The essence of these tasks is to select and save a subset of the most useful candidates…

Databases · Computer Science 2025-01-28 Sergey Zinchenko , Denis Ponomaryov

Entropy-Based Data Selection for Language Models

Modern language models (LMs) increasingly require two critical resources: computational resources and data resources. Data selection techniques can effectively reduce the amount of training data required for fine-tuning LMs. However, their…

Computation and Language · Computer Science 2026-02-20 Hongming Li , Yang Liu , Chao Huang

Effectiveness and Efficiency Trade-off in Selective Query Processing

Query processing in search engines can be optimized for use for all queries. For this, system component parameters such as the weighting function or the automatic query expansion model can be optimized or learned from past queries. However,…

Information Retrieval · Computer Science 2023-02-23 Josiane Mothe , Md Zia Ullah

Adaptive Search Optimization: Dynamic Algorithm Selection and Caching for Enhanced Database Performance

Efficient search operations in databases are paramount for timely retrieval of information various applications. This research introduces a novel approach, combining dynamicalgorithm1 selection and caching2 strategies, to optimize search…

Databases · Computer Science 2023-11-15 Hakikat Singh

Predictive Entropy Search for Efficient Global Optimization of Black-box Functions

We propose a novel information-theoretic approach for Bayesian optimization called Predictive Entropy Search (PES). At each iteration, PES selects the next evaluation point that maximizes the expected information gained with respect to the…

Machine Learning · Statistics 2014-06-11 José Miguel Hernández-Lobato , Matthew W. Hoffman , Zoubin Ghahramani

Fast OLAP Query Execution in Main Memory on Large Data in a Cluster

Main memory column-stores have proven to be efficient for processing analytical queries. Still, there has been much less work in the context of clusters. Using only a single machine poses several restrictions: Processing power and data…

Databases · Computer Science 2017-09-18 Demian Hespe , Martin Weidner , Jonathan Dees , Peter Sanders