English
Related papers

Related papers: Accelerating Approximate Aggregation Queries with …

200 papers

Given a dataset $\mathcal{D}$, we are interested in computing the mean of a subset of $\mathcal{D}$ which matches a predicate. ABae leverages stratified sampling and proxy models to efficiently compute this statistic given a sampling budget…

Statistics Theory · Mathematics 2021-07-30 Daniel Kang , John Guibas , Peter Bailis , Tatsunori Hashimoto , Yi Sun , Matei Zaharia

The question of answering queries over ML predictions has been gaining attention in the database community. This question is challenging because the cost of finding high quality answers corresponds to invoking an oracle such as a human…

Databases · Computer Science 2022-11-18 Dujian Ding , Sihem Amer-Yahia , Laks VS Lakshmanan

Due to the falling costs of data acquisition and storage, researchers and industry analysts often want to find all instances of rare events in large datasets. For instance, scientists can cheaply capture thousands of hours of video, but are…

Databases · Computer Science 2022-01-05 Daniel Kang , Edward Gan , Peter Bailis , Tatsunori Hashimoto , Matei Zaharia

Recently, predictor-based algorithms emerged as a promising approach for neural architecture search (NAS). For NAS, we typically have to calculate the validation accuracy of a large number of Deep Neural Networks (DNNs), what is…

Analysts and scientists are interested in querying streams of video, audio, and text to extract quantitative insights. For example, an urban planner may wish to measure congestion by querying the live feed from a traffic camera. Prior work…

Databases · Computer Science 2023-08-21 Matthew Russo , Tatsunori Hashimoto , Daniel Kang , Yi Sun , Matei Zaharia

We study Aggregation Queries over Nearest Neighbors (AQNN), which compute aggregates over the learned representations of the neighborhood of a designated query object. For example, a medical professional may be interested in the average…

Data Structures and Algorithms · Computer Science 2026-01-06 Carrie Wang , Sihem Amer-Yahia , Laks V. S. Lakshmanan , Reynold Cheng

Over the past a few years, research and development has made significant progresses on big data analytics. A fundamental issue for big data analytics is the efficiency. If the optimal solution is unable to attain or not required or has a…

Databases · Computer Science 2019-01-03 Shuai Ma , Jinpeng Huai

Deep Neural Networks (DNNs) are very popular because of their high performance in various cognitive tasks in Machine Learning (ML). Recent advancements in DNNs have brought beyond human accuracy in many tasks, but at the cost of high…

Hardware Architecture · Computer Science 2022-03-18 Giorgos Armeniakos , Georgios Zervakis , Dimitrios Soudris , Jörg Henkel

Constrained optimization problems arise in various engineering systems such as inventory management and power grids. Standard deep neural network (DNN) based machine learning proxies are ineffective in practical settings where labeled data…

Machine Learning · Computer Science 2025-06-09 Parikshit Pareek , Abhijith Jayakumar , Kaarthik Sundar , Deepjyoti Deka , Sidhant Misra

Due to the recent advances on Neural Architecture Search (NAS), it gains popularity in designing best networks for specific tasks. Although it shows promising results on many benchmarks and competitions, NAS still suffers from its demanding…

Machine Learning · Computer Science 2019-11-22 Minje Park

Deep neural networks (DNNs) are becoming progressively large and costly to train. This paper aims to reduce DNN training costs by leveraging preemptible instances on modern clouds, which can be allocated at a much lower price when idle but…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-03-22 Jiangfei Duan , Ziang Song , Xupeng Miao , Xiaoli Xi , Dahua Lin , Harry Xu , Minjia Zhang , Zhihao Jia

Sample-based approximate query processing (AQP) suffers from many pitfalls such as the inability to answer very selective queries and unreliable confidence intervals when sample sizes are small. Recent research presented an intriguing…

Databases · Computer Science 2021-03-31 Xi Liang , Stavros Sintos , Zechao Shang , Sanjay Krishnan

Recent advances in Deep Neural Networks (DNNs) have demonstrated outstanding performance across various domains. However, their large size is a challenge for deployment on resource-constrained devices such as mobile, edge, and IoT…

Machine Learning · Computer Science 2024-10-10 Divya Jyoti Bajpai , Manjesh Kumar Hanawal

Approximate computing offers promising energy efficiency benefits for error-tolerant applications, but discovering optimal approximations requires extensive design space exploration (DSE). Predicting the accuracy of circuits composed of…

Hardware Architecture · Computer Science 2026-03-20 Ondrej Vlcek , Vojtech Mrazek

Approximate k-Nearest Neighbor (AKNN) search is widely used in vector databases. When vectors carry additional attributes (e.g., labels or numerical values), filtered AKNN search retrieves the nearest vectors to a query vector under…

Databases · Computer Science 2026-05-29 Wenxuan Xia , Mingyu Yang , Wentao Li , Wei Wang

Text analytics has become an important part of business intelligence as enterprises increasingly seek to extract insights for decision making from text data sets. Processing large text data sets can be computationally expensive, however,…

Databases · Computer Science 2020-01-14 Guangyan Hu , Yongfeng Zhang , Sandro Rigo , Thu D. Nguyen

Deep Neural Networks (DNNs) have drawn attention because of their outstanding performance on various tasks. However, deploying full-fledged DNNs in resource-constrained devices (edge, mobile, IoT) is difficult due to their large size. To…

Machine Learning · Computer Science 2023-09-19 Divya J. Bajpai , Vivek K. Trivedi , Sohan L. Yadav , Manjesh K. Hanawal

Data is generated at an unprecedented rate surpassing our ability to analyze them. The database community has pioneered many novel techniques for Approximate Query Processing (AQP) that could give approximate results in a fraction of time…

Databases · Computer Science 2019-11-20 Saravanan Thirumuruganathan , Shohedul Hasan , Nick Koudas , Gautam Das

Vector quantization-based approaches are successful to solve Approximate Nearest Neighbor (ANN) problems which are critical to many applications. The idea is to generate effective encodings to allow fast distance approximation. We propose…

Computer Vision and Pattern Recognition · Computer Science 2015-09-18 Shicong Liu , Junru Shao , Hongtao Lu

Sampling is an important process in many GNN structures in order to train larger datasets with a smaller computational complexity. However, compared to other processes in GNN (such as aggregate, backward propagation), the sampling process…

Machine Learning · Computer Science 2022-09-08 Yuchen Gui , Boyi Wei , Wei Yuan , Xi Jin
‹ Prev 1 2 3 10 Next ›