Related papers: Approximate Selection with Guarantees using Proxie…

On Efficient Approximate Queries over Machine Learning Models

The question of answering queries over ML predictions has been gaining attention in the database community. This question is challenging because the cost of finding high quality answers corresponds to invoking an oracle such as a human…

Databases · Computer Science 2022-11-18 Dujian Ding , Sihem Amer-Yahia , Laks VS Lakshmanan

Two Losses Are Better Than One: Faster Optimization Using a Cheaper Proxy

We present an algorithm for minimizing an objective with hard-to-compute gradients by using a related, easier-to-access function as a proxy. Our algorithm is based on approximate proximal point iterations on the proxy combined with…

Machine Learning · Computer Science 2023-06-08 Blake Woodworth , Konstantin Mishchenko , Francis Bach

Accelerating Approximate Aggregation Queries with Expensive Predicates

Researchers and industry analysts are increasingly interested in computing aggregation queries over large, unstructured datasets with selective predicates that are computed using expensive deep neural networks (DNNs). As these DNNs are…

Databases · Computer Science 2021-08-16 Daniel Kang , John Guibas , Peter Bailis , Tatsunori Hashimoto , Yi Sun , Matei Zaharia

Approximate Queries and Representations for Large Data Sequences

Many new database application domains such as experimental sciences and medicine are characterized by large sequences as their main form of data. Using approximate representation can significantly reduce the required storage and search…

Databases · Computer Science 2019-04-22 Hagit Shatkay , Stanley B. Zdonik

Approximate Computation for Big Data Analytics

Over the past a few years, research and development has made significant progresses on big data analytics. A fundamental issue for big data analytics is the efficiency. If the optimal solution is unable to attain or not required or has a…

Databases · Computer Science 2019-01-03 Shuai Ma , Jinpeng Huai

Using Small Proxy Datasets to Accelerate Hyperparameter Search

One of the biggest bottlenecks in a machine learning workflow is waiting for models to train. Depending on the available computing resources, it can take days to weeks to train a neural network on a large dataset with many classes such as…

Machine Learning · Computer Science 2019-06-13 Sam Shleifer , Eric Prokop

Probably Approximately Optimal Query Optimization

Evaluating query predicates on data samples is the only way to estimate their selectivity in certain scenarios. Finding a guaranteed optimal query plan is not a reasonable optimization goal in those cases as it might require an infinite…

Databases · Computer Science 2015-11-06 Immanuel Trummer , Christoph Koch

Selection via Proxy: Efficient Data Selection for Deep Learning

Data selection methods, such as active learning and core-set selection, are useful tools for machine learning on large datasets. However, they can be prohibitively expensive to apply in deep learning because they depend on feature…

Machine Learning · Computer Science 2020-10-28 Cody Coleman , Christopher Yeh , Stephen Mussmann , Baharan Mirzasoleiman , Peter Bailis , Percy Liang , Jure Leskovec , Matei Zaharia

Policy Optimization Through Approximate Importance Sampling

Recent policy optimization approaches (Schulman et al., 2015a; 2017) have achieved substantial empirical successes by constructing new proxy optimization objectives. These proxy objectives allow stable and low variance policy learning, but…

Machine Learning · Computer Science 2020-02-24 Marcin B. Tomczak , Dongho Kim , Peter Vrancx , Kee-Eung Kim

Analytical low-rank compression via proxy point selection

It has been known in potential theory that, for some kernels matrices corresponding to well-separated point sets, fast analytical low-rank approximation can be achieved via the use of proxy points. This proxy point method gives a…

Numerical Analysis · Mathematics 2019-03-22 Xin Ye , Jianlin Xia , Lexing Ying

100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models

Several data warehouse and database providers have recently introduced extensions to SQL called AI Queries, enabling users to specify functions and conditions in SQL that are evaluated by LLMs, thereby broadening significantly the kinds of…

Databases · Computer Science 2026-04-16 Yeounoh Chung , Rushabh Desai , Jian He , Yu Xiao , Thibaud Hottelier , Yves-Laurent Kom Samo , Pushkar Khadilkar , Xianshun Chen , Sam Idicula , Fatma Özcan , Alon Halevy , Yannis Papakonstantinou

Predicting with Proxies: Transfer Learning in High Dimension

Predictive analytics is increasingly used to guide decision-making in many applications. However, in practice, we often have limited data on the true predictive task of interest, and must instead rely on more abundant data on a…

Machine Learning · Statistics 2020-05-07 Hamsa Bastani

The Case for Distance-Bounded Spatial Approximations

Spatial approximations have been traditionally used in spatial databases to accelerate the processing of complex geometric operations. However, approximations are typically only used in a first filtering step to determine a set of candidate…

Databases · Computer Science 2021-01-22 Eleni Tzirita Zacharatou , Andreas Kipf , Ibrahim Sabek , Varun Pandey , Harish Doraiswamy , Volker Markl

Efficient Approximation Algorithms for Optimal Large-scale Network Monitoring

The growing amount of applications that generate vast amount of data in short time scales render the problem of partial monitoring, coupled with prediction, a rather fundamental one. We study the aforementioned canonical problem under the…

Data Structures and Algorithms · Computer Science 2016-08-02 Michalis Kallitsis , Stilian Stoev , George Michailidis

ExSample: Efficient Searches on Video Repositories through Adaptive Sampling

Capturing and processing video is increasingly common as cameras become cheaper to deploy. At the same time, rich video understanding methods have progressed greatly in the last decade. As a result, many organizations now have massive…

Databases · Computer Science 2022-08-16 Oscar Moll , Favyen Bastani , Sam Madden , Mike Stonebraker , Vijay Gadepally , Tim Kraska

Sufficient Decision Proxies for Decision-Focused Learning

When solving optimization problems under uncertainty with contextual data, utilizing machine learning to predict the uncertain parameters' values is a popular and effective approach. Decision-focused learning (DFL) aims at learning a…

Machine Learning · Computer Science 2026-01-29 Noah Schutte , Grigorii Veviurko , Krzysztof Postek , Neil Yorke-Smith

Leveraging Approximate Caching for Faster Retrieval-Augmented Generation

Retrieval-augmented generation (RAG) improves the reliability of large language model (LLM) answers by integrating external knowledge. However, RAG increases the end-to-end inference time since looking for relevant documents from large…

Databases · Computer Science 2025-10-28 Shai Bergman , Anne-Marie Kermarrec , Diana Petrescu , Rafael Pires , Mathis Randl , Martijn de Vos , Ji Zhang

Approximating a Target Distribution using Weight Queries

We consider a novel challenge: approximating a distribution without the ability to randomly sample from that distribution. We study how such an approximation can be obtained using *weight queries*. Given some data set of examples, a weight…

Machine Learning · Computer Science 2021-07-15 Nadav Barak , Sivan Sabato

Approximation Algorithms for Combinatorial Optimization with Predictions

We initiate a systematic study of utilizing predictions to improve over approximation guarantees of classic algorithms, without increasing the running time. We propose a systematic method for a wide class of optimization problems that ask…

Data Structures and Algorithms · Computer Science 2024-11-26 Antonios Antoniadis , Marek Eliáš , Adam Polak , Moritz Venzin

Accuracy Analysis of the Proxy Point Method with Applications to Some Toeplitz Matrices

For some kernel matrices, low-rank approximations can be quickly obtained via analytic techniques. One important class of analytic methods that has received attention in recent years is based on the use of proxy points. Accuracy analysis…

Numerical Analysis · Mathematics 2026-05-26 Mikhail Lepilov , Jianlin Xia