Related papers: Fine-Tuning Data Structures for Analytical Query P…

Cost-of-Pass: An Economic Framework for Evaluating Language Models

Widespread adoption of AI systems hinges on their ability to generate economic value that outweighs their inference costs. Evaluating this tradeoff requires metrics accounting for both performance and costs. Building on production theory,…

Artificial Intelligence · Computer Science 2026-02-27 Mehmet Hamza Erol , Batu El , Mirac Suzgun , Mert Yuksekgonul , James Zou

Cobra: A Framework for Cost Based Rewriting of Database Applications

Database applications are typically written using a mixture of imperative languages and declarative frameworks for data processing. Application logic gets distributed across the declarative and imperative parts of a program. Often, there is…

Databases · Computer Science 2018-02-27 K. Venkatesh Emani , S. Sudarshan

Enabling Incremental Query Re-Optimization

As declarative query processing techniques expand in scope --- to the Web, data streams, network routers, and cloud platforms --- there is an increasing need for adaptive query processing techniques that can re-plan in the presence of…

Databases · Computer Science 2014-09-23 Mengmeng Liu , Zachary G. Ives , Boon Thau Loo

100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models

Several data warehouse and database providers have recently introduced extensions to SQL called AI Queries, enabling users to specify functions and conditions in SQL that are evaluated by LLMs, thereby broadening significantly the kinds of…

Databases · Computer Science 2026-04-16 Yeounoh Chung , Rushabh Desai , Jian He , Yu Xiao , Thibaud Hottelier , Yves-Laurent Kom Samo , Pushkar Khadilkar , Xianshun Chen , Sam Idicula , Fatma Özcan , Alon Halevy , Yannis Papakonstantinou

MetaTune: Meta-Learning Based Cost Model for Fast and Efficient Auto-tuning Frameworks

Deep learning compiler frameworks are gaining ground as a more portable back-end for deep learning applications on increasingly diverse hardware. However, they face the daunting challenge of matching performance offered by hand-tuned…

Machine Learning · Computer Science 2021-02-10 Jaehun Ryu , Hyojin Sung

Costing Generated Runtime Execution Plans for Large-Scale Machine Learning Programs

Declarative large-scale machine learning (ML) aims at the specification of ML algorithms in a high-level language and automatic generation of hybrid runtime execution plans ranging from single node, in-memory computations to distributed…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-03-24 Matthias Boehm

Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings

Query processing over big data is ubiquitous in modern clouds, where the system takes care of picking both the physical query execution plans and the resources needed to run those plans, using a cost-based query optimizer. A good cost…

Databases · Computer Science 2020-03-02 Tarique Siddiqui , Alekh Jindal , Shi Qiao , Hiren Patel , Wangchao le

Data Complexity-aware Deep Model Performance Forecasting

Deep learning models are widely used across computer vision and other domains. When working on the model induction, selecting the right architecture for a given dataset often relies on repetitive trial-and-error procedures. This procedure…

Machine Learning · Computer Science 2026-01-06 Yen-Chia Chen , Hsing-Kuo Pao , Hanjuan Huang

A Learned Cost Model-based Cross-engine Optimizer for SQL Workloads

Lakehouse systems enable the same data to be queried with multiple execution engines. However, selecting the engine best suited to run a SQL query still requires a priori knowledge of the query computational requirements and an engine…

Databases · Computer Science 2025-06-04 András Strausz , Niels Pardon , Ioana Giurgiu

Structured Analysis Dictionary Learning for Image Classification

We propose a computationally efficient and high-performance classification algorithm by incorporating class structural information in analysis dictionary learning. To achieve more consistent classification, we associate a class…

Computer Vision and Pattern Recognition · Computer Science 2018-05-03 Wen Tang , Ashkan Panahi , Hamid Krim , Liyi Dai

Data Wrangling Task Automation Using Code-Generating Language Models

Ensuring data quality in large tabular datasets is a critical challenge, typically addressed through data wrangling tasks. Traditional statistical methods, though efficient, cannot often understand the semantic context and deep learning…

Machine Learning · Computer Science 2025-02-25 Ashlesha Akella , Krishnasuri Narayanam

Enhancing Cost Efficiency in Active Learning with Candidate Set Query

This paper introduces a cost-efficient active learning (AL) framework for classification, featuring a novel query design called candidate set query. Unlike traditional AL queries requiring the oracle to examine all possible classes, our…

Machine Learning · Computer Science 2025-08-20 Yeho Gwon , Sehyun Hwang , Hoyoung Kim , Jungseul Ok , Suha Kwak

Deep-n-Cheap: An Automated Search Framework for Low Complexity Deep Learning

We present Deep-n-Cheap -- an open-source AutoML framework to search for deep learning models. This search includes both architecture and training hyperparameters, and supports convolutional neural networks and multi-layer perceptrons. Our…

Machine Learning · Computer Science 2020-09-08 Sourya Dey , Saikrishna C. Kanala , Keith M. Chugg , Peter A. Beerel

Deep Boosting: Joint Feature Selection and Analysis Dictionary Learning in Hierarchy

This work investigates how the traditional image classification pipelines can be extended into a deep architecture, inspired by recent successes of deep neural networks. We propose a deep boosting framework based on layer-by-layer joint…

Computer Vision and Pattern Recognition · Computer Science 2015-08-12 Zhanglin Peng , Ya Li , Zhaoquan Cai , Liang Lin

Adaptive Cost Model for Query Optimization

The principal component of conventional database query optimizers is a cost model that is used to estimate expected performance of query plans. The accuracy of the cost model has direct impact on the optimality of execution plans selected…

Databases · Computer Science 2024-09-26 Nikita Vasilenko , Alexander Demin , Denis Ponomaryov

An End-to-End Learning-based Cost Estimator

Cost and cardinality estimation is vital to query optimizer, which can guide the plan selection. However traditional empirical cost and cardinality estimation techniques cannot provide high-quality estimation, because they cannot capture…

Databases · Computer Science 2019-06-07 Ji Sun , Guoliang Li

What Programs Want: Automatic Inference of Input Data Specifications

Nowadays, as machine-learned software quickly permeates our society, we are becoming increasingly vulnerable to programming errors in the data pre-processing or training software, as well as errors in the data itself. In this paper, we…

Programming Languages · Computer Science 2020-07-22 Caterina Urban

Output Space Search for Structured Prediction

We consider a framework for structured prediction based on search in the space of complete structured outputs. Given a structured input, an output is produced by running a time-bounded search procedure guided by a learned cost function, and…

Machine Learning · Computer Science 2012-07-03 Janardhan Rao Doppa , Alan Fern , Prasad Tadepalli

Adaptive Learning of Aggregate Analytics under Dynamic Workloads

Large organizations have seamlessly incorporated data-driven decision making in their operations. However, as data volumes increase, expensive big data infrastructures are called to rescue. In this setting, analytics tasks become very…

Databases · Computer Science 2020-03-17 Fotis Savva , Christos Anagnostopoulos , Peter Triantafillou

Framework for Progressive Knowledge Fusion in Large Language Models Through Structured Conceptual Redundancy Analysis

The organization of latent knowledge within large-scale models poses unique challenges when addressing overlapping representations and optimizing contextual accuracy. Conceptual redundancies embedded across layers often result in…

Computation and Language · Computer Science 2025-03-26 Joseph Sakau , Evander Kozlowski , Roderick Thistledown , Basil Steinberger