Related papers: Costing Generated Runtime Execution Plans for Larg…

A Cost-based Optimizer for Gradient Descent Optimization

As the use of machine learning (ML) permeates into diverse application domains, there is an urgent need to support a declarative framework for ML. Ideally, a user will specify an ML task in a high-level and easy-to-use language and the…

Databases · Computer Science 2017-03-28 Zoi Kaoudi , Jorge-Arnulfo Quiané-Ruiz , Saravanan Thirumuruganathan , Sanjay Chawla , Divy Agrawal

ML-driven Hardware Cost Model for MLIR

During early optimization passes, compilers must make predictions for machine-dependent characteristics such as execution unit utilization, number of register spills, latency, throughput etc. to generate better code. Often a hand-written…

Machine Learning · Computer Science 2023-02-23 Dibyendu Das , Sandya Mannarswamy

A Survey on Large-scale Machine Learning

Machine learning can provide deep insights into data, allowing machines to make high-quality predictions and having been widely used in real-world applications, such as text mining, visual classification, and recommender systems. However,…

Machine Learning · Computer Science 2020-08-11 Meng Wang , Weijie Fu , Xiangnan He , Shijie Hao , Xindong Wu

Scaling Datalog for Machine Learning on Big Data

In this paper, we present the case for a declarative foundation for data-intensive machine learning systems. Instead of creating a new system for each specific flavor of machine learning task, or hardcoding new optimizations, we argue for…

Databases · Computer Science 2012-03-05 Yingyi Bu , Vinayak Borkar , Michael J. Carey , Joshua Rosen , Neoklis Polyzotis , Tyson Condie , Markus Weimer , Raghu Ramakrishnan

Redefining Cost Estimation in Database Systems: The Role of Execution Plan Features and Machine Learning

Accurate query runtime prediction is a critical component of effective query optimization in modern database systems. Traditional cost models, such as those used in PostgreSQL, rely on static heuristics that often fail to reflect actual…

Databases · Computer Science 2025-10-08 Utsav Pathak , Amit Mankodi

The Costly Dilemma: Generalization, Evaluation and Cost-Optimal Deployment of Large Language Models

When deploying machine learning models in production for any product/application, there are three properties that are commonly desired. First, the models should be generalizable, in that we can extend it to further use cases as our…

Computation and Language · Computer Science 2023-08-17 Abi Aryan , Aakash Kumar Nain , Andrew McMahon , Lucas Augusto Meyer , Harpreet Singh Sahota

Integrated Offline and Online Learning to Solve a Large Class of Scheduling Problems

In this paper, we develop a unified machine learning (ML) approach to predict high-quality solutions for single-machine scheduling problems with a non-decreasing min-sum objective function with or without release times. Our ML approach is…

Optimization and Control · Mathematics 2025-01-09 Anbang Liu , Zhi-Long Chen , Jinyang Jiang , Xi Chen

CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning

Utilizing large language models (LLMs) for tool planning has emerged as a promising avenue for developing general AI systems, where LLMs automatically schedule external tools (e.g., vision models) to tackle complex tasks based on task…

Artificial Intelligence · Computer Science 2025-07-15 Duo Wu , Jinghe Wang , Yuan Meng , Yanning Zhang , Le Sun , Zhi Wang

Declarative Machine Learning - A Classification of Basic Properties and Types

Declarative machine learning (ML) aims at the high-level specification of ML tasks or algorithms, and automatic generation of optimized execution plans from these specifications. The fundamental goal is to simplify the usage and/or…

Databases · Computer Science 2016-05-20 Matthias Boehm , Alexandre V. Evfimievski , Niketan Pansare , Berthold Reinwald

A Markov Chain Monte Carlo Approach to Cost Matrix Generation for Scheduling Performance Evaluation

In high performance computing, scheduling of tasks and allocation to machines is very critical especially when we are dealing with heterogeneous execution costs. Simulations can be performed with a large variety of environments and…

Performance · Computer Science 2018-03-23 Louis-Claude Canon , Mohamad El Sayah , Pierre-Cyrille Héam

vTrain: A Simulation Framework for Evaluating Cost-effective and Compute-optimal Large Language Model Training

As large language models (LLMs) become widespread in various application domains, a critical challenge the AI community is facing is how to train these large AI models in a cost-effective manner. Existing LLM training plans typically employ…

Machine Learning · Computer Science 2024-09-11 Jehyeon Bang , Yujeong Choi , Myeongwoo Kim , Yongdeok Kim , Minsoo Rhu

Cost-Aware Model Orchestration for LLM-based Systems

As modern artificial intelligence (AI) systems become more advanced and capable, they can leverage a wide range of tools and models to perform complex tasks. The task of orchestrating these models is increasingly performed by Large Language…

Artificial Intelligence · Computer Science 2026-04-20 Daria Smirnova , Hamid Nasiri , Marta Adamska , Zhengxin Yu , Peter Garraghan

Fine-Tuning Data Structures for Analytical Query Processing

We introduce a framework for automatically choosing data structures to support efficient computation of analytical workloads. Our contributions are twofold. First, we introduce a novel low-level intermediate language that can express the…

Databases · Computer Science 2021-12-28 Amir Shaikhha , Marios Kelepeshis , Mahdi Ghorbani

Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings

Query processing over big data is ubiquitous in modern clouds, where the system takes care of picking both the physical query execution plans and the resources needed to run those plans, using a cost-based query optimizer. A good cost…

Databases · Computer Science 2020-03-02 Tarique Siddiqui , Alekh Jindal , Shi Qiao , Hiren Patel , Wangchao le

Run Time Prediction for Big Data Iterative ML Algorithms: a KMeans case study

Data science and machine learning algorithms running on big data infrastructure are increasingly important in activities ranging from business intelligence and analytics to cybersecurity, smart city management, and many fields of science…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-10-10 Eduardo Rodrigues , Ricardo Morla

Evaluating Large Language Models for Workload Mapping and Scheduling in Heterogeneous HPC Systems

Large language models (LLMs) are increasingly explored for their reasoning capabilities, yet their ability to perform structured, constraint-based optimization from natural language remains insufficiently understood. This study evaluates…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-11-18 Aasish Kumar Sharma , Julian Kunkel

Predictive Scheduling for Efficient Inference-Time Reasoning in Large Language Models

Large language models (LLMs) achieve state-of-the-art accuracy on complex reasoning tasks by generating multiple chain-of-thought (CoT) traces, but using a fixed token budget per query leads to over-computation on easy inputs and…

Artificial Intelligence · Computer Science 2026-02-03 Katrina Brown , Aneesh Muppidi , Rana Shahout

Scaling ML Products At Startups: A Practitioner's Guide

How do you scale a machine learning product at a startup? In particular, how do you serve a greater volume, velocity, and variety of queries cost-effectively? We break down costs into variable costs-the cost of serving the model and…

Machine Learning · Computer Science 2023-04-24 Atul Dhingra , Gaurav Sood

Power Hungry Processing: Watts Driving the Cost of AI Deployment?

Recent years have seen a surge in the popularity of commercial AI products based on generative, multi-purpose AI systems promising a unified approach to building machine learning (ML) models into technology. However, this ambition of…

Machine Learning · Computer Science 2024-10-17 Alexandra Sasha Luccioni , Yacine Jernite , Emma Strubell

Towards Optimizing the Costs of LLM Usage

Generative AI and LLMs in particular are heavily used nowadays for various document processing tasks such as question answering and summarization. However, different LLMs come with different capabilities for different tasks as well as with…

Computation and Language · Computer Science 2024-02-06 Shivanshu Shekhar , Tanishq Dubey , Koyel Mukherjee , Apoorv Saxena , Atharv Tyagi , Nishanth Kotla