Related papers: IDEBench: A Benchmark for Interactive Data Explora…

Conception d'un banc d'essais d\'ecisionnel

We present in this paper a new benchmark for evaluating the performances of data warehouses. Benchmarking is useful either to system users for comparing the performances of different systems, or to system engineers for testing the effect of…

Databases · Computer Science 2007-05-23 Jérôme Darmont , Fadila Bentayeb , Omar Boussaïd

Benchmarking data warehouses

Data warehouse architectural choices and optimization techniques are critical to decision support query performance. To facilitate these choices, the performance of the designed data warehouse must be assessed, usually with benchmarks.…

Databases · Computer Science 2017-01-03 Jérôme Darmont , Fadila Bentayeb , Omar Boussaïd

Create Benchmarks for Data Lakes

Data lakes have emerged as a flexible and scalable solution for storing and analyzing large volumes of heterogeneous data, including structured, semi-structured, and unstructured formats. Despite their growing adoption in both industry and…

Databases · Computer Science 2026-01-28 Yi Lyu , Pei-Chieh Lo , Natan Lidukhover

PDEBENCH: An Extensive Benchmark for Scientific Machine Learning

Machine learning-based modeling of physical systems has experienced increased interest in recent years. Despite some impressive progress, there is still a lack of benchmarks for Scientific ML that are easy to use but still challenging and…

Machine Learning · Computer Science 2024-08-27 Makoto Takamoto , Timothy Praditia , Raphael Leiteritz , Dan MacKinlay , Francesco Alesiani , Dirk Pflüger , Mathias Niepert

AIDABench: AI Data Analytics Benchmark

As AI-driven document understanding and processing tools become increasingly prevalent in real-world applications, the need for rigorous evaluation standards has grown increasingly urgent. Existing benchmarks and evaluations often focus on…

Artificial Intelligence · Computer Science 2026-03-30 Yibo Yang , Fei Lei , Yixuan Sun , Yantao Zeng , Chengguang Lv , Jiancao Hong , Jiaojiao Tian , Tianyu Qiu , Xin Wang , Yanbing Chen , Yanjie Li , Zheng Pan , Xiaochen Zhou , Guanzhou Chen , Haoran Lv , Yuning Xu , Yue Ou , Haodong Liu , Shiqi He , Anya Jia , Yulei Xin , Huan Wu , Liang Liu , Jiaye Ge , Jianxin Dong , Dahua Lin , Wenxiu Sun

BigDataBench: a Big Data Benchmark Suite from Internet Services

As architecture, systems, and data management communities pay greater attention to innovative big data systems and architectures, the pressure of benchmarking and evaluating these systems rises. Considering the broad use of big data…

Databases · Computer Science 2016-11-17 Lei Wang , Jianfeng Zhan , Chunjie Luo , Yuqing Zhu , Qiang Yang , Yongqiang He , Wanling Gao , Zhen Jia , Yingjie Shi , Shujie Zhang , Chen Zheng , Gang Lu , Kent Zhan , Xiaona Li , Bizhu Qiu

PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking

Cloud service providers commonly use standard benchmarks like TPC-H and TPC-DS to evaluate and optimize cloud data analytics systems. However, these benchmarks rely on fixed query patterns and fail to capture the real execution statistics…

Databases · Computer Science 2025-06-23 Yan Zhou , Chunwei Liu , Bhuvan Urgaonkar , Zhengle Wang , Magnus Mueller , Chao Zhang , Songyue Zhang , Pascal Pfeil , Dominik Horn , Zhengchun Liu , Davide Pagano , Tim Kraska , Samuel Madden , Ju Fan

Data Warehouse Benchmarking with DWEB

Performance evaluation is a key issue for designers and users of Database Management Systems (DBMSs). Performance is generally assessed with software benchmarks that help, e.g., test architectural choices, compare different technologies or…

Databases · Computer Science 2017-01-30 Jérôme Darmont

DWEB: A Data Warehouse Engineering Benchmark

Data warehouse architectural choices and optimization techniques are critical to decision support query performance. To facilitate these choices, the performance of the designed data warehouse must be assessed. This is usually done with the…

Databases · Computer Science 2007-05-23 Jérôme Darmont , Fadila Bentayeb , Omar Boussaïd

FDABench: A Benchmark for Data Agents on Analytical Queries over Heterogeneous Data

The growing demand for data-driven decision-making has created an urgent need for data agents that can integrate structured and unstructured data for analysis. While data agents show promise for enabling users to perform complex analytics…

Databases · Computer Science 2025-09-03 Ziting Wang , Shize Zhang , Haitao Yuan , Jinwei Zhu , Shifu Li , Wei Dong , Gao Cong

IDE-Bench: Evaluating Large Language Models as IDE Agents on Real-World Software Engineering Tasks

IDE-Bench is a comprehensive framework for evaluating AI IDE agents on real-world software engineering tasks through an IDE-native tool interface. We present a Dockerized test harness that goes beyond raw terminal execution, granting models…

Software Engineering · Computer Science 2026-02-02 Spencer Mateega , Jeff Yang , Tiana Costello , Shaurya Jadhav , Nicole Tian , Agustin Garcinuño

Object Database Benchmarks

The need for performance measurement tools appeared soon after the emergence of the first Object-Oriented Database Management Systems (OODBMSs), and proved important for both designers and users (Atkinson \& Maier, 1990). Performance…

Databases · Computer Science 2017-01-27 Jerome Darmont

Benchmark Framework with Skewed Workloads

In this work, we present a new benchmarking suite with new real-life inspired skewed workloads to test the performance of concurrent index data structures. We started this project to prepare workloads specifically for self-adjusting data…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-05-19 Vitaly Aksenov , Dmitry Ivanov , Ravil Galiev

An Adaptive Benchmark for Modeling User Exploration of Large Datasets

In this paper, we present a new DBMS performance benchmark that can simulate user exploration with any specified dashboard design made of standard visualization and interaction components. The distinguishing feature of our SImulation-BAsed…

Human-Computer Interaction · Computer Science 2025-01-14 Joanna Purich , Anthony Wise , Leilani Battle

BatchBench: Toward a Workload-Aware Benchmark for Autoscaling Policies in Big Data Batch Processing -- A Proposed Framework

Autoscaling has become a baseline expectation for cloud-native big data processing, and the design space has expanded beyond rule-based heuristics to include learned controllers and, most recently, large language model (LLM) agents. Yet…

Information Retrieval · Computer Science 2026-05-13 Venkata Krishna Prasanth Budigi , Siri Chandana Sirigiri

IDRBench: Interactive Deep Research Benchmark

Deep research agents powered by Large Language Models (LLMs) can perform multi-step reasoning, web exploration, and long-form report generation. However, most existing systems operate in an autonomous manner, assuming fully specified user…

Computation and Language · Computer Science 2026-01-13 Yingchaojie Feng , Qiang Huang , Xiaoya Xie , Zhaorui Yang , Jun Yu , Wei Chen , Anthony K. H. Tung

Characterizing BigBench queries, Hive, and Spark in multi-cloud environments

BigBench is the new standard (TPCx-BB) for benchmarking and testing Big Data systems. The TPCx-BB specification describes several business use cases -- queries -- which require a broad combination of data extraction techniques including…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-07-07 Nicolas Poggi , Alejandro Montero , David Carrera

DRBench: A Realistic Benchmark for Enterprise Deep Research

We introduce DRBench, a benchmark for evaluating AI agents on complex, open-ended deep research tasks in enterprise settings. Unlike prior benchmarks that focus on simple questions or web-only queries, DRBench evaluates agents on multi-step…

Computation and Language · Computer Science 2026-03-11 Amirhossein Abaskohi , Tianyi Chen , Miguel Muñoz-Mármol , Curtis Fox , Amrutha Varshini Ramesh , Étienne Marcotte , Xing Han Lù , Nicolas Chapados , Spandana Gella , Peter West , Giuseppe Carenini , Christopher Pal , Alexandre Drouin , Issam H. Laradji

IDA-Bench: Evaluating LLMs on Interactive Guided Data Analysis

Large Language Models (LLMs) show promise as data analysis agents, but existing benchmarks overlook the iterative nature of the field, where experts' decisions evolve with deeper insights of the dataset. To address this, we introduce…

Computation and Language · Computer Science 2025-06-09 Hanyu Li , Haoyu Liu , Tingyu Zhu , Tianyu Guo , Zeyu Zheng , Xiaotie Deng , Michael I. Jordan

LakeBench: Benchmarks for Data Discovery over Data Lakes

Within enterprises, there is a growing need to intelligently navigate data lakes, specifically focusing on data discovery. Of particular importance to enterprises is the ability to find related tables in data repositories. These tables can…

Databases · Computer Science 2023-07-11 Kavitha Srinivas , Julian Dolby , Ibrahim Abdelaziz , Oktie Hassanzadeh , Harsha Kokel , Aamod Khatiwada , Tejaswini Pedapati , Subhajit Chaudhury , Horst Samulowitz