Related papers: LakeMLB: Data Lake Machine Learning Benchmark

Create Benchmarks for Data Lakes

Data lakes have emerged as a flexible and scalable solution for storing and analyzing large volumes of heterogeneous data, including structured, semi-structured, and unstructured formats. Despite their growing adoption in both industry and…

Databases · Computer Science 2026-01-28 Yi Lyu , Pei-Chieh Lo , Natan Lidukhover

Benchmarking Data Lakes Featuring Structured and Unstructured Data with DLBench

In the last few years, the concept of data lake has become trendy for data storage and analysis. Thus, several design alternatives have been proposed to build data lake systems. However, these proposals are difficult to evaluate as there…

Databases · Computer Science 2021-10-05 Pegdwendé Sawadogo , Jérôme Darmont

LakeBench: Benchmarks for Data Discovery over Data Lakes

Within enterprises, there is a growing need to intelligently navigate data lakes, specifically focusing on data discovery. Of particular importance to enterprises is the ability to find related tables in data repositories. These tables can…

Databases · Computer Science 2023-07-11 Kavitha Srinivas , Julian Dolby , Ibrahim Abdelaziz , Oktie Hassanzadeh , Harsha Kokel , Aamod Khatiwada , Tejaswini Pedapati , Subhajit Chaudhury , Horst Samulowitz

A Comprehensive Benchmark of Machine and Deep Learning Across Diverse Tabular Datasets

The analysis of tabular datasets is highly prevalent both in scientific research and real-world applications of Machine Learning (ML). Unlike many other ML tasks, Deep Learning (DL) models often do not outperform traditional methods in this…

Machine Learning · Computer Science 2024-08-28 Assaf Shmuel , Oren Glickman , Teddy Lazebnik

LAKEGEN: A LLM-based Tabular Corpus Generator for Evaluating Dataset Discovery in Data Lakes

How to generate a large, realistic set of tables along with joinability relationships, to stress-test dataset discovery methods? Dataset discovery methods aim to automatically identify related data assets in a data lake. The development and…

Databases · Computer Science 2025-07-09 Zhenwei Dai , Chuan Lei , Asterios Katsifodimos , Xiao Qin , Christos Faloutsos , Huzefa Rangwala

Retrieve, Merge, Predict: Augmenting Tables with Data Lakes

Machine-learning from a disparate set of tables, a data lake, requires assembling features by merging and aggregating tables. Data discovery can extend autoML to data tables by automating these steps. We present an in-depth analysis of such…

Databases · Computer Science 2025-05-20 Riccardo Cappuzzo , Aimee Coelho , Felix Lefebvre , Paolo Papotti , Gael Varoquaux

PMLB: A Large Benchmark Suite for Machine Learning Evaluation and Comparison

The selection, development, or comparison of machine learning methods in data mining can be a difficult task based on the target problem and goals of a particular study. Numerous publicly available real-world and simulated benchmark…

Machine Learning · Computer Science 2017-03-03 Randal S. Olson , William La Cava , Patryk Orzechowski , Ryan J. Urbanowicz , Jason H. Moore

BenchHub: A Unified Benchmark Suite for Holistic and Customizable LLM Evaluation

As large language models (LLMs) continue to advance, the need for up-to-date and well-organized benchmarks becomes increasingly critical. However, many existing datasets are scattered, difficult to manage, and make it challenging to perform…

Machine Learning · Computer Science 2025-06-03 Eunsu Kim , Haneul Yoo , Guijin Son , Hitesh Patel , Amit Agarwal , Alice Oh

TableVault: Managing Dynamic Data Collections for LLM-Augmented Workflows

Large Language Models (LLMs) have emerged as powerful tools for automating and executing complex data tasks. However, their integration into more complex data workflows introduces significant management challenges. In response, we present…

Databases · Computer Science 2025-06-24 Jinjin Zhao , Sanjay Krishnan

RDBench: ML Benchmark for Relational Databases

Benefiting from high-quality datasets and standardized evaluation metrics, machine learning (ML) has achieved sustained progress and widespread applications. However, while applying machine learning to relational databases (RDBs), the…

Machine Learning · Computer Science 2023-10-31 Zizhao Zhang , Yi Yang , Lutong Zou , He Wen , Tao Feng , Jiaxuan You

Model Lake: a New Alternative for Machine Learning Models Management and Governance

The rise of artificial intelligence and data science across industries underscores the pressing need for effective management and governance of machine learning (ML) models. Traditional approaches to ML models management often involve…

Machine Learning · Computer Science 2025-04-01 Moncef Garouani , Franck Ravat , Nathalie Valles-Parlangeau

RumbleML: program the lakehouse with JSONiq

Lakehouse systems have reached in the past few years unprecedented size and heterogeneity and have been embraced by many industry players. However, they are often difficult to use as they lack the declarative language and optimization…

Databases · Computer Science 2021-12-24 Ghislain Fourny , David Dao , Can Berker Cikis , Ce Zhang , Gustavo Alonso

Tackling prediction tasks in relational databases with LLMs

Though large language models (LLMs) have demonstrated exceptional performance across numerous problems, their application to predictive tasks in relational databases remains largely unexplored. In this work, we address the notion that LLMs…

Machine Learning · Computer Science 2024-11-19 Marek Wydmuch , Łukasz Borchmann , Filip Graliński

Deep Lake: a Lakehouse for Deep Learning

Traditional data lakes provide critical data infrastructure for analytical workloads by enabling time travel, running SQL queries, ingesting data with ACID transactions, and visualizing petabyte-scale datasets on cloud storage. They allow…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-12-15 Sasun Hambardzumyan , Abhinav Tuli , Levon Ghukasyan , Fariz Rahman , Hrant Topchyan , David Isayan , Mark McQuade , Mikayel Harutyunyan , Tatevik Hakobyan , Ivo Stranic , Davit Buniatyan

ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities

Forecasts of future events are essential inputs into informed decision-making. Machine learning (ML) systems have the potential to deliver forecasts at scale, but there is no framework for evaluating the accuracy of ML systems on a…

Machine Learning · Computer Science 2025-03-03 Ezra Karger , Houtan Bastani , Chen Yueh-Han , Zachary Jacobs , Danny Halawi , Fred Zhang , Philip E. Tetlock

BatchBench: Toward a Workload-Aware Benchmark for Autoscaling Policies in Big Data Batch Processing -- A Proposed Framework

Autoscaling has become a baseline expectation for cloud-native big data processing, and the design space has expanded beyond rule-based heuristics to include learned controllers and, most recently, large language model (LLM) agents. Yet…

Information Retrieval · Computer Science 2026-05-13 Venkata Krishna Prasanth Budigi , Siri Chandana Sirigiri

RealHiTBench: A Comprehensive Realistic Hierarchical Table Benchmark for Evaluating LLM-Based Table Analysis

With the rapid advancement of Large Language Models (LLMs), there is an increasing need for challenging benchmarks to evaluate their capabilities in handling complex tabular data. However, existing benchmarks are either based on outdated…

Computation and Language · Computer Science 2025-12-16 Pengzuo Wu , Yuhang Yang , Guangcheng Zhu , Chao Ye , Hong Gu , Xu Lu , Ruixuan Xiao , Bowen Bao , Yijing He , Liangyu Zha , Wentao Ye , Junbo Zhao , Haobo Wang

Something's Fishy In The Data Lake: A Critical Re-evaluation of Table Union Search Benchmarks

Recent table representation learning and data discovery methods tackle table union search (TUS) within data lakes, which involves identifying tables that can be unioned with a given query table to enrich its content. These methods are…

Information Retrieval · Computer Science 2025-05-29 Allaa Boutaleb , Bernd Amann , Hubert Naacke , Rafael Angarita

ModelTables: A Corpus of Tables about Models

We present ModelTables, a benchmark of tables in Model Lakes that captures the structured semantics of performance and configuration tables often overlooked by text only retrieval. The corpus is built from Hugging Face model cards, GitHub…

Databases · Computer Science 2025-12-19 Zhengyuan Dong , Victor Zhong , Renée J. Miller

Benchmarking Table Comprehension In The Wild

Large Language Models (LLMs), while being increasingly dominant on a myriad of knowledge-intensive activities, have only had limited success understanding lengthy table-text mixtures, such as academic papers and financial reports. Recent…

Computation and Language · Computer Science 2024-12-16 Yikang Pan , Yi Zhu , Rand Xie , Yizhi Liu