Related papers: Extensible Database Simulator for Fast Prototyping…

SAVIME: A Multidimensional System for the Analysis and Visualization of Simulation Data

Scientific applications produce a huge amount of data, which imposes serious management and analysis challenges. In particular, limitations in current database management systems prevent their adoption in simulation applications, in which…

Databases · Computer Science 2019-03-18 Hermano Lustosa , Fabio Porto

A Unified System for Data Analytics and In Situ Query Processing

In today's world data is being generated at a high rate due to which it has become inevitable to analyze and quickly get results from this data. Most of the relational databases primarily support SQL querying with a limited support for…

Databases · Computer Science 2021-04-08 Alex Watson , Suvam Kumar Das , Suprio Ray

Educational Database Prototype: the Simplest of All

Database Management System (DBMS) is designed to help store and process large collections of data, and is incredibly flexible to perform various kinds of optimizations as long as it achieves serializability with a high-level interface…

Databases · Computer Science 2026-01-28 Yi Lyu , Yiyin Shen , Takashi Matsuzawa

FabSim: facilitating computational research through automation on large-scale and distributed e-infrastructures

We present FabSim, a toolkit developed to simplify a range of computational tasks for researchers in diverse disciplines. FabSim is flexible, adaptable, and allows users to perform a wide range of tasks with ease. It also provides a…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-09-21 Derek Groen , Agastya Bhati , James Suter , James Hetherington , Stefan Zasada , Peter Coveney

BigDataSDNSim: A Simulator for Analyzing Big Data Applications in Software-Defined Cloud Data Centers

Emerging paradigms of big data and Software-Defined Networking (SDN) in cloud data centers have gained significant attention from industry and academia. The integration and coordination of big data and SDN are required to improve the…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-10-11 Khaled Alwasel , Rodrigo N. Calheiros , Saurabh Garg , Rajkumar Buyya , Rajiv Ranjan

DSGym: A Holistic Framework for Evaluating and Training Data Science Agents

Data science agents promise to accelerate discovery and insight-generation by turning data into executable analyses and findings. Yet existing data science benchmarks fall short due to fragmented evaluation interfaces that make…

Artificial Intelligence · Computer Science 2026-01-26 Fan Nie , Junlin Wang , Harper Hua , Federico Bianchi , Yongchan Kwon , Zhenting Qi , Owen Queen , Shang Zhu , James Zou

High-concurrency Custom-build Relational Database System's design and SQL parser design based on Turing-complete automata

Database system is an indispensable part of software projects. It plays an important role in data organization and storage. Its performance and efficiency are directly related to the performance of software. Nowadays, we have many general…

Databases · Computer Science 2020-08-12 WanHong Huang

Serving Deep Learning Models with Deduplication from Relational Databases

There are significant benefits to serve deep learning models from relational databases. First, features extracted from databases do not need to be transferred to any decoupled deep learning systems for inferences, and thus the system…

Databases · Computer Science 2022-10-24 Lixi Zhou , Jiaqing Chen , Amitabh Das , Hong Min , Lei Yu , Ming Zhao , Jia Zou

Trustworthy and Efficient LLMs Meet Databases

In the rapidly evolving AI era with large language models (LLMs) at the core, making LLMs more trustworthy and efficient, especially in output generation (inference), has gained significant attention. This is to reduce plausible but faulty…

Databases · Computer Science 2024-12-25 Kyoungmin Kim , Anastasia Ailamaki

DSNS: The Deep Space Network Simulator

Simulation tools are commonly used in the development and testing of new protocols or new networks. However, as satellite networks start to grow to encompass thousands of nodes, and as companies and space agencies begin to realize the…

Networking and Internet Architecture · Computer Science 2025-10-30 Joshua Smailes , Filip Futera , Sebastian Köhler , Simon Birnbach , Martin Strohmeier , Ivan Martinovic

Relational Database Augmented Large Language Model

Large language models (LLMs) excel in many natural language processing (NLP) tasks. However, since LLMs can only incorporate new knowledge through training or supervised fine-tuning processes, they are unsuitable for applications that…

Databases · Computer Science 2024-07-23 Zongyue Qin , Chen Luo , Zhengyang Wang , Haoming Jiang , Yizhou Sun

D4M: Bringing Associative Arrays to Database Engines

The ability to collect and analyze large amounts of data is a growing problem within the scientific community. The growing gap between data and users calls for innovative tools that address the challenges faced by big data volume, velocity…

Databases · Computer Science 2017-01-25 Vijay Gadepally , Jeremy Kepner , William Arcand , David Bestor , Bill Bergeron , Chansup Byun , Lauren Edwards , Matthew Hubbell , Peter Michaleas , Julie Mullen , Andrew Prout , Antonio Rosa , Charles Yee , Albert Reuther

Graywulf: A platform for federated scientific databases and services

Many fields of science rely on relational database management systems to analyze, publish and share data. Since RDBMS are originally designed for, and their development directions are primarily driven by, business use cases they often lack…

Databases · Computer Science 2013-08-08 László Dobos , Alexander S. Szalay , Tamás Budavári , István Csabai , Nolan Li

dynsight: an Open Python Platform for Simulation and Experimental Trajectory Data Analysis

The study of complex many-body systems via analysis of the trajectories of the units that dynamically move and interact within them is a non-trivial task. The workflow for extracting meaningful information from the raw trajectory data is…

Materials Science · Physics 2025-10-31 Simone Martino , Matteo Becchi , Andrew Tarzia , Daniele Rapetti , Giovanni M. Pavan

Enabling Relational Database Analytical Processing in Bulk-Bitwise Processing-In-Memory

Bulk-bitwise processing-in-memory (PIM), an emerging computational paradigm utilizing memory arrays as computational units, has been shown to benefit database applications. This paper demonstrates how GROUP-BY and JOIN, database operations…

Hardware Architecture · Computer Science 2023-11-03 Ben Perach , Ronny Ronen , Shahar Kvatinsky

CGSim: A Simulation Framework for Large Scale Distributed Computing Environment

Large-scale distributed computing infrastructures such as the Worldwide LHC Computing Grid (WLCG) require comprehensive simulation tools for evaluating performance, testing new algorithms, and optimizing resource allocation strategies.…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-02 Sairam Sri Vatsavai , Raees Khan , Kuan-Chieh Hsu , Ozgur O. Kilic , Paul Nilsson , Tatiana Korchuganova , David K. Park , Sankha Dutta , Yihui Ren , Joseph Boudreau , Tasnuva Chowdhury , Shengyu Feng , Jaehyung Kim , Scott Klasky , Tadashi Maeno , Verena Ingrid Martinez , Norbert Podhorszki , Frédéric Suter , Wei Yang , Yiming Yang , Shinjae Yoo , Alexei Klimentov , Adolfy Hoisie

Relational Database Distillation: From Structured Tables to Condensed Graph Data

Relational databases (RDBs) underpin the majority of global data management systems, where information is structured into multiple interdependent tables. To effectively use the knowledge within RDBs for predictive tasks, recent advances…

Databases · Computer Science 2026-01-21 Xinyi Gao , Jingxi Zhang , Lijian Chen , Tong Chen , Lizhen Cui , Hongzhi Yin

COMPARE: Accelerating Groupwise Comparison in Relational Databases for Data Analytics

Data analysis often involves comparing subsets of data across many dimensions for finding unusual trends and patterns. While the comparison between subsets of data can be expressed using SQL, they tend to be complex to write, and suffer…

Databases · Computer Science 2021-07-28 Tarique Siddiqui , Surajit Chaudhuri , Vivek Narasayya

TensorDIMM: A Practical Near-Memory Processing Architecture for Embeddings and Tensor Operations in Deep Learning

Recent studies from several hyperscalars pinpoint to embedding layers as the most memory-intensive deep learning (DL) algorithm being deployed in today's datacenters. This paper addresses the memory capacity and bandwidth challenges of…

Machine Learning · Computer Science 2019-08-27 Youngeun Kwon , Yunjae Lee , Minsoo Rhu

DB-GPT: Empowering Database Interactions with Private Large Language Models

The recent breakthroughs in large language models (LLMs) are positioned to transition many areas of software. Database technologies particularly have an important entanglement with LLMs as efficient and intuitive database interactions are…

Databases · Computer Science 2024-01-04 Siqiao Xue , Caigao Jiang , Wenhui Shi , Fangyin Cheng , Keting Chen , Hongjun Yang , Zhiping Zhang , Jianshan He , Hongyang Zhang , Ganglin Wei , Wang Zhao , Fan Zhou , Danrui Qi , Hong Yi , Shaodong Liu , Faqiang Chen