Related papers: Array Requirements for Scientific Applications and…

SQL Query Completion for Data Exploration

Within the big data tsunami, relational databases and SQL are still there and remain mandatory in most of cases for accessing data. On the one hand, SQL is easy-to-use by non specialists and allows to identify pertinent initial data at the…

Databases · Computer Science 2018-02-09 Marie Le Guilly , Jean-Marc Petit , Vasile-Marian Scuturici

Facilitating SQL Query Composition and Analysis

Formulating efficient SQL queries requires several cycles of tuning and execution, particularly for inexperienced users. We examine methods that can accelerate and improve this interaction by providing insights about SQL queries prior to…

Databases · Computer Science 2020-02-24 Zainab Zolaktaf , Mostafa Milani , Rachel Pottinger

SciServer: a Science Platform for Astronomy and Beyond

We present SciServer, a science platform built and supported by the Institute for Data Intensive Engineering and Science at the Johns Hopkins University. SciServer builds upon and extends the SkyServer system of server-side tools that…

Instrumentation and Methods for Astrophysics · Physics 2020-09-07 Manuchehr Taghizadeh-Popp , Jai Won Kim , Gerard Lemson , Dmitry Medvedev , M. Jordan Raddick , Alexander S. Szalay , Aniruddha R. Thakar , Joseph Booker , Camy Chhetri , Laszlo Dobos , Michael Rippin

Graywulf: A platform for federated scientific databases and services

Many fields of science rely on relational database management systems to analyze, publish and share data. Since RDBMS are originally designed for, and their development directions are primarily driven by, business use cases they often lack…

Databases · Computer Science 2013-08-08 László Dobos , Alexander S. Szalay , Tamás Budavári , István Csabai , Nolan Li

SAVIME: A Multidimensional System for the Analysis and Visualization of Simulation Data

Scientific applications produce a huge amount of data, which imposes serious management and analysis challenges. In particular, limitations in current database management systems prevent their adoption in simulation applications, in which…

Databases · Computer Science 2019-03-18 Hermano Lustosa , Fabio Porto

Scientific Data Management in the Coming Decade

This is a thought piece on data-intensive science requirements for databases and science centers. It argues that peta-scale datasets will be housed by science centers that provide substantial storage and processing for scientists who access…

Databases · Computer Science 2007-05-23 Jim Gray , David T. Liu , Maria Nieto-Santisteban , Alexander S. Szalay , David DeWitt , Gerd Heber

Multi-Dimensional Data Compression and Query Processing in Array Databases

In recent times, the production of multidimensional data in various domains and their storage in array databases has witnessed a sharp increase; this rapid growth in data volumes necessitates compression in array databases. However,…

Databases · Computer Science 2022-11-14 Minsoo Kim , Hyubjin Lee , Yon Dohn Chung

SODA: Generating SQL for Business Users

The purpose of data warehouses is to enable business analysts to make better decisions. Over the years the technology has matured and data warehouses have become extremely successful. As a consequence, more and more data has been added to…

Databases · Computer Science 2012-07-03 Lukas Blunschi , Claudio Jossen , Donald Kossman , Magdalini Mori , Kurt Stockinger

A Typeful Integration of SQL into Curry

We present an extension of the declarative programming language Curry to support the access to data stored in relational databases via SQL. Since Curry is statically typed, our emphasis on this SQL integration is on type safety. Our…

Programming Languages · Computer Science 2017-01-04 Michael Hanus , Julia Krone

Efficient Iterative Processing in the SciDB Parallel Array Engine

Many scientific data-intensive applications perform iterative computations on array data. There exist multiple engines specialized for array processing. These engines efficiently support various types of operations, but none includes native…

Databases · Computer Science 2015-06-02 Emad Soroush , Magdalena Balazinska , Simon Krughoff , Andrew Connolly

Adding HL7 version 3 data types to PostgreSQL

The HL7 standard is widely used to exchange medical information electronically. As a part of the standard, HL7 defines scalar communication data types like physical quantity, point in time and concept descriptor but also complex types such…

Databases · Computer Science 2010-03-18 Yeb Havinga , Willem Dijkstra , Ander de Keijzer

Subset Queries in Relational Databases

In this paper, we motivated the need for relational database systems to support subset query processing. We defined new operators in relational algebra, and new constructs in SQL for expressing subset queries. We also illustrated the…

Databases · Computer Science 2007-05-23 Satyanarayana R Valluri , Kamalakar Karlapalem

Experiments \& Analysis of Privacy-Preserving SQL Query Sanitization Systems

Analytical SQL queries are essential for extracting insights from relational databases but concurrently introduce significant privacy risks by potentially exposing sensitive information. To mitigate these risks, numerous query sanitization…

Databases · Computer Science 2025-10-16 Loïs Ecoffet , Veronika Rehn-Sonigo , Jean-François Couchot , Catuscia Palamidessi

Perspects in astrophysical databases

Astrophysics has become a domain extremely rich of scientific data. Data mining tools are needed for information extraction from such large datasets. This asks for an approach to data management emphasizing the efficiency and simplicity of…

Databases · Computer Science 2009-11-10 M. Frailis , A. De Angelis , V. Roberto

A Survey on Array Storage, Query Languages, and Systems

Since scientific investigation is one of the most important providers of massive amounts of ordered data, there is a renewed interest in array data processing in the context of Big Data. To the best of our knowledge, a unified resource that…

Databases · Computer Science 2013-02-20 Florin Rusu , Yu Cheng

Sea: A lightweight data-placement library for Big Data scientific computing

The recent influx of open scientific data has contributed to the transitioning of scientific computing from compute intensive to data intensive. Whereas many Big Data frameworks exist that minimize the cost of data transfers, few scientific…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-07-06 Valérie Hayot-Sasson , Mathieu Dugré , Tristan Glatard

Associative Array Model of SQL, NoSQL, and NewSQL Databases

The success of SQL, NoSQL, and NewSQL databases is a reflection of their ability to provide significant functionality and performance benefits for specific domains, such as financial transactions, internet search, and data analysis. The…

Databases · Computer Science 2016-12-13 Jeremy Kepner , Vijay Gadepally , Dylan Hutchison , Hayden Jananthan , Timothy Mattson , Siddharth Samsi , Albert Reuther

Detecting coherent explorations in SQL workloads

This paper presents a proposal aiming at better understanding a workload of SQL queries and detecting coherent explorations hidden within the workload. In particular, our work investigates SQLShare [11], a database-as-a-service platform…

Databases · Computer Science 2019-07-15 Veronika Peralta , Patrick Marcel , Willeme Verdeaux , Aboubakar Sidikhy Diakhaby

Integration of Skyline Queries into Spark SQL

Skyline queries are frequently used in data analytics and multi-criteria decision support applications to filter relevant information from big amounts of data. Apache Spark is a popular framework for processing big, distributed data. The…

Databases · Computer Science 2022-10-10 Lukas Grasmann , Reinhard Pichler , Alexander Selzer

ArrayBridge: Interweaving declarative array processing with high-performance computing

Scientists are increasingly turning to datacenter-scale computers to produce and analyze massive arrays. Despite decades of database research that extols the virtues of declarative query processing, scientists still write, debug and…

Databases · Computer Science 2017-02-28 Haoyuan Xing , Sofoklis Floratos , Spyros Blanas , Suren Byna , Prabhat , Kesheng Wu , Paul Brown