Related papers: Efficient Iterative Processing in the SciDB Parall…

Spinning Fast Iterative Data Flows

Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative nature of many analysis and machine learning algorithms, however, is still a challenge for current systems. While certain types of bulk…

Databases · Computer Science 2012-08-02 Stephan Ewen , Kostas Tzoumas , Moritz Kaufmann , Volker Markl

ArrayBridge: Interweaving declarative array processing with high-performance computing

Scientists are increasingly turning to datacenter-scale computers to produce and analyze massive arrays. Despite decades of database research that extols the virtues of declarative query processing, scientists still write, debug and…

Databases · Computer Science 2017-02-28 Haoyuan Xing , Sofoklis Floratos , Spyros Blanas , Suren Byna , Prabhat , Kesheng Wu , Paul Brown

Parallel Astronomical Data Processing with Python: Recipes for multicore machines

High performance computing has been used in various fields of astrophysical research. But most of it is implemented on massively parallel systems (supercomputers) or graphical processing unit clusters. With the advent of multicore…

Instrumentation and Methods for Astrophysics · Physics 2013-07-30 Navtej Singh , Lisa-Marie Browne , Ray Butler

An iterative method for classification of binary data

In today's data driven world, storing, processing, and gleaning insights from large-scale data are major challenges. Data compression is often required in order to store large amounts of high-dimensional data, and thus, efficient inference…

Machine Learning · Statistics 2018-09-11 Denali Molitor , Deanna Needell

Leveraging Error Resilience of Iterative Algorithms for Energy Efficiency: from Concept to Implementation

Iterative algorithms are widely used in digital signal processing applications. With the case study of radio astronomy calibration processing, this work contributes towards revealing and exploiting the intrinsic error resilience of…

Signal Processing · Electrical Eng. & Systems 2025-02-21 G. A. Gillani , A. Krapukhin , A. B. J. Kokkeler

A Quick Introduction to Functional Verification of Array-Intensive Programs

Array-intensive programs are often amenable to parallelization across many cores on a single machine as well as scaling across multiple machines and hence are well explored, especially in the domain of high-performance computing. These…

Programming Languages · Computer Science 2019-05-23 Kunal Banerjee , Chandan Karfa

Benchmarking SciDB Data Import on HPC Systems

SciDB is a scalable, computational database management system that uses an array model for data storage. The array data model of SciDB makes it ideally suited for storing and managing large amounts of imaging data. SciDB is designed to…

Databases · Computer Science 2016-12-13 Siddharth Samsi , Laura Brattain , William Arcand , David Bestor , Bill Bergeron , Chansup Byun , Vijay Gadepally , Michael Houle , Matthew Hubbell , Michael Jones , Anna Klein , Peter Michaleas , Lauren Milechin , Julie Mullen , Andrew Prout , Antonio Rosa , Charles Yee , Jeremy Kepner , Albert Reuther

Distributed Caching for Complex Querying of Raw Arrays

As applications continue to generate multi-dimensional data at exponentially increasing rates, fast analytics to extract meaningful results is becoming extremely important. The database community has developed array databases that alleviate…

Databases · Computer Science 2018-03-19 Weijie Zhao , Florin Rusu , Bin Dong , Kesheng Wu , Anna Y. Q. Ho , Peter Nugent

Mathematical Computation on High-dimensional Data via Array Programming and Parallel Acceleration

While deep learning excels in natural image and language processing, its application to high-dimensional data faces computational challenges due to the dimensionality curse. Current large-scale data tools focus on business-oriented…

Machine Learning · Computer Science 2025-07-01 Chen Zhang

A Survey on Intelligent Iterative Methods for Solving Sparse Linear Algebraic Equations

Efficiently solving sparse linear algebraic equations is an important research topic of numerical simulation. Commonly used approaches include direct methods and iterative methods. Compared with the direct methods, the iterative methods…

Numerical Analysis · Mathematics 2023-10-11 Haifeng Zou , Xiaowen Xu , Chen-Song Zhang

Multi-Dimensional Data Compression and Query Processing in Array Databases

In recent times, the production of multidimensional data in various domains and their storage in array databases has witnessed a sharp increase; this rapid growth in data volumes necessitates compression in array databases. However,…

Databases · Computer Science 2022-11-14 Minsoo Kim , Hyubjin Lee , Yon Dohn Chung

A Novel Approach to Translate Structural Aggregation Queries to MapReduce Code

Data management applications are growing and require more attention, especially in the "big data" era. Thus, supporting such applications with novel and efficient algorithms that achieve higher performance is critical. Array database…

Databases · Computer Science 2025-02-04 Ahmed M. Abdelmoniem , Sameh Abdulah , Walid Atwa

Optimizing I/O for Big Array Analytics

Big array analytics is becoming indispensable in answering important scientific and business questions. Most analysis tasks consist of multiple steps, each making one or multiple passes over the arrays to be analyzed and generating…

Databases · Computer Science 2012-04-30 Yi Zhang , Jun Yang

A Scientific Data Management System for Irregular Applications

Many scientific applications are I/O intensive and generate or access large data sets, spanning hundreds or thousands of "files." Management, storage, efficient access, and analysis of this data present an extremely challenging task. We…

Distributed, Parallel, and Cluster Computing · Computer Science 2007-05-23 Jaechun No , Rajeev Thakur , Dinesh Kaushik , Lori Freitag , Alok Choudhary

Memory-efficient array redistribution through portable collective communication

Modern large-scale deep learning workloads highlight the need for parallel execution across many devices in order to fit model data into hardware accelerator memories. In these settings, array redistribution may be required during a…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-11-29 Norman A. Rink , Adam Paszke , Dimitrios Vytiniotis , Georg Stefan Schmid

Projection-Based Iterative Mode Scheduling for Switched Systems

This paper describes a method for scheduling the events of a switched system to achieve an optimal performance. The approach has guarantees on convergence and computational complexity that parallel derivative-based iterative optimization…

Optimization and Control · Mathematics 2017-09-11 Timothy Caldwell , Todd Murphey

Learning Iterative Reasoning through Energy Minimization

Deep learning has excelled on complex pattern recognition tasks such as image classification and object recognition. However, it struggles with tasks requiring nontrivial reasoning, such as algorithmic computation. Humans are able to solve…

Machine Learning · Computer Science 2022-07-01 Yilun Du , Shuang Li , Joshua B. Tenenbaum , Igor Mordatch

LifeRaft: Data-Driven, Batch Processing for the Exploration of Scientific Databases

Workloads that comb through vast amounts of data are gaining importance in the sciences. These workloads consist of "needle in a haystack" queries that are long running and data intensive so that query throughput limits performance. To…

Databases · Computer Science 2009-09-15 Xiaodan Wang , Randal Burns , Tanu Malik

Array Requirements for Scientific Applications and an Implementation for Microsoft SQL Server

This paper outlines certain scenarios from the fields of astrophysics and fluid dynamics simulations which require high performance data warehouses that support array data type. A common feature of all these use cases is that subsetting and…

Databases · Computer Science 2011-10-11 László Dobos , Alexander Szalay , José Blakeley , Tamás Budavári , István Csabai , Dragan Tomic , Milos Milovanovic , Marko Tintor , Andrija Jovanovic

Iterative Domain Optimization

In this paper we study a new approach in optimization that aims to search a large domain D where a given function takes large, small or specific values via an iterative optimization algorithm based on the gradient. We show that the…

Optimization and Control · Mathematics 2020-05-21 Raian Noufel Lefgoum