English
Related papers

Related papers: Supercharging Distributed Computing Environments F…

200 papers

The data science community today has embraced the concept of Dataframes as the de facto standard for data representation and manipulation. Ease of use, massive operator coverage, and popularization of R and Python languages have heavily…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-07-06 Niranda Perera , Supun Kamburugamuve , Chathura Widanage , Vibhatha Abeykoon , Ahmet Uyar , Kaiying Shan , Hasara Maithree , Damitha Lenadora , Thejaka Amila Kanewala , Geoffrey Fox

The amazing advances being made in the fields of machine and deep learning are a highlight of the Big Data era for both enterprise and research communities. Modern applications require resources beyond a single node's ability to provide.…

The Data Science domain has expanded monumentally in both research and industry communities during the past decade, predominantly owing to the Big Data revolution. Artificial Intelligence (AI) and Machine Learning (ML) are bringing more…

Applications are increasingly written as dynamic workflows underpinned by an execution framework that manages asynchronous computations across distributed hardware. However, execution frameworks typically offer one-size-fits-all solutions…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-10-18 J. Gregory Pauloski , Klaudiusz Rydzy , Valerie Hayot-Sasson , Ian Foster , Kyle Chard

In the current era of Big Data, data engineering has transformed into an essential field of study across many branches of science. Advancements in Artificial Intelligence (AI) have broadened the scope of data engineering and opened up new…

We present DataFlow, a computational framework for building, testing, and deploying high-performance machine learning systems on unbounded time-series data. Traditional data science workflows assume finite datasets and require substantial…

Machine Learning · Computer Science 2026-01-01 Giacinto Paolo Saggese , Paul Smith

Over the past decade, machine learning model complexity has grown at an extraordinary rate, as has the scale of the systems training such large models. However there is an alarmingly low hardware utilization (5-20%) in large scale AI…

Hardware Architecture · Computer Science 2022-11-14 Newsha Ardalani , Saptadeep Pal , Puneet Gupta

Data pre-processing is a fundamental component in any data-driven application. With the increasing complexity of data processing operations and volume of data, Cylon, a distributed dataframe system, is developed to facilitate data…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-01-02 Kaiying Shan , Niranda Perera , Damitha Lenadora , Tianle Zhong , Arup Sarker , Supun Kamburugamuve , Thejaka Amila Kanewela , Chathura Widanage , Geoffrey Fox

Recent deep learning workloads increasingly push computational demand beyond what current memory systems can sustain, with many kernels stalling on data movement rather than computation. While modern dataflow accelerators incorporate…

Programming Languages · Computer Science 2025-09-09 Shihan Fang , Hongzheng Chen , Niansong Zhang , Jiajie Li , Han Meng , Adrian Liu , Zhiru Zhang

Shared memory multiprocessors come back to popularity thanks to rapid spreading of commodity multi-core architectures. As ever, shared memory programs are fairly easy to write and quite hard to optimise; providing multi-core programmers…

Distributed, Parallel, and Cluster Computing · Computer Science 2009-09-10 Marco Aldinucci , Massimo Torquati , Massimiliano Meneghin

Researchers and practitioners in the field of reinforcement learning (RL) frequently leverage parallel computation, which has led to a plethora of new algorithms and systems in the last few years. In this paper, we re-examine the challenges…

Machine Learning · Computer Science 2021-11-01 Eric Liang , Zhanghao Wu , Michael Luo , Sven Mika , Joseph E. Gonzalez , Ion Stoica

Reinforcement learning (RL) has become the pivotal post-training technique for large language model (LLM). Effectively scaling reinforcement learning is now the key to unlocking advanced reasoning capabilities and ensuring safe,…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-09-10 Zhixin Wang , Tianyi Zhou , Liming Liu , Ao Li , Jiarui Hu , Dian Yang , Yinhui Lu , Jinlong Hou , Siyuan Feng , Yuan Cheng , Yuan Qi

Significant obstacles exist in scientific domains including genetics, climate modeling, and astronomy due to the management, preprocess, and training on complicated data for deep learning. Even while several large-scale solutions offer…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-04-23 Arup Kumar Sarker , Aymen Alsaadi , Alexander James Halpern , Prabhath Tangella , Mikhail Titov , Niranda Perera , Mills Staylor , Gregor von Laszewski , Shantenu Jha , Geoffrey Fox

Data is found everywhere, from health and human infrastructure to the surge of sensors and the proliferation of internet-connected devices. To meet this challenge, the data engineering field has expanded significantly in recent years in…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-03-06 Mills Staylor , Arup Kumar Sarker , Gregor von Laszewski , Geoffrey Fox , Yue Cheng , Judy Fox

Managing and preparing complex data for deep learning, a prevalent approach in large-scale data science can be challenging. Data transfer for model training also presents difficulties, impacting scientific fields like genomics, climate…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-04-09 Arup Kumar Sarker , Aymen Alsaadi , Niranda Perera , Mills Staylor , Gregor von Laszewski , Matteo Turilli , Ozgur Ozan Kilic , Mikhail Titov , Andre Merzky , Shantenu Jha , Geoffrey Fox

The advances in data, computing and networking over the last two decades led to a shift in many application domains that includes machine learning on big data as a part of the scientific process, requiring new capabilities for integrated…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-03-19 Ilkay Altintas , Kyle Marcus , Isaac Nealey , Scott L. Sellars , John Graham , Dima Mishin , Joel Polizzi , Daniel Crawl , Thomas DeFanti , Larry Smarr

Massive upgrades to science infrastructure are driving data velocities upwards while stimulating adoption of increasingly data-intensive analytics. While next-generation exascale supercomputers promise strong support for I/O-intensive…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-07-06 Michael Salim , Thomas Uram , J. Taylor Childers , Venkat Vishwanath , Michael E. Papka

Many recent machine learning models rely on fine-grained dynamic control flow for training and inference. In particular, models based on recurrent neural networks and on reinforcement learning depend on recurrence relations, data-dependent…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-05-09 Yuan Yu , Martín Abadi , Paul Barham , Eugene Brevdo , Mike Burrows , Andy Davis , Jeff Dean , Sanjay Ghemawat , Tim Harley , Peter Hawkins , Michael Isard , Manjunath Kudlur , Rajat Monga , Derek Murray , Xiaoqiang Zheng

Apart from forming the backbone of compiler optimization, static dataflow analysis has been widely applied in a vast variety of applications, such as bug detection, privacy analysis, program comprehension, etc. Despite its importance,…

Programming Languages · Computer Science 2024-12-18 Zewen Sun , Yujin Zhang , Duanchen Xu , Yiyu Zhang , Yun Qi , Yueyang Wang , Yi Li , Zhaokang Wang , Yue Li , Xuandong Li , Zhiqiang Zuo , Qingda Lu , Wenwen Peng , Shengjian Guo

As deep learning models scale, sparse computation and specialized dataflow hardware have emerged as powerful solutions to address efficiency. We propose FuseFlow, a compiler that converts sparse machine learning models written in PyTorch to…

Machine Learning · Computer Science 2026-01-27 Rubens Lacouture , Nathan Zhang , Ritvik Sharma , Marco Siracusa , Fredrik Kjolstad , Kunle Olukotun , Olivia Hsu
‹ Prev 1 2 3 10 Next ›