English
Related papers

Related papers: Learning Everywhere: Pervasive Machine Learning fo…

200 papers

We recently outlined the vision of "Learning Everywhere" which captures the possibility and impact of how learning methods and traditional HPC methods can be coupled together. A primary driver of such coupling is the promise that Machine…

Machine Learning · Computer Science 2019-09-16 Geoffrey Fox , Shantenu Jha

Traditional simulations on High-Performance Computing (HPC) systems typically involve modeling very large domains and/or very complex equations. HPC systems allow running large models, but limits in performance increase that have become…

Machine learning-based performance models are increasingly being used to build critical job scheduling and application optimization decisions. Traditionally, these models assume that data distribution does not change as more samples are…

Machine Learning · Computer Science 2023-10-27 Ray A. O. Sinurat , Anurag Daram , Haryadi S. Gunawi , Robert B. Ross , Sandeep Madireddy

We explore the idea of integrating machine learning (ML) with high performance computing (HPC)-driven simulations to address challenges in using simulations to teach computational science and engineering courses. We demonstrate that a ML…

Physics Education · Physics 2020-09-01 Vikram Jadhao , JCS Kadupitiya

Characterizing and predicting the training performance of modern machine learning (ML) workloads on compute systems with compute and communication spread between CPUs, GPUs, and network devices is not only the key to optimization and…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-11-27 Zhongyi Lin , Ning Sun , Pallab Bhattacharya , Xizhou Feng , Louis Feng , John D. Owens

The growing demands of the worldwide IT infrastructure stress the need for reduced power consumption, which is addressed in so-called transprecision computing by improving energy efficiency at the expense of precision. For example, reducing…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-07-30 Andrea Borghesi , Giuseppe Tagliavini , Michele Lombardi , Luca Benini , Michela Milano

With the growing complexity of computational and experimental facilities, many scientific researchers are turning to machine learning (ML) techniques to analyze large scale ensemble data. With complexities such as multi-component workflows,…

Growing interest in Artificial Intelligence (AI) has resulted in a surge in demand for faster methods of Machine Learning (ML) model training and inference. This demand for speed has prompted the use of high performance computing (HPC)…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-08-26 Noah Lewis , Jean Luca Bez , Surendra Byna

As a broader set of applications from simulations to data analysis and machine learning require more parallel computational capability, the demand for interactive and urgent high performance computing (HPC) continues to increase. This paper…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-01-29 Albert Reuther , Nick Brown , William Arndt , Johannes Blaschke , Christian Boehme , Antony Chazapis , Bjoern Enders , Robert Henschel , Julian Kunkel , Maxime Martinasso

Hardware support for high-performance computing (HPC) has so far been subject to significant advances. The pervasiveness of HPC systems, mainly made up with parallel computing units, makes it crucial to spread and vivify effective HPC…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-02-06 Claude Tadonki

High-performance computing (HPC) has evolved over decades through multiple architectural transitions, from vector supercomputers to massively parallel CPU clusters and GPU-accelerated systems, continuously expanding the frontier of…

Quantum Physics · Physics 2026-04-23 Suman Raj , Siva Sai , Yogesh Simmhan , Kyle Chard , Rajkumar Buyya

Increasingly, scientific discovery requires sophisticated and scalable workflows. Workflows have become the ``new applications,'' wherein multi-scale computing campaigns comprise multiple and heterogeneous executable tasks. In particular,…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-08-26 Shantenu Jha , Vincent R. Pascuzzi , Matteo Turilli

High Performance Distributed Computing is essential to boost scientific progress in many areas of science and to efficiently deploy a number of complex scientific applications. These applications have different characteristics that require…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-12-04 Mariza Ferro , Antonio R. Mury , Laion F. Manfroi , Bruno Schlze

Recently, language models (LMs), especially large language models (LLMs), have revolutionized the field of deep learning. Both encoder-decoder models and prompt-based techniques have shown immense potential for natural language processing…

Nowadays, we are to find out solutions to huge computing problems very rapidly. It brings the idea of parallel computing in which several machines or processors work cooperatively for computational tasks. In the past decades, there are a…

Programming Languages · Computer Science 2014-02-07 Brijender Kahanwal

Can cloud computing infrastructures provide HPC-competitive performance for scientific applications broadly? Despite prolific related literature, this question remains open. Answers are crucial for designing future systems and democratizing…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-03-09 Giulia Guidi , Marquita Ellis , Aydin Buluc , Katherine Yelick , David Culler

Let's HPC (www.letshpc.org) is an open-access online platform to supplement conventional classroom oriented High Performance Computing (HPC) and Parallel & Distributed Computing (PDC) education. The web based platform provides online…

Computers and Society · Computer Science 2017-01-24 Akshar Varma , Yashwant Keswani , Yashodhan Bhatnagar , Bhaskar Chaudhury

High Performance Computing (HPC), Artificial Intelligence (AI)/Machine Learning (ML), and Quantum Computing (QC) and communications offer immense opportunities for innovation and impact on society. Researchers in these areas depend on…

Computers and Society · Computer Science 2020-12-18 William Gropp , Sujata Banerjee , Ian Foster

Heterogeneous scientific workflows consist of numerous types of tasks that require executing on heterogeneous resources. Asynchronous execution of those tasks is crucial to improve resource utilization, task throughput and reduce workflows'…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-06-28 Vincent R. Pascuzzi , Ozgur O. Kilic , Matteo Turilli , Shantenu Jha

In this work, system monitoring and analysis are discussed in terms of their significance and benefits for operations and research in the field of high-performance computing (HPC). HPC systems deliver unique insights to computational…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-07-10 Florina M. Ciorba
‹ Prev 1 2 3 10 Next ›