Related papers: A Note on Parallel Algorithmic Speedup Bounds

A generalization of Amdahl's law and relative conditions of parallelism

In this work I present a generalization of Amdahl's law on the limits of a parallel implementation with many processors. In particular I establish some mathematical relations involving the number of processors and the dimension of the…

Distributed, Parallel, and Cluster Computing · Computer Science 2007-05-23 Gianluca Argentini

A figure of merit for describing the performance of scaling of parallelization

With the spread of multi- and many-core processors more and more typical task is to re-implement some source code written originally for a single processor to run on more than one cores. Since it is a serious investment, it is important to…

Performance · Computer Science 2016-07-25 János Végh , Péter Molnár , József Vásárhelyi

Parallel Stochastic Asynchronous Coordinate Descent: Tight Bounds on the Possible Parallelism

Several works have shown linear speedup is achieved by an asynchronous parallel implementation of stochastic coordinate descent so long as there is not too much parallelism. More specifically, it is known that if all updates are of similar…

Optimization and Control · Mathematics 2020-11-23 Yun Kuen Cheung , Richard Cole , Yixin Tao

When parallel speedups hit the memory wall

After Amdahl's trailblazing work, many other authors proposed analytical speedup models but none have considered the limiting effect of the memory wall. These models exploited aspects such as problem-size variation, memory size,…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-05-11 Alex F. A. Furtunato , Kyriakos Georgiou , Kerstin Eder , Samuel Xavier-de-Souza

Fully Asynchronous Stochastic Coordinate Descent: A Tight Lower Bound on the Parallelism Achieving Linear Speedup

We seek tight bounds on the viable parallelism in asynchronous implementations of coordinate descent that achieves linear speedup. We focus on asynchronous coordinate descent (ACD) algorithms on convex functions which consist of the sum of…

Optimization and Control · Mathematics 2020-08-04 Yun Kuen Cheung , Richard Cole , Yixin Tao

On Extending Amdahl's law to Learn Computer Performance

The problem of learning parallel computer performance is investigated in the context of multicore processors. Given a fixed workload, the effect of varying system configuration on performance is sought. Conventionally, the performance…

Machine Learning · Computer Science 2022-09-28 Chaitanya Poolla , Rahul Saxena

Speedup and efficiency of computational parallelization: A unifying approach and asymptotic analysis

In high performance computing environments, we observe an ongoing increase in the available numbers of cores. This development calls for re-emphasizing performance (scalability) analysis and speedup laws as suggested in the literature…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-11-16 Guido Schryen

Author's approach to the topological modeling of parallel computing systems

The author's research of topologies of parallel computing systems and the tasks solved with them, including the corresponding tools of their modeling, is summarized in the present paper. The original topological model of such systems is…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-10-07 Victor A. Melent'ev

Performance Evaluation of Parallel Algorithms

Evaluating how well a whole system or set of subsystems performs is one of the primary objectives of performance testing. We can tell via performance assessment if the architecture implementation meets the design objectives. Performance…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-09-15 Donald Ene Vincent Ike Anireh

Parallel parametric linear programming solving, and application to polyhedral computations

Parametric linear programming is central in polyhedral computations and in certain control applications.We propose a task-based scheme for parallelizing it, with quasi-linear speedup over large problems.

Distributed, Parallel, and Cluster Computing · Computer Science 2019-04-15 Camille Coti , David Monniaux , Hang Yu

A task-based approach to parallel parametric linear programming solving, and application to polyhedral computations

Parametric linear programming is a central operation for polyhedral computations, as well as in certain control applications.Here we propose a task-based scheme for parallelizing it, with quasi-linear speedup over large problems.This type…

Computational Geometry · Computer Science 2020-10-01 Camille Coti , David Monniaux , Hang Yu

Parallel Path Progression DAG Scheduling

To satisfy the increasing performance needs of modern cyber-physical systems, multiprocessor architectures are increasingly utilized. To efficiently exploit their potential parallelism in hard real-time systems, appropriate task models and…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-08-26 Niklas Ueter , Mario Günzel , Georg von der Brüggen , Jian-Jia Chen

The performance wall of parallelized sequential computing: the dark performance and the roofline of performance gain

The computing performance today is developing mainly using parallelized sequential computing, in many forms. The paper scrutinizes whether the performance of that type of computing has an upper limit. The simple considerations point out…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-08-07 János Végh

Concurrent Scheduling of High-Level Parallel Programs on Multi-GPU Systems

Parallel programming models can encourage performance portability by moving the responsibility for work assignment and data distribution from the programmer to a runtime system. However, analyzing the resulting implicit memory allocations,…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-14 Fabian Knorr , Philip Salzmann , Peter Thoman , Thomas Fahringer

Method for determining the acceleration of a parallel specialised computer system based on Amdahl's law

The modification of Amdahl's law for the case of increment of processor elements in a computer system is considered. The coefficient $k$ linking accelerations of parallel and parallel specialized computer systems is determined. The limiting…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-01-31 Aleksandr S. Filipchenko

Limitations of performance of Exascale Applications and supercomputers they are running on

The paper highlights that the cooperation of the components of the computing systems receives even more focus in the coming age of exascale computing. It discovers that inherent performance limitations exist and identifies the major…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-08-17 János Végh

A sufficient condition for a linear speedup in competitive parallel computing

In competitive parallel computing, the identical copies of a code in a phase of a sequential program are assigned to processor cores and the result of the fastest core is adopted. In the literature, it is reported that a superlinear speedup…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-08-22 Naoki Yonezawa

The Effect of Communication and Synchronization on Amdahl Law in Multicore Systems

This work analyses the effects of sequential-to-parallel synchronization and inter-core communication on multicore performance, speedup and scaling. A modification of Amdahl law is formulated, to reflect the finding that parallel speedup is…

Hardware Architecture · Computer Science 2013-06-17 Leonid Yavits , Amir Morad , Ran Ginosar

Modernizing Amdahl's Law: How AI Scaling Laws Shape Computer Architecture

Classical Amdahl's Law conceptualized the limit of speedup for an era of fixed serial-parallel decomposition and homogeneous replication. Modern heterogeneous systems need a different conceptual framework: constrained resources must be…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-04-09 Chien-Ping Lu

On Unbounded Delays in Asynchronous Parallel Fixed-Point Algorithms

The need for scalable numerical solutions has motivated the development of asynchronous parallel algorithms, where a set of nodes run in parallel with little or no synchronization, thus computing with delayed information. This paper studies…

Optimization and Control · Mathematics 2017-08-18 Robert Hannah , Wotao Yin