Jack Dongarra — Scifaro

Analysis of Floating-Point Matrix Multiplication Computed via Integer Arithmetic

Ootomo, Ozaki, and Yokota [Int. J. High Perform. Comput. Appl., 38 (2024), p. 297-313] have proposed a strategy to recast a floating-point matrix multiplication in terms of integer matrix products. The factors A and B are split into integer…

Numerical Analysis · Mathematics 2026-05-11 Ahmad Abdelfattah , Jack Dongarra , Massimiliano Fasi , Mantas Mikaitis , Françoise Tisseur

HPL-MxP Benchmark: Mixed-Precision Algorithms, Iterative Refinement, and Scalable Data Generation

We present a mixed-precision benchmark called HPL-MxP that uses both a lower-precision LU factorization with a non-stationary iterative refinement based on GMRES. We evaluate the numerical stability of one of the methods of generating the…

Numerical Analysis · Mathematics 2025-09-25 Jack Dongarra , Piotr Luszczek

The Stability of Block Eliminations and Additive Modifications

The block elimination with additive modifications (BEAM) method was recently proposed as a alternative to LU with partial pivoting requiring less communication. Because of the novelty of BEAM, the existing theoretical analysis is lacking.…

Numerical Analysis · Mathematics 2025-09-10 Neil Lindquist , Piotr Luszczek , Jack Dongarra

Hardware Trends Impacting Floating-Point Computations In Scientific Applications

The evolution of floating-point computation has been shaped by algorithmic advancements, architectural innovations, and the increasing computational demands of modern technologies, such as artificial intelligence (AI) and high-performance…

Numerical Analysis · Mathematics 2024-12-23 Jack Dongarra , John Gunnels , Harun Bayraktar , Azzam Haidar , Dan Ernst

Generalizing Random Butterfly Transforms to Arbitrary Matrix Sizes

Parker and L\^e introduced random butterfly transforms (RBTs) as a preprocessing technique to replace pivoting in dense LU factorization. Unfortunately, their FFT-like recursive structure restricts the dimensions of the matrix. Furthermore,…

Numerical Analysis · Mathematics 2024-10-14 Neil Lindquist , Piotr Luszczek , Jack Dongarra

XaaS: Acceleration as a Service to Enable Productive High-Performance Cloud Computing

HPC and Cloud have evolved independently, specializing their innovations into performance or productivity. Acceleration as a Service (XaaS) is a recipe to empower both fields with a shared execution platform that provides transparent access…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-01-10 Torsten Hoefler , Marcin Copik , Pete Beckman , Andrew Jones , Ian Foster , Manish Parashar , Daniel Reed , Matthias Troyer , Thomas Schulthess , Dan Ernst , Jack Dongarra

Randomized Numerical Linear Algebra : A Perspective on the Field With an Eye to Software

Randomized numerical linear algebra - RandNLA, for short - concerns the use of randomization as a resource to develop improved algorithms for large-scale linear algebra computations. The origins of contemporary RandNLA lay in theoretical…

Numerical Analysis · Mathematics 2023-04-14 Riley Murray , James Demmel , Michael W. Mahoney , N. Benjamin Erichson , Maksim Melnichenko , Osman Asif Malik , Laura Grigori , Piotr Luszczek , Michał Dereziński , Miles E. Lopes , Tianyu Liang , Hengrui Luo , Jack Dongarra

Proposed Consistent Exception Handling for the BLAS and LAPACK

Numerical exceptions, which may be caused by overflow, operations like division by 0 or sqrt(-1), or convergence failures, are unavoidable in many cases, in particular when software is used on unforeseen and difficult inputs. As more…

Mathematical Software · Computer Science 2022-07-20 James Demmel , Jack Dongarra , Mark Gates , Greg Henry , Julien Langou , Xiaoye Li , Piotr Luszczek , Weslley Pereira , Jason Riedy , Cindy Rubio-González

Reinventing High Performance Computing: Challenges and Opportunities

The world of computing is in rapid transition, now dominated by a world of smartphones and cloud services, with profound implications for the future of advanced scientific computing. Simply put, high-performance computing (HPC) is at an…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-03-08 Daniel Reed , Dennis Gannon , Jack Dongarra

Efficient Exascale Discretizations: High-Order Finite Element Methods

Efficient exploitation of exascale architectures requires rethinking of the numerical algorithms used in many large-scale applications. These architectures favor algorithms that expose ultra fine-grain parallelism and maximize the ratio of…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-09-13 Tzanio Kolev , Paul Fischer , Misun Min , Jack Dongarra , Jed Brown , Veselin Dobrev , Tim Warburton , Stanimire Tomov , Mark S. Shephard , Ahmad Abdelfattah , Valeria Barra , Natalie Beams , Jean-Sylvain Camier , Noel Chalmers , Yohann Dudouit , Ali Karakus , Ian Karlin , Stefan Kerkemeier , Yu-Hsiang Lan , David Medina , Elia Merzari , Aleksandr Obabko , Will Pazner , Thilina Rathnayake , Cameron W. Smith , Lukas Spies , Kasia Swirydowicz , Jeremy Thompson , Ananias Tomboulides , Vladimir Tomov

Integrating Deep Learning in Domain Sciences at Exascale

This paper presents some of the current challenges in designing deep learning artificial intelligence (AI) and integrating it with traditional high-performance computing (HPC) simulations. We evaluate existing packages for their ability to…

Machine Learning · Computer Science 2020-11-24 Rick Archibald , Edmond Chow , Eduardo D'Azevedo , Jack Dongarra , Markus Eisenbach , Rocco Febbo , Florent Lopez , Daniel Nichols , Stanimire Tomov , Kwai Wong , Junqi Yin

Improving the Performance of the GMRES Method using Mixed-Precision Techniques

The GMRES method is used to solve sparse, non-symmetric systems of linear equations arising from many scientific applications. The solver performance within a single node is memory bound, due to the low arithmetic intensity of its…

Numerical Analysis · Mathematics 2020-11-04 Neil Lindquist , Piotr Luszczek , Jack Dongarra

A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic

Within the past years, hardware vendors have started designing low precision special function units in response to the demand of the Machine Learning community and their demand for high compute power in low precision formats. Also the…

Mathematical Software · Computer Science 2020-07-15 Ahmad Abdelfattah , Hartwig Anzt , Erik G. Boman , Erin Carson , Terry Cojean , Jack Dongarra , Mark Gates , Thomas Grützmacher , Nicholas J. Higham , Sherry Li , Neil Lindquist , Yang Liu , Jennifer Loe , Piotr Luszczek , Pratik Nayak , Sri Pranesh , Siva Rajamanickam , Tobias Ribizel , Barry Smith , Kasia Swirydowicz , Stephen Thomas , Stanimire Tomov , Yaohung M. Tsai , Ichitaro Yamazaki , Urike Meier Yang

Bidiagonalization with Parallel Tiled Algorithms

We consider algorithms for going from a "full" matrix to a condensed "band bidiagonal" form using orthogonal transformations. We use the framework of "algorithms by tiles". Within this framework, we study: (i) the tiled bidiagonalization…

Mathematical Software · Computer Science 2016-11-23 Mathieu Faverge , Julien Langou , Yves Robert , Jack Dongarra

QR Factorization of Tall and Skinny Matrices in a Grid Computing Environment

Previous studies have reported that common dense linear algebra operations do not achieve speed up by using multiple geographical sites of a computational grid. Because such operations are the building blocks of most scientific…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-11-15 Emmanuel Agullo , Camille Coti , Jack Dongarra , Thomas Herault , Julien Langou

Standards for Graph Algorithm Primitives

It is our view that the state of the art in constructing a large collection of graph algorithms in terms of linear algebraic operations is mature enough to support the emergence of a standard set of primitive building blocks. This paper is…

Mathematical Software · Computer Science 2015-05-26 Tim Mattson , David Bader , Jon Berry , Aydin Buluc , Jack Dongarra , Christos Faloutsos , John Feo , John Gilbert , Joseph Gonzalez , Bruce Hendrickson , Jeremy Kepner , Charles Leiserson , Andrew Lumsdaine , David Padua , Stephen Poole , Steve Reinhardt , Mike Stonebraker , Steve Wallach , Andrew Yoo

Accelerating Scientific Computations with Mixed Precision Algorithms

On modern architectures, the performance of 32-bit operations is often at least twice as fast as the performance of 64-bit operations. By using a combination of 32-bit and 64-bit floating point arithmetic, the performance of many dense and…

Mathematical Software · Computer Science 2015-05-13 Marc Baboulin , Alfredo Buttari , Jack Dongarra , Jakub Kurzak , Julie Langou , Julien Langou , Piotr Luszczek , Stanimire Tomov

Designing LU-QR hybrid solvers for performance and stability

This paper introduces hybrid LU-QR al- gorithms for solving dense linear systems of the form Ax = b. Throughout a matrix factorization, these al- gorithms dynamically alternate LU with local pivoting and QR elimination steps, based upon…

Numerical Analysis · Mathematics 2014-01-23 Mathieu Faverge , Julien Herrmann , Julien Langou , Bradley Lowery , Yves Robert , Jack Dongarra

Optimal Checkpointing Period: Time vs. Energy

This short paper deals with parallel scientific applications using non-blocking and periodic coordinated checkpointing to enforce resilience. We provide a model and detailed formulas for total execution time and consumed energy. We…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-11-01 Guillaume Aupy , Anne Benoit , Thomas Hérault , Yves Robert , Jack Dongarra

Hierarchical QR factorization algorithms for multi-core cluster systems

This paper describes a new QR factorization algorithm which is especially designed for massively parallel platforms combining parallel distributed multi-core nodes. These platforms make the present and the foreseeable future of…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-08-27 Jack Dongarra , Mathieu Faverge , Thomas Herault , Julien Langou , and Yves Robert