English
Related papers

Related papers: Accelerated Multiple Precision Direct Method and M…

200 papers

In this paper, we report the results obtained from the acceleration of multi-binary64-type multiple precision matrix multiplication with AVX2. We target double-double (DD), triple-double (TD), and quad-double (QD) precision arithmetic…

Numerical Analysis · Mathematics 2021-09-14 Tomonori Kouya

Numerical codes that require arbitrary precision floating point (APFP) numbers for their core computation are dominated by elementary arithmetic operations due to the super-linear complexity of multiplication in the number of mantissa bits.…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-04-14 Johannes de Fine Licht , Christopher A. Pattison , Alexandros Nikolaos Ziogas , David Simmons-Duffin , Torsten Hoefler

Low precision arithmetic, in particular half precision floating point arithmetic, is now available in commercial hardware. Using lower precision can offer significant savings in computation and communication costs with proportional savings…

Numerical Analysis · Mathematics 2021-11-16 Eda Oktay , Erin Carson

The direct method is one of the most important algorithms for solving linear systems of equations, with LU decomposition comprising a significant portion of its computation time. This study explores strategies to accelerate complex LU…

Numerical Analysis · Mathematics 2024-08-21 Tomonori Kouya

Additive Runge-Kutta methods designed for preserving highly accurate solutions in mixed-precision computation were proposed and analyzed in [8]. These specially designed methods use reduced precision or the implicit computations and full…

Numerical Analysis · Mathematics 2021-07-08 Ben Burnett , Sigal Gottlieb , Zachary J. Grant , Alfa Heryudono

We present algorithms for real and complex dot product and matrix multiplication in arbitrary-precision floating-point and ball arithmetic. A low-overhead dot product is implemented on the level of GMP limb arrays; it is about twice as fast…

Mathematical Software · Computer Science 2024-12-20 Fredrik Johansson

MP is a package of ANSI Standard Fortran (ANS X3.9-1966) subroutines for performing multiple-precision floating-point arithmetic and evaluating elementary and special functions. The subroutines are machine independent and the precision is…

Mathematical Software · Computer Science 2010-04-21 Richard P. Brent

The machine learning explosion has created a prominent trend in modern computer hardware towards low precision floating-point operations. In response, there have been growing efforts to use low and mixed precision in general scientific…

Numerical Analysis · Mathematics 2024-03-19 Cody J. Balos , Steven Roberts , David J. Gardner

Integer programming (IP) is an important and challenging problem. Approximate methods have shown promising performance on both effectiveness and efficiency for solving the IP problem. However, we observed that a large fraction of variables…

Discrete Mathematics · Computer Science 2022-07-06 Longkang Li , Baoyuan Wu

We describe a new implementation of the elementary transcendental functions exp, sin, cos, log and atan for variable precision up to approximately 4096 bits. Compared to the MPFR library, we achieve a maximum speedup ranging from a factor 3…

Mathematical Software · Computer Science 2015-06-10 Fredrik Johansson

We study the multiple-precision addition of two positive floating-point numbers in base 2, with exact rounding, as specified in the MPFR library, i.e. where each number has its own precision. We show how the best possible complexity (up to…

Data Structures and Algorithms · Computer Science 2016-08-16 Vincent Lefèvre

This article studies a combination of the two state-of-the-art algorithms for the exact solution of linear programs (LPs) over the rational numbers, i.e., without any roundoff errors or numerical tolerances. By integrating the method of…

Optimization and Control · Mathematics 2023-11-15 Leon Eifler , Jules Nicolas-Thouvenin , Ambros Gleixner

In this paper, we propose a mixed-precision convolution unit architecture which supports different integer and floating point (FP) precisions. The proposed architecture is based on low-bit inner product units and realizes higher precision…

Hardware Architecture · Computer Science 2021-01-29 Hamzah Abdel-Aziz , Ali Shafiee , Jong Hoon Shin , Ardavan Pedram , Joseph H. Hassoun

Mixed precision training (MPT) is becoming a practical technique to improve the speed and energy efficiency of training deep neural networks by leveraging the fast hardware support for IEEE half-precision floating point that is available in…

Machine Learning · Computer Science 2019-10-29 Ruizhe Zhao , Brian Vogel , Tanvir Ahmed

Traditional optimization methods rely on the use of single-precision floating point arithmetic, which can be costly in terms of memory size and computing power. However, mixed precision optimization techniques leverage the use of both…

Machine Learning · Computer Science 2023-09-25 Basile Lewandowski , Atli Kosson

Graph analytics techniques based on spectral methods process extremely large sparse matrices with millions or even billions of non-zero values. Behind these algorithms lies the Top-K sparse eigenproblem, the computation of the largest…

Hardware Architecture · Computer Science 2022-01-20 Francesco Sgherzi , Alberto Parravicini , Marco Domenico Santambrogio

Training models for Natural Language Processing (NLP) requires substantial computational resources and time, posing significant challenges, especially for NLP development in Bangla, where access to high-end hardware is often limited. In…

Computation and Language · Computer Science 2025-12-02 Md Mehrab Hossain Opi , Sumaiya Khan , Moshammad Farzana Rahman

The rapid updates in error-resilient applications along with their quest for high throughput have motivated designing fast approximate functional units for Field-Programmable Gate Arrays (FPGAs). Studies that proposed imprecise functional…

Hardware Architecture · Computer Science 2022-06-29 Zahra Ebrahimi , Muhammad Zaid , Mark Wijtvliet , Akash Kumar

Although reliable long precision floating-point arithmetic libraries such as QD and MPFR/GMP are necessary to solve ill-conditioned problems in numerical simulation, long precision BLAS-level computation such as matrix multiplication has…

Mathematical Software · Computer Science 2017-10-06 Tomonori Kouya

We evaluate the performance of the Krylov subspace method by using highly efficient multiple precision sparse matrix-vector multiplication (SpMV). BNCpack is our multiple precision numerical computation library based on MPFR/GMP, which is…

Numerical Analysis · Mathematics 2014-11-11 Tomonori Kouya
‹ Prev 1 2 3 10 Next ›