English
Related papers

Related papers: An efficient hybrid tridiagonal divide-and-conquer…

200 papers

In this paper, a parallel structured divide-and-conquer (PSDC) eigensolver is proposed for symmetric tridiagonal matrices based on ScaLAPACK and a parallel structured matrix multiplication algorithm, called PSMMA. Computing the eigenvectors…

Mathematical Software · Computer Science 2020-12-24 Xia Liao , Shengguo Li , Yutong Lu , Jose E. Roman

In this paper, two accelerated divide-and-conquer algorithms are proposed for the symmetric tridiagonal eigenvalue problem, which cost $O(N^2r)$ {flops} in the worst case, where $N$ is the dimension of the matrix and $r$ is a modest number…

Numerical Analysis · Computer Science 2015-10-16 Shengguo Li , Xiangke Liao , Jie Liu , Hao Jiang

We present a distributed-memory library for computations with dense structured matrices. A matrix is considered structured if its off-diagonal blocks can be approximated by a rank-deficient matrix with low numerical rank. Here, we use…

Mathematical Software · Computer Science 2015-06-29 François-Henry Rouet , Xiaoye S. Li , Pieter Ghysels , Artem Napov

For dense Hermitian matrices with small off-diagonal (numerical) ranks and in a hierarchically semiseparable form, we give a stable divide-and-conquer eigendecomposition method with nearly linear complexity (called SuperDC) that…

Numerical Analysis · Mathematics 2021-08-10 Xiaofeng Ou , Jianlin Xia

Matrix Factorization (MF) has been widely applied in machine learning and data mining. A large number of algorithms have been studied to factorize matrices. Among them, stochastic gradient descent (SGD) is a commonly used method.…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-06-30 Yuanhang Yu , Dong Wen , Ying Zhang , Xiaoyang Wang , Wenjie Zhang , Xuemin Lin

We present a sparse linear system solver that is based on a multifrontal variant of Gaussian elimination, and exploits low-rank approximation of the resulting dense frontal matrices. We use hierarchically semiseparable (HSS) matrices, which…

Mathematical Software · Computer Science 2015-02-27 Pieter Ghysels , Xiaoye S. Li , Francois-Henry Rouet , Samuel Williams , Artem Napov

We provide a flexible, open-source framework for hardware acceleration, namely massively-parallel execution on general-purpose graphics processing units (GPUs), applied to the hierarchical Poincar\'e--Steklov (HPS) family of algorithms for…

Numerical Analysis · Mathematics 2025-11-17 Owen Melia , Daniel Fortunato , Jeremy Hoskins , Rebecca Willett

Divide-and-conquer-based (DC-based) evolutionary algorithms (EAs) have achieved notable success in dealing with large-scale optimization problems (LSOPs). However, the appealing performance of this type of algorithms generally requires a…

Neural and Evolutionary Computing · Computer Science 2020-04-07 Zhigang Ren , Yongsheng Liang , Muyi Wang , Yang Yang , An Chen

In this paper we consider the problem of identifying intersections between two sets of d-dimensional axis-parallel rectangles. This is a common problem that arises in many agent-based simulation studies, and is of central importance in the…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-08-08 Moreno Marzolla , Gabriele D'Angelo

As programmers turn to software-defined hardware (SDH) to maintain a high level of productivity while programming hardware to run complex algorithms, heavy-lifting must be done by the compiler to automatically partition on-chip arrays. In…

Hardware Architecture · Computer Science 2022-03-31 Matthew Feldman , Tian Zhao , Kunle Olukotun

The factorization of skew-symmetric matrices is a critically understudied area of dense linear algebra, particularly in comparison to that of general and symmetric matrices. While some algorithms can be adapted from the symmetric case, the…

Mathematical Software · Computer Science 2026-05-06 Ishna Satyarth , Chao Yin , Devin A. Matthews , Maggie Myers , Robert van de Geijn , RuQing G. Xu

Heterogeneous MPSoCs comprise diverse processing units of varying compute capabilities. To date, the mapping strategies of neural networks (NNs) onto such systems are yet to exploit the full potential of processing parallelism, made…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-02-28 Halima Bouzidi , Mohanad Odema , Hamza Ouarnoughi , Smail Niar , Mohammad Abdullah Al Faruque

Divide and Conquer (DC) is conceptually well suited to high-dimensional optimization by decomposing a problem into multiple small-scale sub-problems. However, appealing performance can be seldom observed when the sub-problems are…

Artificial Intelligence · Computer Science 2018-07-12 Peng Yang , Ke Tang , Xin Yao

The implementation difficulties of combining distribution matching (DM) and dematching (invDM) for probabilistic shaping (PS) with soft-decision forward error correction (FEC) coding can be relaxed by reverse concatenation, for which the…

Signal Processing · Electrical Eng. & Systems 2024-01-25 Tsuyoshi Yoshida , Magnus Karlsson , Erik Agrell

We consider the design of efficient algorithms for a multicore computing environment with a global shared memory and p cores, each having a cache of size M, and with data organized in blocks of size B. We characterize the class of…

Distributed, Parallel, and Cluster Computing · Computer Science 2011-03-22 Richard Cole , Vijaya Ramachandran

Various numerical methods used for solving partial differential equations (PDE) result in tridiagonal systems. Solving tridiagonal systems on distributed-memory environments is not straightforward, and often requires significant amount of…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-07-22 Semih Akkurt , Sébastien Lemaire , Paul Bartholomew , Sylvain Laizet

We present a fast and memory-efficient algorithm for transient, space-time-domain, and elastodynamic boundary-integral analysis. Associated data-sparse approximations and operations are named fast domain partitioning hierarchical matrices…

Computational Physics · Physics 2026-03-20 Dye SK Sato , Ryosuke Ando

The problem of identifying intersections between two sets of d-dimensional axis-parallel rectangles appears frequently in the context of agent-based simulation studies. For this reason, the High Level Architecture (HLA) specification -- a…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-02-28 Moreno Marzolla , Gabriele D'Angelo

This article describes a geometric partitioning software that can be used for quick computation of data partitions on many-core HPC machines. It is most suited for dynamic applications with load distributions that vary with time.…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-08-19 Aparna Sasidharan

Structured dense matrices result from boundary integral problems in electrostatics and geostatistics, and also Schur complements in sparse preconditioners such as multi-frontal methods. Exploiting the structure of such matrices can reduce…

Numerical Analysis · Mathematics 2023-11-03 Sameer Deshmukh , Qinxiang Ma , Rio Yokota , George Bosilca
‹ Prev 1 2 3 10 Next ›