Related papers: On the Mesh Array for Matrix Multiplication

Efficiency of Matrix Multiplication on the Cross-Wired Mesh Array

This note looks at the efficiency of the cross-wired mesh array in the context of matrix multiplication. It is shown that in case of repeated operations, the average number of steps to multiply sets of nxn matrices on a 2D cross-wired mesh…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-11-13 Subhash Kak

Fine-Grained Computation in 3-Space: Matrix Multiplication and Graph Problems

Obeying constraints imposed by classical physics, we give optimal fine-grained algorithms for matrix multiplication and problems involving graphs and mazes, where all calculations are done in 3-dimensional space. We assume that whatever the…

Data Structures and Algorithms · Computer Science 2024-12-20 Quentin F. Stout

New Results on Scrambling Using the Mesh Array

This paper presents new results on randomization using Kak's Mesh Array for matrix multiplication. These results include the periods of the longest cycles when the the array is used for scrambling and the autocorrelation function of the…

Cryptography and Security · Computer Science 2011-05-18 Sandhya Rangineni

Multiplying matrices using n arithmetic operations

It is widely known that the lower bound for the algorithmic complexity of square matrix multiplication resorts to at least $n^2$ arithmetic operations. The justification builds upon the following reasoning: given that there are $2 n^2$…

Data Structures and Algorithms · Computer Science 2023-11-13 Hugo Daniel Macedo

A Framework for Practical Parallel Fast Matrix Multiplication

Matrix multiplication is a fundamental computation in many scientific disciplines. In this paper, we show that novel fast matrix multiplication algorithms can significantly outperform vendor implementations of the classical algorithm and…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-01-08 Austin R. Benson , Grey Ballard

Matrix Multiplication Using Only Addition

Matrix multiplication consumes a large fraction of the time taken in many machine-learning algorithms. Thus, accelerator chips that perform matrix multiplication faster than conventional processors or even GPU's are of increasing interest.…

Data Structures and Algorithms · Computer Science 2023-07-06 Daniel Cussen , Jeffrey D. Ullman

Fair and Square: Replacing One Real Multiplication with a Single Square and One Complex Multiplication with Three Squares When Performing Matrix Multiplication and Convolutions

This paper shows that, for matrix multiplications and convolutions, it is possible to asymptotically replace each real multiplication with a single squaring operation. Similarly, a single complex multiplication can be replaced with 3…

Hardware Architecture · Computer Science 2026-03-11 Vincenzo Liguori

Towards a Multi-array Architecture for Accelerating Large-scale Matrix Multiplication on FPGAs

Large-scale floating-point matrix multiplication is a fundamental kernel in many scientific and engineering applications. Most existing work only focus on accelerating matrix multiplication on FPGA by adopting a linear systolic array. This…

Hardware Architecture · Computer Science 2018-03-13 Junzhong Shen , Yuran Qiao , You Huang , Mei Wen , Chunyuan Zhang

High Performance Matrix Multiplication

Matrix multiplication is the foundation from much of the success from high performance technologies like deep learning, scientific simulations, and video graphics. High level programming languages like Python and R rely on highly optimized…

Performance · Computer Science 2025-09-08 Ethan Davis

Hyper-Systolic Matrix Multiplication

A novel parallel algorithm for matrix multiplication is presented. The hyper-systolic algorithm makes use of a one-dimensional processor abstraction. The procedure can be implemented on all types of parallel systems. It can handle…

Mathematical Software · Computer Science 2007-05-23 Thomas Lippert , Nikolay Petkov , Paolo Palazzari , Klaus Schilling

Multiplying Matrices Without Multiplying

Multiplying matrices is among the most fundamental and compute-intensive operations in machine learning. Consequently, there has been significant work on efficiently approximating matrix multiplies. We introduce a learning-based algorithm…

Machine Learning · Computer Science 2021-08-17 Davis Blalock , John Guttag

Strassen Multisystolic Array Hardware Architectures

While Strassen's matrix multiplication algorithm reduces the complexity of naive matrix multiplication, general-purpose hardware is not suitable for achieving the algorithm's promised theoretical speedups. This leaves the question of if it…

Hardware Architecture · Computer Science 2025-02-17 Trevor E. Pogue , Nicola Nicolici

Fast Matrix Multiplication with Sketching

We present an approximate algorithm for matrix multiplication based on matrix sketching techniques. First one of the matrix is chosen and sparsified using the online matrix sketching algorithm, and then the matrix product is calculated…

Numerical Analysis · Computer Science 2014-06-12 Huan Wang , Christos Boutsidis , Edo Liberty , Daniel Hsu

Flip Graphs with Symmetry and New Matrix Multiplication Schemes

The flip graph algorithm is a method for discovering new matrix multiplication schemes by following random walks on a graph. We introduce a version of the flip graph algorithm for matrix multiplication schemes that admit certain symmetries.…

Symbolic Computation · Computer Science 2025-02-10 Jakob Moosbauer , Michael Poole

New ways to multiply 3 x 3-matrices

It is known since the 1970s that no more than 23 multiplications are required for computing the product of two 3 x 3-matrices. It is not known whether this can also be done with fewer multiplications. However, there are several mutually…

Symbolic Computation · Computer Science 2019-05-27 Marijn J. H. Heule , Manuel Kauers , Martina Seidl

Photonic systolic array for all-optical matrix-matrix multiplication

Systolic arrays have proven to be highly efficient for parallelized matrix-matrix multiplication (MMM), utilizing synchronized, heartbeat-like data flows across an array of processing elements. While optical structures such as waveguide…

Optics · Physics 2025-11-27 Jungmin Kim , Qingyi Zhou , Zongfu Yu

The NumPy array: a structure for efficient numerical computation

In the Python world, NumPy arrays are the standard representation for numerical data. Here, we show how these arrays enable efficient implementation of numerical computations in a high-level language. Overall, three techniques are applied…

Mathematical Software · Computer Science 2011-03-14 Stefan Van Der Walt , S. Chris Colbert , Gaël Varoquaux

A Normal Form for Matrix Multiplication Schemes

Schemes for exact multiplication of small matrices have a large symmetry group. This group defines an equivalence relation on the set of multiplication schemes. There are algorithms to decide whether two schemes are equivalent. However, for…

Computational Complexity · Computer Science 2022-06-02 Manuel Kauers , Jakob Moosbauer

Fast Matrix Multiplication Without Tears: A Constraint Programming Approach

It is known that the multiplication of an $N \times M$ matrix with an $M \times P$ matrix can be performed using fewer multiplications than what the naive $NMP$ approach suggests. The most famous instance of this is Strassen's algorithm for…

Artificial Intelligence · Computer Science 2023-07-18 Arnaud Deza , Chang Liu , Pashootan Vaezipoor , Elias B. Khalil

On the best approximation of the hierarchical matrix product

The multiplication of matrices is an important arithmetic operation in computational mathematics. In the context of hierarchical matrices, this operation can be realized by the multiplication of structured block-wise low-rank matrices,…

Numerical Analysis · Mathematics 2018-05-24 Jürgen Dölz , Helmut Harbrecht , Michael D. Multerer