Related papers: Modular Multiplication without Carry Propagation (…

Matrix Multiplication Using Only Addition

Matrix multiplication consumes a large fraction of the time taken in many machine-learning algorithms. Thus, accelerator chips that perform matrix multiplication faster than conventional processors or even GPU's are of increasing interest.…

Data Structures and Algorithms · Computer Science 2023-07-06 Daniel Cussen , Jeffrey D. Ullman

Compressed Modular Matrix Multiplication

We propose to store several integers modulo a small prime into a single machine word. Modular addition is performed by addition and possibly subtraction of a word containing several times the modulo. Modular Multiplication is not directly…

Symbolic Computation · Computer Science 2008-12-18 Jean-Guillaume Dumas , Laurent Fousse , Bruno Salvy

Operand Folding Hardware Multipliers

This paper describes a new accumulate-and-add multiplication algorithm. The method partitions one of the operands and re-combines the results of computations done with each of the partitions. The resulting design turns-out to be both…

Mathematical Software · Computer Science 2011-04-11 Byungchun Chung , Sandra Marcello , Amir-Pasha Mirbaha , David Naccache , Karim Sabeg

New modular multiplication and division algorithms based on continued fraction expansion

In this paper, we apply results on number systems based on continued fraction expansions to modular arithmetic. We provide two new algorithms in order to compute modular multiplication and modular division. The presented algorithms are…

Data Structures and Algorithms · Computer Science 2013-03-15 Mourad Gouicem

Carrying over algorithm in transformers

Addition is perhaps one of the simplest arithmetic tasks one can think of and is usually performed using the carrying over algorithm. This algorithm consists of two tasks: adding digits in the same position and carrying over a one whenever…

Machine Learning · Computer Science 2024-01-18 Jorrit Kruthoff

New Residue Arithmetic Based Barrett Algorithms, Part II: Modular Polynomial Computations

In this paper, we derive a new computational algorithm for Barrett technique for modular polynomial multiplication, termed BA-P. BA-P is then applied to a new residue arithmetic based Barrett algorithm for modular polynomial multiplication…

Number Theory · Mathematics 2016-02-05 Hari K Garg , Hanshen Xiao

Carries and the arithmetic progression structure of sets

If we want to represent integers in base $m$, we need a set $A$ of digits, which needs to be a complete set of residues modulo $m$. When adding two integers with last digits $a_1, a_2 \in A$, we find the unique $a \in A$ such that $a_1 +…

Number Theory · Mathematics 2015-07-01 Francesco Monopoli , Imre Z. Ruzsa

Multiplierless and Sparse Machine Learning based on Margin Propagation Networks

The new generation of machine learning processors have evolved from multi-core and parallel architectures that were designed to efficiently implement matrix-vector-multiplications (MVMs). This is because at the fundamental level, neural…

Machine Learning · Computer Science 2020-11-06 Nazreen P. M. , Shantanu Chakrabartty , Chetan Singh Thakur

Factoring Odd Integers without Multiplication and Division

A method of determining two factors of an odd integer without need of multiplication or division operation in iterative portion of computation is presented. It is feasible for an implementing algorithm to use only integer addition and…

Discrete Mathematics · Computer Science 2017-03-02 Charles Sauerbier

An Efficient Algorithm for Modulus Operation and Its Hardware Implementation in Prime Number Calculation

This paper presents a novel algorithm for the modulus operation for FPGA implementation. The proposed algorithm use only addition, subtraction, logical, and bit shift operations, avoiding the complexities and hardware costs associated with…

Cryptography and Security · Computer Science 2025-01-10 W. A. Susantha Wijesinghe

Memoryless computation: new results, constructions, and extensions

In this paper, we are interested in memoryless computation, a modern paradigm to compute functions which generalises the famous XOR swap algorithm to exchange the contents of two variables without using a buffer. This uses a combinatorial…

Computational Complexity · Computer Science 2015-03-19 Maximilien Gadouleau , Soren Riis

A non-commutative algorithm for multiplying (7 $\times$ 7) matrices using 250 multiplications

We present a non-commutative algorithm for multiplying (7x7) matrices using 250 multiplications and a non-commutative algorithm for multiplying (9x9) matrices using 520 multiplications. These algorithms are obtained using the same…

Symbolic Computation · Computer Science 2017-12-22 Alexandre Sedoglavic

Learning Multiplication-free Linear Transformations

In this paper, we propose several dictionary learning algorithms for sparse representations that also impose specific structures on the learned dictionaries such that they are numerically efficient to use: reduced number of…

Machine Learning · Computer Science 2020-12-08 Cristian Rusu

Compensating Interpolation Distortion by Using New Optimized Modular Method

A modular method was suggested before to recover a band limited signal from the sample and hold and linearly interpolated (or, in general, an nth-order-hold) version of the regular samples. In this paper a novel approach for compensating…

Computer Vision and Pattern Recognition · Computer Science 2012-05-15 Mohammad Tofighi , Ali Ayremlou , Farokh Marvasti

A new lightweight additive homomorphic encryption algorithm

This article describes a lightweight additive homomorphic algorithm with the same encryption and decryption keys. Compared to standard additive homomorphic algorithms like Paillier, this algorithm reduces the computational cost of…

Cryptography and Security · Computer Science 2024-04-03 Wuqiong Pan , Hongliang Gu

Insight into Primal Augmented Lagrangian Multilplier Method

We provide a simplified form of Primal Augmented Lagrange Multiplier algorithm. We intend to fill the gap in the steps involved in the mathematical derivations of the algorithm so that an insight into the algorithm is made. The experiment…

Numerical Analysis · Computer Science 2014-01-02 B. Premjith , S. Sachin Kumar , Akhil Manikkoth , T V Bijeesh , K P Soman

Multipliers: comparison of Fourier transformation based method and Synopsys design technique for up to 32 bits inputs in regular and saturation arithmetics

The technique for hardware multiplication based upon Fourier transformation has been introduced. The technique has the highest efficiency on multiplication units with up to 8 bit range. Each multiplication unit is realized on base of the…

Hardware Architecture · Computer Science 2016-11-17 Danila Gorodecky

Multiplying Matrices Without Multiplying

Multiplying matrices is among the most fundamental and compute-intensive operations in machine learning. Consequently, there has been significant work on efficiently approximating matrix multiplies. We introduce a learning-based algorithm…

Machine Learning · Computer Science 2021-08-17 Davis Blalock , John Guttag

Multiplication of polynomials over the binary field

Additive Fourier Transform is sdudied. A fast multiplication algorithm for polynomials over the binary field is given. The bit complexity of the algorithm is $O(n(log n)(\log\log n)^2)$.

Number Theory · Mathematics 2025-05-15 Chunlei Liu

Fast Integer Multiplication using Modular Arithmetic

We give an $O(N\cdot \log N\cdot 2^{O(\log^*N)})$ algorithm for multiplying two $N$-bit integers that improves the $O(N\cdot \log N\cdot \log\log N)$ algorithm by Sch\"{o}nhage-Strassen. Both these algorithms use modular arithmetic.…

Symbolic Computation · Computer Science 2008-09-19 Anindya De , Piyush P Kurur , Chandan Saha , Ramprasad Saptharishi