Related papers: An Improved GEF Fast Addition Algorithm

Fast Prefix Adders for Non-Uniform Input Arrival Times

We consider the problem of constructing fast and small parallel prefix adders for non-uniform input arrival times. This problem arises whenever the adder is embedded into a more complex circuit, e. g. a multiplier. Most previous results are…

Hardware Architecture · Computer Science 2014-11-12 Stephan Held , Sophie Spirkl

Fast Bipartitioned Hybrid Adder Utilizing Carry Select and Carry Lookahead Logic

We present a novel fast bipartitioned hybrid adder (FBHA) that utilizes carry-select and carry-lookahead logic. The proposed FBHA is an accurate adder with a significant part and a less significant part joined together by a carry signal. In…

Hardware Architecture · Computer Science 2024-12-03 Padmanabhan Balasubramanian , Douglas L. Maskell

Aggregating Funnels for Faster Fetch&Add and Queues

Many concurrent algorithms require processes to perform fetch-and-add operations on a single memory location, which can be a hot spot of contention. We present a novel algorithm called Aggregating Funnels that reduces this contention by…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-04 Younghun Roh , Yuanhao Wei , Eric Ruppert , Panagiota Fatourou , Siddhartha Jayanti , Julian Shun

An Approximate Carry Estimating Simultaneous Adder with Rectification

Approximate computing has in recent times found significant applications towards lowering power, area, and time requirements for arithmetic operations. Several works done in recent years have furthered approximate computing along these…

Hardware Architecture · Computer Science 2020-09-01 Rajat Bhattacharjya , Vishesh Mishra , Saurabh Singh , Kaustav Goswami , Dip Sankar Banerjee

Operand Folding Hardware Multipliers

This paper describes a new accumulate-and-add multiplication algorithm. The method partitions one of the operands and re-combines the results of computations done with each of the partitions. The resulting design turns-out to be both…

Mathematical Software · Computer Science 2011-04-11 Byungchun Chung , Sandra Marcello , Amir-Pasha Mirbaha , David Naccache , Karim Sabeg

Fast Parallel Integer Adder in Binary Representation

An integer adder for integers in the binary representation is one of the basic operations of any digital processor. For adding two integers of N bits each, the serial adder takes as many clock ticks. For achieving higher speeds, parallel…

Hardware Architecture · Computer Science 2019-03-26 Duggirala Meher Krishna , Duggirala Ravi

Combined Sieve Algorithm for Prime Gaps

A new Combined Sieve algorithm is presented with cost proportional to the number of enumerated factors over a series of intervals. This algorithm achieves a significant speedup, over a traditional sieve, when handling many ([10^4, 10^7])…

Number Theory · Mathematics 2020-12-08 Seth Troisi

Accelerator-driven Data Arrangement to Minimize Transformers Run-time on Multi-core Architectures

The increasing complexity of transformer models in artificial intelligence expands their computational costs, memory usage, and energy consumption. Hardware acceleration tackles the ensuing challenges by designing processors and…

Hardware Architecture · Computer Science 2023-12-21 Alireza Amirshahi , Giovanni Ansaloni , David Atienza

Design of Reconfigurable Multi-Operand Adder for Massively Parallel Processing

The paper presents a systematic study and implementation of a reconfigurable combinatorial multi-operand adder for use in Deep Learning systems. The size of carry changes with the number of operands and hence a reliable algorithm to…

Hardware Architecture · Computer Science 2020-08-10 Shilpa Mayannavar , Uday Wali

Faster Radix Sort via Virtual Memory and Write-Combining

Sorting algorithms are the deciding factor for the performance of common operations such as removal of duplicates or database sort-merge joins. This work focuses on 32-bit integer keys, optionally paired with a 32-bit value. We present a…

Data Structures and Algorithms · Computer Science 2010-09-07 Jan Wassenberg , Peter Sanders

Efficient Additions and Montgomery Reductions of Large Integers for SIMD

This paper presents efficient algorithms, designed to leverage SIMD for performing Montgomery reductions and additions on integers larger than 512 bits. The existing algorithms encounter inefficiencies when parallelized using SIMD due to…

Cryptography and Security · Computer Science 2023-09-01 Pengchang Ren , Reiji Suda , Vorapong Suppakitpaisarn

Fast Sequential Summation Algorithms Using Augmented Data Structures

This paper provides an introduction to the design of augmented data structures that offer an efficient representation of a mathematical sequence and fast sequential summation algorithms, which guarantee both logarithmic running time and…

Data Structures and Algorithms · Computer Science 2014-04-08 Vadim Stadnik

ARC Sort: Enhanced and Time Efficient Sorting Algorithm

This paper discusses about a sorting algorithm which uses the concept of buckets where each bucket represents a certain number of digits. A two dimensional data structure is used where one dimension represents buckets i. e; number of digits…

Data Structures and Algorithms · Computer Science 2014-06-10 Ankit Chadha , Rishikesh Misal , Tanaya Mokashi , Aman Chadha

4-Bit High-Speed Binary Ling Adder

Binary addition is one of the most primitive and most commonly used applications in computer arithmetic. A large variety of algorithms and implementations have been proposed for binary addition. Huey Ling proposed a simpler form of CLA…

Hardware Architecture · Computer Science 2019-08-28 Projjal Gupta

Multiplying boolean Polynomials with Frobenius Partitions in Additive Fast Fourier Transform

We show a new algorithm and its implementation for multiplying bit-polynomials of large degrees. The algorithm is based on evaluating polynomials at a specific set comprising a natural set for evaluation with additive FFT and a high order…

Symbolic Computation · Computer Science 2018-04-02 Ming-Shing Chen , Chen-Mou Cheng , Po-Chun Kuo , Wen-Ding Li , Bo-Yin Yang

Fast geometric trim fitting using partial incremental sorting and accumulation

We present an algorithmic contribution to improve the efficiency of robust trim-fitting in outlier affected geometric regression problems. The method heavily relies on the quick sort algorithm, and we present two important insights. First,…

Computer Vision and Pattern Recognition · Computer Science 2022-09-07 Min Li , Laurent Kneip

Speeding up the Grover algorithm using auxiliary solutions

It may be possible to extend the Grover search algorithm by taking a divide and conquer approach using auxiliary solutions to achieve an exponential speed-up.

General Physics · Physics 2009-07-13 A. Y. Shiekh

High-Speed (7,2) Compressor Using A Fast Carry-Generation Logic based on Sorting Network

Fast binary compressors are the main components of many basic digital calculation units. In this paper, a high-speed (7,2) compressor with a fast carry-generation logic is proposed. The carry-generation logic is based on the sorting…

Hardware Architecture · Computer Science 2023-09-08 Wenbo Guo

Fast Iterative Graph Computing with Updated Neighbor States

Enhancing the efficiency of iterative computation on graphs has garnered considerable attention in both industry and academia. Nonetheless, the majority of efforts focus on expediting iterative computation by minimizing the running time per…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-07-23 Yijie Zhou , Shufeng Gong , Feng Yao , Hanzhang Chen , Song Yu , Pengxi Liu , Yanfeng Zhang , Ge Yu , Jeffrey Xu Yu

Fast algorithm for generating ascending compositions

In this paper we give a fast algorithm to generate all partitions of a positive integer $n$. Integer partitions may be encoded as either ascending or descending compositions for the purposes of systematic generation. It is known that the…

Combinatorics · Mathematics 2019-03-27 Mircea Merca