Related papers: FLInt: Exploiting Floating Point Enabled Integer A…

Reducing numerical precision preserves classification accuracy in Mondrian Forests

Mondrian Forests are a powerful data stream classification method, but their large memory footprint makes them ill-suited for low-resource platforms such as connected objects. We explored using reduced-precision floating-point…

Machine Learning · Computer Science 2021-06-29 Marc Vicuna , Martin Khannouz , Gregory Kiar , Yohan Chatelain , Tristan Glatard

FLiMS: a Fast Lightweight 2-way Merger for Sorting

In this paper, we present FLiMS, a highly-efficient and simple parallel algorithm for merging two sorted lists residing in banked and/or wide memory. On FPGAs, its implementation uses fewer hardware resources than the state-of-the-art…

Hardware Architecture · Computer Science 2022-03-08 Philippos Papaphilippou , Wayne Luk , Chris Brooks

NEAT: A Framework for Automated Exploration of Floating Point Approximations

Much recent research is devoted to exploring tradeoffs between computational accuracy and energy efficiency at different levels of the system stack. Approximation at the floating point unit (FPU) allows saving energy by simply reducing the…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-02-18 Saeid Barati , Lee Ehudin , Hank Hoffmann

FPIRM: Floating-point Processing in Racetrack Memories

Convolutional neural networks (CNN) have become a ubiquitous algorithm with growing applications in mobile and edge settings. We describe a compute-in-memory (CIM) technique called FPIRM using Racetrack Memory (RM) to accelerate CNNs for…

Emerging Technologies · Computer Science 2022-08-02 Sébastien Ollivier , Xinyi Zhang , Yue Tang , Chayanika Choudhuri , Jingtong Hu , Alex K. Jones

Efficient sequential and parallel algorithms for multistage stochastic integer programming using proximity

We consider the problem of solving integer programs of the form $\min \{\,c^\intercal x\ \colon\ Ax=b, x\geq 0\}$, where $A$ is a multistage stochastic matrix in the following sense: the primal treedepth of $A$ is bounded by a parameter…

Data Structures and Algorithms · Computer Science 2020-12-23 Jana Cslovjecsek , Friedrich Eisenbrand , Michał Pilipczuk , Moritz Venzin , Robert Weismantel

Searching input values hitting suspicious Intervals in programs with floating-point operations

Programs with floating-point computations are often derived from mathematical models or designed with the semantics of the real numbers in mind. However, for a given input, the computed path with floating-point numbers may differ from the…

Programming Languages · Computer Science 2016-08-08 Hélène Collavizza , Claude Michel , Michel Rueher

FLINT: Learning-based Flow Estimation and Temporal Interpolation for Scientific Ensemble Visualization

We present FLINT (learning-based FLow estimation and temporal INTerpolation), a novel deep learning-based approach to estimate flow fields for 2D+time and 3D+time scientific ensemble data. FLINT can flexibly handle different types of…

Computer Vision and Pattern Recognition · Computer Science 2025-02-24 Hamid Gadirov , Jos B. T. M. Roerdink , Steffen Frey

Rethinking Floating Point Overheads for Mixed Precision DNN Accelerators

In this paper, we propose a mixed-precision convolution unit architecture which supports different integer and floating point (FP) precisions. The proposed architecture is based on low-bit inner product units and realizes higher precision…

Hardware Architecture · Computer Science 2021-01-29 Hamzah Abdel-Aziz , Ali Shafiee , Jong Hoon Shin , Ardavan Pedram , Joseph H. Hassoun

Efficient, arbitrarily high precision hardware logarithmic arithmetic for linear algebra

The logarithmic number system (LNS) is arguably not broadly used due to exponential circuit overheads for summation tables relative to arithmetic precision. Methods to reduce this overhead have been proposed, yet still yield designs with…

Numerical Analysis · Mathematics 2020-05-15 Jeff Johnson

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Modern AI hardware, such as Nvidia's Blackwell architecture, is increasingly embracing low-precision floating-point (FP) formats to handle the pervasive activation outliers in Large Language Models (LLMs). Despite this industry trend, a…

Machine Learning · Computer Science 2025-10-30 Mengzhao Chen , Meng Wu , Hui Jin , Zhihang Yuan , Jing Liu , Chaoyi Zhang , Yunshui Li , Jie Huang , Jin Ma , Zeyue Xue , Zhiheng Liu , Xingyan Bin , Ping Luo

Green Accelerated Hoeffding Tree

State-of-the-art machine learning solutions mainly focus on creating highly accurate models without constraints on hardware resources. Stream mining algorithms are designed to run on resource-constrained devices, thus a focus on low power…

Machine Learning · Computer Science 2022-05-09 Eva Garcia-Martin , Albert Bifet , Niklas Lavesson , Rikard König , Henrik Linusson

Deep Neural Network inference with reduced word length

Deep neural networks (DNN) are powerful models for many pattern recognition tasks, yet their high computational complexity and memory requirement limit them to applications on high-performance computing platforms. In this paper, we propose…

Machine Learning · Computer Science 2018-10-24 Lukas Mauch , Bin Yang

Finding Root Causes of Floating Point Error with Herbgrind

Floating-point arithmetic plays a central role in science, engineering, and finance by enabling developers to approximate real arithmetic. To address numerical issues in large floating-point applications, developers must identify root…

Programming Languages · Computer Science 2018-07-02 Alex Sanchez-Stern , Pavel Panchekha , Sorin Lerner , Zachary Tatlock

Improving the Accuracy-Memory Trade-Off of Random Forests Via Leaf-Refinement

Random Forests (RF) are among the state-of-the-art in many machine learning applications. With the ongoing integration of ML models into everyday life, the deployment and continuous application of models becomes more and more an important…

Machine Learning · Computer Science 2021-10-20 Sebastian Buschjäger , Katharina Morik

Addressing Resiliency of In-Memory Floating Point Computation

In-memory computing (IMC) can eliminate the data movement between processor and memory which is a barrier to the energy-efficiency and performance in Von-Neumann computing. Resistive RAM (RRAM) is one of the promising devices for IMC…

Hardware Architecture · Computer Science 2020-11-03 Sina Sayyah Ensan , Swaroop Ghosh , Seyedhamidreza Motaman , Derek Weast

Medoid splits for efficient random forests in metric spaces

This paper revisits an adaptation of the random forest algorithm for Fr\'echet regression, addressing the challenge of regression in the context of random objects in metric spaces. Recognizing the limitations of previous approaches, we…

Methodology · Statistics 2023-06-30 Matthieu Bulté , Helle Sørensen

MABSplit: Faster Forest Training Using Multi-Armed Bandits

Random forests are some of the most widely used machine learning models today, especially in domains that necessitate interpretability. We present an algorithm that accelerates the training of random forests and other popular tree-based…

Machine Learning · Computer Science 2022-12-16 Mo Tiwari , Ryan Kang , Je-Yong Lee , Sebastian Thrun , Chris Piech , Ilan Shomorony , Martin Jinye Zhang

Floating-floating point: a highly accurate number representation with flexible Counting ranges

Efficient number representation is essential for federated learning, natural language processing, and network measurement solutions. Due to timing, area, and power constraints, such applications use narrow bit-width (e.g., 8-bit) number…

Networking and Internet Architecture · Computer Science 2024-10-08 Itamar Cohen , Gil Einziger

Efficient and Mathematically Robust Operations for Certified Neural Networks Inference

In recent years, machine learning (ML) and neural networks (NNs) have gained widespread use and attention across various domains, particularly in transportation for achieving autonomy, including the emergence of flying taxis for urban air…

Machine Learning · Computer Science 2024-01-17 Fabien Geyer , Johannes Freitag , Tobias Schulz , Sascha Uhrig

Fast Inference of Tree Ensembles on ARM Devices

With the ongoing integration of Machine Learning models into everyday life, e.g. in the form of the Internet of Things (IoT), the evaluation of learned models becomes more and more an important issue. Tree ensembles are one of the best…

Machine Learning · Computer Science 2023-05-16 Simon Koschel , Sebastian Buschjäger , Claudio Lucchese , Katharina Morik