Ian Colbert — Scifaro

Pushing the Limits of Block Rotations in Post-Training Quantization

Recent post-training quantization (PTQ) methods have adopted block rotations to diffuse outliers prior to rounding. While this reduces the overhead of online full-vector rotations, the effect of block structure on outlier suppression…

Machine Learning · Computer Science 2026-05-29 Sai Sanjeet , Ian Colbert , Pablo Monteagudo-Lago , Giuseppe Franco , Yaman Umuroglu , Nicholas J. Fraser

GPU Acceleration of Sparse Fully Homomorphic Encrypted DNNs

Fully homomorphic encryption (FHE) has recently attracted significant attention as both a cryptographic primitive and a systems challenge. Given the latest advances in accelerated computing, FHE presents a promising opportunity for…

Cryptography and Security · Computer Science 2026-04-14 Lara D'Agata , Carlos Agulló-Domingo , Óscar Vera-López , Kaustubh Shivdikar , Ardhi W. B. Yudha , Ferhat Yaman , David Kaeli , José L. Abellán , Ian Colbert , José Cano

Provable Post-Training Quantization: Theoretical Analysis of OPTQ and Qronos

Post-training quantization (PTQ) has become a crucial tool for reducing the memory and compute costs of modern deep neural networks, including large language models (LLMs). Among PTQ algorithms, the OPTQ framework-also known as GPTQ-has…

Machine Learning · Computer Science 2026-04-13 Haoyu Zhang , Shihao Zhang , Ian Colbert , Rayan Saab

Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization

We introduce Qronos -- a new state-of-the-art post-training quantization algorithm that sequentially rounds and updates neural network weights. Qronos not only explicitly corrects errors due to both weight and activation quantization, but…

Machine Learning · Computer Science 2026-02-18 Shihao Zhang , Haoyu Zhang , Ian Colbert , Rayan Saab

Combining Reinforcement Learning and Behavior Trees for NPCs in Video Games with AMD Schola

While the rapid advancements in the reinforcement learning (RL) research community have been remarkable, the adoption in commercial video games remains slow. In this paper, we outline common challenges the Game AI community faces when using…

Artificial Intelligence · Computer Science 2025-10-17 Tian Liu , Alex Cann , Ian Colbert , Mehdi Saeedi

SIRA: Scaled-Integer Range Analysis for Optimizing FPGA Dataflow Neural Network Accelerators

While neural network quantization effectively reduces the cost of matrix multiplications, aggressive quantization can expose non-matrix-multiply operations as significant performance and resource bottlenecks on embedded systems. Addressing…

Hardware Architecture · Computer Science 2025-09-01 Yaman Umuroglu , Christoph Berganski , Felix Jentzsch , Michal Danilowicz , Tomasz Kryjak , Charalampos Bezaitis , Magnus Sjalander , Ian Colbert , Thomas Preusser , Jakoba Petri-Koenig , Michaela Blott

Improving Quantization with Post-Training Model Expansion

The size of a model has been a strong predictor of its quality, as well as its cost. As such, the trade-off between model cost and quality has been well-studied. Post-training optimizations like quantization and pruning have typically…

Machine Learning · Computer Science 2025-08-29 Giuseppe Franco , Pablo Monteagudo-Lago , Ian Colbert , Nicholas Fraser , Michaela Blott

Accumulator-Aware Post-Training Quantization for Large Language Models

When quantizing weights and activations to increasingly narrower representations, the cost of additions begins to dominate that of multiplications in multiply-accumulate (MAC) units. Recent studies show that reducing addition costs via…

Machine Learning · Computer Science 2025-08-01 Ian Colbert , Giuseppe Franco , Fabian Grob , Jinjie Zhang , Rayan Saab

Path Generation and Evaluation in Video Games: A Nonparametric Statistical Approach

Navigation path traces play a crucial role in video game design, serving as a vital resource for both enhancing player engagement and fine-tuning non-playable character behavior. Generating such paths with human-like realism can enrich the…

Machine Learning · Computer Science 2025-06-05 Daniel Campa , Mehdi Saeedi , Ian Colbert , Srinjoy Das

Exploiting Unstructured Sparsity in Fully Homomorphic Encrypted DNNs

The deployment of deep neural networks (DNNs) in privacy-sensitive environments is constrained by computational overheads in fully homomorphic encryption (FHE). This paper explores unstructured sparsity in FHE matrix multiplication schemes…

Cryptography and Security · Computer Science 2025-04-04 Aidan Ferguson , Perry Gibson , Lara D'Agata , Parker McLeod , Ferhat Yaman , Amitabh Das , Ian Colbert , José Cano

A2Q+: Improving Accumulator-Aware Weight Quantization

Quantization techniques commonly reduce the inference costs of neural networks by restricting the precision of weights and activations. Recent studies show that also reducing the precision of the accumulator can further improve hardware…

Machine Learning · Computer Science 2024-01-22 Ian Colbert , Alessandro Pappalardo , Jakoba Petri-Koenig , Yaman Umuroglu

A2Q: Accumulator-Aware Quantization with Guaranteed Overflow Avoidance

We present accumulator-aware quantization (A2Q), a novel weight quantization method designed to train quantized neural networks (QNNs) to avoid overflow when using low-precision accumulators during inference. A2Q introduces a unique…

Machine Learning · Computer Science 2023-08-28 Ian Colbert , Alessandro Pappalardo , Jakoba Petri-Koenig

Quantized Neural Networks for Low-Precision Accumulation with Guaranteed Overflow Avoidance

We introduce a quantization-aware training algorithm that guarantees avoiding numerical overflow when reducing the precision of accumulators during inference. We leverage weight normalization as a means of constraining parameters during…

Machine Learning · Computer Science 2023-02-01 Ian Colbert , Alessandro Pappalardo , Jakoba Petri-Koenig

Robust Transferable Feature Extractors: Learning to Defend Pre-Trained Networks Against White Box Adversaries

The widespread adoption of deep neural networks in computer vision applications has brought forth a significant interest in adversarial robustness. Existing research has shown that maliciously perturbed inputs specifically tailored for a…

Machine Learning · Computer Science 2022-09-16 Alexander Cann , Ian Colbert , Ihab Amer

Human-Like Navigation Behavior: A Statistical Evaluation Framework

Recent advancements in deep reinforcement learning have brought forth an impressive display of highly skilled artificial agents capable of complex intelligent behavior. In video games, these artificial agents are increasingly deployed as…

Machine Learning · Statistics 2022-03-14 Ian Colbert , Mehdi Saeedi

Generating GPU Compiler Heuristics using Reinforcement Learning

GPU compilers are complex software programs with many optimizations specific to target hardware. These optimizations are often controlled by heuristics hand-designed by compiler experts using time- and resource-intensive processes. In this…

Machine Learning · Computer Science 2021-11-24 Ian Colbert , Jake Daly , Norm Rubin

Training Deep Neural Networks with Joint Quantization and Pruning of Weights and Activations

Quantization and pruning are core techniques used to reduce the inference costs of deep neural networks. State-of-the-art quantization techniques are currently applied to both the weights and activations; however, pruning is most often…

Machine Learning · Computer Science 2021-11-02 Xinyu Zhang , Ian Colbert , Ken Kreutz-Delgado , Srinjoy Das

An Energy-Efficient Edge Computing Paradigm for Convolution-based Image Upsampling

A novel energy-efficient edge computing paradigm is proposed for real-time deep learning-based image upsampling applications. State-of-the-art deep learning solutions for image upsampling are currently trained using either resize or…

Computer Vision and Pattern Recognition · Computer Science 2021-07-27 Ian Colbert , Ken Kreutz-Delgado , Srinjoy Das

A Competitive Edge: Can FPGAs Beat GPUs at DCNN Inference Acceleration in Resource-Limited Edge Computing Applications?

When trained as generative models, Deep Learning algorithms have shown exceptional performance on tasks involving high dimensional data such as image denoising and super-resolution. In an increasingly connected world dominated by mobile and…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-03-10 Ian Colbert , Jake Daly , Ken Kreutz-Delgado , Srinjoy Das

Generative and Discriminative Deep Belief Network Classifiers: Comparisons Under an Approximate Computing Framework

The use of Deep Learning hardware algorithms for embedded applications is characterized by challenges such as constraints on device power consumption, availability of labeled data, and limited internet bandwidth for frequent training on…

Machine Learning · Computer Science 2021-02-02 Siqiao Ruan , Ian Colbert , Ken Kreutz-Delgado , Srinjoy Das