Related papers: Efficient Soft-Error Detection for Low-precision D…

Online Soft Error Tolerance in ReRAM Crossbars for Deep Learning Accelerators

Resistive Random-Access Memory (ReRAM) crossbar arrays are promising candidates for in-situ matrix-vector multiplication (MVM), a frequent operation in Deep Learning algorithms. Despite their advantages, these emerging non-volatile memories…

Emerging Technologies · Computer Science 2024-12-05 Benyamin Khezeli , Hamid Reza Zarandi , Elham Cheshmikhani

ECRM: Efficient Fault Tolerance for Recommendation Model Training via Erasure Coding

Deep-learning-based recommendation models (DLRMs) are widely deployed to serve personalized content to users. DLRMs are large in size due to their use of large embedding tables, and are trained by distributing the model across the memory of…

Machine Learning · Computer Science 2021-04-06 Kaige Liu , Jack Kosaian , K. V. Rashmi

SoftDedup: an Efficient Data Reweighting Method for Speeding Up Language Model Pre-training

The effectiveness of large language models (LLMs) is often hindered by duplicated data in their extensive pre-training datasets. Current approaches primarily focus on detecting and removing duplicates, which risks the loss of valuable…

Computation and Language · Computer Science 2024-07-10 Nan He , Weichen Xiong , Hanwen Liu , Yi Liao , Lei Ding , Kai Zhang , Guohua Tang , Xiao Han , Wei Yang

Low-Complexity Soft-Feedback Detector for AFDM Systems

Affine frequency division multiplexing (AFDM), an emerging multi-carrier modulation scheme, has garnered significant attention due to its resilience to Doppler shifts and capability to achieve full diversity in doubly dispersive channels.…

Signal Processing · Electrical Eng. & Systems 2026-04-17 Taohe Chen , Yin Xu , Tianyao Ma , Aimin Tang , Qu Luo , Dazhi He , Wenjun Zhang

Compiler-Level Matrix Multiplication Optimization for Deep Learning

An important linear algebra routine, GEneral Matrix Multiplication (GEMM), is a fundamental operator in deep learning. Compilers need to translate these routines into low-level code optimized for specific hardware. Compiler-level…

Machine Learning · Computer Science 2019-09-25 Huaqing Zhang , Xiaolin Cheng , Hui Zang , Dae Hoon Park

BagPipe: Accelerating Deep Recommendation Model Training

Deep learning based recommendation models (DLRM) are widely used in several business critical applications. Training such recommendation models efficiently is challenging because they contain billions of embedding-based parameters, leading…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-11-02 Saurabh Agarwal , Chengpo Yan , Ziyi Zhang , Shivaram Venkataraman

Soft Expectation and Deep Maximization for Image Feature Detection

Central to the application of many multi-view geometry algorithms is the extraction of matching points between multiple viewpoints, enabling classical tasks such as camera pose estimation and 3D reconstruction. Many approaches that…

Computer Vision and Pattern Recognition · Computer Science 2021-10-15 Alexander Mai , Allen Yang , Dominique E. Meyer

Towards Automated Error Discovery: A Study in Conversational AI

Although LLM-based conversational agents demonstrate strong fluency and coherence, they still produce undesirable behaviors (errors) that are challenging to prevent from reaching users during deployment. Recent research leverages large…

Computation and Language · Computer Science 2025-09-16 Dominic Petrak , Thy Thy Tran , Iryna Gurevych

DeepGEMM: Accelerated Ultra Low-Precision Inference on CPU Architectures using Lookup Tables

A lot of recent progress has been made in ultra low-bit quantization, promising significant improvements in latency, memory footprint and energy consumption on edge devices. Quantization methods such as Learned Step Size Quantization can…

Machine Learning · Computer Science 2023-04-19 Darshan C. Ganji , Saad Ashfaq , Ehsan Saboori , Sudhakar Sah , Saptarshi Mitra , MohammadHossein AskariHemmat , Alexander Hoffman , Ahmed Hassanien , Mathieu Léonardon

SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition

Error correction in automatic speech recognition (ASR) aims to correct those incorrect words in sentences generated by ASR models. Since recent ASR models usually have low word error rate (WER), to avoid affecting originally correct tokens,…

Computation and Language · Computer Science 2023-12-21 Yichong Leng , Xu Tan , Wenjie Liu , Kaitao Song , Rui Wang , Xiang-Yang Li , Tao Qin , Edward Lin , Tie-Yan Liu

MPGemmFI: A Fault Injection Technique for Mixed Precision GEMM in ML Applications

Emerging deep learning workloads urgently need fast general matrix multiplication (GEMM). To meet such demand, one of the critical features of machine-learning-specific accelerators such as NVIDIA Tensor Cores, AMD Matrix Cores, and Google…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-11-13 Bo Fang , Xinyi Li , Harvey Dam , Cheng Tan , Siva Kumar Sastry Hari , Timothy Tsai , Ignacio Laguna , Dingwen Tao , Ganesh Gopalakrishnan , Prashant Nair , Kevin Barker , Ang Li

Learning During Detection: Continual Learning for Neural OFDM Receivers via DMRS

Deep neural networks (DNNs) have been increasingly explored for receiver design because they can handle complex environments without relying on explicit channel models. Nevertheless, because communication channels change rapidly, their…

Information Theory · Computer Science 2026-02-25 Mohanad Obeed , Ming Jian

Building a Performance Model for Deep Learning Recommendation Model Training on GPUs

We devise a performance model for GPU training of Deep Learning Recommendation Models (DLRM), whose GPU utilization is low compared to other well-optimized CV and NLP models. We show that both the device active time (the sum of kernel…

Machine Learning · Computer Science 2022-11-18 Zhongyi Lin , Louis Feng , Ehsan K. Ardestani , Jaewon Lee , John Lundell , Changkyu Kim , Arun Kejariwal , John D. Owens

Deep Learning Framework Testing via Heuristic Guidance Based on Multiple Model Measurements

Deep learning frameworks serve as the foundation for developing and deploying deep learning applications. To enhance the quality of deep learning frameworks, researchers have proposed numerous testing methods using deep learning models as…

Software Engineering · Computer Science 2025-10-22 Yinglong Zou , Juan Zhai , Chunrong Fang , Yanzhou Mu , Jiawei Liu , Zhenyu Chen

Soft Random Sampling: A Theoretical and Empirical Analysis

Soft random sampling (SRS) is a simple yet effective approach for efficient training of large-scale deep neural networks when dealing with massive data. SRS selects a subset uniformly at random with replacement from the full data set in…

Machine Learning · Computer Science 2023-11-27 Xiaodong Cui , Ashish Mittal , Songtao Lu , Wei Zhang , George Saon , Brian Kingsbury

Error Mitigation of Fault-Tolerant Quantum Circuits with Soft Information

Quantum error mitigation (QEM) is typically viewed as a suite of practical techniques for today's noisy intermediate-scale quantum devices, with limited relevance once fault-tolerant quantum computers become available. In this work, we…

Quantum Physics · Physics 2025-12-11 Zeyuan Zhou , Shaun Pexton , Aleksander Kubica , Yongshan Ding

Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM Framework for Detecting Factual Errors

Accurate text summarization is one of the most common and important tasks performed by Large Language Models, where the costs of human review for an entire document may be high, but the costs of errors in summarization may be even greater.…

Computation and Language · Computer Science 2024-06-21 Alex Chandler , Devesh Surve , Hui Su

Low-Complexity Soft-Decision Detection for Combating DFE Burst Errors in IM/DD Links

The deployment of non-binary pulse amplitude modulation (PAM) and soft decision (SD)-forward error correction (FEC) in future intensity-modulation (IM)/direct-detection (DD) links is inevitable. However, high-speed IM/DD links suffer from…

Signal Processing · Electrical Eng. & Systems 2023-09-29 Kaiquan Wu , Gabriele Liga , Jamal Riani , Alex Alvarado

LM-Fix: Lightweight Bit-Flip Detection and Rapid Recovery Framework for Language Models

This paper presents LM-Fix, a lightweight detection and rapid recovery framework for faults in large language models (LLMs). Existing integrity approaches are often heavy or slow for modern LLMs. LM-Fix runs a short test-vector pass and…

Software Engineering · Computer Science 2026-02-25 Ahmad Tahmasivand , Noureldin Zahran , Saba Al-Sayouri , Mohammed Fouda , Khaled N. Khasawneh

SoftTriple Loss: Deep Metric Learning Without Triplet Sampling

Distance metric learning (DML) is to learn the embeddings where examples from the same class are closer than examples from different classes. It can be cast as an optimization problem with triplet constraints. Due to the vast number of…

Computer Vision and Pattern Recognition · Computer Science 2020-04-16 Qi Qian , Lei Shang , Baigui Sun , Juhua Hu , Hao Li , Rong Jin