English
Related papers

Related papers: BEC: Bit-Level Static Analysis for Reliability aga…

200 papers

Reliability has been a major concern in embedded systems. Higher transistor density and lower voltage supply increase the vulnerability of embedded systems to soft errors. A Single Event Upset (SEU), which is also called a soft error, can…

Hardware Architecture · Computer Science 2024-05-21 Bing Xue , Mark Zwolinski

We present CLEAR (Cross-Layer Exploration for Architecting Resilience), a first of its kind framework which overcomes a major challenge in the design of digital systems that are resilient to reliability failures: achieve desired resilience…

High-performance and safety-critical system architects must accurately evaluate the application-level silent data corruption (SDC) rates of processors to soft errors. Such an evaluation requires error propagation all the way from particle…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-05-05 Siva Kumar Sastry Hari , Paolo Rech , Timothy Tsai , Mark Stephenson , Arslan Zulfiqar , Michael Sullivan , Philip Shirvani , Paul Racunas , Joel Emer , Stephen W. Keckler

In contemporary times, the increasing complexity of the system poses significant challenges to the reliability, trustworthiness, and security of the SACRES. Key issues include the susceptibility to phenomena such as instantaneous voltage…

Hardware Architecture · Computer Science 2024-12-23 Enrico Magliano , Alessio Carpegna , Alessadro Savino , Stefano Di Carlo

Efficient low complexity error correcting code(ECC) is considered as an effective technique for mitigation of multi-bit upset (MBU) in the configuration memory(CM)of static random access memory (SRAM) based Field Programmable Gate Array…

Hardware Architecture · Computer Science 2018-10-24 Swagata Mandal , Sreetama Sarkar , Wong Ming Ming , Anupam Chattopadhyay , Amlan Chakrabarti

Fault injection attacks represent a class of threats that can compromise embedded systems across multiple layers of abstraction, such as system software, instruction set architecture (ISA), microarchitecture, and physical implementation.…

Cryptography and Security · Computer Science 2025-05-07 Arsalan Ali Malik , Harshvadan Mihir , Aydin Aysu

We present a first of its kind framework which overcomes a major challenge in the design of digital systems that are resilient to reliability failures: achieve desired resilience targets at minimal costs (energy, power, execution time,…

In most error correction coding (ECC) frameworks, the typical error metric is the bit error rate (BER) which measures the number of bit errors. For this metric, the positions of the bits are not relevant to the decoding, and in many noise…

Signal Processing · Electrical Eng. & Systems 2021-10-11 Chai Wah Wu

Quantum error mitigation (QEM) is typically viewed as a suite of practical techniques for today's noisy intermediate-scale quantum devices, with limited relevance once fault-tolerant quantum computers become available. In this work, we…

Quantum Physics · Physics 2025-12-11 Zeyuan Zhou , Shaun Pexton , Aleksander Kubica , Yongshan Ding

Modern computer scaling trends in pursuit of larger component counts and power efficiency have, unfortunately, lead to less reliable hardware and consequently soft errors escaping into application data ("silent data corruptions").…

Software Engineering · Computer Science 2016-07-05 Vishal Chandra Sharma , Ganesh Gopalakrishnan , Sriram Krishnamoorthy

Silent Errors within hardware devices occur when an internal defect manifests in a part of the circuit which does not have check logic to detect the incorrect circuit operation. The results of such a defect can range from flipping a single…

Hardware Architecture · Computer Science 2022-03-18 Harish Dattatraya Dixit , Laura Boyle , Gautham Vunnam , Sneha Pendharkar , Matt Beadon , Sriram Sankar

High energy particles from cosmic rays or packaging materials can generate a glitch or a current transient (single event transient or SET) in a logic circuit. This SET can eventually get captured in a register resulting in a flip of the…

Hardware Architecture · Computer Science 2017-06-16 Nanditha P. Rao , Madhav P. Desai

Stabilizer states are a central resource in quantum information processing, underpinning a wide range of applications. While they can be efficiently generated via Clifford circuits, the presence of coherent errors, such as small-angle…

Quantum Physics · Physics 2025-10-24 Giovanni Di Bartolomeo , Giulio Crognaletti , Angelo Bassi , Michele Vischi

Software Fault Localization refers to the activity of finding code elements (e.g., statements) that are related to a software failure. The state-of-the-art fault localization techniques, however, produce coarse-grained results that can…

Software Engineering · Computer Science 2021-11-16 Shangwen Wang , Kui Liu , Bo Lin , Li Li , Jacques Klein , Xiaoguang Mao , Tegawendé F. Bissyandé

Soft errors have a significant impact on the circuit reliability at nanoscale technologies. At the architectural level, soft errors are commonly modeled by a probabilistic bit-flip model. In developing such abstract fault models, an…

Hardware Architecture · Computer Science 2014-01-07 Nanditha P. Rao , Shahbaz Sarik , Madhav P. Desai

Alpha-particles and cosmic rays cause bit flips in chips. Protection circuits ease the problem, but cost chip area and power, and so designers try hard to optimize them. This leads to bugs: an undetected fault can bring miscalculations, the…

Other Computer Science · Computer Science 2017-12-13 Patrick Klampfl , Robert Koenighofer , Roderick Bloem , Ayrat Khalimov , Aiman Abu-Yonis , Shiri Moran

Static analysis plays a crucial role in software vulnerability detection, yet faces a persistent precision-scalability tradeoff. In large codebases like the Linux kernel, traditional static analysis tools often generate excessive false…

Software Engineering · Computer Science 2025-06-03 Haonan Li , Hang Zhang , Kexin Pei , Zhiyun Qian

Future extreme-scale computer systems may expose silent data corruption (SDC) to applications, in order to save energy or increase performance. However, resilience research struggles to come up with useful abstract programming models for…

Mathematical Software · Computer Science 2014-01-15 James Elliott , Mark Hoemmen , Frank Mueller

Silent data corruption (SDC) threatens the reliability of large-scale GPU clusters used for training large language models, yet its rarity and lack of explicit error signals make accurate high-level modeling challenging. To address this…

In large-scale datacenters, memory failure is a common cause of server crashes, with Uncorrectable Errors (UEs) being a major indicator of Dual Inline Memory Module (DIMM) defects. Existing approaches primarily focus on predicting UEs using…

Hardware Architecture · Computer Science 2023-12-19 Qiao Yu , Wengui Zhang , Jorge Cardoso , Odej Kao
‹ Prev 1 2 3 10 Next ›