Related papers: ALBERTA: ALgorithm-Based Error Resilience in Trans…

FT-Transformer: Resilient and Reliable Transformer with End-to-End Fault Tolerant Attention

Transformer models rely on High-Performance Computing (HPC) resources for inference, where soft errors are inevitable in large-scale systems, making the reliability of the model particularly critical. Existing fault tolerance frameworks for…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-08-14 Huangliang Dai , Shixun Wu , Jiajun Huang , Zizhe Jian , Yue Zhu , Haiyang Hu , Zizhong Chen

Custom Algorithm-based Fault Tolerance for Attention Layers in Transformers

Transformers and large language models (LLMs), powered by the attention mechanism, have transformed numerous AI applications, driving the need for specialized hardware accelerators. A major challenge in these accelerators is efficiently…

Machine Learning · Computer Science 2025-07-23 Vasileios Titopoulos , Kosmas Alexandridis , Giorgos Dimitrakopoulos

Soft Error Reliability Analysis of Vision Transformers

Vision Transformers (ViTs) that leverage self-attention mechanism have shown superior performance on many classical vision tasks compared to convolutional neural networks (CNNs) and gain increasing popularity recently. Existing ViTs works…

Cryptography and Security · Computer Science 2024-04-29 Xinghua Xue , Cheng Liu , Ying Wang , Bing Yang , Tao Luo , Lei Zhang , Huawei Li , Xiaowei Li

Making Convolutions Resilient via Algorithm-Based Error Detection Techniques

The ability of Convolutional Neural Networks (CNNs) to accurately process real-time telemetry has boosted their use in safety-critical and high-performance computing systems. As such systems require high levels of resilience to errors, CNNs…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-06-11 Siva Kumar Sastry Hari , Michael B. Sullivan , Timothy Tsai , Stephen W. Keckler

Efficient Fault Detection Architectures for Modular Exponentiation Targeting Cryptographic Applications Benchmarked on FPGAs

Whether stemming from malicious intent or natural occurrences, faults and errors can significantly undermine the reliability of any architecture. In response to this challenge, fault detection assumes a pivotal role in ensuring the secure…

Cryptography and Security · Computer Science 2024-02-29 Saeed Aghapour , Kasra Ahmadi , Mehran Mozaffari Kermani , Reza Azarderakhsh

Efficient Software Vulnerability Detection Using Transformer-based Models

Detecting software vulnerabilities is critical to ensuring the security and reliability of modern computer systems. Deep neural networks have shown promising results on vulnerability detection, but they lack the capability to capture global…

Cryptography and Security · Computer Science 2026-04-02 Sameer Shaik , Zhen Huang , Daniela Stan Raicu , Jacob Furst

Code Vulnerability Detection Across Different Programming Languages with AI Models

Security vulnerabilities present in a code that has been written in diverse programming languages are among the most critical yet complicated aspects of source code to detect. Static analysis tools based on rule-based patterns usually do…

Cryptography and Security · Computer Science 2025-08-19 Hael Abdulhakim Ali Humran , Ferdi Sonmez

Alert-ME: An Explainability-Driven Defense Against Adversarial Examples in Transformer-Based Text Classification

Transformer-based text classifiers such as BERT, RoBERTa, T5, and GPT have shown strong performance in natural language processing tasks but remain vulnerable to adversarial examples. These vulnerabilities raise significant security…

Computation and Language · Computer Science 2025-10-27 Bushra Sabir , Yansong Gao , Alsharif Abuadbba , M. Ali Babar

JavelinGuard: Low-Cost Transformer Architectures for LLM Security

We present JavelinGuard, a suite of low-cost, high-performance model architectures designed for detecting malicious intent in Large Language Model (LLM) interactions, optimized specifically for production deployment. Recent advances in…

Machine Learning · Computer Science 2025-06-10 Yash Datta , Sharath Rajasekar

Cross-Layer Optimization for Fault-Tolerant Deep Learning

Fault-tolerant deep learning accelerator is the basis for highly reliable deep learning processing and critical to deploy deep learning in safety-critical applications such as avionics and robotics. Since deep learning is known to be…

Hardware Architecture · Computer Science 2023-12-22 Qing Zhang , Cheng Liu , Bo Liu , Haitong Huang , Ying Wang , Huawei Li , Xiaowei Li

Towards Light Weight Object Detection System

Transformers are a popular choice for classification tasks and as backbones for object detection tasks. However, their high latency brings challenges in their adaptation to lightweight object detection systems. We present an approximation…

Computer Vision and Pattern Recognition · Computer Science 2022-10-11 Dharma KC , Venkata Ravi Kiran Dayana , Meng-Lin Wu , Venkateswara Rao Cherukuri , Hau Hwang

ReaLM: Reliable and Efficient Large Language Model Inference with Statistical Algorithm-Based Fault Tolerance

The demand for efficient large language model (LLM) inference has propelled the development of dedicated accelerators. As accelerators are vulnerable to hardware faults due to aging, variation, etc, existing accelerator designs often…

Hardware Architecture · Computer Science 2025-04-08 Tong Xie , Jiawang Zhao , Zishen Wan , Zuodong Zhang , Yuan Wang , Runsheng Wang , Ru Huang , Meng Li

Comparing Unidirectional, Bidirectional, and Word2vec Models for Discovering Vulnerabilities in Compiled Lifted Code

Ransomware and other forms of malware cause significant financial and operational damage to organizations by exploiting long-standing and often difficult-to-detect software vulnerabilities. To detect vulnerabilities such as buffer overflows…

Cryptography and Security · Computer Science 2025-06-05 Gary A. McCully , John D. Hastings , Shengjie Xu , Adam Fortier

SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers

Vision transformers (ViTs) have become essential backbones in advanced computer vision applications and multi-modal foundation models. Despite their strengths, ViTs remain vulnerable to adversarial perturbations, comparable to or even…

Computer Vision and Pattern Recognition · Computer Science 2025-01-06 Bhavna Gopal , Huanrui Yang , Mark Horton , Yiran Chen

CLEAR: Cross-Layer Exploration for Architecting Resilience - Combining Hardware and Software Techniques to Tolerate Soft Errors in Processor Cores

We present a first of its kind framework which overcomes a major challenge in the design of digital systems that are resilient to reliability failures: achieve desired resilience targets at minimal costs (energy, power, execution time,…

Hardware Architecture · Computer Science 2016-06-27 Eric Cheng , Shahrzad Mirkhani , Lukasz G. Szafaryn , Chen-Yong Cher , Hyungmin Cho , Kevin Skadron , Mircea R. Stan , Klas Lilja , Jacob A. Abraham , Pradip Bose , Subhasish Mitra

Detecting software vulnerabilities using Language Models

Recently, deep learning techniques have garnered substantial attention for their ability to identify vulnerable code patterns accurately. However, current state-of-the-art deep learning models, such as Convolutional Neural Networks (CNN),…

Cryptography and Security · Computer Science 2023-02-24 Marwan Omar

LogShield: A Transformer-based APT Detection System Leveraging Self-Attention

Cyber attacks are often identified using system and network logs. There have been significant prior works that utilize provenance graphs and ML techniques to detect attacks, specifically advanced persistent threats, which are very difficult…

Cryptography and Security · Computer Science 2023-11-13 Sihat Afnan , Mushtari Sadia , Shahrear Iqbal , Anindya Iqbal

PRIMA: Operational Patterns for Resilient Multi-Agent Research with Verifiable Identity and Convergent Feedback

Operating LLMs as coordinated multi-agent research systems over multi-hour runs surfaces failure modes that single-shot evaluation cannot: upstream providers throttle without warning, sub-agents drift the task to fit accessible tools,…

Artificial Intelligence · Computer Science 2026-05-26 Sasank Annapureddy

AWARE-NET: Adaptive Weighted Averaging for Robust Ensemble Network in Deepfake Detection

Deepfake detection has become increasingly important due to the rise of synthetic media, which poses significant risks to digital identity and cyber presence for security and trust. While multiple approaches have improved detection…

Computer Vision and Pattern Recognition · Computer Science 2025-05-02 Muhammad Salman , Iqra Tariq , Mishal Zulfiqar , Muqadas Jalal , Sami Aujla , Sumbal Fatima

Explainability-Guided Adversarial Attacks on Transformer-Based Malware Detectors Using Control Flow Graphs

Transformer-based malware detection systems operating on graph modalities such as control flow graphs (CFGs) achieve strong performance by modeling structural relationships in program behavior. However, their robustness to adversarial…

Cryptography and Security · Computer Science 2026-04-07 Andrew Wheeler , Kshitiz Aryal , Maanak Gupta