English
Related papers

Related papers: Branch Target Buffer Reverse Engineering on Arm

200 papers

High-performance branch target buffers (BTBs) and the L1I cache are key to high-performance front-end. Modern branch predictors are highly accurate, but with an increase in code footprint in modern-day server workloads, BTB and L1I misses…

Hardware Architecture · Computer Science 2021-07-06 Vishal Gupta , Biswabandan Panda

Many contemporary applications feature multi-megabyte instruction footprints that overwhelm the capacity of branch target buffers (BTB) and instruction caches (L1-I), causing frequent front-end stalls that inevitably hurt performance. BTB…

Hardware Architecture · Computer Science 2023-01-11 Truls Asheim , Boris Grot , Rakesh Kumar

Modern processors have suffered a deluge of threats exploiting branch instruction collisions inside the branch prediction unit (BPU), from eavesdropping on secret-related branch operations to triggering malicious speculative executions.…

Cryptography and Security · Computer Science 2022-04-22 Tao Zhang , Timothy Lesch , Kenneth Koltermann , Dmitry Evtyushkin

Performance-enhancing mechanisms such as branch prediction, out-of-order execution, and return stack buffer (RSB) have been widely employed in today's modern processing units. Although successful in increasing the CPU performance,…

Cryptography and Security · Computer Science 2023-02-21 Farhad Taheri , Siavash Bayat-Sarmadi , Alireza Sadeghpour , Seyed Parsa Tayefeh Morsal

Modern out-of-order CPUs heavily rely on speculative execution for performance optimization, with branch prediction serving as a cornerstone to minimize stalls and maximize efficiency. Whenever shared branch prediction resources lack proper…

Cryptography and Security · Computer Science 2025-06-10 Yuhui Zhu , Alessandro Biondi

Industries are recently considering the adoption of cloud computing for hosting safety critical applications. However, the use of multicore processors usually adopted in the cloud introduces temporal anomalies due to contention for shared…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-06-30 Giorgio Farina , Gautam Gala , Marcello Cinque , Gerhard Fohler

Branch predictor (BP) is a critical component of modern processors, and its accurate modeling is essential for compilers and applications. However, processor vendors have disclosed limited details about their BP implementations. Recent…

Hardware Architecture · Computer Science 2024-11-22 Jiajie Chen , Peng Qu , Youhui Zhang

General matrix multiplication (GEMM) is the computational backbone of modern AI workloads, and its efficiency is critically dependent on effective tiling strategies. Conventional approaches employ symmetric tile buffering, where the…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-11-21 Chengyue Wang , Wesley Pang , Xinrui Wu , Gregory Jun , Luis Romero , Endri Taka , Diana Marculescu , Tony Nowatzki , Pranathi Vasireddy , Joseph Melber , Deming Chen , Jason Cong

In the context of hardware trust and assurance, reverse engineering has been often considered as an illegal action. Generally speaking, reverse engineering aims to retrieve information from a product, i.e., integrated circuits (ICs) and…

We study exact sparse linear regression with an $\ell_0-\ell_2$ penalty and develop a branch-and-bound (BnB) algorithm explicitly designed for GPU execution. Starting from a perspective reformulation, we derive an interval relaxation that…

Optimization and Control · Mathematics 2026-02-05 Xiang Meng , Ryan Lucas , Rahul Mazumder

Prior work has observed that fetch-directed prefetching (FDIP) is highly effective at covering instruction cache misses. The key to FDIP's effectiveness is having a sufficiently large BTB to accommodate the application's branch working set.…

Hardware Architecture · Computer Science 2020-06-25 Truls Asheim , Rakesh Kumar , Boris Grot

Previous schemes for designing secure branch prediction unit (SBPU) based on physical isolation can only offer limited security and significantly affect BPU's prediction capability, leading to prominent performance degradation. Moreover,…

Cryptography and Security · Computer Science 2025-01-22 Zhe Zhou , Fei Tong , Hongyu Wang , Xiaoyu Cheng , Fang Jiang , Zhikun Zhang , Yuxing Mao

We describe some features of the A100 memory architecture. In particular, we give a technique to reverse-engineer some hardware layout information. Using this information, we show how to avoid TLB issues to obtain full-speed random HBM…

Performance · Computer Science 2024-05-21 Alden Walker

Exploring a substantial amount of unlabeled data, semi-supervised learning (SSL) boosts the recognition performance when only a limited number of labels are provided. However, traditional methods assume that the data distribution is…

Computer Vision and Pattern Recognition · Computer Science 2023-05-23 Wujian Peng , Zejia Weng , Hengduo Li , Zuxuan Wu

Spectre v1 information disclosure attacks, which exploit CPU conditional branch misprediction, remain unsolved in deployed software. Certain Spectre v1 gadgets can be exploited only by out-of-place mistraining, in which the attacker…

Cryptography and Security · Computer Science 2025-02-18 Adam Tuby , Adam Morrison

Backdoor attacks (BA) are an emerging threat to deep neural network classifiers. A classifier being attacked will predict to the attacker's target class when a test sample from a source class is embedded with the backdoor pattern (BP).…

Cryptography and Security · Computer Science 2021-10-22 Zhen Xiang , David J. Miller , Siheng Chen , Xi Li , George Kesidis

Motivated by the recent introduction and large-scale deployment of BBR congestion control algorithms, multiple studies have investigated the performance and fairness implications of this shift from loss-based to delay-based congestion…

Networking and Internet Architecture · Computer Science 2026-01-12 Kathrin Elmenhorst , Nils Aschenbruck

Existing anti-malware software and reverse engineering toolkits struggle with stealthy sub-OS rootkits due to limitations of run-time kernel-level monitoring. A malicious kernel-level driver can bypass OS-level anti-virus mechanisms easily.…

Speculative execution is an optimization technique that has been part of CPUs for over a decade. It predicts the outcome and target of branch instructions to avoid stalling the execution pipeline. However, until recently, the security…

Cryptography and Security · Computer Science 2018-08-21 Giorgi Maisuradze , Christian Rossow

The demand for precise information on DRAM microarchitectures and error characteristics has surged, driven by the need to explore processing in memory, enhance reliability, and mitigate security vulnerability. Nonetheless, DRAM…

Cryptography and Security · Computer Science 2024-05-07 Hwayong Nam , Seungmin Baek , Minbok Wi , Michael Jaemin Kim , Jaehyun Park , Chihun Song , Nam Sung Kim , Jung Ho Ahn
‹ Prev 1 2 3 10 Next ›