English
Related papers

Related papers: Optimizing Branch Predictor for Graph Applications

200 papers

Branch predictor (BP) is an essential component in modern processors since high BP accuracy can improve performance and reduce energy by decreasing the number of instructions executed on wrong-path. However, reducing latency and storage…

Hardware Architecture · Computer Science 2018-04-03 Sparsh Mittal

Major chip manufacturers have all introduced Multithreaded processors. These processors are used for running a variety of workloads. Efficient resource utilization is an important design aspect in such processors. Depending on the workload,…

Performance · Computer Science 2019-09-20 Murthy Durbhakula

Modern branch predictors predict the vast majority of conditional branch instructions with near-perfect accuracy, allowing superscalar, out-of-order processors to maximize speculative efficiency and thus performance. However, this…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-10-13 Chit-Kwan Lin , Stephen J. Tarsa

This paper quantifies the impact of branches and branch mispredictions on the single-core performance for two classes of graph problems. Specifically, we consider classical algorithms for computing connected components and breadth-first…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-11-11 Oded Green , Marat Dukhan , Richard Vuduc

Modern processors rely heavily on speculation to keep the pipeline filled and consequently execute and commit instructions as close to maximum capacity as possible. To improve instruction-level parallelism, the processor core needs to fetch…

Hardware Architecture · Computer Science 2021-10-19 Ilias Vougioukas , Andreas Sandberg , Nikos Nikoleris

Tree-based models have proven to be an effective solution for web ranking as well as other problems in diverse domains. This paper focuses on optimizing the runtime performance of applying such models to make predictions, given an…

Databases · Computer Science 2013-04-29 Nima Asadi , Jimmy Lin , Arjen P. de Vries

Branch prediction is a standard feature in most processors, significantly improving the run time of programs by allowing a processor to predict the direction of a branch before it has been evaluated. Current branch prediction methods can…

Hardware Architecture · Computer Science 2018-05-03 Adam Auten , Tanishq Dubey , Rohan Mathur

Branch misprediction latency is one of the most important contributors to performance degradation and wasted energy consumption in a modern core. State-of-the-art predictors generally perform very well but occasionally suffer from high…

Hardware Architecture · Computer Science 2026-01-22 Ioannis Constantinou , Arthur Perais , Yiannakis Sazeides

CPU branch prediction has hit a wall--existing techniques achieve near-perfect accuracy on 99% of static branches, and yet the mispredictions that remain hide major performance gains. In a companion report, we show that a primary source of…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-06-25 Stephen J Tarsa , Chit-Kwan Lin , Gokce Keskin , Gautham Chinya , Hong Wang

Graph processing is typically considered to be a memory-bound rather than compute-bound problem. One common line of thought is that more available memory bandwidth corresponds to better graph processing performance. However, in this work we…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-10-10 Oded Green , James Fox , Jeffrey Young , Jun Shirako , David Bader

Decision Tree (DT) Learning is a fundamental problem in Interpretable Machine Learning, yet it poses a formidable optimisation challenge. Practical algorithms have recently emerged, primarily leveraging Dynamic Programming and Branch &…

Machine Learning · Computer Science 2025-05-13 Ayman Chaouki , Jesse Read , Albert Bifet

Processing large-scale graph datasets is computationally intensive and time-consuming. Processor-centric CPU and GPU architectures, commonly used for graph applications, often face bottlenecks caused by extensive data movement between the…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-02-11 Marzieh Barkhordar , Alireza Tabatabaeian , Mohammad Sadrosadati , Christina Giannoula , Juan Gomez Luna , Izzat El Hajj , Onur Mutlu , Alaa R. Alameldeen

A processor's memory hierarchy has a major impact on the performance of running code. However, computing platforms, where the actual hardware characteristics are hidden from both the end user and the tools that mediate execution, such as a…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-07-10 Keith Cooper , Xiaoran Xu

Conditional branch prediction predicts the likely direction of a conditional branch instruction to support ILP extraction. Branch prediction is a pattern recognition problem that learns mappings between a context to the branch outcome. An…

Hardware Architecture · Computer Science 2025-12-19 FNU Vikas , Paul Gratz , Daniel Jiménez

High load latency that results from deep cache hierarchies and relatively slow main memory is an important limiter of single-thread performance. Data prefetch helps reduce this latency by fetching data up the hierarchy before it is…

Hardware Architecture · Computer Science 2021-03-30 Majid Jalili , Mattan Erez

Graph Neural Networks (GNNs) extend basic Neural Networks (NNs) by additionally making use of graph structure based on the relational inductive bias (edge bias), rather than treating the nodes as collections of independent and identically…

Machine Learning · Computer Science 2023-11-07 Sitao Luan , Chenqing Hua , Qincheng Lu , Jiaqi Zhu , Xiao-Wen Chang , Doina Precup

Branch prediction is arguably one of the most important speculative mechanisms within a high-performance processor architecture. A common approach to improve branch prediction accuracy is to employ lengthy history records of previously seen…

Hardware Architecture · Computer Science 2022-07-29 Anastasios Zouzias , Kleovoulos Kalaitzidis , Konstantin Berestizshevsky , Renzo Andri , Leeor Peled , Zhe Wang

Countless applications cast their computational core in terms of dense linear algebra operations. These operations can usually be implemented by combining the routines offered by standard linear algebra libraries such as BLAS and LAPACK,…

Performance · Computer Science 2014-10-01 Elmar Peise , Paolo Bientinesi

Graph streams are rapidly evolving sequences of edges that convey continuously changing relationships among entities, playing a crucial role in domains such as networking, finance, and cybersecurity. Their massive scale and high dynamism…

Databases · Computer Science 2026-02-18 Boyan Wang , Zhuochen Fan , Dayu Wang , Fangcheng Fu , Zeyu Luan , Lei Zou , Qing Li , Tong Yang

Future computing systems, from handhelds to supercomputers, will undoubtedly be more parallel and heterogeneous than todays systems to provide more performance and energy efficiency. Thus, GPUs are increasingly being used to accelerate…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-10-18 Saeed Taheri , Apan Qasem , Martin Burtscher
‹ Prev 1 2 3 10 Next ›