Distributed, Parallel, and Cluster Computing · Computer Science
A NUMA-Aware Provably-Efficient Task-Parallel Platform Based on the Work-First Principle
Justin Deters, Jiaye Wu, Yifan Xu, I-Ting Angelina Lee
2019-01-08
Distributed, Parallel, and Cluster Computing · Computer Science
Parallel scheduling of task trees with limited memory
Lionel Eyraud-Dubois, Loris Marchal, Oliver Sinnen, Frédéric Vivien
2014-10-02
Distributed, Parallel, and Cluster Computing · Computer Science
Exploring Fine-grained Task Parallelism on Simultaneous Multithreading Cores
Denis Los, Igor Petushkov
2024-10-03
Distributed, Parallel, and Cluster Computing · Computer Science
Parallelizing Maximal Clique Enumeration on GPUs
Mohammad Almasri, Yen-Hsiang Chang, Izzat El Hajj, Rakesh Nagi +2
2025-04-25
Distributed, Parallel, and Cluster Computing · Computer Science
Parallelizing Workload Execution in Embedded and High-Performance Heterogeneous Systems
Jose Nunez-Yanez, Mohammad Hosseinabady, Moslem Amiri, Andrés Rodríguez +4
2018-02-12
Distributed, Parallel, and Cluster Computing · Computer Science
An Empirical-cum-Statistical Approach to Power-Performance Characterization of Concurrent GPU Kernels
Nilanjan Goswami, Amer Qouneh, Chao Li, Tao Li
2020-11-06
Distributed, Parallel, and Cluster Computing · Computer Science
Accelerating Monte-Carlo Tree Search on CPU-FPGA Heterogeneous Platform
Yuan Meng, Rajgopal Kannan, Viktor Prasanna
2022-08-25
Data Structures and Algorithms · Computer Science
Co-Scheduling Algorithms for High-Throughput Workload Execution
Guillaume Aupy, Manu Shantharam, Anne Benoit, Yves Robert +1
2013-05-01
Distributed, Parallel, and Cluster Computing · Computer Science
Concurrent Scheduling of High-Level Parallel Programs on Multi-GPU Systems
Fabian Knorr, Philip Salzmann, Peter Thoman, Thomas Fahringer
2025-03-14
Distributed, Parallel, and Cluster Computing · Computer Science
Unleashing the Power of Preemptive Priority-based Scheduling for Real-Time GPU Tasks
Yidi Wang, Cong Liu, Daniel Wong, Hyoseung Kim
2024-01-31
Distributed, Parallel, and Cluster Computing · Computer Science
A Comparative Study of Asynchronous Many-Tasking Runtimes: Cilk, Charm++, ParalleX and AM++
Abhishek Kulkarni, Andrew Lumsdaine
2019-04-02
Distributed, Parallel, and Cluster Computing · Computer Science
Toward the Design of Fault-Tolerance- and Peak- Power-Aware Multi-Core Mixed-Criticality Systems
Behnaz Ranjbar, Ali Hosseinghorban, Mohammad Salehi, Alireza Ejlali +1
2021-06-01
Distributed, Parallel, and Cluster Computing · Computer Science
GPU First -- Execution of Legacy CPU Codes on GPUs
Shilei Tian, Tom Scogland, Barbara Chapman, Johannes Doerfert
2023-06-27
Distributed, Parallel, and Cluster Computing · Computer Science
Specx: a C++ task-based runtime system for heterogeneous distributed architectures
Paul Cardosi, Bérenger Bramas
2024-11-18
Distributed, Parallel, and Cluster Computing · Computer Science
Optimizing Fine-Grained Parallelism Through Dynamic Load Balancing on Multi-Socket Many-Core Systems
Wenyi Wang, Maxime Gonthier, Poornima Nookala, Haochen Pan +3
2025-03-20
Hardware Architecture · Computer Science
MGPU-TSM: A Multi-GPU System with Truly Shared Memory
Saiful A. Mojumder, Yifan Sun, Leila Delshadtehrani, Yenai Ma +5
2020-08-11
Hardware Architecture · Computer Science
Thread Batching for High-performance Energy-efficient GPU Memory Design
Bing Li, Mengjie Mao, Xiaoxiao Liu, Tao Liu +5
2019-06-17
Distributed, Parallel, and Cluster Computing · Computer Science
Concurrent CPU-GPU Task Programming using Modern C++
Tsung-Wei Huang, Yibo Lin
2022-03-17
Distributed, Parallel, and Cluster Computing · Computer Science
GTaP: A GPU-Resident Fork-Join Task-Parallel Runtime with a Pragma-Based Interface
Yuki Maeda, Kenjiro Taura
2026-04-08
Computational Physics · Physics
Parallel TREE code for two-component ultracold plasma analysis
Byoungseon Jeon, Joel D. Kress, Lee A. Collins, Niels Grønbech-Jensen
2009-05-04