Distributed, Parallel, and Cluster Computing · Computer Science
Co-Design of the Dense Linear AlgebravSoftware Stack for Multicore Processors
Héctor Martínez, Sandra Catalán, Francisco D. Igual, José R. Herrero +2
2023-05-01
Programming Languages · Computer Science
PolyBlocks: A Compiler Infrastructure for AI Chips and Programming Frameworks
Uday Bondhugula, Akshay Baviskar, Navdeep Katel, Vimal Patel +2
2026-03-11
Hardware Architecture · Computer Science
Towards a Multi-array Architecture for Accelerating Large-scale Matrix Multiplication on FPGAs
Junzhong Shen, Yuran Qiao, You Huang, Mei Wen +1
2018-03-13
Databases · Computer Science
ArrayBridge: Interweaving declarative array processing with high-performance computing
Haoyuan Xing, Sofoklis Floratos, Spyros Blanas, Suren Byna +3
2017-02-28
Computational Physics · Physics
Hybrid programming-model strategies for GPU offloading of electronic structure calculation kernels
Jean-Luc Fattebert, Christian F. A. Negre, Joshua Finkelstein, Jamaludin Mohd-Yusof +5
2024-01-26
Programming Languages · Computer Science
Efficient Tree-Traversals: Reconciling Parallelism and Dense Data Representations
Chaitanya Koparkar, Mike Rainey, Michael Vollmer, Milind Kulkarni +1
2021-07-02
Distributed, Parallel, and Cluster Computing · Computer Science
Overview of the IBM Neural Computer Architecture
Pritish Narayanan, Charles E. Cox, Alexis Asseman, Nicolas Antoine +3
2020-03-26
Distributed, Parallel, and Cluster Computing · Computer Science
PGAbB: A Block-Based Graph Processing Framework for Heterogeneous Platforms
Abdurrahman Yasar, Sivasankaran Rajamanickam, Jonathan W. Berry, Umit V. Catalyurek
2022-09-13
Distributed, Parallel, and Cluster Computing · Computer Science
Parallel Programming Models for Heterogeneous Many-Cores : A Survey
Jianbin Fang, Chun Huang, Tao Tang, Zheng Wang
2020-05-11
Distributed, Parallel, and Cluster Computing · Computer Science
MSREP: A Fast yet Light Sparse Matrix Framework for Multi-GPU Systems
Jieyang Chen, Chenhao Xie, Jesun S Firoz, Jiajia Li +4
2022-09-19
Distributed, Parallel, and Cluster Computing · Computer Science
A Parallel Task-based Approach to Linear Algebra
Ashkan Tousimojarad, Wim Vanderbauwhede
2014-10-07
Optimization and Control · Mathematics
A Parallelizable Acceleration Framework for Packing Linear Programs
Palma London, Shai Vardi, Adam Wierman, Hanling Yi
2017-11-20
Distributed, Parallel, and Cluster Computing · Computer Science
Extreme-Scale Block-Structured Adaptive Mesh Refinement
Florian Schornbaum, Ulrich Rüde
2018-07-24
Distributed, Parallel, and Cluster Computing · Computer Science
Coarse-Grain Performance Estimator for Heterogeneous Parallel Computing Architectures like Zynq All-Programmable SoC
Daniel Jiménez-González, Carlos Álvarez, Antonio Filgueras, Xavier Martorell +3
2015-08-28
Mathematical Software · Computer Science
MIRGE: An Array-Based Computational Framework for Scientific Computing
Matthias Diener, Matthew J. Smith, Michael T. Campbell, Kaushik Kulkarni +5
2025-12-22
Programming Languages · Computer Science
AutoParallel: A Python module for automatic parallelization and distributed execution of affine loop nests
Cristian Ramon-Cortes, Ramon Amela, Jorge Ejarque, Philippe Clauss +1
2018-10-29
Distributed, Parallel, and Cluster Computing · Computer Science
High level programming abstractions for leveraging hierarchical memories with micro-core architectures
Maurice Jamieson, Nick Brown
2020-10-06
Systems and Control · Electrical Eng. & Systems
Data Generation for Stability Studies of Power Systems with High Penetration of Inverter-Based Resources
Francesca Rossi, Mauro Garcia Lorenzo, Eduardo Iraola de Acevedo, Elia Mateu Barriendos +4
2026-01-26
Distributed, Parallel, and Cluster Computing · Computer Science
RZBENCH: Performance evaluation of current HPC architectures using low-level and application benchmarks
Georg Hager, Holger Stengel, Thomas Zeiser, Gerhard Wellein
2007-12-21
Distributed, Parallel, and Cluster Computing · Computer Science
Para-B&B: Load-Balanced Deterministic Parallelization of Solving MIP
Jinyu Zhang, Di Huang, Yue Liu, Shuo Wang +2
2026-04-14
Distributed, Parallel, and Cluster Computing · Computer Science
ArborX: A Performance Portable Geometric Search Library
D. Lebrun-Grandié, A. Prokopenko, B. Turcksin, S. R. Slattery
2022-06-30
Machine Learning · Computer Science
A flexible and fast PyTorch toolkit for simulating training and inference on analog crossbar arrays
Malte J. Rasch, Diego Moreda, Tayfun Gokmen, Manuel Le Gallo +5
2023-02-17