Distributed, Parallel, and Cluster Computing · Computer Science
Leveraging shared caches for parallel temporal blocking of stencil codes on multicore processors and clusters
Markus Wittmann, Georg Hager, Jan Treibig, Gerhard Wellein
2010-06-17
Distributed, Parallel, and Cluster Computing · Computer Science
Block-Relaxation Methods for 3D Constant-Coefficient Stencils on GPUs and Multicore CPUs
Manuel Birke, Bobby Philip, Zhen Wang, Mark Berrill
2019-07-16
Distributed, Parallel, and Cluster Computing · Computer Science
Revisiting Temporal Blocking Stencil Optimizations
Lingqi Zhang, Mohamed Wahib, Peng Chen, Jintao Meng +3
2023-05-15
Distributed, Parallel, and Cluster Computing · Computer Science
Multi-dimensional intra-tile parallelization for memory-starved stencil computations
Tareq Malas, Georg Hager, Hatem Ltaief, David Keyes
2015-10-19
Computational Physics · Physics
3D Blocking for Matrix-free Smoothers in 2D Variable-Viscosity Stokes Equations with Applications to Geodynamics
Marcel Ferrari, Cyrill Püntener, Alexander Sotoudeh, Niklas Viebig
2025-09-24
Distributed, Parallel, and Cluster Computing · Computer Science
Multicore-optimized wavefront diamond blocking for optimizing stencil updates
Tareq Malas, Georg Hager, Hatem Ltaief, Holger Stengel +2
2015-07-28
Distributed, Parallel, and Cluster Computing · Computer Science
An Efficient Vectorization Scheme for Stencil Computation
Kun Li, Liang Yuan, Yunquan Zhang, Yue Yue +2
2021-03-19
Distributed, Parallel, and Cluster Computing · Computer Science
Temporal blocking of finite-difference stencil operators with sparse "off-the-grid" sources
George Bisbas, Fabio Luporini, Mathias Louboutin, Rhodri Nelson +2
2021-02-26
Mathematical Software · Computer Science
Temporal Vectorization for Stencils
Liang Yuan, Hang Cao, Yunquan Zhang, Kun Li +2
2020-10-13
Distributed, Parallel, and Cluster Computing · Computer Science
Improving Memory Hierarchy Utilisation for Stencil Computations on Multicore Machines
Alexandre Sena, Aline Nascimento, Cristina Boeres, Vinod E. F. Rebello +1
2013-10-31
Distributed, Parallel, and Cluster Computing · Computer Science
Scalable communication for high-order stencil computations using CUDA-aware MPI
Johannes Pekkilä, Miikka S. Väisälä, Maarit J. Käpylä, Matthias Rheinhardt +1
2022-05-11
Distributed, Parallel, and Cluster Computing · Computer Science
Exploiting Scratchpad Memory for Deep Temporal Blocking: A case study for 2D Jacobian 5-point iterative stencil kernel (j2d5pt)
Lingqi Zhang, Mohamed Wahib, Peng Chen, Jintao Meng +3
2023-06-07
Computational Engineering, Finance, and Science · Computer Science
A quantitative performance analysis for Stokes solvers at the extreme scale
Björn Gmeiner, Markus Huber, Lorenz John, Ulrich Rüde +1
2015-11-09
Distributed, Parallel, and Cluster Computing · Computer Science
MMStencil: Optimizing High-order Stencils on Multicore CPU using Matrix Unit
Yinuo Wang, Tianqi Mao, Lin Gan, Wubing Wan +7
2025-07-16
Distributed, Parallel, and Cluster Computing · Computer Science
AN5D: Automated Stencil Framework for High-Degree Temporal Blocking on GPUs
Kazuaki Matsumura, Hamid Reza Zohouri, Mohamed Wahib, Toshio Endo +1
2020-02-04
Distributed, Parallel, and Cluster Computing · Computer Science
Beyond 16GB: Out-of-Core Stencil Computations
Istvan Z Reguly, Gihan R Mudalige, Michael B Giles
2017-10-27
Computational Engineering, Finance, and Science · Computer Science
Optimization of an electromagnetics code with multicore wavefront diamond blocking and multi-dimensional intra-tile parallelization
Tareq M. Malas, Julian Hornich, Georg Hager, Hatem Ltaief +2
2015-10-20
Distributed, Parallel, and Cluster Computing · Computer Science
Real-time topological image smoothing on shared memory parallel machines
Ramzi Mahmoudi, Mohamed Akil
2016-04-01
Distributed, Parallel, and Cluster Computing · Computer Science
Stencil Computations on AMD and Nvidia Graphics Processors: Performance and Tuning Strategies
Johannes Pekkilä, Oskar Lappi, Fredrik Robertsén, Maarit J. Korpi-Lagg
2025-05-28
Distributed, Parallel, and Cluster Computing · Computer Science
Enhanced computation method of topological smoothing on shared memory parallel machines
Ramzi Mahmoudi, Mohamed Akil
2016-03-31
Data Structures and Algorithms · Computer Science
Fast Stencil Computations using Fast Fourier Transforms
Zafar Ahmad, Rezaul Chowdhury, Rathish Das, Pramod Ganapathi +2
2021-05-17
Distributed, Parallel, and Cluster Computing · Computer Science
StencilFlow: Mapping Large Stencil Programs to Distributed Spatial Computing Systems
Johannes de Fine Licht, Andreas Kuster, Tiziano De Matteis, Tal Ben-Nun +2
2021-01-12
Distributed, Parallel, and Cluster Computing · Computer Science
Combined Spatial and Temporal Blocking for High-Performance Stencil Computation on FPGAs Using OpenCL
Hamid Reza Zohouri, Artur Podobas, Satoshi Matsuoka
2019-10-16