中文
相关论文

相关论文: Source-to-Source Transformations for GPU Code Gene…

200 篇论文

Using GPUs as general-purpose processors has revolutionized parallel computing by offering, for a large and growing set of algorithms, massive data-parallelization on desktop machines. An obstacle to widespread adoption, however, is the…

分布式、并行与集群计算 · 计算机科学 2015-10-14 Alexey Kolesnichenko , Christopher M. Poskitt , Sebastian Nanz , Bertrand Meyer

Utilizing GPUs is critical for high performance on heterogeneous systems. However, leveraging the full potential of GPUs for accelerating legacy CPU applications can be a challenging task for developers. The porting process requires…

分布式、并行与集群计算 · 计算机科学 2023-06-27 Shilei Tian , Tom Scogland , Barbara Chapman , Johannes Doerfert

Modern computing is shifting from homogeneous CPU-centric systems to heterogeneous systems with closely integrated CPUs and GPUs. While the CPU software stack has benefited from decades of memory safety hardening, the GPU software stack…

密码学与安全 · 计算机科学 2026-03-09 Mingkai Li , Joseph Devietti , Suman Jana , Tanvir Ahmed Khan

Current AI code generation systems suffer from significant latency bottlenecks due to CPU-GPU data transfers during compilation, execution, and testing phases. We establish theoretical foundations for three complementary approaches to…

分布式、并行与集群计算 · 计算机科学 2025-12-15 Adilet Metinov , Gulida M. Kudakeeva , Gulnara D. Kabaeva

The number of cores on graphical computing units (GPUs) is reaching thousands nowadays, whereas the clock speed of processors stagnates. Unfortunately, constraint programming solvers do not take advantage yet of GPU parallelism. One reason…

分布式、并行与集群计算 · 计算机科学 2022-07-26 Pierre Talbot , Frédéric Pinel , Pascal Bouvry

Computational tools for rigorously verifying the performance of large-scale machine learning (ML) models have progressed significantly in recent years. The most successful solvers employ highly specialized, GPU-accelerated branch and bound…

机器学习 · 计算机科学 2023-09-11 Samuel Chevalier , Ilgiz Murzakhanov , Spyros Chatzivasileiadis

Tremendous advances in parallel computing and graphics hardware opened up several novel real-time GPU applications in the fields of computer vision, computer graphics as well as augmented reality (AR) and virtual reality (VR). Although…

分布式、并行与集群计算 · 计算机科学 2019-08-19 Patrick Stotko

Generating high-performance CUDA kernels remains challenging due to the need to navigate a combinatorial space of low-level transformations under noisy and expensive hardware feedback. Although large language models can synthesize…

机器学习 · 计算机科学 2026-02-16 Arijit Bhattacharjee , Heng Ping , Son Vu Le , Paul Bogdan , Nesreen K. Ahmed , Ali Jannesari

Graphics Processing Units (GPUs) are high performance co-processors originally intended to improve the use and quality of computer graphics applications. Once, researchers and practitioners noticed the potential of using GPU for general…

数值分析 · 计算机科学 2016-07-12 K. Parand , Saeed Zafarvahedian , Sayyed A. Hossayni

High-level scripting languages are in many ways polar opposites to GPUs. GPUs are highly parallel, subject to hardware subtleties, and designed for maximum throughput, and they offer a tremendous advance in the performance achievable for a…

软件工程 · 计算机科学 2013-04-23 Andreas Klöckner , Nicolas Pinto , Bryan Catanzaro , Yunsup Lee , Paul Ivanov , Ahmed Fasih

Future computing systems, from handhelds to supercomputers, will undoubtedly be more parallel and heterogeneous than todays systems to provide more performance and energy efficiency. Thus, GPUs are increasingly being used to accelerate…

分布式、并行与集群计算 · 计算机科学 2019-10-18 Saeed Taheri , Apan Qasem , Martin Burtscher

Tree-based Genetic Programming (TGP) is a widely used evolutionary algorithm for tasks such as symbolic regression, classification, and robotic control. Due to the intensive computational demands of running TGP, GPU acceleration is crucial…

神经与进化计算 · 计算机科学 2026-02-17 Zhihong Wu , Lishuang Wang , Kebin Sun , Zhuozhao Li , Ran Cheng

We present a security framework that strengthens distributed machine learning by standardizing integrity protections across CPU and GPU platforms and significantly reducing verification overheads. Our approach co-locates integrity…

密码学与安全 · 计算机科学 2025-10-29 Marcin Spoczynski , Marcela S. Melara

The future of computation is the Graphical Processing Unit, i.e. the GPU. The promise that the graphics cards have shown in the field of image processing and accelerated rendering of 3D scenes, and the computational capability that these…

分布式、并行与集群计算 · 计算机科学 2012-02-21 Jayshree Ghorpade , Jitendra Parande , Madhura Kulkarni , Amit Bawaskar

Graphics processing units (GPU) had evolved from a specialized hardware capable to render high quality graphics in games to a commodity hardware for effective processing blocks of data in a parallel schema. This evolution is particularly…

分布式、并行与集群计算 · 计算机科学 2012-03-26 Luis Cabellos

General Purpose Graphic Processing Unit(GPGPU) is used widely for achieving high performance or high throughput in parallel programming. This capability of GPGPUs is very famous in the new era and mostly used for scientific computing which…

分布式、并行与集群计算 · 计算机科学 2018-10-10 Vajira Thambawita , Roshan G. Ragel , Dhammike Elkaduwe

GPUs have gained significant popularity over the past decade, extending beyond their original role in graphics rendering. This evolution has brought GPU security and reliability to the forefront of concerns. Prior research has shown that…

密码学与安全 · 计算机科学 2026-01-06 Saurabh Singh , Ruobing Han , Jaewon Lee , Seonjin Na , Yonghae Kim , Taesoo Kim , Hyesoon Kim

As GPU availability has increased and programming support has matured, a wider variety of applications are being ported to these platforms. Many parallel applications contain fine-grained synchronization idioms; as such, their correct…

GPUs are popular devices for accelerating scientific calculations. However, as GPU code is usually written in low-level languages, it breaks the abstractions of high-level languages popular with scientific programmers. To overcome this, we…

分布式、并行与集群计算 · 计算机科学 2016-04-13 Tim Besard , Pieter Verstraete , Bjorn De Sutter

Graphics Processing Units (GPU) offer tremendous computational power by following a throughput oriented computing paradigm where many thousand computational units operate in parallel. Programming this massively parallel hardware is…

编程语言 · 计算机科学 2023-05-08 Bastian Köpcke , Sergei Gorlatch , Michel Steuwer
‹ 上一页 1 2 3 10 下一页 ›