English
Related papers

Related papers: Supporting Dynamic Control-Flow Execution for Runt…

200 papers

Many recent machine learning models rely on fine-grained dynamic control flow for training and inference. In particular, models based on recurrent neural networks and on reinforcement learning depend on recurrence relations, data-dependent…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-05-09 Yuan Yu , Martín Abadi , Paul Barham , Eugene Brevdo , Mike Burrows , Andy Davis , Jeff Dean , Sanjay Ghemawat , Tim Harley , Peter Hawkins , Michael Isard , Manjunath Kudlur , Rajat Monga , Derek Murray , Xiaoqiang Zheng

Rigid body dynamics is a key technology in the robotics field. In trajectory optimization and model predictive control algorithms, there are usually a large number of rigid body dynamics computing tasks. Using CPUs to process these tasks…

Robotics · Computer Science 2023-09-29 Yuxin Yang , Xiaoming Chen , Yinhe Han

Lockstep processing is a recognized technique for helping to secure functional-safety relevant processing against, for instance, single upset errors that might cause faulty execution of code. Lockstepping processors does however bind…

Hardware Architecture · Computer Science 2021-07-20 Hans Dermot Doran , Timo Lang

The rapid progress and advancement in electronic chips technology provide a variety of new implementation options for system engineers. The choice varies between the flexible programs running on a general-purpose processor (GPP) and the…

Hardware Architecture · Computer Science 2019-04-11 Issam Damaj

Many dedicated embedded processors do not have memory or computational resources to coexist with traditional (host-based) security solutions. As a result, there is interest in using out-of-band analog side-channel measurements and their…

Cryptography and Security · Computer Science 2019-11-01 Mark Chilenski , George Cybenko , Isaac Dekine , Piyush Kumar , Gil Raz

Based on the two observations that diverse applications perform better on different multicore architectures, and that different phases of an application may have vastly different resource requirements, Pal et al. proposed a novel…

Programming Languages · Computer Science 2016-06-21 Sanjiva Prasad

Intra-device parallelism addresses resource under-utilization in ML inference and training by overlapping the execution of operators with different resource usage. However, its wide adoption is hindered by a fundamental conflict with the…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-22 Yi Pan , Yile Gu , Jinbin Luo , Yibo Wu , Ziren Wang , Hongtao Zhang , Ziyi Xu , Shengkai Lin , Baris Kasikci , Stephanie Wang

Stream workflow application such as online anomaly detection or online traffic monitoring, integrates multiple streaming big data applications into data analysis pipeline. This application can be highly dynamic in nature, where the data…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-12-19 Mutaz Barika , Saurabh Garg , Rajiv Ranjan

A new approach to designing processor accelerators is presented. A new computing model and a special kind of accelerator with dynamic (end-user programmable) architecture is suggested. The new model considers a processor, in which a newly…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-07-07 János Végh

We discuss computational superstructures that, using repeated, appropriately initialized short calls, enable temporal process simulators to perform alternative tasks such as fixed point computation, stability analysis and projective…

Optimization and Control · Mathematics 2025-10-20 C. I. Siettos , C. C. Pantelides , I. G. Kevrekidis

Autotuning of performance-relevant source-code parameters allows to automatically tune applications without hard coding optimizations and thus helps with keeping the performance portable. In this paper, we introduce a benchmark set of ten…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-03-02 Filip Petrovič , David Střelák , Jana Hozzová , Jaroslav Oľha , Richard Trembecký , Siegfried Benkner , Jiří Filipovič

Hybrid workflows combining traditional HPC and novel ML methodologies are transforming scientific computing. This paper presents the architecture and implementation of a scalable runtime system that extends RADICAL-Pilot with service-based…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-18 Andre Merzky , Mikhail Titov , Matteo Turilli , Ozgur Kilic , Tianle Wang , Shantenu Jha

FastFlow is a programming environment specifically targeting cache-coherent shared-memory multi-cores. FastFlow is implemented as a stack of C++ template libraries built on top of lock-free (fence-free) synchronization mechanisms. In this…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-02-26 Marco Aldinucci , Marco Danelutto , Peter Kilpatrick , Massimiliano Meneghin , Massimo Torquati

This work details a hardware-assisted approach for information flow tracking implemented on reconfigurable chips. Current solutions are either time-consuming or hardly portable (modifications of both sofware/hardware layers). This work…

Cryptography and Security · Computer Science 2018-11-15 Muhammad Abdul Wahab , Pascal Cotret , Mounir Nasr Allah , Guillaume Hiet , Vianney Lapotre , Guy Gogniat

Shared memory multiprocessors come back to popularity thanks to rapid spreading of commodity multi-core architectures. As ever, shared memory programs are fairly easy to write and quite hard to optimise; providing multi-core programmers…

Distributed, Parallel, and Cluster Computing · Computer Science 2009-09-10 Marco Aldinucci , Massimo Torquati , Massimiliano Meneghin

Microprocessor roadmaps clearly show a trend towards multiple core CPUs. Modern operating systems already make use of these CPU architectures by distributing tasks between processing cores thereby increasing system performance. This review…

Software Engineering · Computer Science 2016-09-08 M. Vaidehi , T. R. Gopalakrishnan Nair

The software configurable processor finds best use in the embedded systems. These processors have onchip logic like FPGA (Field Programmable Gate Array) and thus can be configured to implement custom hardware functionality. The digital…

Hardware Architecture · Computer Science 2025-05-13 Ganesh Prabhu , Steevan Rodrigues , Niranjan Chiplunkar , Niranjan U. C

Information Fusion Systems are now widely used in different fusion contexts, like scientific processing, sensor networks, video and image processing. One of the current trends in this area is to cope with distributed systems. In this…

Distributed, Parallel, and Cluster Computing · Computer Science 2009-06-26 Eric Benoit , Marc-Philippe Huget , Patrice Moreaux , Olivier Passalacqua

Elasticity is highly desirable for stream processing systems to guarantee low latency against workload dynamics, such as surges in data arrival rate and fluctuations in data distribution. Existing systems achieve elasticity following a…

Databases · Computer Science 2017-11-06 Li Wang , Tom Z. J. Fu , Richard T. B. Ma , Marianne Winslett , Zhenjie Zhang

Modern embedded and cyber-physical systems require every day more performance, power efficiency and flexibility, to execute several profiles and functionalities targeting the ever growing adaptivity needs and preserving execution…

Hardware Architecture · Computer Science 2021-03-08 Carlo Sau , Tiziana Fanni , Claudio Rubattu , Luigi Raffo , Francesca Palumbo
‹ Prev 1 2 3 10 Next ›