Related papers: Multicore Dynamic Kernel Modules Attachment Techni…

A Deep Learning Approach To Multiple Kernel Fusion

Kernel fusion is a popular and effective approach for combining multiple features that characterize different aspects of data. Traditional approaches for Multiple Kernel Learning (MKL) attempt to learn the parameters for combining the…

Machine Learning · Statistics 2016-12-30 Huan Song , Jayaraman J. Thiagarajan , Prasanna Sattigeri , Karthikeyan Natesan Ramamurthy , Andreas Spanias

Manifold Adaptive Multiple Kernel K-Means for Clustering

Multiple kernel methods based on k-means aims to integrate a group of kernels to improve the performance of kernel k-means clustering. However, we observe that most existing multiple kernel k-means methods exploit the nonlinear relationship…

Machine Learning · Computer Science 2020-10-01 Liang Du , Haiying Zhang , Xin Ren , Xiaolin Lv

Execution of Compound Multi-Kernel OpenCL Computations in Multi-CPU/Multi-GPU Environments

Current computational systems are heterogeneous by nature, featuring a combination of CPUs and GPUs. As the latter are becoming an established platform for high-performance computing, the focus is shifting towards the seamless programming…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-10-23 Fábio Soldado , Fernando Alexandre , Hervé Paulino

Exploiting Application-to-Architecture Dependencies for Designing Scalable OS

With the advent of hundreds of cores on a chip to accelerate applications, the operating system (OS) needs to exploit the existing parallelism provided by the underlying hardware resources to determine the right amount of processes to be…

Operating Systems · Computer Science 2025-01-07 Yao Xiao , Nikos Kanakaris , Anzhe Cheng , Chenzhong Yin , Nesreen K. Ahmed , Shahin Nazarian , Andrei Irimia , Paul Bogdan

Securing Monolithic Kernels using Compartmentalization

Monolithic operating systems, where all kernel functionality resides in a single, shared address space, are the foundation of most mainstream computer systems. However, a single flaw, even in a non-essential part of the kernel (e.g., device…

Cryptography and Security · Computer Science 2024-04-16 Soo Yee Lim , Sidhartha Agrawal , Xueyuan Han , David Eyers , Dan O'Keeffe , Thomas Pasquier

Soft Tiles: Capturing Physical Implementation Flexibility for Tightly-Coupled Parallel Processing Clusters

Modern high-performance computing architectures (Multicore, GPU, Manycore) are based on tightly-coupled clusters of processing elements, physically implemented as rectangular tiles. Their size and aspect ratio strongly impact the achievable…

Hardware Architecture · Computer Science 2022-09-05 Gianna Paulin , Matheus Cavalcante , Paul Scheffler , Luca Bertaccini , Yichao Zhang , Frank Gürkaynak , Luca Benini

Ring-Mesh: A Scalable and High-Performance Approach for Manycore Accelerators

There are increasing number of works addressing the design challenges of fast, scalable solutions for the growing number of new type of applications. Recently, many of the solutions aimed at improving processing element capabilities to…

Hardware Architecture · Computer Science 2019-12-16 Somnath Mazumdar , Alberto Scionti

Multicore Quantum Computing

Any architecture for practical quantum computing must be scalable. An attractive approach is to create multiple cores, computing regions of fixed size that are well-spaced but interlinked with communication channels. This exploded…

Quantum Physics · Physics 2022-11-08 Hamza Jnane , Brennan Undseth , Zhenyu Cai , Simon C Benjamin , Bálint Koczor

KernelDNA: Dynamic Kernel Sharing via Decoupled Naive Adapters

Dynamic convolution enhances model capacity by adaptively combining multiple kernels, yet faces critical trade-offs: prior works either (1) incur significant parameter overhead by scaling kernel numbers linearly, (2) compromise inference…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Haiduo Huang , Yadong Zhang , Yinghui Xu , Pengju Ren

The Dynamical Kernel Scheduler - Part 1

Emerging processor architectures such as GPUs and Intel MICs provide a huge performance potential for high performance computing. However developing software using these hardware accelerators introduces additional challenges for the…

Computational Physics · Physics 2016-09-21 Andreas Adelmann , Uldis Locans , Andreas Suter

Dynamic autotuning of adaptive fast multipole methods on hybrid multicore CPU & GPU systems

We discuss an implementation of adaptive fast multipole methods targeting hybrid multicore CPU- and GPU-systems. From previous experiences with the computational profile of our version of the fast multipole algorithm, suitable parts are…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-09-03 Marcus Holm , Stefan Engblom , Anders Goude , Sverker Holmgren

Self-weighted Multiple Kernel Learning for Graph-based Clustering and Semi-supervised Classification

Multiple kernel learning (MKL) method is generally believed to perform better than single kernel method. However, some empirical studies show that this is not always true: the combination of multiple kernels may even yield an even worse…

Machine Learning · Statistics 2018-06-21 Zhao Kang , Xiao Lu , Jinfeng Yi , Zenglin Xu

Multicore Applications in Real Time Systems

Microprocessor roadmaps clearly show a trend towards multiple core CPUs. Modern operating systems already make use of these CPU architectures by distributing tasks between processing cores thereby increasing system performance. This review…

Software Engineering · Computer Science 2016-09-08 M. Vaidehi , T. R. Gopalakrishnan Nair

Transkernel: Bridging Monolithic Kernels to Peripheral Cores

Smart devices see a large number of ephemeral tasks driven by background activities. In order to execute such a task, the OS kernel wakes up the platform beforehand and puts it back to sleep afterwards. In doing so, the kernel operates…

Operating Systems · Computer Science 2019-06-07 Liwei Guo , Shuang Zhai , Yi Qiao , Felix Xiaozhu Lin

Memory Aware Load Balance Strategy on a Parallel Branch-and-Bound Application

The latest trends in high-performance computing systems show an increasing demand on the use of a large scale multicore systems in a efficient way, so that high compute-intensive applications can be executed reasonably well. However, the…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-02-25 Juliana M. N. Silva , Cristina Boeres , Lúcia M. A. Drummond , Artur A. Pessoa

Efficient Kernel Object Management for Tiered Memory Systems with KLOC

Software-controlled heterogeneous memory systems have the potential to improve performance, efficiency, and cost tradeoffs in emerging systems. Delivering on this promise requires an efficient operating system (OS) mechanisms and policies…

Operating Systems · Computer Science 2020-04-13 Sudarsun Kannan , Yujie Ren , Abhishek Bhatacharjee

An Implementation and Experimental Comparison of Dynamic Ordered Sets

It is becoming increasingly difficult to improve the performance of a a single process (thread) on a computer due to physical limitations. Modern systems use multi-core processors in which multiple processes (threads) may run concurrently.…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-11-05 Jordan Malek

Dynamic Convolution: Attention over Convolution Kernels

Light-weight convolutional neural networks (CNNs) suffer performance degradation as their low computational budgets constrain both the depth (number of convolution layers) and the width (number of channels) of CNNs, resulting in limited…

Computer Vision and Pattern Recognition · Computer Science 2020-04-02 Yinpeng Chen , Xiyang Dai , Mengchen Liu , Dongdong Chen , Lu Yuan , Zicheng Liu

Fast Dynamic Memory Integration in Co-Simulation Frameworks for Multiprocessor System on-Chip

In this paper is proposed a technique to integrate and simulate a dynamic memory in a multiprocessor framework based on C/C++/SystemC. Using host machine's memory management capabilities, dynamic data processing is supported without…

Hardware Architecture · Computer Science 2011-11-09 O. Villa , P. Schaumont , I. Verbauwhede , M. Monchiero , G. Palermo

Supervised Multiple Kernel Learning approaches for multi-omics data integration

Advances in high-throughput technologies have originated an ever-increasing availability of omics datasets. The integration of multiple heterogeneous data sources is currently an issue for biology and bioinformatics. Multiple kernel…

Machine Learning · Statistics 2024-12-04 Mitja Briscik , Gabriele Tazza , Marie-Agnes Dillies , László Vidács , Sébastien Dejean