Related papers: Nancy: An efficient parallel Network Calculus libr…

Extending the Network Calculus Algorithmic Toolbox for Ultimately Pseudo-Periodic Functions: Pseudo-Inverse and Composition

Network Calculus (NC) is an algebraic theory that represents traffic and service guarantees as curves in a Cartesian plane, in order to compute performance guarantees for flows traversing a network. NC uses transformation operations, e.g.,…

Networking and Internet Architecture · Computer Science 2026-01-15 Raffaele Zippo , Paul Nikolaus , Giovanni Stea

Extending Network Calculus To Deal With Partially Negative And Decreasing Service Curves

Network Calculus (NC) is a versatile analytical methodology to efficiently compute performance bounds in networked systems. The arrival and service curve abstractions allow to model diverse and heterogeneous distributed systems. The…

Networking and Internet Architecture · Computer Science 2024-03-28 Anja Hamscher , Vlad-Cristian Constantin , Jens B. Schmitt

Automatic Parallelization of Software Network Functions

Software network functions (NFs) trade-off flexibility and ease of deployment for an increased challenge of performance. The traditional way to increase NF performance is by distributing traffic to multiple CPU cores, but this poses a…

Networking and Internet Architecture · Computer Science 2023-10-16 Francisco Pereira , Fernando M. V. Ramos , Luis Pedrosa

ABCpy: A High-Performance Computing Perspective to Approximate Bayesian Computation

ABCpy is a highly modular scientific library for Approximate Bayesian Computation (ABC) written in Python. The main contribution of this paper is to document a software engineering effort that enables domain scientists to easily apply ABC…

Computation · Statistics 2021-12-20 Ritabrata Dutta , Marcel Schoengens , Lorenzo Pacchiardi , Avinash Ummadisingu , Nicole Widmer , Pierre Künzli , Jukka-Pekka Onnela , Antonietta Mira

Parallel Computation of functions of matrices and their action on vectors

We present a novel class of methods to compute functions of matrices or their action on vectors that are suitable for parallel programming. Solving appropriate simple linear systems of equations in parallel (or computing the inverse of…

Numerical Analysis · Mathematics 2022-10-10 Sergio Blanes

ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks

Neural networks (NNs) with intensive multiplications (e.g., convolutions and transformers) are capable yet power hungry, impeding their more extensive deployment into resource-constrained devices. As such, multiplication-free networks,…

Machine Learning · Computer Science 2025-03-04 Haoran You , Baopu Li , Huihong Shi , Yonggan Fu , Yingyan Celine Lin

NeuralMatrix: Compute the Entire Neural Networks with Linear Matrix Operations for Efficient Inference

The inherent diversity of computation types within the deep neural network (DNN) models often requires a variety of specialized units in hardware processors, which limits computational efficiency, increasing both inference latency and power…

Machine Learning · Computer Science 2024-08-21 Ruiqi Sun , Siwei Ye , Jie Zhao , Xin He , Jianzhe Lin , Yiran Li , An Zou

dMath: A Scalable Linear Algebra and Math Library for Heterogeneous GP-GPU Architectures

A new scalable parallel math library, dMath, is presented in this paper that demonstrates leading scaling when using intranode, or internode, hybrid-parallelism for deep-learning. dMath provides easy-to-use distributed base primitives and a…

Neural and Evolutionary Computing · Computer Science 2016-04-07 Steven Eliuk , Cameron Upright , Anthony Skjellum

Network Calculus Bounds for Time-Sensitive Networks: A Revisit

Network calculus (NC), particularly its min-plus branch, has been extensively utilized to construct service models and compute delay bounds for time-sensitive networks (TSNs). This paper provides a revisit to the fundamental results. In…

Networking and Internet Architecture · Computer Science 2024-03-21 Yuming Jiang

On the energy efficiency of sparse matrix computations on multi-GPU clusters

We investigate the energy efficiency of a library designed for parallel computations with sparse matrices. The library leverages high-performance, energy-efficient Graphics Processing Unit (GPU) accelerators to enable large-scale scientific…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-04-16 Massimo Bernaschi , Alessandro Celestini , Pasqua D'Ambra , Giorgio Richelli

Fast Parallel Exact Inference on Bayesian Networks: Poster

Bayesian networks (BNs) are attractive, because they are graphical and interpretable machine learning models. However, exact inference on BNs is time-consuming, especially for complex problems. To improve the efficiency, we propose a fast…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-03-14 Jiantong Jiang , Zeyi Wen , Atif Mansoor , Ajmal Mian

Chunks and Tasks: a programming model for parallelization of dynamic algorithms

We propose Chunks and Tasks, a parallel programming model built on abstractions for both data and work. The application programmer specifies how data and work can be split into smaller pieces, chunks and tasks, respectively. The Chunks and…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-07-29 Emanuel H. Rubensson , Elias Rudberg

Cellular neural networks for NP-hard optimization problems

Nowadays, Cellular Neural Networks (CNN) are practically implemented in parallel, analog computers, showing a fast developing trend. Physicist must be aware that such computers are appropriate for solving in an elegant manner practically…

Disordered Systems and Neural Networks · Physics 2016-02-17 Mária Ercsey-Ravasz , Tamás Roska , Zoltán Néda

Computationally Efficient Worst-Case Analysis of Flow-Controlled Networks with Network Calculus

Networks with hop-by-hop flow control occur in several contexts, from data centers to systems architectures (e.g., wormhole-routing networks on chip). A worst-case end-to-end delay in such networks can be computed using Network Calculus…

Networking and Internet Architecture · Computer Science 2023-07-10 Raffaele Zippo , Giovanni Stea

Enabling Massive Deep Neural Networks with the GraphBLAS

Deep Neural Networks (DNNs) have emerged as a core tool for machine learning. The computations performed during DNN training and inference are dominated by operations on the weight matrices describing the DNN. As DNNs incorporate more…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-03-06 Jeremy Kepner , Manoj Kumar , José Moreira , Pratap Pattnaik , Mauricio Serrano , Henry Tufo

Computing with Networks of Nonlinear Mechanical Oscillators

As it is getting increasingly difficult to achieve gains in the density and power efficiency of microelectronic computing devices because of lithographic techniques reaching fundamental physical limits, new approaches are required to…

Emerging Technologies · Computer Science 2017-07-05 Jean C. Coulombe , Mark C. A. York , Julien Sylvestre

Parallel Software to Offset the Cost of Higher Precision

Hardware double precision is often insufficient to solve large scientific problems accurately. Computing in higher precision defined by software causes significant computational overhead. The application of parallel algorithms compensates…

Mathematical Software · Computer Science 2020-12-15 Jan Verschelde

AXNet: ApproXimate computing using an end-to-end trainable neural network

Neural network based approximate computing is a universal architecture promising to gain tremendous energy-efficiency for many error resilient applications. To guarantee the approximation quality, existing works deploy two neural networks…

Machine Learning · Computer Science 2018-12-19 Zhenghao Peng , Xuyang Chen , Chengwen Xu , Naifeng Jing , Xiaoyao Liang , Cewu Lu , Li Jiang

BCNN: Binary Complex Neural Network

Binarized neural networks, or BNNs, show great promise in edge-side applications with resource limited hardware, but raise the concerns of reduced accuracy. Motivated by the complex neural networks, in this paper we introduce complex…

Neural and Evolutionary Computing · Computer Science 2021-04-21 Yanfei Li , Tong Geng , Ang Li , Huimin Yu

Evaluating a Novel Neuroevolution and Neural Architecture Search System

The choice of neural network features can have a large impact on both the accuracy and speed of the network. Despite the current industry shift towards large transformer models, specialized binary classifiers remain critical for numerous…

Neural and Evolutionary Computing · Computer Science 2025-03-17 Benjamin David Winter , William John Teahan