English
Related papers

Related papers: Maximal Atomic irRedundant Sets: a Usage-based Dat…

200 papers

Optimization pipelines targeting polyhedral programs try to maximize the compute throughput. Traditional approaches favor reuse and temporal locality; while the communicated volume can be low, failure to optimize spatial locality may cause…

Programming Languages · Computer Science 2023-12-07 Corentin Ferry , Steven Derrien , Sanjay Rajopadhye

Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative nature of many analysis and machine learning algorithms, however, is still a challenge for current systems. While certain types of bulk…

Databases · Computer Science 2012-08-02 Stephan Ewen , Kostas Tzoumas , Moritz Kaufmann , Volker Markl

Execution graphs of parallel loop programs exhibit a nested, repeating structure. We show how such graphs that are the result of nested repetition can be represented by succinct parametric structures. This parametric graph template…

Data Structures and Algorithms · Computer Science 2023-07-18 Tal Ben-Nun , Lukas Gianinazzi , Torsten Hoefler , Yishai Oltchik

We introduce an efficient combination of polyhedral analysis and predicate partitioning. Template polyhedral analysis abstracts numerical variables inside a program by one polyhedron per control location, with a priori fixed directions for…

Logic in Computer Science · Computer Science 2014-10-06 David Monniaux , Peter Schrammel

Discrete Fracture Network models are largely used for very large scale geological flow simulations. For this reason numerical methods require an investigation of tools for efficient parallel solutions on High Performance Computing systems.…

Numerical Analysis · Mathematics 2021-06-21 Stefano Berrone , Alice Raeli

Partitioning large matrices is an important problem in distributed linear algebra computing (used in ML among others). Briefly, our goal is to perform a sequence of matrix algebra operations in a distributed manner (whenever possible) on…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-06-30 Avah Banerjee , Guoli Ding , Maxwell Reeser

Algorithms for extracting hydrologic features and properties from digital elevation models (DEMs) are challenged by large datasets, which often cannot fit within a computer's RAM. Depression filling is an important preconditioning step to…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-08-17 Richard Barnes

A rising research challenge is running costly machine learning (ML) networks locally on resource-constrained edge devices. ML networks with large convolutional layers can easily exceed available memory, increasing latency due to excessive…

Machine Learning · Computer Science 2023-07-20 Jackson Farley , Andreas Gerstlauer

We present Cyclotron, a framework and compiler for using recurrence equations to express streaming dataflow algorithms, which then get portably compiled to distributed topologies of interlinked processors. Our framework provides an input…

Programming Languages · Computer Science 2025-11-14 Shiv Sundram , Akhilesh Balasingam , Nathan Zhang , Kunle Olukotun , Fredrik Kjolstad

Spatial dataflow accelerators are a promising direction for next-generation computer systems because they can reduce the memory bottlenecks of traditional von Neumann machines such as CPUs and GPUs. They organize computation around…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-13 Wei Li , Zhenyu Bai , Heru Wang , Pranav Dangi , Zhiqiang Zhang , Cheng Tan , Huiying Lan , Weng-Fai Wong , Tulika Mitra

The paper introduces a novel algorithm for computing the output admissible set of linear discrete-time systems subject to input saturation. The proposed method takes advantage of the piecewise-affine dynamics to propagate the output…

Optimization and Control · Mathematics 2023-11-29 Yaashia Gautam , Marco M. Nicotra

We present a shared-memory parallelization of flow-based refinement, which is considered the most powerful iterative improvement technique for hypergraph partitioning at the moment. Flow-based refinement works on bipartitions, so current…

Data Structures and Algorithms · Computer Science 2022-01-06 Lars Gottesbüren , Tobias Heuer , Peter Sanders

Several high-throughput distributed data-processing applications require multi-hop processing of streams of data. These applications include continual processing on data streams originating from a network of sensors, composing a multimedia…

Distributed, Parallel, and Cluster Computing · Computer Science 2009-03-26 Shah Asaduzzaman , Muthucumaru Maheswaran

Partitioning a graph into balanced blocks such that few edges run between blocks is a key problem for large-scale distributed processing. A current trend for partitioning huge graphs are streaming algorithms, which use low computational…

Data Structures and Algorithms · Computer Science 2022-02-02 Marcelo Fonseca Faraj , Christian Schulz

Edge-AI applications demand high-throughput, low-latency inference on FPGAs under tight resource and power constraints. This survey provides a comprehensive review of two key architectural decisions for FPGA-based neural network…

Hardware Architecture · Computer Science 2025-06-03 Richie Li

In this paper, we study the problem of finding an integral multiflow which maximizes the sum of flow values between every two terminals in an undirected tree with a nonnegative integer edge capacity and a set of terminals. In general, it is…

Data Structures and Algorithms · Computer Science 2016-11-29 Mingyu Xiao , Hiroshi Nagamochi

In unsplittable network flow problems, certain nodes must satisfy a combinatorial requirement that the incoming arc flows cannot be split or merged when routed through outgoing arcs. This so-called "no-split no-merge" requirement arises in…

Optimization and Control · Mathematics 2024-03-27 Hosseinali Salemi , Danial Davarnia

An algorithm for irreducible decomposition of representations of finite groups over fields of characteristic zero is described. The algorithm uses the fact that the decomposition induces a partition of the invariant inner product into a…

Representation Theory · Mathematics 2019-06-05 Vladimir V Kornyak

Data flow analysis and optimization is considered for homogeneous rectangular mesh networks. We propose a flow matrix equation which allows a closed-form characterization of the nature of the minimal time solution, speedup and a simple…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-12-30 Junwei Zhang , Yang Liu , Li Shi , Thomas G. Robertazzi

Computing maximum independent sets in graphs is an important problem in computer science. In this paper, we develop an evolutionary algorithm to tackle the problem. The core innovations of the algorithm are very natural combine operations…

Data Structures and Algorithms · Computer Science 2015-02-06 Sebastian Lamm , Peter Sanders , Christian Schulz
‹ Prev 1 2 3 10 Next ›