Related papers: Optimizing Distributed Tensor Contractions using N…

Distributed and heterogeneous tensor-vector contraction algorithms for high performance computing

The tensor-vector contraction (TVC) is the most memory-bound operation of its class and a core component of the higher-order power method (HOPM). This paper brings distributed-memory parallelization to a native TVC algorithm for dense…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-02-26 Pedro J. Martinez-Ferrer , Albert-Jan Yzelman , Vicenç Beltran

Communication Lower Bounds and Optimal Algorithms for Multiple Tensor-Times-Matrix Computation

Multiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms for computing and operating with the Tucker tensor decomposition, which is frequently used in multidimensional data analysis. We establish communication lower…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-02-03 Hussam Al Daas , Grey Ballard , Laura Grigori , Suraj Kumar , Kathryn Rouse

Communication/Computation Tradeoffs in Consensus-Based Distributed Optimization

We study the scalability of consensus-based distributed optimization algorithms by considering two questions: How many processors should we use for a given problem, and how often should they communicate when communication is not free?…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-09-06 Konstantinos I. Tsianos , Sean Lawlor , Michael G. Rabbat

Tensor-based framework for training flexible neural networks

Activation functions (AFs) are an important part of the design of neural networks (NNs), and their choice plays a predominant role in the performance of a NN. In this work, we are particularly interested in the estimation of flexible…

Machine Learning · Computer Science 2021-06-28 Yassine Zniyed , Konstantin Usevich , Sebastian Miron , David Brie

Optimizing Tensor Network Partitioning using Simulated Annealing

Tensor networks have proven to be a valuable tool, for instance, in the classical simulation of (strongly correlated) quantum systems. As the size of the systems increases, contracting larger tensor networks becomes computationally…

Quantum Physics · Physics 2025-07-29 Manuel Geiger , Qunsheng Huang , Christian B. Mendl

Communication-Efficient Algorithms For Distributed Optimization

This thesis is concerned with the design of distributed algorithms for solving optimization problems. We consider networks where each node has exclusive access to a cost function, and design algorithms that make all nodes cooperate to find…

Optimization and Control · Mathematics 2013-12-03 João F. C. Mota

Coded Computing for Half-Duplex Wireless Distributed Computing Systems via Interference Alignment

Distributed computing frameworks such as MapReduce and Spark are often used to process large-scale data computing jobs. In wireless scenarios, exchanging data among distributed nodes would seriously suffer from the communication bottleneck…

Information Theory · Computer Science 2023-10-25 Youlong Wu , Zhenhao Huang , Kai Yuan , Shuai Ma , Yue Bi

Fast, Accurate, and Scalable Method for Sparse Coupled Matrix-Tensor Factorization

How can we capture the hidden properties from a tensor and a matrix data simultaneously in a fast, accurate, and scalable way? Coupled matrix-tensor factorization (CMTF) is a major tool to extract latent factors from a tensor and matrices…

Numerical Analysis · Computer Science 2017-12-06 Dongjin Choi , Jun-Gi Jang , U Kang

Constrained In-network Computing with Low Congestion in Datacenter Networks

Distributed computing has become a common practice nowadays, where the recent focus has been given to the usage of smart networking devices with in-network computing capabilities. State-of-the-art switches with near-line rate computing and…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-01-13 Raz Segal , Chen Avin , Gabriel Scalosub

A Flocking-based Approach for Distributed Stochastic Optimization

In recent years, the paradigm of cloud computing has emerged as an architecture for computing that makes use of distributed (networked) computing resources. In this paper, we consider a distributed computing algorithmic scheme for…

Optimization and Control · Mathematics 2017-09-22 Shi Pu , Alfredo Garcia

Communication Efficient Generalized Tensor Factorization for Decentralized Healthcare Networks

Tensor factorization has been proved as an efficient unsupervised learning approach for health data analysis, especially for computational phenotyping, where the high-dimensional Electronic Health Records (EHRs) with patients' history of…

Machine Learning · Computer Science 2022-11-04 Jing Ma , Qiuchen Zhang , Jian Lou , Li Xiong , Sivasubramanium Bhavani , Joyce C. Ho

Tie-Line Characteristics based Partitioning for Distributed Optimization of Power Systems

The convergence performance of distributed optimization algorithms is of significant importance to solve optimal power flow (OPF) in a distributed fashion. In this paper, we aim to provide some insights on how to partition a power system to…

Signal Processing · Electrical Eng. & Systems 2019-02-13 Ali Mohammadi , Mahdi Mehrtash , Amin Kargarian , Masoud Barati

Distributed Priority-Based Load Shedding over Time-Varying Communication Networks

We study the problem of distributed optimal resource allocation on networks with actions defined on discrete spaces, with applications to adaptive under-frequency load-shedding in power systems. In this context, the primary objective is to…

Optimization and Control · Mathematics 2024-12-25 Adel Aghajan , Miguel Jimenez-Aparicio , Michael E. Ropp , Jorge I. Poveda

Performance Analysis and Optimal Node-Aware Communication for Enlarged Conjugate Gradient Methods

Krylov methods are a key way of solving large sparse linear systems of equations, but suffer from poor strong scalabilty on distributed memory machines. This is due to high synchronization costs from large numbers of collective…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-03-14 Shelby Lockhart , Amanda Bienz , William Gropp , Luke Olson

Flow-Through Tensors: A Unified Computational Graph Architecture for Multi-Layer Transportation Network Optimization

Modern transportation network modeling increasingly involves the integration of diverse methodologies including sensor-based forecasting, reinforcement learning, classical flow optimization, and demand modeling that have traditionally been…

Optimization and Control · Mathematics 2025-07-08 Xuesong , Zhou , Taehooie Kim , Mostafa Ameli , Henan , Zhu , Yu- dai Honma , Ram M. Pendyala

Distributed Optimization With Local Domains: Applications in MPC and Network Flows

In this paper we consider a network with $P$ nodes, where each node has exclusive access to a local cost function. Our contribution is a communication-efficient distributed algorithm that finds a vector $x^\star$ minimizing the sum of all…

Optimization and Control · Mathematics 2016-11-15 João F. C. Mota , João M. F. Xavier , Pedro M. Q. Aguiar , Markus Püschel

A Model for Communication in Clusters of Multi-core Machines

A common paradigm for scientific computing is distributed message-passing systems, and a common approach to these systems is to implement them across clusters of high-performance workstations. As multi-core architectures become increasingly…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-05-01 Christine Task , Arun Chauhan

Flexible Coded Distributed Convolution Computing for Enhanced Straggler Resilience and Numerical Stability in Distributed CNNs

Deploying Convolutional Neural Networks (CNNs) on resource-constrained devices necessitates efficient management of computational resources, often via distributed environments susceptible to latency from straggler nodes. This paper…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-09-09 Shuo Tan , Rui Liu , Xuesong Han , XianLei Long , Kai Wan , Linqi Song , Yong Li

Near Optimal Coded Data Shuffling for Distributed Learning

Data shuffling between distributed cluster of nodes is one of the critical steps in implementing large-scale learning algorithms. Randomly shuffling the data-set among a cluster of workers allows different nodes to obtain fresh data…

Information Theory · Computer Science 2018-01-08 Mohamed A. Attia , Ravi Tandon

Hardware-Efficient Mixed-Precision CP Tensor Decomposition

Tensor decomposition has been widely used in machine learning and high-volume data analysis. However, large-scale tensor factorization often consumes huge memory and computing cost. Meanwhile, modernized computing hardware such as tensor…

Optimization and Control · Mathematics 2022-09-12 Zi Yang , Junnan Shan , Zheng Zhang