English
Related papers

Related papers: Distributed and heterogeneous tensor-vector contra…

200 papers

The Tucker decomposition generalizes the notion of Singular Value Decomposition (SVD) to tensors, the higher dimensional analogues of matrices. We study the problem of constructing the Tucker decomposition of sparse tensors on distributed…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-01-22 Venkatesan T. Chakaravarthy , Jee W. Choi , Douglas J. Joseph , Prakash Murali , Shivmaran S. Pandian , Yogish Sabharwal , Dheeraj Sreedhar

We propose an algorithm that aims at minimizing the inter-node communication volume for distributed and memory-efficient tensor contraction schemes on modern multi-core compute nodes. The key idea is to define processor grids that optimize…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-07-19 Andreas Irmler , Raghavendra Kanakagiri , Sebastian T. Ohlmann , Edgar Solomonik , Andreas Grüneis

The Tucker decomposition expresses a given tensor as the product of a small core tensor and a set of factor matrices. Apart from providing data compression, the construction is useful in performing analysis such as principal component…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-07-19 Venkatesan T Chakaravarthy , Jee W Choi , Douglas J Joseph , Xing Liu , Prakash Murali , Yogish Sabharwal , Dheeraj Sreedhar

Best rank-one approximation is one of the most fundamental tasks in tensor computation. In order to fully exploit modern multi-core parallel computers, it is necessary to develop decoupling algorithms for computing the best rank-one…

Numerical Analysis · Mathematics 2024-03-05 Chuanfu Xiao , Zeyu Li , Chao Yang

Higher-order singular value decomposition (HOSVD) is a celebrated tool for tensor data analysis. The sequential HOSVD was recently generalized to the quaternion domain, while a naive quaternion extension of the classical HOSVD% by De…

Numerical Analysis · Mathematics 2025-06-26 Hanxin Ya , Yuning Yang

In this paper we propose a parallel coordinate descent algorithm for solving smooth convex optimization problems with separable constraints that may arise e.g. in distributed model predictive control (MPC) for linear network systems. Our…

Optimization and Control · Mathematics 2014-11-19 Ion Necoara , Dragos Clipici

Big data analysis has become a crucial part of new emerging technologies such as the internet of things, cyber-physical analysis, deep learning, anomaly detection, etc. Among many other techniques, dimensionality reduction plays a key role…

This article provides next step towards solving speed bottleneck of any system that intensively uses convolutions operations (e.g. CNN). Method described in the article is applied on deformable part models (DPM) algorithm. Method described…

Computer Vision and Pattern Recognition · Computer Science 2017-07-12 D. V. Parkhomenko , I. L. Mazurenko

Improving the computational efficiency of quantum many-body calculations from a hardware perspective remains a critical challenge. Although field-programmable gate arrays (FPGAs) have recently been exploited to improve the computational…

Strongly Correlated Electrons · Physics 2026-02-06 Songtai Lv , Yang Liang , Rui Zhu , Qibin Zheng , Haiyuan Zou

Current high-performance computer systems used for scientific computing typically combine shared memory computational nodes in a distributed memory environment. Extracting high performance from these complex systems requires tailored…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-01-14 Afshin Zafari , Elisabeth Larsson , Martin Tillenius

Distributed training of deep neural networks has received significant research interest, and its major approaches include implementations on multiple GPUs and clusters. Parallelization can dramatically improve the efficiency of training…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-11-29 Jaehee Jang , Byungook Na , Sungroh Yoon

We introduce a data distribution scheme for $\mathcal{H}$-matrices and a distributed-memory algorithm for $\mathcal{H}$-matrix-vector multiplication. Our data distribution scheme avoids an expensive $\Omega(P^2)$ scheduling procedure used…

Numerical Analysis · Mathematics 2020-09-23 Yingzhou Li , Jack Poulson , Lexing Ying

With data sizes constantly expanding, and with classical machine learning algorithms that analyze such data requiring larger and larger amounts of computation time and storage space, the need to distribute computation and memory…

Machine Learning · Computer Science 2015-12-08 Aruna Govada , Shree Ranjani , Aditi Viswanathan , S. K. Sahay

Motivated by machine learning applications in networks of sensors, internet-of-things (IoT) devices, and autonomous agents, we propose techniques for distributed stochastic convex learning from high-rate data streams. The setup involves a…

Machine Learning · Statistics 2019-06-11 Matthew Nokleby , Waheed U. Bajwa

Deep convolutional neural networks (ConvNets) of 3-dimensional kernels allow joint modeling of spatiotemporal features. These networks have improved performance of video and volumetric image analysis, but have been limited in size due to…

Computer Vision and Pattern Recognition · Computer Science 2017-06-13 David Budden , Alexander Matveev , Shibani Santurkar , Shraman Ray Chaudhuri , Nir Shavit

In this paper, we present a distributed algorithm for solving convex, constraint-coupled, optimization problems over peer-to-peer networks. We consider a network of processors that aim to cooperatively minimize the sum of local cost…

Optimization and Control · Mathematics 2021-04-14 Andrea Camisa , Alessia Benevento , Giuseppe Notarstefano

The CP tensor decomposition is a low-rank approximation of a tensor. We present a distributed-memory parallel algorithm and implementation of an alternating optimization method for computing a CP decomposition of dense tensor data that can…

Numerical Analysis · Computer Science 2018-06-22 Grey Ballard , Koby Hayashi , Ramakrishnan Kannan

Hyperdimensional computing (HDC) is an emerging computing paradigm that represents, manipulates, and communicates data using very long random vectors (aka hypervectors). Among different hardware platforms capable of executing HDC…

Hardware Architecture · Computer Science 2022-05-24 Robert Guirado , Abbas Rahimi , Geethan Karunaratne , Eduard Alarcón , Abu Sebastian , Sergi Abadal

Hypergraphs are a popular paradigm to represent complex real-world networks exhibiting multi-way relationships of varying sizes. Mining centrality in hypergraphs via symmetric adjacency tensors has only recently become computationally…

Numerical Analysis · Mathematics 2023-11-16 Shruti Shivakumar , Ilya Amburg , Sinan G. Aksoy , Jiajia Li , Stephen J. Young , Srinivas Aluru

Higher-order tensor decompositions are analogous to the familiar Singular Value Decomposition (SVD), but they transcend the limitations of matrices (second-order tensors). SVD is a powerful tool that has achieved impressive results in…

Machine Learning · Computer Science 2007-11-14 Peter D. Turney
‹ Prev 1 2 3 10 Next ›