English
Related papers

Related papers: A Simple and Efficient Tensor Calculus for Machine…

200 papers

An efficient coordinate-free notation is elucidated for differentiating matrix expressions and other functions between higher-dimensional vector spaces. This method of differentiation is known, but not explained well, in the literature.…

History and Overview · Mathematics 2013-10-03 Jonathan H. Manton

Computing multivariate derivatives of matrix-like expressions in the compact, coordinate free fashion is very important for both theory and applied computations (e.g. optimization and machine learning). The critical components of such…

Symbolic Computation · Computer Science 2019-12-02 Maciej Skorski

We consider the problem of automatically decomposing operations over tensors or arrays so that they can be executed in parallel on multiple devices. We address two, closely-linked questions. First, what programming abstraction should…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-10-04 Daniel Bourgeois , Zhimin Ding , Dimitrije Jankov , Jiehui Li , Mahmoud Sleem , Yuxin Tang , Jiawen Yao , Xinyu Yao , Chris Jermaine

The general linear model is a universally accepted method to conduct and test multiple linear regression models. Using this model one has the ability to simultaneously regress covariates among different groups of data. Moreover, there are…

Methodology · Statistics 2024-10-15 Gavin T. Kress

In scientific computing and machine learning applications, matrices and more general multidimensional arrays (tensors) can often be approximated with the help of low-rank decompositions. Since matrices and tensors of fixed rank form smooth…

Optimization and Control · Mathematics 2021-10-26 Alexander Novikov , Maxim Rakhuba , Ivan Oseledets

In this paper, we propose a method for importing tensor index notation, including Einstein summation notation, into functional programming. This method involves introducing two types of parameters, i.e, scalar and tensor parameters, and…

Programming Languages · Computer Science 2018-08-31 Satoshi Egi

A \emph{tensor-relational} computation is a relational computation where individual tuples carry vectors, matrices, or higher-dimensional arrays. An advantage of tensor-relational computation is that the overall computation can be executed…

Mathematical Software · Computer Science 2026-03-11 Yuxin Tang , Zhiyuan Xin , Zhimin Ding , Xinyu Yao , Daniel Bourgeois , Tirthak Patel , Chris Jermaine

We present new results on the classical algorithm of variable elimination, which underlies many algorithms including for probabilistic inference. The results relate to exploiting functional dependencies, allowing one to perform inference…

Artificial Intelligence · Computer Science 2020-04-21 Adnan Darwiche

Tensor methods have become a promising tool to solve high-dimensional problems in the big data era. By exploiting possible low-rank tensor factorization, many high-dimensional model-based or data-driven problems can be solved to facilitate…

Optimization and Control · Mathematics 2019-08-22 Chunfeng Cui , Cole Hawkins , Zheng Zhang

Many critical EDA problems suffer from the curse of dimensionality, i.e. the very fast-scaling computational burden produced by large number of parameters and/or unknown variables. This phenomenon may be caused by multiple spatial or…

Numerical Analysis · Computer Science 2016-11-18 Zheng Zhang , Kim Batselier , Haotian Liu , Luca Daniel , Ngai Wong

We propose a notation for tensors with named axes, which relieves the author, reader, and future implementers of machine learning models from the burden of keeping track of the order of axes and the purpose of each. The notation makes it…

Machine Learning · Computer Science 2023-01-19 David Chiang , Alexander M. Rush , Boaz Barak

High-dimensional data arise naturally in many areas of science and engineering, including machine learning, signal processing, computational physics, and statistics. Such data are often represented as tensors, multi-dimensional…

Machine Learning · Computer Science 2026-05-19 Beheshteh T. Rakhshan , Guillaume Rabusseau

We introduce a simple algorithm that efficiently computes tensor products of Pauli matrices. This is done by tailoring the calculations to this specific case, which allows to avoid unnecessary calculations. The strength of this strategy is…

Quantum Physics · Physics 2023-12-20 Sebastián V. Romero , Juan Santos-Suárez

How does one compile derivatives of tensor programs, such that the resulting code is purely functional (hence easier to optimize and parallelize) and provably efficient relative to the original program? We show that naively differentiating…

Programming Languages · Computer Science 2020-10-01 Gilbert Bernstein , Michael Mara , Tzu-Mao Li , Dougal Maclaurin , Jonathan Ragan-Kelley

Efficient matrix determinant calculations have been studied since the 19th century. Computers expand the range of determinants that are practically calculable to include matrices with symbolic entries. However, the fastest determinant…

Symbolic Computation · Computer Science 2013-04-18 Tanya Khovanova , Ziv Scully

Classical regression methods treat covariates as a vector and estimate a corresponding vector of regression coefficients. Modern applications in medical imaging generate covariates of more complex form such as multidimensional arrays…

Methodology · Statistics 2013-10-22 Hua Zhou , Lexin Li , Hongtu Zhu

ITensor is a system for programming tensor network calculations with an interface modeled on tensor diagram notation, which allows users to focus on the connectivity of a tensor network without manually bookkeeping tensor indices. The…

Mathematical Software · Computer Science 2023-03-07 Matthew Fishman , Steven R. White , E. Miles Stoudenmire

The Hessian matrix has a number of important applications in a variety of different fields, such as optimzation, image processing and statistics. In this paper we focus on the practical aspects of efficiently computing Hessian matrices in…

Machine Learning · Computer Science 2021-04-08 Geir K. Nilsen , Antonella Z. Munthe-Kaas , Hans J. Skaug , Morten Brun

When training large models, such as neural networks, the full derivatives of order 2 and beyond are usually inaccessible, due to their computational cost. Therefore, among the second-order optimization methods, it is common to bypass the…

Machine Learning · Computer Science 2025-10-01 Pierre Wolinski

Computing with discrete representations of high-dimensional probability distributions is fundamental to uncertainty quantification, Bayesian inference, and stochastic modeling. However, storing and manipulating such distributions suffers…

Numerical Analysis · Mathematics 2025-10-03 Gerhard Kirsten , Bilgesu Bilgin , Janith Petangoda , Phillip Stanley-Marbell
‹ Prev 1 2 3 10 Next ›