English
Related papers

Related papers: BLASFEO: basic linear algebra subroutines for embe…

200 papers

BLASFEO is a dense linear algebra library providing high-performance implementations of BLAS- and LAPACK-like routines for use in embedded optimization and other applications targeting relatively small matrices. BLASFEO defines an API which…

Mathematical Software · Computer Science 2020-06-09 Gianluca Frison , Tommaso Sartor , Andrea Zanelli , Moritz Diehl

This work introduces CLBlast, an open-source BLAS library providing optimized OpenCL routines to accelerate dense linear algebra for a wide variety of devices. It is targeted at machine learning and HPC applications and thus provides a fast…

Mathematical Software · Computer Science 2018-04-30 Cedric Nugteren

Countless applications cast their computational core in terms of dense linear algebra operations. These operations can usually be implemented by combining the routines offered by standard linear algebra libraries such as BLAS and LAPACK,…

Performance · Computer Science 2014-10-01 Elmar Peise , Paolo Bientinesi

Basic Linear Algebra Subprograms (BLAS) are a set of low level linear algebra kernels widely adopted by applications involved with the deep learning and scientific computing. The massive and economic computing power brought forth by the…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-10-20 Linnan Wang , Wei Wu , Jianxiong Xiao , Yi Yang

In the past two decades, some major efforts have been made to reduce exact (e.g. integer, rational, polynomial) linear algebra problems to matrix multiplication in order to provide algorithms with optimal asymptotic complexity. To provide…

Symbolic Computation · Computer Science 2009-01-14 Jean-Guillaume Dumas , Pascal Giorgi , Clément Pernet

Matrix-matrix multiplication is a fundamental operation of great importance to scientific computing and, increasingly, machine learning. It is a simple enough concept to be introduced in a typical high school algebra course yet in practice…

Mathematical Software · Computer Science 2016-09-02 Jianyu Huang , Robert A. van de Geijn

Basic Linear Algebra Subprograms (BLAS) is a core library in scientific computing and machine learning. This paper presents FT-BLAS, a new implementation of BLAS routines that not only tolerates soft errors on the fly, but also provides…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-04-09 Yujia Zhai , Elisabeth Giem , Quan Fan , Kai Zhao , Jinyang Liu , Zizhong Chen

KBLAS is a new open source high performance library that provides optimized kernels for a subset of Level 2 BLAS functionalities on CUDA-enabled GPUs. Since performance of dense matrix-vector multiplication is hindered by the overhead of…

Mathematical Software · Computer Science 2014-10-08 Ahmad Abdelfattah , David Keyes , Hatem Ltaief

Dense linear algebra libraries, such as BLAS and LAPACK, provide a relevant collection of numerical tools for many scientific and engineering applications. While there exist high performance implementations of the BLAS (and LAPACK)…

Mathematical Software · Computer Science 2015-11-09 Sandra Catalán , José R. Herrero , Francisco D. Igual , Rafael Rodríguez-Sánchez , Enrique S. Quintana-Ortí

Limbo is an open-source C++11 library for Bayesian optimization which is designed to be both highly flexible and very fast. It can be used to optimize functions for which the gradient is unknown, evaluations are expensive, and runtime cost…

Machine Learning · Computer Science 2016-11-23 Antoine Cully , Konstantinos Chatzilygeroudis , Federico Allocati , Jean-Baptiste Mouret

Scientific programmers often turn to vendor-tuned Basic Linear Algebra Subprograms (BLAS) to obtain portable high performance. However, many numerical algorithms require several BLAS calls in sequence, and those successive calls result in…

Mathematical Software · Computer Science 2012-05-09 Geoffrey Belter , Elizabeth Jessup , Thomas Nelson , Boyana Norris , Jeremy G. Siek

We leverage highly successful prior projects sponsored by multiple NSF grants and gifts from industry: the BLAS-like Library Instantiation Software (BLIS) and the libflame efforts to lay the foundation for a new flexible framework by…

Mathematical Software · Computer Science 2026-04-09 Devin A. Matthews , Tze Meng Low , Margaret E. Myers , Devangi N. Parikh , Robert A. van de Geijn

This paper presents the acados software package, a collection of solvers for fast embedded optimization intended for fast embedded applications. Its interfaces to higher-level languages make it useful for quickly designing an…

Spatial (dataflow) computer architectures can mitigate the control and performance overhead of classical von Neumann architectures such as traditional CPUs. Driven by the popularity of Machine Learning (ML) workloads, spatial devices are…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-10-02 Tristan Laan , Tiziano De Matteis

It is well known that the behavior of dense linear algebra algorithms is greatly influenced by factors like target architecture, underlying libraries and even problem size; because of this, the accurate prediction of their performance is a…

Mathematical Software · Computer Science 2012-12-11 Elmar Peise , Paolo Bientinesi

Most, if not all the modern scientific simulation packages utilize matrix algebra operations. Among the operation of the linear algebra, one of the most important kernels is the multiplication of matrices, dense and sparse. Examples of…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-10-14 Ilia Sivkov , Alfio Lazzaro , Juerg Hutter

Matrix libraries often focus on achieving high performance for problems considered to be either "small" or "large", as these two scenarios tend to respond best to different optimization strategies. We propose a unified technique for…

Mathematical Software · Computer Science 2023-02-20 RuQing G. Xu , Field G. Van Zee , Robert A. van de Geijn

The standardization of an interface for dense linear algebra operations in the BLAS standard has enabled interoperability between different linear algebra libraries, thereby boosting the success of scientific computing, in particular in…

Large-scale foundation models have demonstrated exceptional performance in language and vision tasks. However, the numerous dense matrix-vector operations involved in these large networks pose significant computational challenges during…

Machine Learning · Computer Science 2024-10-31 Changwoo Lee , Soo Min Kwon , Qing Qu , Hun-Seok Kim

This paper introduces LMFAO (Layered Multiple Functional Aggregate Optimization), an in-memory optimization and execution engine for batches of aggregates over the input database. The primary motivation for this work stems from the…

Databases · Computer Science 2019-06-21 Maximilian Schleich , Dan Olteanu , Mahmoud Abo Khamis , Hung Q. Ngo , XuanLong Nguyen
‹ Prev 1 2 3 10 Next ›