English
Related papers

Related papers: The LevelArray: A Fast, Practical Long-Lived Renam…

200 papers

We consider the task of assigning unique integers to a group of processes in an asynchronous distributed system of a total of $n$ processes prone to crashes that communicate through shared read-write registers. In the Renaming problem, an…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-10-01 Bogdan S. Chlebus , Dariusz R. Kowalski

We study the space complexity of implementing long-lived and one-shot adaptive renaming from multi-reader multi-writer registers, in an asynchronous distributed system with $n$ processes. As a result of an $f$-adaptive renaming algorithm…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-05-17 Maryam Helmi , Lisa Higham , Philipp Woelfel

Parallelization techniques have become ubiquitous for accelerating inference and training of deep neural networks. Despite this, several operations are still performed in a sequential manner. For instance, the forward and backward passes…

Machine Learning · Computer Science 2023-10-30 Federico Danieli , Miguel Sarabia , Xavier Suau , Pau Rodríguez , Luca Zappella

In plenty of data analysis tasks, a basic and time-consuming process is to produce a large number of solutions and feed them into downstream processing. Various enumeration algorithms have been developed for this purpose. An enumeration…

Data Structures and Algorithms · Computer Science 2023-02-28 Pengyu Chen , Dongjing Miao , Weitian Tong , Zizheng Guo , Jianzhong Li , Zhipeng Cai

The aggressive application of scalar replacement to array references substantially reduces the number of memory operations at the expense of a possibly very large number of registers. In this paper we describe a register allocation…

Programming Languages · Computer Science 2011-11-09 Nastaran Baradaran , Pedro C. Diniz

Iterative refinement is particularly popular for numerical solution of linear systems of equations. We extend it to Low Rank Approximation of a matrix (LRA) and observe close link of the resulting algorithm to oversampling techniques,…

Numerical Analysis · Mathematics 2024-11-28 Victor Y. Pan , Qi Luan , Soo Go

A matrix algorithm runs at {\em sublinear cost} if it uses much fewer memory cells and arithmetic operations than the input matrix has entries. Such algorithms are indispensable for Big Data Mining and Analysis. Quite typically in that area…

Numerical Analysis · Mathematics 2021-04-02 Qi Luan , Victor Y. Pan , John Svadlenka

There are many challenging problems in the person re-identification (ReID) task, such as the occlusion and scale variation. Existing works usually tried to solve them by employing a one-branch network. This one-branch network needs to be…

Computer Vision and Pattern Recognition · Computer Science 2022-03-01 Xianghao Zang , Ge Li , Wei Gao , Xiujun Shu

Structured low-rank (SLR) algorithms, which exploit annihilation relations between the Fourier samples of a signal resulting from different properties, is a powerful image reconstruction framework in several applications. This scheme relies…

Machine Learning · Computer Science 2020-08-11 Aniket Pramanik , Hemant Aggarwal , Mathews Jacob

Serializability is a well-understood concurrency control mechanism that eases reasoning about highly-concurrent database programs. Unfortunately, enforcing serializability has a high-performance cost, especially on geographically…

Programming Languages · Computer Science 2021-03-10 Kia Rahmani , Kartik Nagar , Benjamin Delaware , Suresh Jagannathan

In this paper we develop optimal algorithms in the binary-forking model for a variety of fundamental problems, including sorting, semisorting, list ranking, tree contraction, range minima, and ordered set union, intersection and difference.…

Data Structures and Algorithms · Computer Science 2020-06-26 Guy E. Blelloch , Jeremy T. Fineman , Yan Gu , Yihan Sun

As neural network algorithms show high performance in many applications, their efficient inference on mobile and embedded systems are of great interests. When a single stream recurrent neural network (RNN) is executed for a personal user in…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-04-02 Wonyong Sung , Jinhwan Park

Pipelines combining SQL-style business intelligence (BI) queries and linear algebra (LA) are becoming increasingly common in industry. As a result, there is a growing need to unify these workloads in a single framework. Unfortunately,…

Databases · Computer Science 2017-08-29 Christopher R. Aberger , Andrew Lamb , Kunle Olukotun , Christopher Ré

Several classic problems in graph processing and computational geometry are solved via incremental algorithms, which split computation into a series of small tasks acting on shared state, which gets updated progressively. While the…

Data Structures and Algorithms · Computer Science 2020-03-24 Dan Alistarh , Nikita Koval , Giorgi Nadiradze

How can one quickly answer the most and top popular objects at any time, given a large log stream in a system of billions of users? It is equivalent to find the mode and top-frequent elements in a dynamic array corresponding to the log…

Data Structures and Algorithms · Computer Science 2018-12-14 Dingcheng Yang , Wenjian Yu , Junhui Deng , Shenghua Liu

The primary challenge for handwriting recognition systems lies in managing long-range contextual dependencies, an issue that traditional models often struggle with. To mitigate it, attention mechanisms have recently been employed to enhance…

Computer Vision and Pattern Recognition · Computer Science 2024-09-10 Sara Ferro , Alessandro Torcinovich , Arianna Traviglia , Marcello Pelillo

In computer science, sorting algorithms are crucial for data processing and machine learning. Large datasets and high efficiency requirements provide challenges for comparison-based algorithms like Quicksort and Merge sort, which achieve…

Data Structures and Algorithms · Computer Science 2024-10-01 Amin Amini

Long-range sequence modeling is a crucial aspect of natural language processing and time series analysis. However, traditional models like Recurrent Neural Networks (RNNs) and Transformers suffer from computational and memory…

Artificial Intelligence · Computer Science 2025-01-15 Mohamed A. Taha

A practical large language model (LLM) service may involve a long system prompt, which specifies the instructions, examples, and knowledge documents of the task and is reused across requests. However, the long system prompt causes…

Computation and Language · Computer Science 2024-05-31 Lei Zhu , Xinjiang Wang , Wayne Zhang , Rynson W. H. Lau

Multicore architectures dominate today's processor market. Even though the number of cores and threads are pretty high and continues to grow, inherently serial algorithms do not benefit from the abundance of cores and threads. In this…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-05-21 Mohammad Bakhshalipour , Hamid Sarbazi-Azad
‹ Prev 1 2 3 10 Next ›