Related papers: Counter Pools: Counter Representation for Efficien…

Detailed Network Measurements Using Sparse Graph Counters: The Theory

Measuring network flow sizes is important for tasks like accounting/billing, network forensics and security. Per-flow accounting is considered hard because it requires that many counters be updated at a very high speed; however, the large…

Networking and Internet Architecture · Computer Science 2007-10-07 Yi Lu , Andrea Montanari , Balaji Prabhakar

Algorithms for Efficient, Compact Online Data Stream Curation

Data stream algorithms tackle operations on high-volume sequences of read-once data items. Data stream scenarios include inherently real-time systems like sensor networks and financial markets. They also arise in purely-computational…

Data Structures and Algorithms · Computer Science 2024-03-04 Matthew Andres Moreno , Santiago Rodriguez Papa , Emily Dolson

Sublime: Sublinear Error & Space for Unbounded Skewed Streams

Modern stream processing systems often need to track the frequency of distinct keys in a data stream in real-time. Since maintaining exact counts can require a prohibitive amount of memory, many applications rely on compact, probabilistic…

Data Structures and Algorithms · Computer Science 2026-04-29 Navid Eslami , Ioana O. Bercea , Rasmus Pagh , Niv Dayan

SALSA: Self-Adjusting Lean Streaming Analytics

Counters are the fundamental building block of many data sketching schemes, which hash items to a small number of counters and account for collisions to provide good approximations for frequencies and other measures. Most existing methods…

Data Structures and Algorithms · Computer Science 2021-02-26 Ran Ben Basat , Gil Einziger , Michael Mitzenmacher , Shay Vargaftik

Scaling Ordered Stream Processing on Shared-Memory Multicores

Many modern applications require real-time processing of large volumes of high-speed data. Such data processing needs can be modeled as a streaming computation. A streaming computation is specified as a dataflow graph that exposes multiple…

Databases · Computer Science 2018-04-02 Guna Prasaad , G. Ramalingam , Kaushik Rajan

Bitstream Organization for Parallel Entropy Coding on Neural Network-based Video Codecs

Video compression systems must support increasing bandwidth and data throughput at low cost and power, and can be limited by entropy coding bottlenecks. Efficiency can be greatly improved by parallelizing coding, which can be done at much…

Image and Video Processing · Electrical Eng. & Systems 2023-12-05 Amir Said , Hoang Le , Farzad Farhadzadeh

The Graphics Card as a Streaming Computer

Massive data sets have radically changed our understanding of how to design efficient algorithms; the streaming paradigm, whether it in terms of number of passes of an external memory algorithm, or the single pass and limited memory of a…

Graphics · Computer Science 2007-05-23 Suresh Venkatasubramanian

On the Value of Multiple Read/Write Streams for Data Compression

We study whether, when restricted to using polylogarithmic memory and polylogarithmic passes, we can achieve qualitatively better data compression with multiple read/write streams than we can with only one. We first show how we can achieve…

Data Structures and Algorithms · Computer Science 2012-04-06 Travis Gagie

StreamTensor: Make Tensors Stream in Dataflow Accelerators for LLMs

Efficient execution of deep learning workloads on dataflow architectures is crucial for overcoming memory bottlenecks and maximizing performance. While streaming intermediate results between computation kernels can significantly improve…

Hardware Architecture · Computer Science 2025-09-24 Hanchen Ye , Deming Chen

Streaming Approximation Scheme for Minimizing Total Completion Time on Parallel Machines Subject to Varying Processing Capacity

We study the problem of minimizing total completion time on parallel machines subject to varying processing capacity. In this paper, we develop an approximation scheme for the problem under the data stream model where the input data is…

Data Structures and Algorithms · Computer Science 2022-04-06 Bin Fu , Yumei Huo , Hairong Zhao

New Algorithms and Lower Bounds for Sequential-Access Data Compression

This thesis concerns sequential-access data compression, i.e., by algorithms that read the input one or more times from beginning to end. In one chapter we consider adaptive prefix coding, for which we must read the input character by…

Information Theory · Computer Science 2009-02-03 Travis Gagie

Performance of Balanced Fairness in Resource Pools: A Recursive Approach

Understanding the performance of a pool of servers is crucial for proper dimensioning. One of the main challenges is to take into account the complex interactions between servers that are pooled to process jobs. In particular, a job can…

Networking and Internet Architecture · Computer Science 2019-10-03 Thomas Bonald , Céline Comte , Fabien Mathieu

Communication-Efficient (Weighted) Reservoir Sampling from Fully Distributed Data Streams

We consider communication-efficient weighted and unweighted (uniform) random sampling from distributed data streams presented as a sequence of mini-batches of items. This is a natural model for distributed streaming computation, and our…

Data Structures and Algorithms · Computer Science 2020-02-26 Lorenz Hübschle-Schneider , Peter Sanders

Stream quantiles via maximal entropy histograms

We address the problem of estimating the running quantile of a data stream when the memory for storing observations is limited. We (i) highlight the limitations of approaches previously described in the literature which make them unsuitable…

Data Structures and Algorithms · Computer Science 2014-09-26 Ognjen Arandjelovic , Ducson Pham , Svetha Venkatesh

Asymptotically exact streaming algorithms

We introduce a new computational model for data streams: asymptotically exact streaming algorithms. These algorithms have an approximation ratio that tends to one as the length of the stream goes to infinity while the memory used by the…

Data Structures and Algorithms · Computer Science 2014-08-11 Marc Heinrich , Alexander Munteanu , Christian Sohler

Matrix Norms in Data Streams: Faster, Multi-Pass and Row-Order

A central problem in data streams is to characterize which functions of an underlying frequency vector can be approximated efficiently. Recently there has been considerable effort in extending this problem to that of estimating functions of…

Data Structures and Algorithms · Computer Science 2018-10-25 Vladimir Braverman , Stephen R. Chestnut , Robert Krauthgamer , Yi Li , David P. Woodruff , Lin F. Yang

An Empirical Study on Budget-Aware Online Kernel Algorithms for Streams of Graphs

Kernel methods are considered an effective technique for on-line learning. Many approaches have been developed for compactly representing the dual solution of a kernel method when the problem imposes memory constraints. However, in…

Machine Learning · Computer Science 2016-07-21 Giovanni Da San Martino , Nicolò Navarin , Alessandro Sperduti

Downstream: efficient cross-platform algorithms for fixed-capacity stream downsampling

Due to ongoing accrual over long durations, a defining characteristic of real-world data streams is the requirement for rolling, often real-time, mechanisms to coarsen or summarize stream history. One common data structure for this purpose…

Data Structures and Algorithms · Computer Science 2025-06-17 Connor Yang , Joey Wagner , Emily Dolson , Luis Zaman , Matthew Andres Moreno

Scheduling of Intermittent Query Processing

Stream processing is usually done either on a tuple-by-tuple basis or in micro-batches. There are many applications where tuples over a predefined duration/window must be processed within certain deadlines. Processing such queries using…

Databases · Computer Science 2024-09-23 Saranya Chandrasekaran , S. Sudarshan

Structured Downsampling for Fast, Memory-efficient Curation of Online Data Streams

Operations over data streams typically hinge on efficient mechanisms to aggregate or summarize history on a rolling basis. For high-volume data steams, it is critical to manage state in a manner that is fast and memory efficient --…

Data Structures and Algorithms · Computer Science 2024-09-24 Matthew Andres Moreno , Luis Zaman , Emily Dolson