English
Related papers

Related papers: Universal Streaming

200 papers

Due to recent advances in data collection techniques, massive amounts of data are being collected at an extremely fast pace. Also, these data are potentially unbounded. Boundless streams of data collected from sensors, equipments, and other…

Databases · Computer Science 2012-03-12 T Soni Madhulatha

We introduce a new computational model for data streams: asymptotically exact streaming algorithms. These algorithms have an approximation ratio that tends to one as the length of the stream goes to infinity while the memory used by the…

Data Structures and Algorithms · Computer Science 2014-08-11 Marc Heinrich , Alexander Munteanu , Christian Sohler

This paper investigates parallel random sampling from a potentially-unending data stream whose elements are revealed in a series of element sequences (minibatches). While sampling from a stream was extensively studied sequentially, not much…

Data Structures and Algorithms · Computer Science 2019-06-11 Kanat Tangwongsan , Srikanta Tirthapura

Much of the research in differential privacy has focused on offline applications with the assumption that all data is available at once. When these algorithms are applied in practice to streams where data is collected over time, this either…

Databases · Computer Science 2024-02-01 Girish Kumar , Thomas Strohmer , Roman Vershynin

With the dawn of the Big Data era, data sets are growing rapidly. Data is streaming from everywhere - from cameras, mobile phones, cars, and other electronic devices. Clustering streaming data is a very challenging problem. Unlike the…

Machine Learning · Computer Science 2019-02-08 Shlomo Bugdary , Shay Maymon

Data streaming, in which a large dataset is received as a "stream" of updates, is an important model in the study of space-bounded computation. Starting with the work of Le Gall [SPAA `06], it has been known that quantum streaming…

Quantum Physics · Physics 2021-11-16 John Kallaugher

Modern applications require processing streams of data for estimating statistical quantities such as quantiles with small amount of memory. In many such applications, in fact, one needs to compute such statistical quantities for each of a…

Databases · Computer Science 2014-07-07 Qiang Ma , S. Muthukrishnan , Mark Sandler

Big data problems frequently require processing datasets in a streaming fashion, either because all data are available at once but collectively are larger than available memory or because the data intrinsically arrive one data point at a…

Computation · Statistics 2018-08-08 Andrea Giovannucci , Victor Minden , Cengiz Pehlevan , Dmitri B. Chklovskii

In this paper we propose a new algorithm for streaming principal component analysis. With limited memory, small devices cannot store all the samples in the high-dimensional regime. Streaming principal component analysis aims to find the…

Machine Learning · Statistics 2018-02-16 Puyudi Yang , Cho-Jui Hsieh , Jane-Ling Wang

This work is devoted to a certain class of probabilistic snapshots for elements of the observed data stream. We show you how one can control their probabilistic properties and we show some potential applications. Our solution can be used to…

Information Retrieval · Computer Science 2022-06-24 Dominik Bojko , Jacek Cichoń

The literature on data sanitization aims to design algorithms that take an input dataset and produce a privacy-preserving version of it, that captures some of its statistical properties. In this note we study this question from a streaming…

Data Structures and Algorithms · Computer Science 2021-11-30 Haim Kaplan , Uri Stemmer

Estimating the number of subgraphs in data streams is a fundamental problem that has received great attention in the past decade. In this paper, we give improved streaming algorithms for approximately counting the number of occurrences of…

Data Structures and Algorithms · Computer Science 2022-03-29 Hendrik Fichtenberger , Pan Peng

Massive data sets have radically changed our understanding of how to design efficient algorithms; the streaming paradigm, whether it in terms of number of passes of an external memory algorithm, or the single pass and limited memory of a…

Graphics · Computer Science 2007-05-23 Suresh Venkatasubramanian

Streaming data can arise from a variety of contexts. Important use cases are continuous sensor measurements such as temperature, light or radiation values. In the process, streaming data may also contain data errors that should be cleaned…

Databases · Computer Science 2025-07-29 Valerie Restat , Niklas Rodenhausen , Carina Antonin , Uta Störl

We study high-dimensional robust statistics tasks in the streaming model. A recent line of work obtained computationally efficient algorithms for a range of high-dimensional robust estimation tasks. Unfortunately, all previous algorithms…

Data Structures and Algorithms · Computer Science 2023-05-04 Ilias Diakonikolas , Daniel M. Kane , Ankit Pensia , Thanasis Pittas

Data stream algorithms tackle operations on high-volume sequences of read-once data items. Data stream scenarios include inherently real-time systems like sensor networks and financial markets. They also arise in purely-computational…

Data Structures and Algorithms · Computer Science 2024-03-04 Matthew Andres Moreno , Santiago Rodriguez Papa , Emily Dolson

In this paper we study how to perform distinct sampling in the streaming model where data contain near-duplicates. The goal of distinct sampling is to return a distinct element uniformly at random from the universe of elements, given that…

Data Structures and Algorithms · Computer Science 2018-10-31 Jiecao Chen , Qin Zhang

The past decade has witnessed many interesting algorithms for maintaining statistics over a data stream. This paper initiates a theoretical study of algorithms for monitoring distributed data streams over a time-based sliding window (which…

Data Structures and Algorithms · Computer Science 2010-02-03 Ho-Leung Chan , Tak-Wah Lam , Lap-Kei Lee , Hing-Fung Ting

For many modern applications in science and engineering, data are collected in a streaming fashion carrying time-varying information, and practitioners need to process them with a limited amount of memory and computational resources in a…

Machine Learning · Statistics 2018-06-13 Laura Balzano , Yuejie Chi , Yue M. Lu

Machine learning from data streams is an active and growing research area. Research on learning from streaming data typically makes strict assumptions linked to computational resource constraints, including requirements for stream mining…

Machine Learning · Computer Science 2023-11-01 Indre Zliobaite , Jesse Read
‹ Prev 1 2 3 10 Next ›