English
Related papers

Related papers: Validating XML Documents in the Streaming Model wi…

200 papers

We study the problem of solving semidefinite programs (SDP) in the streaming model. Specifically, $m$ constraint matrices and a target matrix $C$, all of size $n\times n$ together with a vector $b\in \mathbb{R}^m$ are streamed to us…

Data Structures and Algorithms · Computer Science 2023-09-12 Zhao Song , Mingquan Ye , Lichen Zhang

Some of the most relevant document schemas used online, such as XML and JSON, have a nested format. In the last decade, the task of extracting data from nested documents over streams has become especially relevant. We focus on the streaming…

Databases · Computer Science 2022-01-11 Martín Muñoz , Cristian Riveros

Depth first search (DFS) tree is a fundamental data structure for solving various graph problems. The classical DFS algorithm requires $O(m+n)$ time for a graph having $n$ vertices and $m$ edges. In the streaming model, an algorithm is…

Data Structures and Algorithms · Computer Science 2019-01-14 Shahbaz Khan , Shashank K. Mehta

Depth first search is a fundamental graph problem having a wide range of applications. For a graph $G=(V,E)$ having $n$ vertices and $m$ edges, the DFS tree can be computed in $O(m+n)$ using $O(m)$ space where $m=O(n^2)$. In the streaming…

Data Structures and Algorithms · Computer Science 2024-06-10 Kancharla Nikhilesh Bhagavan , Macharla Sri Vardhan , Madamanchi Ashok Chowdary , Shahbaz Khan

We study the complexity of the following problems in the streaming model. Membership testing for \DLIN We show that every language in \DLIN\ can be recognised by a randomized one-pass $O(\log n)$ space algorithm with inverse polynomial…

Data Structures and Algorithms · Computer Science 2011-04-06 Ajesh Babu , Nutan Limaye , Jaikumar Radhakrishnan , Girish Varma

Big data problems frequently require processing datasets in a streaming fashion, either because all data are available at once but collectively are larger than available memory or because the data intrinsically arrive one data point at a…

Computation · Statistics 2018-08-08 Andrea Giovannucci , Victor Minden , Cengiz Pehlevan , Dmitri B. Chklovskii

Problems involving the efficient arrangement of simple objects, as captured by bin packing and makespan scheduling, are fundamental tasks in combinatorial optimization. These are well understood in the traditional online and offline cases,…

Data Structures and Algorithms · Computer Science 2026-01-27 Graham Cormode , Pavel Veselý

We consider the classic Set Cover problem in the data stream model. For $n$ elements and $m$ sets ($m\geq n$) we give a $O(1/\delta)$-pass algorithm with a strongly sub-linear $\tilde{O}(mn^{\delta})$ space and logarithmic approximation…

Data Structures and Algorithms · Computer Science 2016-05-03 Sariel Har-Peled , Piotr Indyk , Sepideh Mahabadi , Ali Vakilian

We initiate a broad study of classical problems in the streaming model with insertions and deletions in the setting where we allow the approximation factor $\alpha$ to be much larger than $1$. Such algorithms can use significantly less…

Data Structures and Algorithms · Computer Science 2022-07-19 Yi Li , Honghao Lin , David P. Woodruff , Yuheng Zhang

Attention computation takes both the time complexity of $O(n^2)$ and the space complexity of $O(n^2)$ simultaneously, which makes deploying Large Language Models (LLMs) in streaming applications that involve long contexts requiring…

Machine Learning · Computer Science 2024-02-06 Raghav Addanki , Chenyang Li , Zhao Song , Chiwun Yang

We consider the problem of estimating the value of max cut in a graph in the streaming model of computation. At one extreme, there is a trivial $2$-approximation for this problem that uses only $O(\log n)$ space, namely, count the number of…

Data Structures and Algorithms · Computer Science 2014-09-09 Michael Kapralov , Sanjeev Khanna , Madhu Sudan

We consider the problem of monotone, submodular maximization over a ground set of size $n$ subject to cardinality constraint $k$. For this problem, we introduce the first deterministic algorithms with linear time complexity; these…

Data Structures and Algorithms · Computer Science 2021-03-09 Alan Kuhnle

The distinct elements problem is one of the fundamental problems in streaming algorithms --- given a stream of integers in the range $\{1,\ldots,n\}$, we wish to provide a $(1+\varepsilon)$ approximation to the number of distinct elements…

Data Structures and Algorithms · Computer Science 2019-01-07 Jarosław Błasiok

Finding dense subgraphs is a fundamental algorithmic tool in data mining, community detection, and clustering. In this problem, one aims to find an induced subgraph whose edge-to-vertex ratio is maximized. We study the directed case of this…

Data Structures and Algorithms · Computer Science 2023-11-21 Slobodan Mitrović , Theodore Pan

We propose two one-pass streaming algorithms for the $\mathcal{NP}$-hard hypergraph matching problem. The first algorithm stores a small subset of potential matching edges in a stack using dual variables to select edges. It has an…

Data Structures and Algorithms · Computer Science 2025-07-09 Henrik Reinstädtler , S M Ferdous , Alex Pothen , Bora Uçar , Christian Schulz

Tracking and approximating data matrices in streaming fashion is a fundamental challenge. The problem requires more care and attention when data comes from multiple distributed sites, each receiving a stream of data. This paper considers…

Databases · Computer Science 2014-05-01 Mina Ghashami , Jeff M. Phillips , Feifei Li

We study the general problem of computing frequency-based functions, i.e., the sum of any given function of data stream frequencies. Special cases include fundamental data stream problems such as computing the number of distinct elements…

Data Structures and Algorithms · Computer Science 2020-10-08 Prantar Ghosh

We study the classic NP-Hard problem of finding the maximum $k$-set coverage in the data stream model: given a set system of $m$ sets that are subsets of a universe $\{1,\ldots,n \}$, find the $k$ sets that cover the most number of distinct…

Data Structures and Algorithms · Computer Science 2018-05-11 Andrew McGregor , Hoa T. Vu

We present a new streaming algorithm to validate JSON documents against a set of constraints given as a JSON schema. Among the possible values a JSON document can hold, objects are unordered collections of key-value pairs while arrays are…

Formal Languages and Automata Theory · Computer Science 2023-05-15 Véronique Bruyère , Guillermo A. Perez , Gaëtan Staquet

The eXtensible Markup Language (XML) provides a powerful and flexible means of encoding and exchanging data. As it turns out, its main advantage as an encoding format (namely, its requirement that all open and close markup tags are present…

Databases · Computer Science 2015-05-13 Gregory Leighton , Denilson Barbosa
‹ Prev 1 2 3 10 Next ›