Nearly Optimal Distinct Elements and Heavy Hitters on Sliding Windows
May 01, 2018 Β· Declared Dead Β· π International Workshop and International Workshop on Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Vladimir Braverman, Elena Grigorescu, Harry Lang, David P. Woodruff, Samson Zhou
arXiv ID
1805.00212
Category
cs.DS: Data Structures & Algorithms
Citations
32
Venue
International Workshop and International Workshop on Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques
Last Checked
3 months ago
Abstract
We study the distinct elements and $\ell_p$-heavy hitters problems in the sliding window model, where only the most recent $n$ elements in the data stream form the underlying set. We first introduce the composable histogram, a simple twist on the exponential (Datar et al., SODA 2002) and smooth histograms (Braverman and Ostrovsky, FOCS 2007) that may be of independent interest. We then show that the composable histogram along with a careful combination of existing techniques to track either the identity or frequency of a few specific items suffices to obtain algorithms for both distinct elements and $\ell_p$-heavy hitters that are nearly optimal in both $n$ and $Ξ΅$. Applying our new composable histogram framework, we provide an algorithm that outputs a $(1+Ξ΅)$-approximation to the number of distinct elements in the sliding window model and uses $\mathcal{O}\left(\frac{1}{Ξ΅^2}\log n\log\frac{1}Ξ΅\log\log n+\frac{1}Ξ΅\log^2 n\right)$ bits of space. For $\ell_p$-heavy hitters, we provide an algorithm using space $\mathcal{O}\left(\frac{1}{Ξ΅^p}\log^3 n\left(\log\log n+\log\frac{1}Ξ΅\right)\right)$ for $0<p\le 2$, improving upon the best-known algorithm for $\ell_2$-heavy hitters (Braverman et al., COCOON 2014), which has space complexity $\mathcal{O}\left(\frac{1}{Ξ΅^4}\log^3 n\right)$. We also show lower bounds of $Ξ©\left(\frac{1}Ξ΅\log^2 n+\frac{1}{Ξ΅^2}\log n\right)$ for distinct elements and $Ξ©\left(\frac{1}{Ξ΅^p}\log^2 n\right)$ for $\ell_p$-heavy hitters.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Data Structures & Algorithms
π
π
The Cartographer
R.I.P.
π»
Ghosted
Route Planning in Transportation Networks
R.I.P.
π»
Ghosted
Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration
R.I.P.
π»
Ghosted
Hierarchical Clustering: Objective Functions and Algorithms
R.I.P.
π»
Ghosted
Graph Isomorphism in Quasipolynomial Time
π
π
The Cartographer
Simulation optimization: A review of algorithms and applications
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted