Revisiting Frequency Moment Estimation in Random Order Streams

March 06, 2018 Β· Declared Dead Β· πŸ› International Colloquium on Automata, Languages and Programming

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Vladimir Braverman, Emanuele Viola, David Woodruff, Lin F. Yang arXiv ID 1803.02270 Category cs.DS: Data Structures & Algorithms Citations 18 Venue International Colloquium on Automata, Languages and Programming Last Checked 3 months ago
Abstract
We revisit one of the classic problems in the data stream literature, namely, that of estimating the frequency moments $F_p$ for $0 < p < 2$ of an underlying $n$-dimensional vector presented as a sequence of additive updates in a stream. It is well-known that using $p$-stable distributions one can approximate any of these moments up to a multiplicative $(1+Ξ΅)$-factor using $O(Ξ΅^{-2} \log n)$ bits of space, and this space bound is optimal up to a constant factor in the turnstile streaming model. We show that surprisingly, if one instead considers the popular random-order model of insertion-only streams, in which the updates to the underlying vector arrive in a random order, then one can beat this space bound and achieve $\tilde{O}(Ξ΅^{-2} + \log n)$ bits of space, where the $\tilde{O}$ hides poly$(\log(1/Ξ΅) + \log \log n)$ factors. If $Ξ΅^{-2} \approx \log n$, this represents a roughly quadratic improvement in the space achievable in turnstile streams. Our algorithm is in fact deterministic, and we show our space bound is optimal up to poly$(\log(1/Ξ΅) + \log \log n)$ factors for deterministic algorithms in the random order model. We also obtain a similar improvement in space for $p = 2$ whenever $F_2 \gtrsim \log n\cdot F_1$.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Data Structures & Algorithms

Died the same way β€” πŸ‘» Ghosted