The Full Landscape of Robust Mean Testing: Sharp Separations between Oblivious and Adaptive Contamination
July 18, 2023 Β· Declared Dead Β· π IEEE Annual Symposium on Foundations of Computer Science
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
ClΓ©ment L. Canonne, Samuel B. Hopkins, Jerry Li, Allen Liu, Shyam Narayanan
arXiv ID
2307.10273
Category
cs.DS: Data Structures & Algorithms
Cross-listed
math.ST
Citations
8
Venue
IEEE Annual Symposium on Foundations of Computer Science
Last Checked
4 months ago
Abstract
We consider the question of Gaussian mean testing, a fundamental task in high-dimensional distribution testing and signal processing, subject to adversarial corruptions of the samples. We focus on the relative power of different adversaries, and show that, in contrast to the common wisdom in robust statistics, there exists a strict separation between adaptive adversaries (strong contamination) and oblivious ones (weak contamination) for this task. Specifically, we resolve both the information-theoretic and computational landscapes for robust mean testing. In the exponential-time setting, we establish the tight sample complexity of testing $\mathcal{N}(0,I)$ against $\mathcal{N}(Ξ±v, I)$, where $\|v\|_2 = 1$, with an $\varepsilon$-fraction of adversarial corruptions, to be \[ \tildeΞ\!\left(\max\left(\frac{\sqrt{d}}{Ξ±^2}, \frac{d\varepsilon^3}{Ξ±^4},\min\left(\frac{d^{2/3}\varepsilon^{2/3}}{Ξ±^{8/3}}, \frac{d \varepsilon}{Ξ±^2}\right)\right) \right) \,, \] while the complexity against adaptive adversaries is \[ \tildeΞ\!\left(\max\left(\frac{\sqrt{d}}{Ξ±^2}, \frac{d\varepsilon^2}{Ξ±^4} \right)\right) \,, \] which is strictly worse for a large range of vanishing $\varepsilon,Ξ±$. To the best of our knowledge, ours is the first separation in sample complexity between the strong and weak contamination models. In the polynomial-time setting, we close a gap in the literature by providing a polynomial-time algorithm against adaptive adversaries achieving the above sample complexity $\tildeΞ(\max({\sqrt{d}}/{Ξ±^2}, {d\varepsilon^2}/{Ξ±^4} ))$, and a low-degree lower bound (which complements an existing reduction from planted clique) suggesting that all efficient algorithms require this many samples, even in the oblivious-adversary setting.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Data Structures & Algorithms
π
π
The Cartographer
R.I.P.
π»
Ghosted
Route Planning in Transportation Networks
R.I.P.
π»
Ghosted
Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration
R.I.P.
π»
Ghosted
Hierarchical Clustering: Objective Functions and Algorithms
R.I.P.
π»
Ghosted
Graph Isomorphism in Quasipolynomial Time
π
π
The Cartographer
Simulation optimization: A review of algorithms and applications
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted