Random Smoothing Might be Unable to Certify $\ell_\infty$ Robustness for High-Dimensional Images

February 10, 2020 · Entered Twilight · 🏛 Journal of machine learning research

"Last commit was 5.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: LICENSE, README.md, checkpoints, code, results

Authors Avrim Blum, Travis Dick, Naren Manoj, Hongyang Zhang arXiv ID 2002.03517 Category cs.LG: Machine Learning Cross-listed cs.CR, stat.ML Citations 83 Venue Journal of machine learning research Repository https://github.com/hongyanz/TRADES-smoothing ⭐ 14 Last Checked 1 month ago

Abstract

We show a hardness result for random smoothing to achieve certified adversarial robustness against attacks in the $\ell_p$ ball of radius $ε$ when $p>2$. Although random smoothing has been well understood for the $\ell_2$ case using the Gaussian distribution, much remains unknown concerning the existence of a noise distribution that works for the case of $p>2$. This has been posed as an open problem by Cohen et al. (2019) and includes many significant paradigms such as the $\ell_\infty$ threat model. In this work, we show that any noise distribution $\mathcal{D}$ over $\mathbb{R}^d$ that provides $\ell_p$ robustness for all base classifiers with $p>2$ must satisfy $\mathbb{E}η_i^2=Ω(d^{1-2/p}ε^2(1-δ)/δ^2)$ for 99% of the features (pixels) of vector $η\sim\mathcal{D}$, where $ε$ is the robust radius and $δ$ is the score gap between the highest-scored class and the runner-up. Therefore, for high-dimensional images with pixel values bounded in $[0,255]$, the required noise will eventually dominate the useful information in the images, leading to trivial smoothed classifiers.