False Discoveries Occur Early on the Lasso Path

November 05, 2015 · Declared Dead · 🏛 arXiv.org

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Weijie Su, Malgorzata Bogdan, Emmanuel Candes arXiv ID 1511.01957 Category math.ST Cross-listed cs.IT, stat.ML Citations 199 Venue arXiv.org Last Checked 1 month ago

Abstract

In regression settings where explanatory variables have very low correlations and there are relatively few effects, each of large magnitude, we expect the Lasso to find the important variables with few errors, if any. This paper shows that in a regime of linear sparsity---meaning that the fraction of variables with a non-vanishing effect tends to a constant, however small---this cannot really be the case, even when the design variables are stochastically independent. We demonstrate that true features and null features are always interspersed on the Lasso path, and that this phenomenon occurs no matter how strong the effect sizes are. We derive a sharp asymptotic trade-off between false and true positive rates or, equivalently, between measures of type I and type II errors along the Lasso path. This trade-off states that if we ever want to achieve a type II error (false negative rate) under a critical value, then anywhere on the Lasso path the type I error (false positive rate) will need to exceed a given threshold so that we can never have both errors at a low level at the same time. Our analysis uses tools from approximate message passing (AMP) theory as well as novel elements to deal with a possibly adaptive selection of the Lasso regularizing parameter.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — math.ST

R.I.P. 👻 Ghosted

Nonparametric regression using deep neural networks with ReLU activation function

Johannes Schmidt-Hieber

math.ST 🏛 Annals of Statistics 📚 949 cites 8 years ago

R.I.P. 👻 Ghosted

An introduction to Topological Data Analysis: fundamental and practical aspects for data scientists

Frédéric Chazal, Bertrand Michel

math.ST 🏛 AI 📚 727 cites 8 years ago

R.I.P. 👻 Ghosted

Minimax Optimal Procedures for Locally Private Estimation

John Duchi, Martin Wainwright, Michael Jordan

math.ST 🏛 arXiv 📚 481 cites 9 years ago

R.I.P. 👻 Ghosted

Optimal Best Arm Identification with Fixed Confidence

Aurélien Garivier, Emilie Kaufmann

math.ST 🏛 COLT 📚 384 cites 10 years ago

R.I.P. 👻 Ghosted

Fast low-rank estimation by projected gradient descent: General statistical and algorithmic guarantees

Yudong Chen, Martin J. Wainwright

math.ST 🏛 arXiv 📚 329 cites 10 years ago

R.I.P. 👻 Ghosted

User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient

Arnak S. Dalalyan, Avetik G. Karagulyan

math.ST 🏛 Stochastic Processes and their Applications 📚 319 cites 8 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 5 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago