SARAH: A Novel Method for Machine Learning Problems Using Stochastic Recursive Gradient

March 01, 2017 · Declared Dead · 🏛 International Conference on Machine Learning

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Lam M. Nguyen, Jie Liu, Katya Scheinberg, Martin Takáč arXiv ID 1703.00102 Category stat.ML: Machine Learning (Stat) Cross-listed cs.LG, math.OC Citations 685 Venue International Conference on Machine Learning Last Checked 1 month ago

Abstract

In this paper, we propose a StochAstic Recursive grAdient algoritHm (SARAH), as well as its practical variant SARAH+, as a novel approach to the finite-sum minimization problems. Different from the vanilla SGD and other modern stochastic methods such as SVRG, S2GD, SAG and SAGA, SARAH admits a simple recursive framework for updating stochastic gradient estimates; when comparing to SAG/SAGA, SARAH does not require a storage of past gradients. The linear convergence rate of SARAH is proven under strong convexity assumption. We also prove a linear convergence rate (in the strongly convex case) for an inner loop of SARAH, the property that SVRG does not possess. Numerical experiments demonstrate the efficiency of our algorithm.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Machine Learning (Stat)

R.I.P. 👻 Ghosted

Graph Attention Networks

Petar Veličković, Guillem Cucurull, ... (+4 more)

stat.ML 🏛 ICLR 📚 24.7K cites 8 years ago

R.I.P. 👻 Ghosted

Distilling the Knowledge in a Neural Network

Geoffrey Hinton, Oriol Vinyals, Jeff Dean

stat.ML 🏛 arXiv 📚 22.9K cites 11 years ago

R.I.P. 👻 Ghosted

Layer Normalization

Jimmy Lei Ba, Jamie Ryan Kiros, Geoffrey E. Hinton

stat.ML 🏛 arXiv 📚 12.0K cites 9 years ago

R.I.P. 👻 Ghosted

Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning

Yarin Gal, Zoubin Ghahramani

stat.ML 🏛 ICML 📚 11.0K cites 10 years ago

R.I.P. 👻 Ghosted

Domain-Adversarial Training of Neural Networks

Yaroslav Ganin, Evgeniya Ustinova, ... (+6 more)

stat.ML 🏛 JMLR 📚 10.8K cites 10 years ago

R.I.P. 👻 Ghosted

Deep Learning with Differential Privacy

Martín Abadi, Andy Chu, ... (+5 more)

stat.ML 🏛 CCS 📚 7.2K cites 9 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 5 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago