Natasha: Faster Non-Convex Stochastic Optimization Via Strongly Non-Convex Parameter
February 02, 2017 Β· Declared Dead Β· π International Conference on Machine Learning
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Zeyuan Allen-Zhu
arXiv ID
1702.00763
Category
math.OC: Optimization & Control
Cross-listed
cs.DS,
cs.LG,
stat.ML
Citations
82
Venue
International Conference on Machine Learning
Last Checked
3 months ago
Abstract
Given a nonconvex function that is an average of $n$ smooth functions, we design stochastic first-order methods to find its approximate stationary points. The convergence of our new methods depends on the smallest (negative) eigenvalue $-Ο$ of the Hessian, a parameter that describes how nonconvex the function is. Our methods outperform known results for a range of parameter $Ο$, and can be used to find approximate local minima. Our result implies an interesting dichotomy: there exists a threshold $Ο_0$ so that the currently fastest methods for $Ο>Ο_0$ and for $Ο<Ο_0$ have different behaviors: the former scales with $n^{2/3}$ and the latter scales with $n^{3/4}$.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Optimization & Control
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Local SGD Converges Fast and Communicates Little
R.I.P.
π»
Ghosted
On Lazy Training in Differentiable Programming
π
π
The Cartographer
A Review on Bilevel Optimization: From Classical to Evolutionary Approaches and Applications
R.I.P.
π»
Ghosted
Learned Primal-dual Reconstruction
R.I.P.
π»
Ghosted
On the Global Convergence of Gradient Descent for Over-parameterized Models using Optimal Transport
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted