Natasha: Faster Non-Convex Stochastic Optimization Via Strongly Non-Convex Parameter

February 02, 2017 Β· Declared Dead Β· πŸ› International Conference on Machine Learning

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Zeyuan Allen-Zhu arXiv ID 1702.00763 Category math.OC: Optimization & Control Cross-listed cs.DS, cs.LG, stat.ML Citations 82 Venue International Conference on Machine Learning Last Checked 3 months ago
Abstract
Given a nonconvex function that is an average of $n$ smooth functions, we design stochastic first-order methods to find its approximate stationary points. The convergence of our new methods depends on the smallest (negative) eigenvalue $-Οƒ$ of the Hessian, a parameter that describes how nonconvex the function is. Our methods outperform known results for a range of parameter $Οƒ$, and can be used to find approximate local minima. Our result implies an interesting dichotomy: there exists a threshold $Οƒ_0$ so that the currently fastest methods for $Οƒ>Οƒ_0$ and for $Οƒ<Οƒ_0$ have different behaviors: the former scales with $n^{2/3}$ and the latter scales with $n^{3/4}$.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Optimization & Control

Died the same way β€” πŸ‘» Ghosted