Accelerated Decentralized Optimization with Local Updates for Smooth and Strongly Convex Objectives
October 05, 2018 Β· Declared Dead Β· π International Conference on Artificial Intelligence and Statistics
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Hadrien Hendrikx, Francis Bach, Laurent MassouliΓ©
arXiv ID
1810.02660
Category
math.OC: Optimization & Control
Cross-listed
cs.DC,
cs.LG
Citations
46
Venue
International Conference on Artificial Intelligence and Statistics
Last Checked
3 months ago
Abstract
In this paper, we study the problem of minimizing a sum of smooth and strongly convex functions split over the nodes of a network in a decentralized fashion. We propose the algorithm $ESDACD$, a decentralized accelerated algorithm that only requires local synchrony. Its rate depends on the condition number $ΞΊ$ of the local functions as well as the network topology and delays. Under mild assumptions on the topology of the graph, $ESDACD$ takes a time $O((Ο_{\max} + Ξ_{\max})\sqrt{ΞΊ/Ξ³}\ln(Ξ΅^{-1}))$ to reach a precision $Ξ΅$ where $Ξ³$ is the spectral gap of the graph, $Ο_{\max}$ the maximum communication delay and $Ξ_{\max}$ the maximum computation time. Therefore, it matches the rate of $SSDA$, which is optimal when $Ο_{\max} = Ξ©\left(Ξ_{\max}\right)$. Applying $ESDACD$ to quadratic local functions leads to an accelerated randomized gossip algorithm of rate $O( \sqrt{ΞΈ_{\rm gossip}/n})$ where $ΞΈ_{\rm gossip}$ is the rate of the standard randomized gossip. To the best of our knowledge, it is the first asynchronous gossip algorithm with a provably improved rate of convergence of the second moment of the error. We illustrate these results with experiments in idealized settings.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Optimization & Control
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Local SGD Converges Fast and Communicates Little
R.I.P.
π»
Ghosted
On Lazy Training in Differentiable Programming
R.I.P.
π»
Ghosted
A Review on Bilevel Optimization: From Classical to Evolutionary Approaches and Applications
R.I.P.
π»
Ghosted
Learned Primal-dual Reconstruction
R.I.P.
π»
Ghosted
On the Global Convergence of Gradient Descent for Over-parameterized Models using Optimal Transport
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Language Models are Few-Shot Learners
R.I.P.
π»
Ghosted
PyTorch: An Imperative Style, High-Performance Deep Learning Library
R.I.P.
π»
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
π»
Ghosted