Correlated Quantization for Faster Nonconvex Distributed Optimization

January 10, 2024 · Declared Dead · 🏛 Conference on Uncertainty in Artificial Intelligence

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Andrei Panferov, Yury Demidovich, Ahmad Rammal, Peter Richtárik arXiv ID 2401.05518 Category cs.LG: Machine Learning Cross-listed cs.AI, cs.DC, math.OC Citations 4 Venue Conference on Uncertainty in Artificial Intelligence Last Checked 3 months ago

Abstract

Quantization (Alistarh et al., 2017) is an important (stochastic) compression technique that reduces the volume of transmitted bits during each communication round in distributed model training. Suresh et al. (2022) introduce correlated quantizers and show their advantages over independent counterparts by analyzing distributed SGD communication complexity. We analyze the forefront distributed non-convex optimization algorithm MARINA (Gorbunov et al., 2022) utilizing the proposed correlated quantizers and show that it outperforms the original MARINA and distributed SGD of Suresh et al. (2022) with regard to the communication complexity. We significantly refine the original analysis of MARINA without any additional assumptions using the weighted Hessian variance (Tyurin et al., 2022), and then we expand the theoretical framework of MARINA to accommodate a substantially broader range of potentially correlated and biased compressors, thus dilating the applicability of the method beyond the conventional independent unbiased compressor setup. Extensive experimental results corroborate our theoretical findings.