An Improved Cutting Plane Method for Convex Optimization, Convex-Concave Games and its Applications
April 08, 2020 ยท Declared Dead ยท ๐ Symposium on the Theory of Computing
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Haotian Jiang, Yin Tat Lee, Zhao Song, Sam Chiu-wai Wong
arXiv ID
2004.04250
Category
cs.DS: Data Structures & Algorithms
Cross-listed
cs.LG,
stat.ML
Citations
117
Venue
Symposium on the Theory of Computing
Last Checked
1 month ago
Abstract
Given a separation oracle for a convex set $K \subset \mathbb{R}^n$ that is contained in a box of radius $R$, the goal is to either compute a point in $K$ or prove that $K$ does not contain a ball of radius $ฮต$. We propose a new cutting plane algorithm that uses an optimal $O(n \log (ฮบ))$ evaluations of the oracle and an additional $O(n^2)$ time per evaluation, where $ฮบ= nR/ฮต$. $\bullet$ This improves upon Vaidya's $O( \text{SO} \cdot n \log (ฮบ) + n^{ฯ+1} \log (ฮบ))$ time algorithm [Vaidya, FOCS 1989a] in terms of polynomial dependence on $n$, where $ฯ< 2.373$ is the exponent of matrix multiplication and $\text{SO}$ is the time for oracle evaluation. $\bullet$ This improves upon Lee-Sidford-Wong's $O( \text{SO} \cdot n \log (ฮบ) + n^3 \log^{O(1)} (ฮบ))$ time algorithm [Lee, Sidford and Wong, FOCS 2015] in terms of dependence on $ฮบ$. For many important applications in economics, $ฮบ= ฮฉ(\exp(n))$ and this leads to a significant difference between $\log(ฮบ)$ and $\mathrm{poly}(\log (ฮบ))$. We also provide evidence that the $n^2$ time per evaluation cannot be improved and thus our running time is optimal. A bottleneck of previous cutting plane methods is to compute leverage scores, a measure of the relative importance of past constraints. Our result is achieved by a novel multi-layered data structure for leverage score maintenance, which is a sophisticated combination of diverse techniques such as random projection, batched low-rank update, inverse maintenance, polynomial interpolation, and fast rectangular matrix multiplication. Interestingly, our method requires a combination of different fast rectangular matrix multiplication algorithms.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Data Structures & Algorithms
R.I.P.
๐ป
Ghosted
R.I.P.
๐ป
Ghosted
Relief-Based Feature Selection: Introduction and Review
R.I.P.
๐ป
Ghosted
Route Planning in Transportation Networks
R.I.P.
๐ป
Ghosted
Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration
R.I.P.
๐ป
Ghosted
Hierarchical Clustering: Objective Functions and Algorithms
R.I.P.
๐ป
Ghosted
Graph Isomorphism in Quasipolynomial Time
Died the same way โ ๐ป Ghosted
R.I.P.
๐ป
Ghosted
Language Models are Few-Shot Learners
R.I.P.
๐ป
Ghosted
PyTorch: An Imperative Style, High-Performance Deep Learning Library
R.I.P.
๐ป
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
๐ป
Ghosted