Stochastic Multi-armed Bandits in Constant Space

December 25, 2017 · Declared Dead · 🏛 International Conference on Artificial Intelligence and Statistics

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors David Liau, Eric Price, Zhao Song, Ger Yang arXiv ID 1712.09007 Category cs.DS: Data Structures & Algorithms Cross-listed cs.LG, stat.ML Citations 35 Venue International Conference on Artificial Intelligence and Statistics Last Checked 3 months ago

Abstract

We consider the stochastic bandit problem in the sublinear space setting, where one cannot record the win-loss record for all $K$ arms. We give an algorithm using $O(1)$ words of space with regret \[ \sum_{i=1}^{K}\frac{1}{Δ_i}\log \frac{Δ_i}Δ\log T \] where $Δ_i$ is the gap between the best arm and arm $i$ and $Δ$ is the gap between the best and the second-best arms. If the rewards are bounded away from $0$ and $1$, this is within an $O(\log 1/Δ)$ factor of the optimum regret possible without space constraints.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Data Structures & Algorithms

R.I.P. 👻 Ghosted

Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphs

Yu. A. Malkov, D. A. Yashunin

cs.DS 🏛 IEEE TPAMI 📚 2.0K cites 10 years ago

R.I.P. 👻 Ghosted

Relief-Based Feature Selection: Introduction and Review

Ryan J. Urbanowicz, Melissa Meeker, ... (+3 more)

cs.DS 🏛 J.BI 📚 1.1K cites 8 years ago

R.I.P. 👻 Ghosted

Route Planning in Transportation Networks

Hannah Bast, Daniel Delling, ... (+6 more)

cs.DS 🏛 Algorithm Engineering 📚 759 cites 11 years ago

R.I.P. 👻 Ghosted

Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration

Jason Altschuler, Jonathan Weed, Philippe Rigollet

cs.DS 🏛 NeurIPS 📚 654 cites 9 years ago

R.I.P. 👻 Ghosted

Hierarchical Clustering: Objective Functions and Algorithms

Vincent Cohen-Addad, Varun Kanade, ... (+2 more)

cs.DS 🏛 SODA 📚 637 cites 9 years ago

R.I.P. 👻 Ghosted

Graph Isomorphism in Quasipolynomial Time

László Babai

cs.DS 🏛 STOC 📚 616 cites 10 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 6 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago