Logarithmic regret bounds for Bandits with Knapsacks

October 07, 2015 · Declared Dead · + Add venue

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Arthur Flajolet, Patrick Jaillet arXiv ID 1510.01800 Category cs.DS: Data Structures & Algorithms Citations 18 Last Checked 3 months ago

Abstract

Optimal regret bounds for Multi-Armed Bandit problems are now well documented. They can be classified into two categories based on the growth rate with respect to the time horizon $T$: (i) small, distribution-dependent, bounds of order of magnitude $\ln(T)$ and (ii) robust, distribution-free, bounds of order of magnitude $\sqrt{T}$. The Bandits with Knapsacks model, an extension to the framework allowing to model resource consumption, lacks this clear-cut distinction. While several algorithms have been shown to achieve asymptotically optimal distribution-free bounds on regret, there has been little progress toward the development of small distribution-dependent regret bounds. We partially bridge the gap by designing a general-purpose algorithm with distribution-dependent regret bounds that are logarithmic in the initial endowments of resources in several important cases that cover many practical applications, including dynamic pricing with limited supply, bid optimization in online advertisement auctions, and dynamic procurement.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Data Structures & Algorithms

📚 📚 The Cartographer

Relief-Based Feature Selection: Introduction and Review

Ryan J. Urbanowicz, Melissa Meeker, ... (+3 more)

cs.DS 🏛 J.BI 📚 1.1K cites 8 years ago

R.I.P. 👻 Ghosted

Route Planning in Transportation Networks

Hannah Bast, Daniel Delling, ... (+6 more)

cs.DS 🏛 Algorithm Engineering 📚 759 cites 11 years ago

R.I.P. 👻 Ghosted

Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration

Jason Altschuler, Jonathan Weed, Philippe Rigollet

cs.DS 🏛 NeurIPS 📚 654 cites 9 years ago

R.I.P. 👻 Ghosted

Hierarchical Clustering: Objective Functions and Algorithms

Vincent Cohen-Addad, Varun Kanade, ... (+2 more)

cs.DS 🏛 SODA 📚 637 cites 9 years ago

R.I.P. 👻 Ghosted

Graph Isomorphism in Quasipolynomial Time

László Babai

cs.DS 🏛 STOC 📚 616 cites 10 years ago

📚 📚 The Cartographer

Simulation optimization: A review of algorithms and applications

Satyajith Amaran, Nikolaos V. Sahinidis, ... (+2 more)

cs.DS 🏛 4OR 📚 588 cites 8 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 8 years ago