Active Ranking with Subset-wise Preferences

October 23, 2018 ยท Declared Dead ยท ๐Ÿ› International Conference on Artificial Intelligence and Statistics

๐Ÿ‘ป CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Aadirupa Saha, Aditya Gopalan arXiv ID 1810.10321 Category cs.LG: Machine Learning Cross-listed stat.ML Citations 29 Venue International Conference on Artificial Intelligence and Statistics Last Checked 3 months ago
Abstract
We consider the problem of probably approximately correct (PAC) ranking $n$ items by adaptively eliciting subset-wise preference feedback. At each round, the learner chooses a subset of $k$ items and observes stochastic feedback indicating preference information of the winner (most preferred) item of the chosen subset drawn according to a Plackett-Luce (PL) subset choice model unknown a priori. The objective is to identify an $ฮต$-optimal ranking of the $n$ items with probability at least $1 - ฮด$. When the feedback in each subset round is a single Plackett-Luce-sampled item, we show $(ฮต, ฮด)$-PAC algorithms with a sample complexity of $O\left(\frac{n}{ฮต^2} \ln \frac{n}ฮด \right)$ rounds, which we establish as being order-optimal by exhibiting a matching sample complexity lower bound of $ฮฉ\left(\frac{n}{ฮต^2} \ln \frac{n}ฮด \right)$---this shows that there is essentially no improvement possible from the pairwise comparisons setting ($k = 2$). When, however, it is possible to elicit top-$m$ ($\leq k$) ranking feedback according to the PL model from each adaptively chosen subset of size $k$, we show that an $(ฮต, ฮด)$-PAC ranking sample complexity of $O\left(\frac{n}{m ฮต^2} \ln \frac{n}ฮด \right)$ is achievable with explicit algorithms, which represents an $m$-wise reduction in sample complexity compared to the pairwise case. This again turns out to be order-wise unimprovable across the class of symmetric ranking algorithms. Our algorithms rely on a novel {pivot trick} to maintain only $n$ itemwise score estimates, unlike $O(n^2)$ pairwise score estimates that has been used in prior work. We report results of numerical experiments that corroborate our findings.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Machine Learning

Died the same way โ€” ๐Ÿ‘ป Ghosted