FIESTA: Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms
June 28, 2019 ยท Entered Twilight ยท ๐ Annual Meeting of the Association for Computational Linguistics
"Last commit was 6.0 years ago (โฅ5 year threshold)"
Evidence collected by the PWNC Scanner
Repo contents: .github, .gitignore, .travis.yml, AUTHORS.txt, CONTRIBUTING.md, LICENSE, README.md, allennlp_training_configs, docs, fiesta, notebooks, pytest.ini, requirements.txt, results, setup.py, tests
Authors
Henry B. Moss, Andrew Moore, David S. Leslie, Paul Rayson
arXiv ID
1906.12230
Category
cs.LG: Machine Learning
Cross-listed
cs.CL,
stat.ML
Citations
5
Venue
Annual Meeting of the Association for Computational Linguistics
Repository
https://github.com/apmoore1/fiesta
โญ 14
Last Checked
1 month ago
Abstract
We present FIESTA, a model selection approach that significantly reduces the computational resources required to reliably identify state-of-the-art performance from large collections of candidate models. Despite being known to produce unreliable comparisons, it is still common practice to compare model evaluations based on single choices of random seeds. We show that reliable model selection also requires evaluations based on multiple train-test splits (contrary to common practice in many shared tasks). Using bandit theory from the statistics literature, we are able to adaptively determine appropriate numbers of data splits and random seeds used to evaluate each model, focusing computational resources on the evaluation of promising models whilst avoiding wasting evaluations on models with lower performance. Furthermore, our user-friendly Python implementation produces confidence guarantees of correctly selecting the optimal model. We evaluate our algorithms by selecting between 8 target-dependent sentiment analysis methods using dramatically fewer model evaluations than current model selection approaches.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
R.I.P.
๐ป
Ghosted
R.I.P.
๐ป
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
๐ป
Ghosted
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
R.I.P.
๐ป
Ghosted
Semi-Supervised Classification with Graph Convolutional Networks
R.I.P.
๐ป
Ghosted
Proximal Policy Optimization Algorithms
R.I.P.
๐ป
Ghosted