Revisiting Distillation and Incremental Classifier Learning

July 08, 2018 · Entered Twilight · 🏛 Asian Conference on Computer Vision

"Last commit was 6.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, .idea, README.md, data_handler, experiment, images, mnist_missing_experiment.py, model, plots, plotter, requirements.txt, run_experiment.py, trainer, utils

Authors Khurram Javed, Faisal Shafait arXiv ID 1807.02802 Category cs.LG: Machine Learning Cross-listed cs.CV, stat.ML Citations 68 Venue Asian Conference on Computer Vision Repository https://github.com/Khurramjaved96/incremental-learning ⭐ 109 Last Checked 1 month ago

Abstract

One of the key differences between the learning mechanism of humans and Artificial Neural Networks (ANNs) is the ability of humans to learn one task at a time. ANNs, on the other hand, can only learn multiple tasks simultaneously. Any attempts at learning new tasks incrementally cause them to completely forget about previous tasks. This lack of ability to learn incrementally, called Catastrophic Forgetting, is considered a major hurdle in building a true AI system. In this paper, our goal is to isolate the truly effective existing ideas for incremental learning from those that only work under certain conditions. To this end, we first thoroughly analyze the current state of the art (iCaRL) method for incremental learning and demonstrate that the good performance of the system is not because of the reasons presented in the existing literature. We conclude that the success of iCaRL is primarily due to knowledge distillation and recognize a key limitation of knowledge distillation, i.e, it often leads to bias in classifiers. Finally, we propose a dynamic threshold moving algorithm that is able to successfully remove this bias. We demonstrate the effectiveness of our algorithm on CIFAR100 and MNIST datasets showing near-optimal results. Our implementation is available at https://github.com/Khurramjaved96/incremental-learning.