On the Effectiveness of Lipschitz-Driven Rehearsal in Continual Learning

October 12, 2022 · Entered Twilight · 🏛 Neural Information Processing Systems

Repo contents: .gitignore, README.md, backbone, data, datasets, extra_illustrations.pdf, gem_license, models, stgcn_license, utils

Authors Lorenzo Bonicelli, Matteo Boschini, Angelo Porrello, Concetto Spampinato, Simone Calderara arXiv ID 2210.06443 Category cs.LG: Machine Learning Cross-listed cs.AI Citations 58 Venue Neural Information Processing Systems Repository https://github.com/aimagelab/LiDER ⭐ 14 Last Checked 1 month ago

Abstract

Rehearsal approaches enjoy immense popularity with Continual Learning (CL) practitioners. These methods collect samples from previously encountered data distributions in a small memory buffer; subsequently, they repeatedly optimize on the latter to prevent catastrophic forgetting. This work draws attention to a hidden pitfall of this widespread practice: repeated optimization on a small pool of data inevitably leads to tight and unstable decision boundaries, which are a major hindrance to generalization. To address this issue, we propose Lipschitz-DrivEn Rehearsal (LiDER), a surrogate objective that induces smoothness in the backbone network by constraining its layer-wise Lipschitz constants w.r.t. replay examples. By means of extensive experiments, we show that applying LiDER delivers a stable performance gain to several state-of-the-art rehearsal CL methods across multiple datasets, both in the presence and absence of pre-training. Through additional ablative experiments, we highlight peculiar aspects of buffer overfitting in CL and better characterize the effect produced by LiDER. Code is available at https://github.com/aimagelab/LiDER