Principled Training of Neural Networks with Direct Feedback Alignment

June 11, 2019 · Entered Twilight · 🏛 arXiv.org

"Last commit was 6.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, README.md, best_practices.py, bottleneck.py, bottleneck_from_log.py, dfatools, log_retrieval.py, mltools

Authors Julien Launay, Iacopo Poli, Florent Krzakala arXiv ID 1906.04554 Category stat.ML: Machine Learning (Stat) Cross-listed cs.LG, cs.NE Citations 37 Venue arXiv.org Repository https://github.com/lightonai/principled-dfa-training ⭐ 22 Last Checked 1 month ago

Abstract

The backpropagation algorithm has long been the canonical training method for neural networks. Modern paradigms are implicitly optimized for it, and numerous guidelines exist to ensure its proper use. Recently, synthetic gradients methods -where the error gradient is only roughly approximated - have garnered interest. These methods not only better portray how biological brains are learning, but also open new computational possibilities, such as updating layers asynchronously. Even so, they have failed to scale past simple tasks like MNIST or CIFAR-10. This is in part due to a lack of standards, leading to ill-suited models and practices forbidding such methods from performing to the best of their abilities. In this work, we focus on direct feedback alignment and present a set of best practices justified by observations of the alignment angles. We characterize a bottleneck effect that prevents alignment in narrow layers, and hypothesize it may explain why feedback alignment methods have yet to scale to large convolutional networks.