Revisiting Batch Normalization for Training Low-latency Deep Spiking Neural Networks from Scratch

October 05, 2020 · Declared Dead · 🏛 Frontiers in Neuroscience

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Youngeun Kim, Priyadarshini Panda arXiv ID 2010.01729 Category cs.CV: Computer Vision Cross-listed cs.AI, cs.NE Citations 196 Venue Frontiers in Neuroscience Last Checked 4 months ago

Abstract

Spiking Neural Networks (SNNs) have recently emerged as an alternative to deep learning owing to sparse, asynchronous and binary event (or spike) driven processing, that can yield huge energy efficiency benefits on neuromorphic hardware. However, training high-accuracy and low-latency SNNs from scratch suffers from non-differentiable nature of a spiking neuron. To address this training issue in SNNs, we revisit batch normalization and propose a temporal Batch Normalization Through Time (BNTT) technique. Most prior SNN works till now have disregarded batch normalization deeming it ineffective for training temporal SNNs. Different from previous works, our proposed BNTT decouples the parameters in a BNTT layer along the time axis to capture the temporal dynamics of spikes. The temporally evolving learnable parameters in BNTT allow a neuron to control its spike rate through different time-steps, enabling low-latency and low-energy training from scratch. We conduct experiments on CIFAR-10, CIFAR-100, Tiny-ImageNet and event-driven DVS-CIFAR10 datasets. BNTT allows us to train deep SNN architectures from scratch, for the first time, on complex datasets with just few 25-30 time-steps. We also propose an early exit algorithm using the distribution of parameters in BNTT to reduce the latency at inference, that further improves the energy-efficiency.