Understanding Convolutional Neural Networks with Information Theory: An Initial Exploration

April 18, 2018 Β· Entered Twilight Β· πŸ› IEEE Transactions on Neural Networks and Learning Systems

πŸŒ… TWILIGHT: Old Age
Predates the code-sharing era β€” a pioneer of its time

"Last commit was 7.0 years ago (β‰₯5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: VGG16.py, cifar_10_loader.py, readme.txt

Authors Shujian Yu, Kristoffer Wickstrøm, Robert Jenssen, Jose C. Principe arXiv ID 1804.06537 Category cs.LG: Machine Learning Cross-listed cs.IT, stat.ML Citations 83 Venue IEEE Transactions on Neural Networks and Learning Systems Repository https://github.com/Wickstrom/InfExperiment ⭐ 3 Last Checked 1 month ago
Abstract
The matrix-based Renyi's Ξ±-entropy functional and its multivariate extension were recently developed in terms of the normalized eigenspectrum of a Hermitian matrix of the projected data in a reproducing kernel Hilbert space (RKHS). However, the utility and possible applications of these new estimators are rather new and mostly unknown to practitioners. In this paper, we first show that our estimators enable straightforward measurement of information flow in realistic convolutional neural networks (CNN) without any approximation. Then, we introduce the partial information decomposition (PID) framework and develop three quantities to analyze the synergy and redundancy in convolutional layer representations. Our results validate two fundamental data processing inequalities and reveal some fundamental properties concerning the training of CNN.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Machine Learning