Bayesian Compression for Deep Learning

May 24, 2017 · Declared Dead · 🏛 Neural Information Processing Systems

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Christos Louizos, Karen Ullrich, Max Welling arXiv ID 1705.08665 Category stat.ML: Machine Learning (Stat) Cross-listed cs.LG Citations 494 Venue Neural Information Processing Systems Last Checked 1 month ago

Abstract

Compression and computational efficiency in deep learning have become a problem of great significance. In this work, we argue that the most principled and effective way to attack this problem is by adopting a Bayesian point of view, where through sparsity inducing priors we prune large parts of the network. We introduce two novelties in this paper: 1) we use hierarchical priors to prune nodes instead of individual weights, and 2) we use the posterior uncertainties to determine the optimal fixed point precision to encode the weights. Both factors significantly contribute to achieving the state of the art in terms of compression rates, while still staying competitive with methods designed to optimize for speed or energy efficiency.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Machine Learning (Stat)

R.I.P. 👻 Ghosted

Graph Attention Networks

Petar Veličković, Guillem Cucurull, ... (+4 more)

stat.ML 🏛 ICLR 📚 24.7K cites 8 years ago

R.I.P. 👻 Ghosted

Distilling the Knowledge in a Neural Network

Geoffrey Hinton, Oriol Vinyals, Jeff Dean

stat.ML 🏛 arXiv 📚 22.9K cites 11 years ago

R.I.P. 👻 Ghosted

Layer Normalization

Jimmy Lei Ba, Jamie Ryan Kiros, Geoffrey E. Hinton

stat.ML 🏛 arXiv 📚 12.0K cites 9 years ago

R.I.P. 👻 Ghosted

Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning

Yarin Gal, Zoubin Ghahramani

stat.ML 🏛 ICML 📚 11.0K cites 10 years ago

R.I.P. 👻 Ghosted

Domain-Adversarial Training of Neural Networks

Yaroslav Ganin, Evgeniya Ustinova, ... (+6 more)

stat.ML 🏛 JMLR 📚 10.8K cites 10 years ago

R.I.P. 👻 Ghosted

Deep Learning with Differential Privacy

Martín Abadi, Andy Chu, ... (+5 more)

stat.ML 🏛 CCS 📚 7.2K cites 9 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 5 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago