The (Un)reliability of saliency methods

November 02, 2017 · Declared Dead · 🏛 Explainable AI

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Pieter-Jan Kindermans, Sara Hooker, Julius Adebayo, Maximilian Alber, Kristof T. Schütt, Sven Dähne, Dumitru Erhan, Been Kim arXiv ID 1711.00867 Category stat.ML: Machine Learning (Stat) Cross-listed cs.LG Citations 753 Venue Explainable AI Last Checked 1 month ago

Abstract

Saliency methods aim to explain the predictions of deep neural networks. These methods lack reliability when the explanation is sensitive to factors that do not contribute to the model prediction. We use a simple and common pre-processing step ---adding a constant shift to the input data--- to show that a transformation with no effect on the model can cause numerous methods to incorrectly attribute. In order to guarantee reliability, we posit that methods should fulfill input invariance, the requirement that a saliency method mirror the sensitivity of the model with respect to transformations of the input. We show, through several examples, that saliency methods that do not satisfy input invariance result in misleading attribution.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Machine Learning (Stat)

R.I.P. 👻 Ghosted

Graph Attention Networks

Petar Veličković, Guillem Cucurull, ... (+4 more)

stat.ML 🏛 ICLR 📚 24.7K cites 8 years ago

R.I.P. 👻 Ghosted

Distilling the Knowledge in a Neural Network

Geoffrey Hinton, Oriol Vinyals, Jeff Dean

stat.ML 🏛 arXiv 📚 22.9K cites 11 years ago

R.I.P. 👻 Ghosted

Layer Normalization

Jimmy Lei Ba, Jamie Ryan Kiros, Geoffrey E. Hinton

stat.ML 🏛 arXiv 📚 12.0K cites 9 years ago

R.I.P. 👻 Ghosted

Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning

Yarin Gal, Zoubin Ghahramani

stat.ML 🏛 ICML 📚 11.0K cites 10 years ago

R.I.P. 👻 Ghosted

Domain-Adversarial Training of Neural Networks

Yaroslav Ganin, Evgeniya Ustinova, ... (+6 more)

stat.ML 🏛 JMLR 📚 10.8K cites 10 years ago

R.I.P. 👻 Ghosted

Deep Learning with Differential Privacy

Martín Abadi, Andy Chu, ... (+5 more)

stat.ML 🏛 CCS 📚 7.2K cites 9 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 5 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago