E-LPIPS: Robust Perceptual Image Similarity via Random Transformation Ensembles

June 10, 2019 Β· Entered Twilight Β· πŸ› arXiv.org

πŸŒ… TWILIGHT: Old Age
Predates the code-sharing era β€” a pioneer of its time

"Last commit was 6.0 years ago (β‰₯5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: LICENSE, README.md, darc.py, elpips, environment.yml, ex_bary10.py, ex_compare_distances.py, ex_evaluate_distance.py, ex_pairwise_average.py, ex_simple_distance.py, inputs, lpips_scripts, media, train.py, train_dataset.py, train_run.py, train_squeeze_ensemble.sh, train_test_2afc.py, train_test_2afc_squeeze_example.sh

Authors Markus Kettunen, Erik HÀrkânen, Jaakko Lehtinen arXiv ID 1906.03973 Category cs.CV: Computer Vision Cross-listed cs.NE Citations 80 Venue arXiv.org Repository https://github.com/mkettune/elpips/ ⭐ 105 Last Checked 1 month ago
Abstract
It has been recently shown that the hidden variables of convolutional neural networks make for an efficient perceptual similarity metric that accurately predicts human judgment on relative image similarity assessment. First, we show that such learned perceptual similarity metrics (LPIPS) are susceptible to adversarial attacks that dramatically contradict human visual similarity judgment. While this is not surprising in light of neural networks' well-known weakness to adversarial perturbations, we proceed to show that self-ensembling with an infinite family of random transformations of the input --- a technique known not to render classification networks robust --- is enough to turn the metric robust against attack, while retaining predictive power on human judgments. Finally, we study the geometry imposed by our our novel self-ensembled metric (E-LPIPS) on the space of natural images. We find evidence of "perceptual convexity" by showing that convex combinations of similar-looking images retain appearance, and that discrete geodesics yield meaningful frame interpolation and texture morphing, all without explicit correspondences.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Computer Vision