Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models

January 26, 2023 · Declared Dead · 🏛 International Conference on Machine Learning

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Matthew J. Muckley, Alaaeldin El-Nouby, Karen Ullrich, Hervé Jégou, Jakob Verbeek arXiv ID 2301.11189 Category eess.IV: Image & Video Processing Cross-listed cs.AI, cs.CV, cs.IT Citations 94 Venue International Conference on Machine Learning Last Checked 3 months ago

Abstract

Lossy image compression aims to represent images in as few bits as possible while maintaining fidelity to the original. Theoretical results indicate that optimizing distortion metrics such as PSNR or MS-SSIM necessarily leads to a discrepancy in the statistics of original images from those of reconstructions, in particular at low bitrates, often manifested by the blurring of the compressed images. Previous work has leveraged adversarial discriminators to improve statistical fidelity. Yet these binary discriminators adopted from generative modeling tasks may not be ideal for image compression. In this paper, we introduce a non-binary discriminator that is conditioned on quantized local image representations obtained via VQ-VAE autoencoders. Our evaluations on the CLIC2020, DIV2K and Kodak datasets show that our discriminator is more effective for jointly optimizing distortion (e.g., PSNR) and statistical fidelity (e.g., FID) than the PatchGAN of the state-of-the-art HiFiC model. On CLIC2020, we obtain the same FID as HiFiC with 30-40\% fewer bits.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Image & Video Processing

R.I.P. 👻 Ghosted

Variational image compression with a scale hyperprior

Johannes Ballé, David Minnen, ... (+3 more)

eess.IV 🏛 ICLR 📚 2.2K cites 8 years ago

📚 📚 The Cartographer

Deep Learning for Hyperspectral Image Classification: An Overview

Shutao Li, Weiwei Song, ... (+4 more)

eess.IV 🏛 IEEE TGRS 📚 1.5K cites 6 years ago

R.I.P. 👻 Ghosted

U-Net and its variants for medical image segmentation: theory and applications

Nahian Siddique, Paheding Sidike, ... (+2 more)

eess.IV 🏛 IEEE Access 📚 1.4K cites 5 years ago

R.I.P. 👻 Ghosted

Algorithm Unrolling: Interpretable, Efficient Deep Learning for Signal and Image Processing

Vishal Monga, Yuelong Li, Yonina C. Eldar

eess.IV 🏛 IEEE Signal Processing Magazine 📚 1.3K cites 6 years ago

R.I.P. 💀 404 Not Found

Lightweight Image Super-Resolution with Information Multi-distillation Network

Zheng Hui, Xinbo Gao, ... (+2 more)

eess.IV 🏛 ACM MM 📚 1.1K cites 6 years ago

R.I.P. 👻 Ghosted

Deep Learning on Image Denoising: An overview

Chunwei Tian, Lunke Fei, ... (+4 more)

eess.IV 🏛 Neural Networks 📚 941 cites 6 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 8 years ago