Towards Modality Transferable Visual Information Representation with Optimal Model Compression

August 13, 2020 · Declared Dead · 🏛 ACM Multimedia

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Rongqun Lin, Linwei Zhu, Shiqi Wang, Sam Kwong arXiv ID 2008.05642 Category eess.IV: Image & Video Processing Cross-listed cs.CV, cs.LG, cs.MM Citations 2 Venue ACM Multimedia Last Checked 3 months ago

Abstract

Compactly representing the visual signals is of fundamental importance in various image/video-centered applications. Although numerous approaches were developed for improving the image and video coding performance by removing the redundancies within visual signals, much less work has been dedicated to the transformation of the visual signals to another well-established modality for better representation capability. In this paper, we propose a new scheme for visual signal representation that leverages the philosophy of transferable modality. In particular, the deep learning model, which characterizes and absorbs the statistics of the input scene with online training, could be efficiently represented in the sense of rate-utility optimization to serve as the enhancement layer in the bitstream. As such, the overall performance can be further guaranteed by optimizing the new modality incorporated. The proposed framework is implemented on the state-of-the-art video coding standard (i.e., versatile video coding), and significantly better representation capability has been observed based on extensive evaluations.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Image & Video Processing

R.I.P. 👻 Ghosted

Variational image compression with a scale hyperprior

Johannes Ballé, David Minnen, ... (+3 more)

eess.IV 🏛 ICLR 📚 2.2K cites 8 years ago

📚 📚 The Cartographer

Deep Learning for Hyperspectral Image Classification: An Overview

Shutao Li, Weiwei Song, ... (+4 more)

eess.IV 🏛 IEEE TGRS 📚 1.5K cites 6 years ago

R.I.P. 👻 Ghosted

U-Net and its variants for medical image segmentation: theory and applications

Nahian Siddique, Paheding Sidike, ... (+2 more)

eess.IV 🏛 IEEE Access 📚 1.4K cites 5 years ago

R.I.P. 👻 Ghosted

Algorithm Unrolling: Interpretable, Efficient Deep Learning for Signal and Image Processing

Vishal Monga, Yuelong Li, Yonina C. Eldar

eess.IV 🏛 IEEE Signal Processing Magazine 📚 1.3K cites 6 years ago

R.I.P. 💀 404 Not Found

Lightweight Image Super-Resolution with Information Multi-distillation Network

Zheng Hui, Xinbo Gao, ... (+2 more)

eess.IV 🏛 ACM MM 📚 1.1K cites 6 years ago

R.I.P. 👻 Ghosted

Deep Learning on Image Denoising: An overview

Chunwei Tian, Lunke Fei, ... (+4 more)

eess.IV 🏛 Neural Networks 📚 941 cites 6 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 8 years ago