Generative AI for Multimedia Communication: Recent Advances, An Information-Theoretic Framework, and Future Opportunities

August 23, 2025 · Declared Dead · 🏛 ACM Multimedia

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Yili Jin, Xue Liu, Jiangchuan Liu arXiv ID 2508.17163 Category cs.MM: Multimedia Cross-listed eess.IV Citations 0 Venue ACM Multimedia Last Checked 3 months ago

Abstract

Recent breakthroughs in generative artificial intelligence (AI) are transforming multimedia communication. This paper systematically reviews key recent advancements across generative AI for multimedia communication, emphasizing transformative models like diffusion and transformers. However, conventional information-theoretic frameworks fail to address semantic fidelity, critical to human perception. We propose an innovative semantic information-theoretic framework, introducing semantic entropy, mutual information, channel capacity, and rate-distortion concepts specifically adapted to multimedia applications. This framework redefines multimedia communication from purely syntactic data transmission to semantic information conveyance. We further highlight future opportunities and critical research directions. We chart a path toward robust, efficient, and semantically meaningful multimedia communication systems by bridging generative AI innovations with information theory. This exploratory paper aims to inspire a semantic-first paradigm shift, offering a fresh perspective with significant implications for future multimedia research.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Multimedia

🌅 🌅 Old Age

Quality Assessment of In-the-Wild Videos

Dingquan Li, Tingting Jiang, Ming Jiang

cs.MM 🏛 ACM MM 📚 375 cites 6 years ago

R.I.P. 👻 Ghosted

Viewport-Adaptive Navigable 360-Degree Video Delivery

Xavier Corbillon, Gwendal Simon, ... (+2 more)

cs.MM 🏛 ICC 📚 328 cites 9 years ago

📚 📚 The Cartographer

A Comprehensive Survey on Cross-modal Retrieval

Kaiye Wang, Qiyue Yin, ... (+3 more)

cs.MM 🏛 arXiv 📚 322 cites 9 years ago

📚 📚 The Cartographer

An Overview of Cross-media Retrieval: Concepts, Methodologies, Benchmarks and Challenges

Yuxin Peng, Xin Huang, Yunzhen Zhao

cs.MM 🏛 IEEE TCSVT 📚 309 cites 9 years ago

R.I.P. 👻 Ghosted

A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding

Yuanying Dai, Dong Liu, Feng Wu

cs.MM 🏛 ICMM 📚 305 cites 9 years ago

R.I.P. 👻 Ghosted

Video Generation From Text

Yitong Li, Martin Renqiang Min, ... (+3 more)

cs.MM 🏛 AAAI 📚 300 cites 8 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 8 years ago