FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models

December 06, 2023 · Declared Dead · 🏛 European Conference on Computer Vision

Authors Junhyuk So, Jungwon Lee, Eunhyeok Park arXiv ID 2312.03517 Category cs.CV: Computer Vision Cross-listed cs.AI Citations 16 Venue European Conference on Computer Vision Repository https://github.com/ECoLab-POSTECH/FRDiff Last Checked 1 month ago

Abstract

The substantial computational costs of diffusion models, especially due to the repeated denoising steps necessary for high-quality image generation, present a major obstacle to their widespread adoption. While several studies have attempted to address this issue by reducing the number of score function evaluations (NFE) using advanced ODE solvers without fine-tuning, the decreased number of denoising iterations misses the opportunity to update fine details, resulting in noticeable quality degradation. In our work, we introduce an advanced acceleration technique that leverages the temporal redundancy inherent in diffusion models. Reusing feature maps with high temporal similarity opens up a new opportunity to save computation resources without compromising output quality. To realize the practical benefits of this intuition, we conduct an extensive analysis and propose a novel method, FRDiff. FRDiff is designed to harness the advantages of both reduced NFE and feature reuse, achieving a Pareto frontier that balances fidelity and latency trade-offs in various generative tasks.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 💻 Repository 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Computer Vision

🌅 🌅 Old Age

Deep Residual Learning for Image Recognition

Kaiming He, Xiangyu Zhang, ... (+2 more)

cs.CV 🏛 CVPR 📚 220.4K cites 10 years ago

🌅 🌅 Old Age

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, Kaiming He, ... (+2 more)

cs.CV 🏛 IEEE TPAMI 📚 70.4K cites 10 years ago

R.I.P. 👻 Ghosted

You Only Look Once: Unified, Real-Time Object Detection

Joseph Redmon, Santosh Divvala, ... (+2 more)

cs.CV 🏛 CVPR 📚 43.4K cites 10 years ago

🌅 🌅 Old Age

SSD: Single Shot MultiBox Detector

Wei Liu, Dragomir Anguelov, ... (+5 more)

cs.CV 🏛 ECCV 📚 33.8K cites 10 years ago

🌅 🌅 Old Age

Squeeze-and-Excitation Networks

Jie Hu, Li Shen, ... (+3 more)

cs.CV 🏛 CVPR 📚 32.3K cites 8 years ago

R.I.P. 👻 Ghosted

Rethinking the Inception Architecture for Computer Vision

Christian Szegedy, Vincent Vanhoucke, ... (+3 more)

cs.CV 🏛 CVPR 📚 30.2K cites 10 years ago

Died the same way — ⚰️ The Empty Tomb

R.I.P. ⚰️ The Empty Tomb

DSFD: Dual Shot Face Detector

Jian Li, Yabiao Wang, ... (+7 more)

cs.CV 🏛 CVPR 📚 462 cites 7 years ago

R.I.P. ⚰️ The Empty Tomb

InstanceCut: from Edges to Instances with MultiCut

Alexander Kirillov, Evgeny Levinkov, ... (+3 more)

cs.CV 🏛 CVPR 📚 261 cites 9 years ago

R.I.P. ⚰️ The Empty Tomb

FLNet: Landmark Driven Fetching and Learning Network for Faithful Talking Facial Animation Synthesis

Kuangxiao Gu, Yuqian Zhou, Thomas Huang

cs.CV 🏛 AAAI 📚 62 cites 6 years ago

R.I.P. ⚰️ The Empty Tomb

Personalized Showcases: Generating Multi-Modal Explanations for Recommendations

An Yan, Zhankui He, ... (+3 more)

cs.IR 🏛 SIGIR 📚 58 cites 3 years ago