π
π
Old Age
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
November 09, 2023 Β· Entered Twilight Β· π arXiv.org
Repo contents: LCM-LoRA Technical Report, LCM_Training_Script, LICENSE, README.md, cog.yaml, img2img_demo, lcm_logo.png, local_gradio, predict.py, speed_fid.png, teaser.png, tungsten_model.py
Authors
Simian Luo, Yiqin Tan, Suraj Patil, Daniel Gu, Patrick von Platen, ApolinΓ‘rio Passos, Longbo Huang, Jian Li, Hang Zhao
arXiv ID
2311.05556
Category
cs.CV: Computer Vision
Cross-listed
cs.LG
Citations
218
Venue
arXiv.org
Repository
https://github.com/luosiallen/latent-consistency-model
β 4615
Last Checked
1 month ago
Abstract
Latent Consistency Models (LCMs) have achieved impressive performance in accelerating text-to-image generative tasks, producing high-quality images with minimal inference steps. LCMs are distilled from pre-trained latent diffusion models (LDMs), requiring only ~32 A100 GPU training hours. This report further extends LCMs' potential in two aspects: First, by applying LoRA distillation to Stable-Diffusion models including SD-V1.5, SSD-1B, and SDXL, we have expanded LCM's scope to larger models with significantly less memory consumption, achieving superior image generation quality. Second, we identify the LoRA parameters obtained through LCM distillation as a universal Stable-Diffusion acceleration module, named LCM-LoRA. LCM-LoRA can be directly plugged into various Stable-Diffusion fine-tuned models or LoRAs without training, thus representing a universally applicable accelerator for diverse image generation tasks. Compared with previous numerical PF-ODE solvers such as DDIM, DPM-Solver, LCM-LoRA can be viewed as a plug-in neural PF-ODE solver that possesses strong generalization abilities. Project page: https://github.com/luosiallen/latent-consistency-model.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Computer Vision
π
π
Old Age
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
R.I.P.
π»
Ghosted
You Only Look Once: Unified, Real-Time Object Detection
π
π
Old Age
SSD: Single Shot MultiBox Detector
π
π
Old Age
Squeeze-and-Excitation Networks
R.I.P.
π»
Ghosted