Disguised Copyright Infringement of Latent Diffusion Models
April 10, 2024 ยท Entered Twilight ยท ๐ International Conference on Machine Learning
Repo contents: FSSAAD, LICENSE, README.md, config.pkl, configs, create_poison.py, detection.ipynb, environment.yaml, evaluation, generate.sh, generate_style.sh, img, invert.sh, invert_style.sh, ldm, main.py, merge_embeddings.py, models, poison, scripts, setup.py
Authors
Yiwei Lu, Matthew Y. R. Yang, Zuoqiu Liu, Gautam Kamath, Yaoliang Yu
arXiv ID
2404.06737
Category
cs.LG: Machine Learning
Cross-listed
cs.CR
Citations
9
Venue
International Conference on Machine Learning
Repository
https://github.com/watml/disguised_copyright_infringement
โญ 2
Last Checked
1 month ago
Abstract
Copyright infringement may occur when a generative model produces samples substantially similar to some copyrighted data that it had access to during the training phase. The notion of access usually refers to including copyrighted samples directly in the training dataset, which one may inspect to identify an infringement. We argue that such visual auditing largely overlooks a concealed copyright infringement, where one constructs a disguise that looks drastically different from the copyrighted sample yet still induces the effect of training Latent Diffusion Models on it. Such disguises only require indirect access to the copyrighted material and cannot be visually distinguished, thus easily circumventing the current auditing tools. In this paper, we provide a better understanding of such disguised copyright infringement by uncovering the disguises generation algorithm, the revelation of the disguises, and importantly, how to detect them to augment the existing toolbox. Additionally, we introduce a broader notion of acknowledgment for comprehending such indirect access. Our code is available at https://github.com/watml/disguised_copyright_infringement.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
R.I.P.
๐ป
Ghosted
R.I.P.
๐ป
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
๐ป
Ghosted
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
R.I.P.
๐ป
Ghosted
Semi-Supervised Classification with Graph Convolutional Networks
R.I.P.
๐ป
Ghosted
Proximal Policy Optimization Algorithms
R.I.P.
๐ป
Ghosted