Tunable Soft Prompts are Messengers in Federated Learning

November 12, 2023 · Entered Twilight · 🏛 Conference on Empirical Methods in Natural Language Processing

Repo contents: .flake8, .github, .gitignore, .pre-commit-config.yaml, .style.yapf, LICENSE, README.md, benchmark, doc, environment, federatedscope, materials, meta.yaml, scripts, setup.py, tests

Authors Chenhe Dong, Yuexiang Xie, Bolin Ding, Ying Shen, Yaliang Li arXiv ID 2311.06805 Category cs.CL: Computation & Language Citations 12 Venue Conference on Empirical Methods in Natural Language Processing Repository https://github.com/alibaba/FederatedScope/tree/fedsp/federatedscope/nlp/fedsp} ⭐ 1510 Last Checked 1 month ago

Abstract

Federated learning (FL) enables multiple participants to collaboratively train machine learning models using decentralized data sources, alleviating privacy concerns that arise from directly sharing local data. However, the lack of model privacy protection in FL becomes an unneglectable challenge, especially when people want to federally finetune models based on a proprietary large language model. In this study, we propose a novel FL training approach that accomplishes information exchange among participants via tunable soft prompts. These soft prompts, updated and transmitted between the server and clients, assume the role of the global model parameters and serve as messengers to deliver useful knowledge from the local data and global model. As the global model itself is not required to be shared and the local training is conducted based on an auxiliary model with fewer parameters than the global model, the proposed approach provides protection for the global model while reducing communication and computation costs in FL. Extensive experiments show the effectiveness of the proposed approach compared to several baselines. We have released the source code at \url{https://github.com/alibaba/FederatedScope/tree/fedsp/federatedscope/nlp/fedsp}.