A Survey on Efficient Federated Learning Methods for Foundation Model Training

January 09, 2024 ยท The Cartographer ยท ๐Ÿ› International Joint Conference on Artificial Intelligence

๐Ÿ“š THE CARTOGRAPHER: The Cartographer
Survey/review paper โ€” maps the landscape rather than implementing a method.

"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey on Efficient Federated Learning Methods for Foundation Model Training"

Evidence collected by the PWNC Scanner

Authors Herbert Woisetschlรคger, Alexander Isenko, Shiqiang Wang, Ruben Mayer, Hans-Arno Jacobsen arXiv ID 2401.04472 Category cs.LG: Machine Learning Cross-listed cs.AI, cs.DC Citations 41 Venue International Joint Conference on Artificial Intelligence Last Checked 8 days ago
Abstract
Federated Learning (FL) has become an established technique to facilitate privacy-preserving collaborative training across a multitude of clients. However, new approaches to FL often discuss their contributions involving small deep-learning models only and focus on training full models on clients. In the wake of Foundation Models (FM), the reality is different for many deep learning applications. Typically, FMs have already been pre-trained across a wide variety of tasks and can be fine-tuned to specific downstream tasks over significantly smaller datasets than required for full model training. However, access to such datasets is often challenging. By its design, FL can help to open data silos. With this survey, we introduce a novel taxonomy focused on computational and communication efficiency, the vital elements to make use of FMs in FL systems. We discuss the benefits and drawbacks of parameter-efficient fine-tuning (PEFT) for FL applications, elaborate on the readiness of FL frameworks to work with FMs, and provide future research opportunities on how to evaluate generative models in FL as well as the interplay of privacy and PEFT.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Machine Learning