A Survey on Efficient Federated Learning Methods for Foundation Model Training
January 09, 2024 ยท The Cartographer ยท ๐ International Joint Conference on Artificial Intelligence
"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey on Efficient Federated Learning Methods for Foundation Model Training"
Evidence collected by the PWNC Scanner
Authors
Herbert Woisetschlรคger, Alexander Isenko, Shiqiang Wang, Ruben Mayer, Hans-Arno Jacobsen
arXiv ID
2401.04472
Category
cs.LG: Machine Learning
Cross-listed
cs.AI,
cs.DC
Citations
41
Venue
International Joint Conference on Artificial Intelligence
Last Checked
8 days ago
Abstract
Federated Learning (FL) has become an established technique to facilitate privacy-preserving collaborative training across a multitude of clients. However, new approaches to FL often discuss their contributions involving small deep-learning models only and focus on training full models on clients. In the wake of Foundation Models (FM), the reality is different for many deep learning applications. Typically, FMs have already been pre-trained across a wide variety of tasks and can be fine-tuned to specific downstream tasks over significantly smaller datasets than required for full model training. However, access to such datasets is often challenging. By its design, FL can help to open data silos. With this survey, we introduce a novel taxonomy focused on computational and communication efficiency, the vital elements to make use of FMs in FL systems. We discuss the benefits and drawbacks of parameter-efficient fine-tuning (PEFT) for FL applications, elaborate on the readiness of FL frameworks to work with FMs, and provide future research opportunities on how to evaluate generative models in FL as well as the interplay of privacy and PEFT.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
๐ฎ
๐ฎ
The Ethereal
๐ฎ
๐ฎ
The Ethereal
Continuous control with deep reinforcement learning
๐
๐
Old Age
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
๐
๐
Old Age
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
๐
๐
Old Age
SGDR: Stochastic Gradient Descent with Warm Restarts
๐ฎ
๐ฎ
The Ethereal