R.I.P.
๐ป
Ghosted
Rethinking Data Heterogeneity in Federated Learning: Introducing a New Notion and Standard Benchmarks
September 30, 2022 ยท Entered Twilight ยท ๐ IEEE Transactions on Artificial Intelligence
Repo contents: .gitignore, Angle Measure Demo.ipynb, LICENSE, README.md, main.py, mix4, scripts, src
Authors
Mahdi Morafah, Saeed Vahidian, Chen Chen, Mubarak Shah, Bill Lin
arXiv ID
2209.15595
Category
cs.LG: Machine Learning
Cross-listed
cs.AI
Citations
74
Venue
IEEE Transactions on Artificial Intelligence
Repository
https://github.com/MMorafah/FL-SC-NIID
โญ 26
Last Checked
1 month ago
Abstract
Though successful, federated learning presents new challenges for machine learning, especially when the issue of data heterogeneity, also known as Non-IID data, arises. To cope with the statistical heterogeneity, previous works incorporated a proximal term in local optimization or modified the model aggregation scheme at the server side or advocated clustered federated learning approaches where the central server groups agent population into clusters with jointly trainable data distributions to take the advantage of a certain level of personalization. While effective, they lack a deep elaboration on what kind of data heterogeneity and how the data heterogeneity impacts the accuracy performance of the participating clients. In contrast to many of the prior federated learning approaches, we demonstrate not only the issue of data heterogeneity in current setups is not necessarily a problem but also in fact it can be beneficial for the FL participants. Our observations are intuitive: (1) Dissimilar labels of clients (label skew) are not necessarily considered data heterogeneity, and (2) the principal angle between the agents' data subspaces spanned by their corresponding principal vectors of data is a better estimate of the data heterogeneity. Our code is available at https://github.com/MMorafah/FL-SC-NIID.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
R.I.P.
๐ป
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
๐ป
Ghosted
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
R.I.P.
๐ป
Ghosted
Semi-Supervised Classification with Graph Convolutional Networks
R.I.P.
๐ป
Ghosted
Proximal Policy Optimization Algorithms
R.I.P.
๐ป
Ghosted