TY - GEN
T1 - Towards Instant Clustering Approach for Federated Learning Client Selection
AU - Arisdakessian, Sarhad
AU - Wahab, Omar Abdel
AU - Mourad, Azzam
AU - Otrok, Hadi
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - In just few years, Federated Learning (FL) started to gain unprecedented attention given its ability to solve some fundamental privacy and communication challenges of traditional machine learning. Client selection is one of the main challenges in FL and is usually done in a random fashion, where the central server arbitrarily selects a certain number of clients to participate in each training round. However, given the heterogeneity of the client devices in terms of data quality and resource availability, randomly selecting clients is likely to result in long local training time and thus delayed global model's convergence. To address this problem, in this work, we propose a real-time and on-demand client selection mechanism that employs the DBSCAN (Density-Based Spatial clustering of Applications with Noise) clustering technique from machine learning to group the clients into a set of homogeneous clusters based on a set of criteria defined by the FL task owners, such as resource availability, data quality, data size, data freshness and non-IID degree. Based on the requirements of each FL task, the server then intelligently selects the clusters of clients that best match with each task's requirements, thus improving the performance of the overall federated learning process. Experiments suggest that our solution significantly improves the accuracy of FL compared to the Vanilla FL approach.
AB - In just few years, Federated Learning (FL) started to gain unprecedented attention given its ability to solve some fundamental privacy and communication challenges of traditional machine learning. Client selection is one of the main challenges in FL and is usually done in a random fashion, where the central server arbitrarily selects a certain number of clients to participate in each training round. However, given the heterogeneity of the client devices in terms of data quality and resource availability, randomly selecting clients is likely to result in long local training time and thus delayed global model's convergence. To address this problem, in this work, we propose a real-time and on-demand client selection mechanism that employs the DBSCAN (Density-Based Spatial clustering of Applications with Noise) clustering technique from machine learning to group the clients into a set of homogeneous clusters based on a set of criteria defined by the FL task owners, such as resource availability, data quality, data size, data freshness and non-IID degree. Based on the requirements of each FL task, the server then intelligently selects the clusters of clients that best match with each task's requirements, thus improving the performance of the overall federated learning process. Experiments suggest that our solution significantly improves the accuracy of FL compared to the Vanilla FL approach.
KW - Client Selection
KW - Clustering
KW - Federated Learning
KW - Heterogeneity in Federated Learning
UR - https://www.scopus.com/pages/publications/85152035493
U2 - 10.1109/ICNC57223.2023.10074237
DO - 10.1109/ICNC57223.2023.10074237
M3 - Conference contribution
AN - SCOPUS:85152035493
T3 - 2023 International Conference on Computing, Networking and Communications, ICNC 2023
SP - 409
EP - 413
BT - 2023 International Conference on Computing, Networking and Communications, ICNC 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 International Conference on Computing, Networking and Communications, ICNC 2023
Y2 - 20 February 2023 through 22 February 2023
ER -