TY - GEN
T1 - FedCLF – towards efficient participant selection for federated learning in heterogeneous iov networks
AU - Wijethilake, Kasun Eranda
AU - Mahmood, Adnan
AU - Sheng, Quan Z.
PY - 2025
Y1 - 2025
N2 - Federated Learning (FL) is a distributed machine learning technique that preserves data privacy by sharing only the trained parameters instead of the client data. This makes FL ideal for highly dynamic, heterogeneous, and time-critical applications, in particular, the Internet of Vehicles (IoV) networks. However, FL encounters considerable challenges in such networks owing to the high data and device heterogeneity. To address these challenges, we propose FedCLF, i.e., FL with Calibrated Loss and Feedback control, which introduces calibrated loss as a utility in the participant selection process and a feedback control mechanism to dynamically adjust the sampling frequency of the clients. The envisaged approach (a) enhances the overall model accuracy in case of highly heterogeneous data and (b) optimizes the resource utilization for resource constrained IoV networks, thereby leading to increased efficiency in the FL process. We evaluated FedCLF vis-à-vis baseline models, i.e., FedAvg, Newt, and Oort, using CIFAR-10 dataset with varying data heterogeneity. Our results depict that FedCLF significantly outperforms the baseline models by up to a 16% improvement in high data heterogeneity-related scenarios with improved efficiency via reduced sampling frequency.
AB - Federated Learning (FL) is a distributed machine learning technique that preserves data privacy by sharing only the trained parameters instead of the client data. This makes FL ideal for highly dynamic, heterogeneous, and time-critical applications, in particular, the Internet of Vehicles (IoV) networks. However, FL encounters considerable challenges in such networks owing to the high data and device heterogeneity. To address these challenges, we propose FedCLF, i.e., FL with Calibrated Loss and Feedback control, which introduces calibrated loss as a utility in the participant selection process and a feedback control mechanism to dynamically adjust the sampling frequency of the clients. The envisaged approach (a) enhances the overall model accuracy in case of highly heterogeneous data and (b) optimizes the resource utilization for resource constrained IoV networks, thereby leading to increased efficiency in the FL process. We evaluated FedCLF vis-à-vis baseline models, i.e., FedAvg, Newt, and Oort, using CIFAR-10 dataset with varying data heterogeneity. Our results depict that FedCLF significantly outperforms the baseline models by up to a 16% improvement in high data heterogeneity-related scenarios with improved efficiency via reduced sampling frequency.
KW - Federated Learning
KW - Participant Selection
KW - Internet of Vehicles
KW - Statistical Utility
KW - Feedback Control
KW - Data Heterogeneity
UR - http://www.scopus.com/inward/record.url?scp=85213297498&partnerID=8YFLogxK
U2 - 10.1007/978-981-96-0814-0_15
DO - 10.1007/978-981-96-0814-0_15
M3 - Conference proceeding contribution
AN - SCOPUS:85213297498
SN - 9789819608133
T3 - Lecture Notes in Computer Science
SP - 223
EP - 238
BT - Advanced Data Mining and Applications
A2 - Sheng, Quan Z.
A2 - Dobbie, Gill
A2 - Jiang, Jing
A2 - Zhang, Xuyun
A2 - Zhang, Wei Emma
A2 - Manolopoulos, Yannis
A2 - Wu, Jia
A2 - Mansoor, Wathiq
A2 - Ma, Congbo
PB - Springer, Springer Nature
CY - Singapore
T2 - 20th International Conference on Advanced Data Mining Applications, ADMA 2024
Y2 - 3 December 2024 through 5 December 2024
ER -