|
| Abstract: |
| User heterogeneity in Federated Learning (FL) necessitates the re-optimization of local models, results in the loss of global knowledge, and leads to slow convergence and degraded performance. When dealing with heterogeneous clients in FL, Knowledge Distillation (KD) is a standard approach to increasing efficiency and improving generalization. However, KD relies on proxy datasets, and the underutilization of client knowledge in guiding local model learning has a negative impact on the quality of the aggregation model. Regarding these, a new FL method is proposed based on a Generative Adversarial Network (GAN) and KD, which has two training stages. At the first stage, client collaboration pre-trains a GAN to generate secondary datasets, overcoming the limitations of proxy datasets. In the second stage, the mutual KD process is implemented through the dynamic adjustment of the weights of client models to tackle the underutilization of integrated client knowledge. In the training phase, the pre-trained generator after fine-tuning can transfer knowledge from multiple local models to a global model to enhance the efficiency of KD. On the introduced benchmark datasets, the experimental results show that the proposed FL method needs fewer communication rounds and reflects better generalization than the state-of-the-art FL methods. |
| Key words: federated learning non-independently and identically distributed knowledge distillation generative adversarial network |
| DOI:10.11916/j.issn.1005-9113.25011 |
| Clc Number:TP391.9 |
| Fund: |
|
| Descriptions in Chinese: |
| 基于互知识蒸馏与生成对抗网络的非独立同分布数据联邦学习 何洋1彭维民1,2 1.杭州电子科技大学 计算机科学与技术学院,杭州 310000 2. 杭州电子科技大学 计算机科学与技术学院 离散工业物联网重点实验室,杭州 310000 摘要: 联邦学习(FL)中的用户异质性会导致局部模型需重新优化、全局知识流失,进而造成收敛速度变慢、模型性能下降。在处理联邦学习中的异构客户端问题时,知识蒸馏(KD)是提升训练效率与泛化能力的常用方法。然而,知识蒸馏依赖代理数据集,且客户端知识未能充分用于指导局部模型学习,会对聚合模型的质量产生不利影响。针对上述问题,本文提出一种基于生成对抗网络(GAN)与知识蒸馏的新型联邦学习方法,该方法包含两个训练阶段。第一阶段,各客户端协作预训练一个生成对抗网络以生成辅助数据集,从而突破代理数据集的限制;第二阶段,通过动态调整客户端模型权重实现互知识蒸馏过程,解决客户端整体知识利用不足的问题。在训练阶段,经微调后的预训练生成器可将多个局部模型的知识迁移至全局模型,提升知识蒸馏效率。在引入的基准数据集上的实验结果表明,与当前先进的联邦学习方法相比,本文方法所需通信轮次更少,泛化性能更优。 关键词:联邦学习;非独立同分布;知识蒸馏;生成对抗网络 |