Non-IID Data Based Federated Learning Using Mutual Knowledge Distillation and Generative Adversarial Network

Yang He; Weimin Peng

Please submit manuscripts in either of the following two submission systems

ScholarOne Manuscripts

ScholarOne

勤云稿件系统

Search by Issue

Search by Keywords

News & AnnouncementMORE

【03-29】2015 Outstanding Reviewers
【03-27】2014 Outstanding Reviewers
【02-18】2013 Outstanding Reviewers
【12-29】The First Outstanding Reviewers
【05-04】Copyright Transfer Agreement
【04-04】To authors

Supervised by Ministry of Industry and Information Technology of The People's Republic of China Sponsored by Harbin Institute of Technology Editor-in-chief Yu Zhou ISSNISSN 1005-9113 CNCN 23-1378/T

期刊网站二维码

微信公众号二维码

Related citation:

Yang He,Weimin Peng.Non-IID Data Based Federated Learning Using Mutual Knowledge Distillation and Generative Adversarial Network[J].Journal of Harbin Institute Of Technology(New Series),2026,33(2):17-30.DOI:10.11916/j.issn.1005-9113.25011.

【Print】【HTML】【PDF download】【View/Add Comment】【Download reader】【 Close 】

←Previous|Next→

Back Issue Advanced Search

This paper has been: browsed 300times downloaded 34times	码上扫一扫！
Shared by: Wechat More Font:larger+\|default\|smaller-
Non-IID Data Based Federated Learning Using Mutual Knowledge Distillation and Generative Adversarial Network

Author Name	Affiliation
Yang He	School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310000, China
Weimin Peng	Key Laboratory of Discrete Industrial Internet of Things, Hangzhou Dianzi University, Hangzhou 310000, China

Abstract:

User heterogeneity in Federated Learning (FL) necessitates the re-optimization of local models, results in the loss of global knowledge, and leads to slow convergence and degraded performance. When dealing with heterogeneous clients in FL, Knowledge Distillation (KD) is a standard approach to increasing efficiency and improving generalization. However, KD relies on proxy datasets, and the underutilization of client knowledge in guiding local model learning has a negative impact on the quality of the aggregation model. Regarding these, a new FL method is proposed based on a Generative Adversarial Network (GAN) and KD, which has two training stages. At the first stage, client collaboration pre-trains a GAN to generate secondary datasets, overcoming the limitations of proxy datasets. In the second stage, the mutual KD process is implemented through the dynamic adjustment of the weights of client models to tackle the underutilization of integrated client knowledge. In the training phase, the pre-trained generator after fine-tuning can transfer knowledge from multiple local models to a global model to enhance the efficiency of KD. On the introduced benchmark datasets, the experimental results show that the proposed FL method needs fewer communication rounds and reflects better generalization than the state-of-the-art FL methods.

Key words: federated learning non-independently and identically distributed knowledge distillation generative adversarial network

DOI：10.11916/j.issn.1005-9113.25011

Clc Number:TP391.9

Fund:

Descriptions in Chinese:

基于互知识蒸馏与生成对抗网络的非独立同分布数据联邦学习

何洋¹彭维民^1,2

1.杭州电子科技大学计算机科学与技术学院，杭州 310000

2. 杭州电子科技大学计算机科学与技术学院离散工业物联网重点实验室，杭州 310000

摘要：

联邦学习（FL）中的用户异质性会导致局部模型需重新优化、全局知识流失，进而造成收敛速度变慢、模型性能下降。在处理联邦学习中的异构客户端问题时，知识蒸馏（KD）是提升训练效率与泛化能力的常用方法。然而，知识蒸馏依赖代理数据集，且客户端知识未能充分用于指导局部模型学习，会对聚合模型的质量产生不利影响。针对上述问题，本文提出一种基于生成对抗网络（GAN）与知识蒸馏的新型联邦学习方法，该方法包含两个训练阶段。第一阶段，各客户端协作预训练一个生成对抗网络以生成辅助数据集，从而突破代理数据集的限制；第二阶段，通过动态调整客户端模型权重实现互知识蒸馏过程，解决客户端整体知识利用不足的问题。在训练阶段，经微调后的预训练生成器可将多个局部模型的知识迁移至全局模型，提升知识蒸馏效率。在引入的基准数据集上的实验结果表明，与当前先进的联邦学习方法相比，本文方法所需通信轮次更少，泛化性能更优。

关键词：联邦学习；非独立同分布；知识蒸馏；生成对抗网络

Search by Issue

Search by Keywords

News & AnnouncementMORE

LINKS