Non-IID Data Based Federated Learning Using Mutual Knowledge Distillation and Generative Adversarial Network

Yang He; Weimin Peng

Please submit manuscripts in either of the following two submission systems

ScholarOne Manuscripts

ScholarOne

勤云稿件系统

Search by Issue

Search by Keywords

News & AnnouncementMORE

【03-29】2015 Outstanding Reviewers
【03-27】2014 Outstanding Reviewers
【02-18】2013 Outstanding Reviewers
【12-29】The First Outstanding Reviewers
【05-04】Copyright Transfer Agreement
【04-04】To authors

Supervised by Ministry of Industry and Information Technology of The People's Republic of China Sponsored by Harbin Institute of Technology Editor-in-chief Yu Zhou ISSNISSN 1005-9113 CNCN 23-1378/T

期刊网站二维码

微信公众号二维码

Related citation:

【Print】【HTML】【PDF download】【View/Add Comment】【Download reader】【 Close 】

Back Issue Advanced Search

This paper has been: browsed 178times downloaded 159times
Shared by: Wechat More Font:larger+\|default\|smaller-
Non-IID Data Based Federated Learning Using Mutual Knowledge Distillation and Generative Adversarial Network

Author Name	Affiliation	Postcode
Yang He	School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310000, China	310000
Weimin Peng^*	Key Laboratory of Discrete Industrial Internet of Things, School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310000, China	310000

Abstract:

User heterogeneity in federated learning (FL) necessitates the re-optimization of local models, results in the loss of global knowledge, and leads to slow convergence and degraded performance. When dealing with heterogeneous clients in FL, knowledge distillation (KD) is a standard approach to increasing efficiency and improving generalization. However, KD relies on proxy datasets, and the underutilization of client knowledge in guiding local model learning has a negative impact on the quality of the aggregation model. Regarding these, a new FL method is proposed based on generative adversarial network (GAN) and KD, which has two training stages. At the first stage, client collaboration pre-trains a GAN to generate secondary datasets, overcoming the limitations of proxy datasets. In the second stage, the mutual KD process is implemented through the dynamic adjustment of the weights of client models to tackle the underutilization of integrated client knowledge. In the training phase, the pre-trained generator after fine-tuning can transfer knowledge from multiple local models to a global model to enhance the efficiency of KD. On the introduced benchmark datasets, the experimental results show that the proposed FL method needs fewer communication rounds and reflects better generalization than the state-of-the-art FL methods.

Key words: federated learning, non-independently and identically distributed, knowledge distillation, generative adversarial network

DOI：10.11916/j.issn.1005-9113.25011

Clc Number:TP391.9

Fund:

Search by Issue

Search by Keywords

News & AnnouncementMORE

LINKS