基于双向自举蒸馏的异质云-端医疗对话联邦

doi:10.3785/j.issn.1008-973X.2024.10.009

浙江大学学报(工学版)

2024, Vol. 58

Issue (10): 2062-2068 DOI: 10.3785/j.issn.1008-973X.2024.10.009

计算机与控制工程

基于双向自举蒸馏的异质云-端医疗对话联邦

刘宇鹏(

),林明豪,张江,姚登举

哈尔滨理工大学计算机科学与技术学院，黑龙江哈尔滨 150080

Heterogeneous cloud-end medical dialogue federation based on bi-directional bootstrapping distillation

Yupeng LIU(

),Minghao LIN,Jiang ZHANG,Dengju YAO

School of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, China

全文: PDF(721 KB) HTML

摘要：

医疗对话场景下的数据/模型异质、数据类型不同，为此提出新的联邦学习方法. 云模型和端模型以相互自举蒸馏的方式进行知识递进传递. 端到云的自举蒸馏过程为多教师-单学生模式，知识被从多个局部模型蒸馏统一到全局模型；云到端的自举蒸馏过程为单教师-多学生模式，知识被从全局模型蒸馏回多个局部模型. 在医疗对话ReMeDi和MedDG数据集上，所提方法与经典基线相比通过文本生成指标评价获得了显著提高，训练速度有所提升.

关键词： 自举蒸馏; 异质数据; 异质模型; 结构正则; 医疗对话

Abstract:

A new federated learning method was proposed in the medical dialogue scene for the heterogeneous data/models and different types of data. The cloud model and the end model transferred knowledge by mutual bootstrapping distillation. The end-to-cloud bootstrapping distillation process was a multi-teacher-single-student model, and knowledge was distilled from multiple local models to a global model. The cloud-to-end bootstrapping distillation process was a single-teacher-multi-student model, and knowledge was distilled from the global model back to multiple local models. On the medical dialogue ReMeDi and MedDG data sets, the proposed method is significantly improved compared with the classical baseline by the text generation evaluation criterion, and the training speed has also been improved.

Key words: bootstrapping distillation heterogenous data heterogenous model structure regularization medical dialogue

收稿日期: 2023-07-29 出版日期: 2024-09-27

CLC:

TP 393

基金资助: 国家自然科学基金资助项目（61300115, 62172128）.

作者简介: 刘宇鹏（1978—），男，博士，教授，从事自然语言处理研究. orcid.org/0000-0002-8437-6894. E-mail：flyeaglelyp@hrbust.edu.cn

	服务
	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	作者相关文章
	刘宇鹏
	林明豪
	张江
	姚登举

引用本文:

刘宇鹏,林明豪,张江,姚登举. 基于双向自举蒸馏的异质云-端医疗对话联邦[J]. 浙江大学学报(工学版), 2024, 58(10): 2062-2068.

Yupeng LIU,Minghao LIN,Jiang ZHANG,Dengju YAO. Heterogeneous cloud-end medical dialogue federation based on bi-directional bootstrapping distillation. Journal of ZheJiang University (Engineering Science), 2024, 58(10): 2062-2068.

链接本文:

https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2024.10.009 或 https://www.zjujournals.com/eng/CN/Y2024/V58/I10/2062

图 1 基于双向自举蒸馏的联邦学习方法

表 1 不同联邦学习方法在2个数据集上的性能比较

图 2 同质数据下的模型表现

图 3 异质数据下的模型表现

表 2 各客户端的模型参数

图 4 各客户端的模型性能变化

表 3 温度对模型性能的影响

表 4 不同模型参数对模型性能的影响

1	YAO A C. Protocols for secure computations [C]// Proceedings of 23rd Annual Symposium on Foundations of Computer Science . Chicago: IEEE, 1982: 160–164.
2	GOLDREICH O, MICALI S, WIGDERSON A. How to play any mental game [C]// Proceedings of the Nineteenth Annual ACM Symposium on Theory of Computing . New York: [s.n.], 1987: 218–229.
3	SHAMIR A How to share a secret[J]. Communications of the ACM, 1979, 22 (11): 612- 613 doi: 10.1145/359168.359176
4	KONEČNÝ J, MCMAHAN H B, RAMAGE D, et al. Federated optimization: distributed machine learning for on-device intelligence [EB/OL]. (2016−10−08) [2022−12−01]. https://arxiv.org/pdf/1610.02527.
5	TAN Y, LONG G, LIU L, et al. FedProto: federated prototype learning across heterogeneous clients [C]// Proceedings of the AAAI Conference on Artificial Intelligence . [S.l.]: AAAI Press, 2022: 8432−8440.
6	LI D, WANG J. FedMD: heterogenous federated learning via model distillation [EB/OL]. (2019−10−08)[2022−12−01]. https://arxiv.org/pdf/1910.03581.
7	MCMAHAN H B, MOORE E, RAMAGE D, et al. Communication-efficient learning of deep networks from decentralized data [EB/OL]. (2023−01−26) [2023−12−01]. https://arxiv.org/pdf/1602.05629.
8	HANZELY F, RICHTÁRIK P. Federated learning of a mixture of global and local models [EB/OL]. (2021−02−12)[2022−12−01]. https://arxiv.org/pdf/2002.05516.
9	HUANG L, YIN Y, FU Z, et al. LoAdaBoost: loss-based AdaBoost federated machine learning with reduced computational complexity on IID and non-IID intensive care data [J]. PLoS ONE , 2020, 15(4): e0230706.
10	HINTON G, VINYALS O, DEAN J. Distilling the knowledge in a neural network [EB/OL]. (2015−03−09)[2022−12−01]. https://arxiv.org/pdf/1503.02531.
11	FURLANELLO T, LIPTON Z C, TSCHANNEN M, et al. Born-again neural networks [EB/OL]. (2018−06−29)[2022−12−01]. https://arxiv.org/pdf/1805.04770.
12	KIMURA A, GHAHRAMANI Z, TAKEUCHI K, et al. Few-shot learning of neural networks from scratch by pseudo example optimization [EB/OL]. (2018−07−05)[2022−12−01]. https://arxiv.org/pdf/1802.03039.
13	LOPES R G, FENU S, STARNER T. Data-free knowledge distillation for deep neural networks [EB/OL]. (2017−11−23)[2022−12−01]. https://arxiv.org/pdf/1710.07535.
14	NAYAK G K, MOPURI K R, SHAJ V, et al. Zero-shot knowledge distillation in deep networks [EB/OL]. (2019−05−20) [2022−12−01]. https://arxiv.org/pdf/1905.08114.
15	CHEN H, WANG Y, XU C, et al. Data-free learning of student networks [C]// Proceedings of the IEEE/CVF International Conference on Computer Vision . Seoul: IEEE, 2019: 3514–3522.
16	FANG G, SONG J, SHEN C, et al. Data-free adversarial distillation [EB/OL]. (2020−03−02) [2022−12−01]. https://arxiv.org/pdf/1912.11006.
17	JEONG E, OH S, KIM H, et al. Communication-efficient on-device machine learning: federated distillation and augmentation under non-IID private data [EB/OL]. (2023−10−19)[2023−12−01]. https://arxiv.org/pdf/1811.11479.
18	ITAHARA S, NISHIO T, KODA Y, et al Distillation-based semi-supervised federated learning for communication-efficient collaborative training with non-IID private data[J]. IEEE Transactions on Mobile Computing, 2021, 22 (1): 191- 205
19	LIN T, KONG L, STICH S U, et al. Ensemble distillation for robust model fusion in federated learning [C]// Proceedings of the 34th International Conference on Neural Information Processing Systems . [S.l.]: CAI, 2020: 2351−2363.
20	CHANDRAKALA S, JAYALAKSHMI S L Generative model driven representation learning in a hybrid framework for environmental audio scene and sound event recognition[J]. IEEE Transactions on Multimedia, 2019, 22 (1): 3- 14
21	ARIVAZHAGAN M G, AGGARWAL V, SINGH A K, et al. Federated learning with personalization layers [EB/OL]. (2019−12−02) [2022−12−01]. https://arxiv.org/pdf/1912.00818.
22	ZHU Z, HONG J, ZHOU J. Data-free knowledge distillation for heterogeneous federated learning [EB/OL]. (2021−06−09) [2022−12−01]. https://arxiv.org/pdf/2105.10056.
23	RADFORD A, WU J, CHILD R, et al. Language models are unsupervised multitask learners [EB/OL]. [2022−12−01]. https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf.
24	YAN G, PEI J, REN P, et al. ReMeDi: resources for multi-domain, multi-service, medical dialogues [EB/OL]. (2022−03−01) [2022−12−01]. https://arxiv.org/pdf/2109.00430.
25	LIU W, TANG J, CHENG Y, et al. MedDG: an entity-centric medical consultation dataset for entity-aware medical dialogue generation [C]// Natural Language Processing and Chinese Computing . [S.l.]: Springer, 2022: 447−459.

[1]	王友卫,王炜琦,凤丽洲,朱建明,李洋. 基于广度-深度采样和图卷积网络的谣言检测方法[J]. 浙江大学学报(工学版), 2024, 58(10): 2040-2052.
[2]	温夏露,黄鹤,王会峰,杨澜,高涛. 基于秃鹰搜索算法优化的三维多无人机低空突防[J]. 浙江大学学报(工学版), 2024, 58(10): 2020-2030.
[3]	刘雪娇,赵祥,夏莹杰,曹天聪. 空地协同场景下具有隐私保护的高效异构认证方案[J]. 浙江大学学报(工学版), 2024, 58(10): 1981-1991.
[4]	刘欢,李云红,张蕾涛,郭越,苏雪平,朱耀麟,侯乐乐. 基于MA-ConvNext网络和分步关系知识蒸馏的苹果叶片病害识别[J]. 浙江大学学报(工学版), 2024, 58(9): 1757-1767.
[5]	叶宝林,孙瑞涛,吴维敏,陈滨,姚青. 基于异步优势演员-评论家的交通信号控制方法[J]. 浙江大学学报(工学版), 2024, 58(8): 1671-1680.
[6]	胡涛涛,贺韶君,王栋. 考虑层理倾角的炭质板岩蠕变损伤本构模型[J]. 浙江大学学报(工学版), 2024, 58(8): 1704-1716.
[7]	郝树仁,李天娥,彭辉,武海全,闫月勤,李海旺,苏宁. 联排凹曲面屋盖的风荷载特性试验研究[J]. 浙江大学学报(工学版), 2024, 58(7): 1467-1478.
[8]	李昕阳,刘为锋,郭旭宁,李云玲,朱非林,钟平安. 流域风光水电出力互补特性[J]. 浙江大学学报(工学版), 2024, 58(7): 1505-1515.
[9]	邵子豪,霍如,王志浩,倪东,谢人超. 基于区块链的移动群智感知数据处理研究综述[J]. 浙江大学学报(工学版), 2024, 58(6): 1091-1106.
[10]	叶倩琳,王万良,王铮. 多目标粒子群优化算法及其应用研究综述[J]. 浙江大学学报(工学版), 2024, 58(6): 1107-1120.
[11]	霍育福,金蓓弘,廖肇翊. 多模态信息增强的短视频推荐模型[J]. 浙江大学学报(工学版), 2024, 58(6): 1142-1152.
[12]	宋娟,贺龙喜,龙会平. 基于深度学习的隧道衬砌多病害检测算法[J]. 浙江大学学报(工学版), 2024, 58(6): 1161-1173.
[13]	邢海军,叶宇静,刘哲远,江伟建,张文博,田书欣. 含多种灵活性资源的综合能源系统低碳优化调度[J]. 浙江大学学报(工学版), 2024, 58(6): 1243-1254.
[14]	舒晴,刘喜平,谭钊,李希,万常选,刘德喜,廖国琼. 基于依存关系图注意力网络的SQL生成方法[J]. 浙江大学学报(工学版), 2024, 58(5): 908-917.
[15]	刘议丹,朱小飞,尹雅博. 基于异质图卷积神经网络的论点对抽取模型[J]. 浙江大学学报(工学版), 2024, 58(5): 900-907.

Viewed

Full text

Abstract

Cited

Shared

Discussed