基于双向自举蒸馏的异质云-端医疗对话联邦
刘宇鹏,林明豪,张江,姚登举

Heterogeneous cloud-end medical dialogue federation based on bi-directional bootstrapping distillation
Yupeng LIU,Minghao LIN,Jiang ZHANG,Dengju YAO
表 4 不同模型参数对模型性能的影响
Tab.4 Effects of different model parameters on model performance
模型层数隐藏层维度np/106BLEU
GPT-2-small1276811716.75
GPT-224102434519.91
GPT-2-large36128076220.62
GPT-2-max481600154222.03