基于扩散模型多模态提示的电力人员行为图像生成
朱志航,闫云凤,齐冬莲

Image generation for power personnel behaviors based on diffusion model with multimodal prompts
Zhihang ZHU,Yunfeng YAN,Donglian QI
表 2 不同方法在多种指标下的定量结果
Tab.2 Quantitative results of different methods under multiple metrics
方法FIDKIDCLIP-ScorePCK/%OKS
ControlNet[18]274.726.6267.33/30.3147.50.872
HumanSD[20]331.0511.6562.21/29.2489.40.946
PoseNet130.795.1287.23/30.3275.40.889
PoseNet+图像滤波器130.434.9089.81/30.4390.40.978
PoseNet+图像滤波器+双阶段训练128.254.5691.02/31.4494.20.979