基于多模态语义信息的文本生成图像方法
杨冰,周家辉,姚金良,向学勤

Text-to-image generation method based on multimodal semantic information
Bing YANG,Jiahui ZHOU,Jinliang YAO,Xueqin XIANG
表 3 基于多模态语义信息的文本生成图像方法的模块消融实验
Tab.3 Module ablation study of text-to-image generation method based on multimodal semantic information
基线星模块卷积可变形卷积语义对齐鉴别器CUB数据集COCO数据集
FID↓SCLIPFID↓SCLIP
10.080.31645.850.3338
9.700.31845.760.3352
9.890.31765.800.3343
9.970.31915.720.3368
9.680.32055.690.3341
9.620.32225.650.3382
9.930.31985.710.3375
9.560.32595.620.3405