基于多模态语义信息的文本生成图像方法
杨冰,周家辉,姚金良,向学勤

Text-to-image generation method based on multimodal semantic information
Bing YANG,Jiahui ZHOU,Jinliang YAO,Xueqin XIANG
图 3 语义对齐块结构
Fig.3 Structure of semantic alignment block