多模态大模型边缘部署与推理加速技术综述
陈思如,舒元超

Survey on edge deployment and inference acceleration of multimodal large language models
Siru CHEN,Yuanchao SHU
图 1 典型多模态大语言模型结构
Fig.1 Typical architecture of MLLMs