基于大语言模型的中文隐喻多维度评估
黄孝喜,查正超,陆诗佳

Multi-dimensional evaluation of Chinese metaphors based on large language models
Xiaoxi HUANG,Zhengchao ZHA,Shijia LU
表 3 不同评分主体之间的皮尔逊相关系数
Tab.3 Pearson correlation coefficient between different rating entities
评分主体A评分主体Brp
Qwen2.5GLM-4-Plus0.8910.008 7
ERNIE-4.0GPT-40.8780.008 2
ERNIE-4.0Qwen2.50.9130.007 3
GPT-4人工基准0.7660.011 9
ERNIE-4.0人工基准0.8070.010 1
Qwen2.5人工基准0.7830.016 3
GLM-4-Plus人工基准0.7780.014 5