Please wait a minute...
浙江大学学报(工学版)  2024, Vol. 58 Issue (5): 900-907    DOI: 10.3785/j.issn.1008-973X.2024.05.003
计算机技术、通信技术     
基于异质图卷积神经网络的论点对抽取模型
刘议丹(),朱小飞*(),尹雅博
重庆理工大学 计算机科学与工程学院,重庆 400054
Heterogeneous graph convolutional neural network for argument pair extraction
Yidan LIU(),Xiaofei ZHU*(),Yabo YIN
College of Computer Science and Engineering, Chongqing University of Technology, Chongqing 400054, China
 全文: PDF(850 KB)   HTML
摘要:

针对论点对抽取任务中存在着评审段和反驳段之间交互信息难以捕获以及忽略了对句子间的相对位置信息进行建模问题, 提出基于异质图卷积神经网络的论点对抽取模型. 该模型在评审段和反驳段中构建异质图,定义2种不同类型的节点及4种不同类型的边,通过关系图卷积神经网络来更新图中节点的表示. 提出位置感知的句子对生成器,利用旋转位置编码来建模评审段和反驳段句子间的相对位置信息. 在RR-passage和RR-submission-v2数据集上进行实验,实验结果表明,提出模型的性能均优于所有的基线模型. 这表明通过构建异质图区分不同的节点类型和边的类型,设计位置感知的句子对生成器,能够提升论点对抽取模型的效果.

关键词: 论辩挖掘论点对抽取图神经网络旋转位置编码自然语言处理    
Abstract:

An argument pair extraction model based on heterogeneous graph convolutional neural network was proposed aiming at the issue of difficulty in capturing interactive information between review passage and rebuttal passage and neglecting to model relative positional information between sentences. Heterogeneous graphs were constructed within the review passage and rebuttal passage. Two types of nodes and four types of edges were defined. The relational graph convolutional neural network was utilized to update the representations of nodes within the graph. A position-aware sentence pair generator was introduced, and rotary position embedding was employed to model the relative positional information between sentences in review passage and rebuttal passage. Experimental evaluations on the RR-passage and RR-submission-v2 datasets demonstrate that the proposed model outperforms all baseline models. The performance of the argument pair extraction model can be enhanced by constructing heterogeneous graphs to distinguish between different types of nodes and edges and designing a position-aware sentence pair generator.

Key words: argument mining    argument pair extraction    graph neural network    rotary position embedding    natural language processing
收稿日期: 2023-07-03 出版日期: 2024-04-26
CLC:  TP 393  
基金资助: 国家自然科学基金资助项目(62141201);重庆市自然科学基金资助项目 (CSTB2022NSCQ-MSX1672);重庆市教育委员会科学技术研究计划资助项目(KJZD-M202201102);重庆理工大学研究生教育高质量发展行动计划资助项目(gzlcx20233230).
通讯作者: 朱小飞     E-mail: lyd@stu.cqut.edu.cn;zxf@cqut.edu.cn
作者简介: 刘议丹(1999—),男,硕士生,从事论辩挖掘的研究. orcid.org/0009-0005-1101-8294.E-mail:lyd@stu.cqut.edu.cn
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
作者相关文章  
刘议丹
朱小飞
尹雅博

引用本文:

刘议丹,朱小飞,尹雅博. 基于异质图卷积神经网络的论点对抽取模型[J]. 浙江大学学报(工学版), 2024, 58(5): 900-907.

Yidan LIU,Xiaofei ZHU,Yabo YIN. Heterogeneous graph convolutional neural network for argument pair extraction. Journal of ZheJiang University (Engineering Science), 2024, 58(5): 900-907.

链接本文:

https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2024.05.003        https://www.zjujournals.com/eng/CN/Y2024/V58/I5/900

图 1  HGCN-APE模型架构
图 2  HGCN-APE异质图构建
图 3  HGCN-APE超参数$\lambda $的影响
评审反驳评审反驳
NvbNtrNdevNtNfNaNsNfNaNs
4764381147647799 80023 20058 50094 90017 70067 500
表 1  论点对抽取数据集统计
数据集模型论点预测句子对预测论点对预测
P/%R/%F1/%P/%R/%F1/%P/%R/%F1/%
RR-submission-v2MT-H-LSTM-CRF70.7469.4670.0952.0546.7449.2527.2426.0026.61
MLMC69.5373.2771.3560.0146.8252.6037.1529.3832.81
MGF70.4071.8771.1334.2334.5734.40
MRC-APE-Bert73.3668.3570.7742.2634.0637.72
HGCN-APE71.8671.8071.8366.1259.4062.5842.7036.0539.09
RR-passageMT-H-LSTM-CRF71.8571.0171.4354.2843.2448.1330.0829.5529.81
MLMC66.7972.1769.3762.4942.3350.5340.2729.5334.07
MGF73.6270.8872.2238.0335.6836.82
MRC-APE-Bert66.8169.8468.2934.7035.5335.51
HGCN-APE72.5071.6172.0567.6859.2563.1845.7638.3241.71
表 2  HGCN-APE在RR-passage和RR-submission-v2数据集上的性能对比
模型论点对预测
P/%R/%F1/%
HGCN-APE45.7638.3241.71
w/o v2v44.3638.2641.09
w/o b2b44.6937.6340.86
w/o v2b41.5835.6338.38
w/o b2v42.6832.8437.12
w/o pos44.5637.6840.83
表 3  HGCN-APE在RR-passage数据集上的消融实验
数据集LP/%R/%F1/%
RR-passage133.4725.9429.23
244.0335.1139.02
345.7638.3241.71
443.7739.7441.66
544.4938.1141.05
RR-submission-v2133.4725.9429.23
240.4732.5036.05
342.7036.0539.09
442.0235.6838.59
540.9834.1537.25
表 4  HGCN-APE超参数$L$的影响
图 4  HGCN-APE在RR-passage和RR-submission-v2数据集上的损失和性能对比
样本真实评审段论点预测评审段论点真实反驳段论点预测反驳段论点真实论点对预测论点对
18-98-91-71-7(8-9)-(1-7)(6-7)-(1-7)
10-1510-158-138-13(10-15)-(8-13)(10-15)-(8-13)
6-7
210-1110-111-81-8(10-11)-(1-8)(10-11)-(1-8)
12-1212-129-119-11(12-12)-(9-11)(12-12)-(9-11)
16-1717-17(16-17)-(1-8)(17-17)-(1-8)
36-106-101-111-11(6-10)-(1-11)(6-10)-(1-11)
12-1312-1314-1814-18(12-13)-(14-18)(12-13)-(14-18)
14-1514-1521-2421-24(14-15)-(21-24)(14-15)-(21-24)
表 5  HGCN-APE案例研究
1 EGER S, DAXENBERGER J, GUREVYCH I. Neural end-to-end learning for computational argumentation mining [C]// Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. Vancouver: ACL, 2017: 11-22.
2 KURIBAYASHI T, OUCHI H, INOUE N, et al. An empirical study of span representations in argumentation structure parsing [C]// Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Florence: ACL, 2019: 4691-4698.
3 MORIO G, FUJITA K. End-to-end argument mining for discussion threads based on parallel constrained pointer architecture [C]// Proceedings of the 5th Workshop on Argument Mining. Brussels: ACL, 2018: 11-21.
4 CHAKRABARTY T, HIDEY C, MURESAN S, et al. AMPERSAND: argument mining for PERSuAsive oNline discussions [C]// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Hong Kong: ACL, 2019: 2933–2943.
5 魏忠钰 计算论辩技术: 迈向智能人类辩手之路[J]. 世界科学, 2023, (5): 46- 49
WEI Zhongyu Computational argumentation techniques: toward the path of intelligent human debaters[J]. World Science, 2023, (5): 46- 49
doi: 10.3969/j.issn.1000-0968.2023.05.013
6 CHENG L, BING L, YU Q, et al. APE: argument pair extraction from peer review and rebuttal via multi-task learning [C]// Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. [S. l. ]: ACL, 2020: 7000-7011.
7 CHENG L, WU T, BING L, et al. Argument pair extraction via attention-guided multi-layer multi-cross encoding [C]// Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. [S. l. ]: ACL, 2021: 6341-6353.
8 BAO J, LIANG B, SUN J, et al. Argument pair extraction with mutual guidance and inter-sentence relation graph [C]// Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Punta Cana: ACL, 2021: 3923-3934.
9 BAO J, SUN J, ZHU Q, et al. Have my arguments been replied to? argument pair extraction as machine reading comprehension [C]// Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. Dublin: ACL, 2022: 29-35.
10 WALKER V, FOERSTER D, PONCE J M, et al. Evidence types, credibility factors, and patterns or soft rules for weighing conflicting evidence: argument mining in the context of legal rules governing evidence assessment [C]// Proceedings of the 5th Workshop on Argument Mining. Brussels: ACL, 2018: 68-78.
11 LE D T, NGUYEN C T, NGUYEN K A. Dave the debater: a retrieval-based and generative argumentative dialogue agent [C]// Proceedings of the 5th Workshop on Argument Mining. Brussels: ACL, 2018: 121-130.
12 ZHANG F, LITMAN D. Using context to predict the purpose of argumentative writing revisions [C]// Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. San Diego: ACL, 2016: 1424-1430.
13 JI L, WEI Z, LI J, et al. Discrete argument representation learning for interactive argument pair identification [C]// Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. [S. l. ]: ACL, 2016: 5467-5478.
14 YUAN J, WEI Z, ZHAO D, et al. Leveraging argumentation knowledge graph for interactive argument pair identification [C]// Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. [S. l. ]: ACL, 2021: 2310-2319.
15 SHI L, GIUNCHIGLIA F, SONG R, et al. A simple contrastive learning framework for interactive argument pair identification via argument-context extraction [C]// Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. Abu Dhabi: ACL, 2022: 10027-10039.
16 KIPF T N, WELLING M. Semi-supervised classification with graph convolutional networks [EB/OL]. (2017-02-22). https://arxiv.org/abs/1609.02907.
17 VELICKOVIC P, CUCURULL G, CASANOVA A, et al. Graph attention networks [EB/OL]. (2017-10-30)[2023-06-20]. https://arxiv.org/abs/1710.10903.
18 SCHLICHTKRULL M, KIPF T N, BLOEM P, et al. Modeling relational data with graph convolutional networks [C]// The Semantic Web: 15th International Conference, Extended Semantic Web Conference. Heraklion: Springer, 2018: 593-607.
19 WANG K, SHEN W, YANG Y, et al. Relational graph attention network for aspect-based sentiment analysis [C]// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. [S. l. ]: ACL, 2020: 3229-3238.
20 HU L, YANG T, SHI C, et al. Heterogeneous graph attention networks for semi-supervised short text classification [C]// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Hong Kong: ACL, 2019: 4821-4830.
21 VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need [C]// Proceedings of the 31st lnternational Conference on Neural lnformation Processing Systems. California: Curran Associates Inc., 2017: 6000-6010.
22 DEVLIN J, CHANG M W, LEE K, et al. Bert: pre-training of deep bidirectional transformers for language understanding [C]// Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics . Minneapolis: ACL, 2019: 4171-4186.
23 SRIVASTAVA N, HINTON G, KRIZHEVSKY A, et al Dropout: a simple way to prevent neural networks from overfitting[J]. The Journal of Machine Learning Research, 2014, 15 (1): 1929- 1958
24 HOCHREITER S, SCHMIDHUBER J Long short-term memory[J]. Neural Computation, 1997, 9 (8): 1735- 1780
doi: 10.1162/neco.1997.9.8.1735
25 SU J, LU Y, PAN S, et al. Roformer: enhanced transformer with rotary position embedding [EB/OL]. (2021-04-09). https://arxiv.org/abs/2104.09864.
26 LAFFERTY J, MCCALLUM A, PEREIRA F C N. Conditional random fields: probabilistic models for segmenting and labeling sequence data [C]// Proceedings of the 18th International Conference on Machine Learning. San Francisco: Morgan Kaufmann, 2001: 282-289.
27 BELTAGY I, PETERS M E, COHAN A. Longformer: the long-document transformer [EB/OL]. (2020-05-10). https://arxiv.org/abs/2004.05150.
[1] 姚鑫骅,于涛,封森文,马梓健,栾丛丛,沈洪垚. 基于图神经网络的零件机加工特征识别方法[J]. 浙江大学学报(工学版), 2024, 58(2): 349-359.
[2] 程艳芬,吴家俊,何凡. 基于关系门控图卷积网络的方面级情感分析[J]. 浙江大学学报(工学版), 2023, 57(3): 437-445.
[3] 周青松,蔡晓东,刘家良. 结合社交影响和长短期偏好的个性化推荐算法[J]. 浙江大学学报(工学版), 2023, 57(3): 495-502.
[4] 曾菊香,王平辉,丁益东,兰林,蔡林熹,管晓宏. 面向节点分类的图神经网络节点嵌入增强模型[J]. 浙江大学学报(工学版), 2023, 57(2): 219-225.
[5] 张京京,张兆功,许鑫. 融合图增强和采样策略的图卷积协同过滤模型[J]. 浙江大学学报(工学版), 2023, 57(2): 243-251.
[6] 周天琪,杨艳,张继杰,殷少伟,郭增强. 基于无负样本损失和自适应增强的图对比学习[J]. 浙江大学学报(工学版), 2023, 57(2): 259-266.
[7] 陆佳炜,李端倪,王策策,徐俊,肖刚. 基于超图卷积神经网络的多行为感知服务推荐方法[J]. 浙江大学学报(工学版), 2023, 57(10): 1977-1986.
[8] 王婷,朱小飞,唐顾. 基于知识增强的图卷积神经网络的文本分类[J]. 浙江大学学报(工学版), 2022, 56(2): 322-328.
[9] 许佳辉,王敬昌,陈岭,吴勇. 基于图神经网络的地表水水质预测模型[J]. 浙江大学学报(工学版), 2021, 55(4): 601-607.
[10] 张晗, 罗森林, 邹丽丽, 石秀民. 融合句义分析的跨文本人名消歧[J]. 浙江大学学报(工学版), 2015, 49(4): 717-723.
[11] 林萌, 罗森林, 贾丛飞, 韩磊, 原玉娇, 潘丽敏. 融合句义结构模型的微博话题摘要算法[J]. 浙江大学学报(工学版), 2015, 49(12): 2316-2325.
[12] 韩磊, 罗森林, 潘丽敏, 魏超. 融合词法和句法特征的汉语谓词高精度识别方法[J]. 浙江大学学报(工学版), 2014, 48(12): 2107-2114.