Please wait a minute...
Journal of ZheJiang University (Engineering Science)  2024, Vol. 58 Issue (9): 1790-1800    DOI: 10.3785/j.issn.1008-973X.2024.09.004
    
Zero-shot object rumor detection based on contrastive learning
Ke CHEN1(),Wenhao ZHANG2
1. School of Computer, Guangdong University of Petrochemical Technology, Maoming 525000, China
2. School of Electronic and Information Engineering, Guangdong University of Petrochemical Technology, Maoming 525000, China
Download: HTML     PDF(931KB) HTML
Export: BibTeX | EndNote (RIS)      

Abstract  

Existing rumor detection models often rely on large-scale manually annotated rumor datasets, which are costly and limited in their ability to detect unknown rumors due to the reliance on features derived from debunked rumors. To address this limitation, an approach for rumor detection targeted at different objects was proposed. Leveraging the zero-shot learning, the rumor dataset was divided into multiple datasets with non-overlapping samples and contents based on different objects, enabling the zero-shot object-oriented rumor detection task. Correspondingly, a universal mask feature was constructed to represent the relationship between objects, and a proxy task was designed to differentiate the universal mask feature. Additionally, object-oriented information-assisted text was introduced to reduce noise caused by data augmentation and was linearly transformed with the original vector semantics. Then, a proxy task-based hierarchical contrastive learning model (ZPTHCL) was presented for zero-shot object-oriented rumor detection, which leveraged transfer learning for rumor detection. Finally, experiments were conducted on a zero-shot rumor dataset based on objects and four publicly available datasets, Ma-Weibo, Weibo20, Twitter15 and Twitter16, demonstrating superior performance of the proposed contrastive learning zero-shot object-oriented rumor detection model.



Key wordsrumor detection      zero-shot learning      transfer learning      proxy task      contrastive learning     
Received: 20 May 2023      Published: 30 August 2024
CLC:  TP 18  
Fund:  国家自然科学基金资助项目(61172145);广东省自然科学基金资助项目(2018A030307032);广东省普通高校重点科研平台和项目(2020ZDZX3038).
Cite this article:

Ke CHEN,Wenhao ZHANG. Zero-shot object rumor detection based on contrastive learning. Journal of ZheJiang University (Engineering Science), 2024, 58(9): 1790-1800.

URL:

https://www.zjujournals.com/eng/10.3785/j.issn.1008-973X.2024.09.004     OR     https://www.zjujournals.com/eng/Y2024/V58/I9/1790


基于对比学习的零样本对象谣言检测

现有的谣言检测模型通常依赖大规模人工标注的谣言数据集,标注成本高且谣言特征来源于已被辟谣的谣言. 为了提高模型对未知谣言的检测能力,提出面向不同对象的谣言检测方法. 基于零样本学习,将谣言数据集按照不同的对象划分为样本与内容互不重叠的多个数据集,从而实现零样本对象谣言检测任务;为了表征对象之间的关系构建通义掩码特征,从而设计区分通义掩码特征的代理任务;为了减少数据增强带来的噪声,引入面向对象的信息辅助文本作为特征,并将其与原语义向量进行线性变换. 在此基础上,提出面向零样本对象谣言检测的基于代理任务的分层对比学习模型(ZPTHCL),可以通过迁移学习进行谣言检测. 在一个基于对象的零样本谣言数据集和Ma-Weibo、Weibo20、Twitter15、Twitter16这4个公开数据集上进行实验,结果表明所提出的对比学习零样本对象谣言检测模型性能更优.


关键词: 谣言检测,  零样本学习,  迁移学习,  代理任务,  对比学习 
Fig.1 Overall framework of ZPTHCL model
Fig.2 Data augmentation process
Fig.3 Example of topic-related words
Fig.4 Masked sample example
Fig.5 Information auxiliary text corresponding to each object
数据集N
言论非谣言谣言
Ma-Weibo466423512313
Weibo20606830343034
Twitter15742370372
Twitter16412205207
Tab.1 Data statistics of four rumor detection data sets
数据集N
言论非谣言谣言
中国507247260
刘翔1386969
北京924646
地震1788989
日本884939
死亡308150158
美国1297158
Tab.2 Statistics of zero-shot object rumor data set Zeo-Weibo
方法A
Ma-WeiboWeibo20Twitter15Twitter16
基于机器学
习的方法
SVM-TS[33]88.4689.3273.8576.46
基于图结
构的方法
Ma-RvNN[38]94.8194.3193.9292.68
GCNN[37]95.1093.3187.2192.14
Bi-GCN[17]96.1291.1295.9695.15
UMLARD[22]92.8085.7090.10
基于Transformer
的方法
BERT[29]93.0396.2196.6793.20
RoBERTa[38]96.0396.1193.5693.69
Longformer[37]90.8495.6190.5790.78
PLAN[38]92.2692.5692.1394.23
ToBERT[23]98.12
基于模型融
合的方法
Wu-Stacking[41]93.4893.5292.8692.86
Bagging-BERT(2)[40]96.6796.6896.5096.50
Geng-Ensemble[43]95.6095.6795.1295.12
STANKER[21]97.4597.4697.1797.17
基于对比学
习的方法
ZPTHCL
(本研究)
98.9798.8698.8998.89
Tab.3 Accuracy of different methods on four rumor detection data sets %
方法Avg-F1Avg-RAvg-PAvg-ASCO
CNN[33]68.8276.4371.7182.9774.98
BiLSTM[34]70.6872.0270.6676.5972.48
CNN-BiLSTM[35]64.0166.5862.8475.8867.32
Arc1[22]79.6480.2679.7381.5880.30
Arc2[22]76.2977.9976.9782.3978.41
Arc3[22]74.7777.3376.4082.0277.63
BERT[29]85.2487.0983.9085.3985.40
PT-HCL[12]85.3585.0986.3984.8885.43
ZPTHCL
(本研究)
87.5887.2588.2487.1387.55
Tab.4 Results of different methods on Zeo-Weibo object rumor detection dataset %
方法AMac-F1R-F1NR-F1
PT-HCL[12]49.8848.3957.1439.64
ZPTHCL(本研究)56.5153.6041.9765.23
w/o cl51.3550.0842.1158.05
w/o au53.8153.1847.7858.59
w/o text54.0553.8756.8150.92
Tab.5 Rumor detection results from Chinese training dataset to English test dataset %
方法AMac-F1R-F1NR-F1
PT-HCL[12]50.2048.3338.5258.15
ZPTHCL(本研究)54.6050.6836.7764.59
w/o cl51.4051.3353.1849.48
w/o au52.8052.1157.8646.36
w/o text48.6048.2552.5044.01
Tab.6 Rumor detection results from English training dataset to Chinese test dataset %
ρ/%A
Ma-WeiboWeibo20Twitter15Twitter16
594.7092.0085.7574.20
1096.7493.7891.7288.45
1597.5394.1496.6194.84
2098.3395.3197.5696.07
Tab.7 Accuracy of ZPTHCL model in absence of labels on four rumor detection datasets %
方法A
北京地震刘翔美国日本死亡中国
w/o cl86.3681.6778.1793.5482.4492.3482.18
w/o au85.5682.3477.4392.9679.0091.7084.51
w/o text86.3286.0175.9094.2283.5293.5383.32
ZPTHCL(本研究)86.7187.6479.0594.9284.9393.6795.90
Tab.8 Ablation experimental results on seven object datasets %
[1]   KANTAR M. Social Media Trends [R]. London: Kantar Media, 2019.
[2]   KAPFERER J. Rumeurs-Le plus vieux média du monde [M]// Pari: Editions du Seuil, 1987: 31−33.
[3]   LAROCHELLE H, ERHAN D, BENGIO Y. Zero-data learning of new tasks [C]// Proceedings of the 23rd AAAI Conference on Artificial Intelligence . Chicago: AAAI Press, 2008: 646−651 .
[4]   CHANG M W, RATINOV L, ROTH D, et al. Importance of semantic representation: dataless classification [C]// Proceedings of the 23rd AAAI Conference on Artificial Intelligence. Chicago: AAAI Press, 2008: 830−835.
[5]   LIN H, YI P, MA J, et al. Zero-shot rumor detection with propagation structure via prompt learning [C]// Proceedings of the AAAI Conference on Artificial Intelligence . Washington: AAAI Press, 2023: 5213−5221.
[6]   SONG Y, UPADHYAY S, PENG H, et al Toward any-language zero-shot topic classification of textual documents[J]. Artificial Intelligence, 2019, 274 (C): 133- 150
[7]   SONG Y, UPADHYAY S, PENG H, et al. Cross-lingual dataless classification for many languages [C]// Proceedings of the 25th International Joint Conference on Artificial Intelligence . New York: AAAI Press, 2016: 2901−2907.
[8]   GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al Generative adversarial networks[J]. Communications of the ACM, 2020, 63 (11): 139- 44
doi: 10.1145/3422622
[9]   KINGMA D P, WELLING M. Auto-encoding variational bayes [C]// Proceedings of the International Conference on Learning Representations . Ithaca: ArXiv, 2014: 14−16.
[10]   CHEN T, KORNBLITH S, NOROUZI M, et al. A simple framework for contrastive learning of visual representations [C]// Proceedings of the International Conference on Machine Learning . [s. l. ]: PMLR, 2020: 1597−1607.
[11]   HE K, FAN H, WU Y, et al. Momentum contrast for unsupervised visual representation learning [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition . Seattle: IEEE, 2020: 9726−9735.
[12]   LIANG B, CHEN Z X, GUI L, et al. Zero-shot stance detection via contrastive learning [C]// Proceedings of the ACM Web Conference. Lyon: ACM, 2022: 2738−2747.
[13]   VICARIO M D, QUATTROCIOCCHI W, SCALA A, et al Polarization and fake news: early warning of potential misinformation targets[J]. ACM Transactions on the Web, 2019, 13 (2): 1- 22
[14]   MEEL P, VISHWAKARMA D K Fake news, rumor, information pollution in social media and web: a contemporary survey of state-of-the-arts, challenges and opportunities[J]. Expert Systems with Applications, 2020, 153 (1): 112986
[15]   WANG Z, GUO Y Rumor events detection enhanced by encoding sentimental information into time series division and word representations[J]. Neurocomputing, 2020, 397 (2): 224- 243
[16]   KUMAR S, CARLEY K M. Tree LSTMs with convolution units to predict stance and rumor veracity in social media conversations [C]// Proceedings of the 57th annual meeting of the association for computational linguistics . Florence: ACL, 2019: 5047−5058.
[17]   BIAN T, XIAO X, XU T, et al. Rumor detection on social media with bi-directional graph convolutional networks [C]// Proceedings of the AAAI Conference on Artificial Intelligence . New York: AAAI Press, 2020: 546−556.
[18]   ZHANG Q, LIPANI A, LIANG S, et al. Reply-aided detection of misinformation via bayesian deep learning [C]// Proceedings of the World Wide Web Conference . San Francisco: ACM, 2019: 2333−2343.
[19]   RIEDEL B, AUGENSTEIN I, SPITHOURAKIS G P, et al. A simple but tough-to-beat baseline for the fake news challenge stance detection task [EB/OL]. (2018−05−21). https://doi.org/10.48550/arXiv.1707.03264.
[20]   LU Y J, LI C T. GCAN: graph-aware co-attention networks for explainable fake news detection on social media [C]// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics . [s. l. ]: ACL, 2020: 505−514.
[21]   RAO D, MIAO X, JIANG Z, et al. STANKER: stacking network based on level-grained attention-masked BERT for rumor detection on social media [C]// Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing . Online and Punta Cana: ACL, 2021: 3347−3363.
[22]   CHEN X, ZHOU F, TRAJCEVSKI G, et al Multi-view learning with distinguishable feature fusion for rumor detection[J]. Knowledge-Based Systems, 2022, 240 (8): 108085
[23]   XU Y, GUO J, QIU W, et al. "Comments matter and the more the better!": improving rumor detection with user comments [C]// International Conference on Trust, Security and Privacy in Computing and Communications . Wuhan: IEEE, 2022: 383−390.
[24]   PUSHP P K, SRIVASTAVA M M. Train once, test anywhere: zero-shot learning for text classification [EB/OL]. (2017−12−23). https://doi.org/10.48550/arXiv.1 712.05972.
[25]   陆恒杨, 范晨悠, 吴小俊. 面向网络社交媒体的少样本新 冠谣言检测 [J]. 中文信息学报, 2022, 36(1): 135−144.
LU Hengyang, FAN Chenyou, WU Xiaojun. Few-shot COVID-19 rumor detection for online social media [J]. Journal of Chinese Information Processing . 2022, 36(1): 135−144.
[26]   ZHOU H, MA T, RONG H, et al MDMN: multi-task and domain adaptation based multi-modal network for early rumor detection[J]. Expert Systems with Applications, 2022, 195 (3): 116517
[27]   RAN H, JIA C. Unsupervised cross-domain rumor detection with contrastive learning and cross-attention [C]// Proceedings of the AAAI Conference on Artificial Intelligence . Washington: AAAI Press, 2023: 13510−13518.
[28]   MA J, GAO W, MITRA P, et al. Detecting rumors from microblogs with recurrent neural networks [C]// Proceedings of the 25th International Joint Conference on Artificial Intelligence . New York: AAAI Press, 2016: 3818−3824.
[29]   DEVLIN J, CHANG M, LEE K, et al. BERT: pre-training of deep bidirectional transformers for language understanding [C]// Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics . Minneapolis: ACL, 2019: 4171−4186.
[30]   BLEI D M, NG A Y, JORDAN M I Latent dirichlet allocation[J]. Journal of Machine Learning Research, 2003, 3 (1): 993- 1022
[31]   MA J, GAO W, WEI Z, et al. Detect rumors using time series of social context information on microblogging websites [C]// Proceedings of the 24th ACM International on Conference on Information and Knowledge Management . Melbourne : ACM , 2015: 1751−1754.
[32]   MA J, GAO W, WONG K F. Detect rumors in microblog posts using propagation structure via kernel learning [C]// Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics . Vancouver: ACL, 2017: 708−717.
[33]   LIU Z, WEI Z, ZHANG R Rumor detection based on convolutional neural network[J]. Journal of Computer Applications, 2017, 37 (11): 3053
[34]   SUJANA Y, LI J, KAO H Y. Rumor detection on twitter using multiloss hierarchical bilstm with an attenuation factor [C]// Asian Chapter of the Association for Computational Linguistics . [s. l. ]: ACL, 2020: 18−26.
[35]   RANI N, DAS P, BHARDWAJ A K. A hybrid deep learning model based on CNN-BiLSTM for rumor detection [C]// Proceedings of the 2021 6th International Conference on Communication and Electronics Systems . Coimbatre: IEEE, 2021: 1423−1427.
[36]   MA J, GAO W, JOTY S, et al An attention-based rumor detection model with tree-structured recursive neural networks[J]. ACM Transactions on Intelligent Systems and Technology, 2020, 11 (4): 1- 28
[37]   TU K, CHEN C, HOU C, et al Rumor2vec: a rumor detection framework with joint text and propagation structure representation learning[J]. Information Sciences, 2021, 560 (1): 137- 151
[38]   LIU Y, OTT M, GOYAL N, et al. Roberta: a robustly optimized Bert pretraining approach [C]// Proceedings of the 20th Chinese National Conference on Computational Linguistics . Huhhot: Chinese Information Processing Society of China, 2021: 1218−1227.
[39]   BELTAGY I, PETERS M E, COHAN A. Longformer: the long-document transformer [EB/OL]. [2020-12-02]. https://doi.org/10.48550/arXiv.2004.05150.
[40]   KHOO L M S, CHIEU H L, QIAN Z, et al. Interpretable rumor detection in microblogs by attending to user interactions [C]// Proceedings of the AAAI Conference on Artificial Intelligence . California: AAAI Press, 2020: 8783-8790.
[41]   WU Y, ZENG Y, YANG J, et al Weibo rumor recognition based on communication and stacking ensemble learning[J]. Discrete Dynamics in Nature and Society, 2020, 2020: 1- 12
[42]   RISCH J, KRESTEL R. Bagging bert models for robust aggression identification [C]// Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying . Marseille: ELRA, 2020: 55−61.
[43]   GENG Y, LIN Z, FU P, et al. Rumor detection on social media: a multi-view model using self-attention mechanism [C]// Proceedings of the Computational Science-ICCS 2019: 19th International Conference . Faro: Springer-Verlag, 2019: 339−352.
[1] Zihan ZHOU,Xumeng WANG,Wei CHEN. Interactive visualization generation method for time series data based on transfer learning[J]. Journal of ZheJiang University (Engineering Science), 2024, 58(2): 239-246.
[2] Tian-qi ZHOU,Yan YANG,Ji-jie ZHANG,Shao-wei YIN,Zeng-qiang GUO. Graph contrastive learning based on negative-sample-free loss and adaptive augmentation[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(2): 259-266.
[3] Gui-mei GU,Yao-hua JIA,Yan-hao ZHAO,Wen-hui ZHANG,Bing-xu YAN. Defect identification for catenary dropper line based on compositional zero-shot learning[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(11): 2285-2293.
[4] Xia HUA,Xin-qing WANG,Ting RUI,Fa-ming SHAO,Dong WANG. Vision-driven end-to-end maneuvering object tracking of UAV[J]. Journal of ZheJiang University (Engineering Science), 2022, 56(7): 1464-1472.
[5] You-wei WANG,Shuang TONG,Li-zhou FENG,Jian-ming ZHU,Yang LI,Fu CHEN. New inductive microblog rumor detection method based on graph convolutional network[J]. Journal of ZheJiang University (Engineering Science), 2022, 56(5): 956-966.
[6] Yi-cong GAO,Yan-kun WANG,Shao-mei FEI,Qiong LIN. Intelligent proofreading method of engineering drawing based on transfer learning[J]. Journal of ZheJiang University (Engineering Science), 2022, 56(5): 856-863, 889.
[7] Su-jia ZENG,Shan-min PANG,Wen-yu HAO. Zero-shot image classification method base on deep supervised alignment[J]. Journal of ZheJiang University (Engineering Science), 2022, 56(11): 2204-2214.
[8] Xiao-feng FU,Li NIU. Micro-expression classification based on deep convolution and auto-encoder enhancement[J]. Journal of ZheJiang University (Engineering Science), 2022, 56(10): 1948-1957.
[9] Zhi-chao CHEN,Hai-ning JIAO,Jie YANG,Hua-fu ZENG. Garbage image classification algorithm based on improved MobileNet v2[J]. Journal of ZheJiang University (Engineering Science), 2021, 55(8): 1490-1499.
[10] Zhuang KANG,Jie YANG,Hao-qi GUO. Automatic garbage classification system based on machine vision[J]. Journal of ZheJiang University (Engineering Science), 2020, 54(7): 1272-1280.
[11] Zong-li SHEN,Jian-bo YU. Wafer map defect recognition based on transfer learning and deep forest[J]. Journal of ZheJiang University (Engineering Science), 2020, 54(6): 1228-1239.
[12] Xiao-feng FU,Li NIU,Zhuo-qun HU,Jian-jun LI,Qing WU. Deep micro-expression spotting network training based on concept of transition frame[J]. Journal of ZheJiang University (Engineering Science), 2020, 54(11): 2128-2137.