Please wait a minute...
JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE)
    
Survey of opinion mining
CHEN Min1, ZHU Fan-wei2, WU Ming-hui2, YING Jing1,2
1. College of Computer Science, Zhejiang University, Hangzhou 310027, China; 2. Department of Computer Science and Engineering, Zhejiang University City College, Hangzhou 310015, China
Download:   PDF(2081KB) HTML
Export: BibTeX | EndNote (RIS)      

Abstract  

With the rapid development of Internet technology, the use of the network in people’s daily life has become increasingly popular. The appearance of blog, social networking, twitter and some other platforms, makes people express their personal views on the Internet more conveniently. Analyzing these views can provide the most authentic and useful information for consumers, businesses and government. As a result, it becomes particularly important to extract, analyze and summarize information from the massive data, which will be formed as a targeted, readable analysis result for users. In this paper, with the process of opinion mining for clues, we firstly gave a comprehensive analysis on opinion mining from the view of opinion extraction, polarity analyze and opinion summarization, describing the tasks required to complete in each step. Opinion extraction section introduced the concept of domain-dependency, and shows some methods used in extraction. In the polarity analysis section, supervised learning and unsupervised learning, qualitative judgment and quantitative judgment were compared. Summary section gave a brief introduction of several summary forms, such as aspect based, opinion comparison and key works.. Then we introduced different methods on different mining levels, such as document level, sentence level, word level and aspect level, involving techniques like information retrieval, natural language processing, machine learning, etc. Later, we summarized some remaining challenging problems on opinion mining, and proposed some improved methods respectively. For example, sometimes objective sentences also contain opinions which often be overlooked by researchers. At the same time, we still unable to determine the polarity of some complex sentence. In the end of the paper, we predicted the future research trends of opinion mining.



Published: 01 August 2014
CLC:  TP 391  
Cite this article:

CHEN Min, ZHU Fan-wei, WU Ming-hui, YING Jing. Survey of opinion mining. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2014, 48(8): 1461-1472.

URL:

http://www.zjujournals.com/eng/10.3785/j.issn.1008-973X.2014.08.016     OR     http://www.zjujournals.com/eng/Y2014/V48/I8/1461


观点挖掘综述

随着互联网技术的飞速发展,博客、社交网络、微博等平台的出现,使得人们在网络上发表个人观点变得更为方便快捷.从海量的数据中提取出消费者、商家、政府等群体需要的信息,并对其进行分析、总结,从而形成有针对性的、可读性强的分析结果.以观点挖掘的流程为线索,从观点提取、极性分析、观点总结等方面对观点挖掘做了较为全面的分析.其中观点提取部分介绍了域依赖性的概念,并例举观点句提取所采用的主要技术;极性分析部分对有监督学习与无监督学习、定性判断极性与定量判断极性进行对比;而观点总结部分则简要介绍了基于方面、观点对比和关键语句等总结的形式.还介绍了在不同的挖掘粒度上所运用的技术,涉及信息检索、自然语言处理、机器学习等.最后,总结了观点挖掘技术中依然存在的具有挑战性的问题,并预测未来观点挖掘领域的研究趋势.

[1] RAYMOND N, PATRICIA C, DENILSON B, et al. Perspectives on business intelligence [R]. Ontario: University of Waterloo, 2013.
[2] DAVE K, LAWRENCE S, PENNOCK D. Mining the peanut gallery: opinion extraction and semantic classification of product reviews [C] ∥ Proc of WWW ′03. New York: ACM, 2003: 519-529.
[3] YANG Hui, LUO Si, CALLAN J. Knowledge transfer and opinion detection in the TREC2006 blog track [C] ∥ Proc of TREC. Pennsylvania: NIST, 2006: 163-168.
[4] BLITZER J, DREDZE M, PEREIRA F. Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification [C] ∥ Proc of the 45th Annual Meeting of the Association of Computational Linguistics. Prague: Cambridge University Press, 2007: 440-447.
[5] OWSLEY S, SOOD S, HAMMOND K. Domain specific affective classification of documents [C] ∥ Proc of CAAW ′06. California: AAAI, 2006: 181-183.
[6] SEKI K, UEHARA K. Adaptive subjective triggers for opinionated document retrieval [C] ∥ Proc of WSDM ′09. New York: ACM, 2009: 25-33.
[7] NA S, LEE Y, NAM S, et al. Improving opinion retrieval based on query-specific sentiment Lexicon [J]. Advances in Information Retrieval, 2009, 5478(8): 734-738.
[8] MISSEN M, BOUGHANEM M, CABANAC G. Opinion mining: reviewed from word to document level [J]. Social Network Analysis and Mining, 2013, 3(1): 107-125.
[9] HU Min-qing, LIU Bing. Mining opinion features in customer reviews [C] ∥ Proc of National Conference on Artificial. California: AAAI, 2004: 755-760.
[10] LIU Li-zhen, LV Zhi-xin. Extract product features in Chinese web for opinion mining [J]. Journal of Software, 2013, 8(3): 627-632.
[11] PAPPAS N, KATSIMPRAS G, STAMATATOS E. Extracting informative textual parts from web pages containing user-generated content [C] ∥ Proc of the 12th International Conference on Knowledge Management and Knowledge Technologies. New York: ACM, 2012.
[12] PAPPAS N, KATSIMPRAS G, STAMATATOS E. Distinguishing the popularity between topics a system for up-to-date opinion retrieval and mining in the web [J]. Computational Linguistics and Intelligent Text Processing, 2013, 7817: 197-209.
[13] CHAOVALIT P, ZHOU Li-na. Movie review mining: a comparison between supervised and unsupervised classification approaches [C] ∥ Proc of the 38th Hawaii International Conference on System Sciences. Hawaii: IEEE, 2005: 112c112c.
[14] HU Min-qing, LIU Bing. Mining and summarizing customer reviews [C] ∥ Proc of KDD ’04. New York: ACM, 2004: 168-177.
[15] KIM S, HOVY E. Determining the sentiment of opinions [C] ∥ Proc of COLING ′04. Stroudsburg: ACL, 2004
[16] ESULI A, SEBASTIANI F. SentiWordNet: a publicly available lexical resource for opinion mining [C] ∥ Proc of LREC. Genoa: European Language Resources Association, 2006: 417-422.
[17] LIU Bing. Sentiment analysis and opinion mining [M]. Florida: Morgan & Claypool Publishers, 2012.
[18] KIM H, ZHAI Cheng-xiang. Generating comparative summaries of contradictory opinions in text [C] ∥ Proc of CIKM ′09. New York: ACM, 2009: 385-394.
[19] BEINEKE P, HASTIE T, MANNING C, et al. An exploration of sentiment summarization [C] ∥ Proc of AAAI ′03. California: AAAI, 2003: 12-15.
[20] SEkI Y, EGUCHI K, KANDO N, et al. Opinion focused summarization and its analysis at DUC 2006 [C] ∥ Proc of HLT-NAACL ′06. New York: Johns Hopkins U., 2006: 122130.
[21] TURNEY P. Thumbs up or thumbs down: semantic orientation applied to unsupervised classification of reviews [C] ∥ Proc of ACL ′02. Stroudsburg: Cambridge University Press, 2002: 417-424.
[22] MISHNE G. Multiple ranking strategies for opinion retrieval in blogs [C] ∥ Proc of TREC. Pennsylvania: NIST, 2006: 78-81.
[23] GERANI S, CARMAN M, CRESTANI F. Investigating learning approaches for blog post opinion retrieval [J]. Advances in Information Retrieval, 2009, Volume 5478: 313-324.
[24] HE Ben, MACDONALD C, HE Ji-yin, et al. An effective statistical approach to blog post opinion retrieval [C] ∥ Proc of CIKM ’08. New York: ACM, 2008: 1063-1072.
[25] ZHANG Zi-qiong, YE Qiang, LAW R, et al. Automatic detection of subjective sentences based on chinese subjective patterns [J]. Cutting-edge Research Topics on Multiple Criteria Decision Making, 2009, 35(6): 29-36.
[26] RILOFF E, WIEBE J, WIKSON T. Learning subjective nouns using extraction pattern bootstrapping [C] ∥ Proc of CONLL ′03. Stroudsburg: ACL, 2003: 25-32.
[27] ZHANG Yi, XU Wei. Exact maximum likelihood estimation for word mixtures [C] ∥ Proc of ICML’02. Sydney: Morgan Kaufmann Publishers, 2002.
[28] YU Hong, HATZIVASSILOGLOU V. Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences [C] ∥ Proc of EMNLP ′03. Stroudsburg: ACL, 2003: 129-136.
[29] GAMON M, AUE A, CORSTON-OLIVER S. Pulse: mining customer opinions from free text [J]. Advances in Intelligent Data Analysis VI, 2005, 3646(9): 121-132.
[30] JINDAL N, LIU Bing. Identifying comparative sentences in text documents [C] ∥ Proc of SIGIR ′06. New York: ACM, 2006: 244-251.
[31] JINDAL N, LIU Bing. Mining comparative sentences and relations [C] ∥ Proc of AAAI ′06. California: AAAI, 2006: 1331-1336.
[32] VALITUTTI A, STRAPPARAVVA C, STOCK O. Developing affective lexical resources [J]. PsychNology Journal, 2004, 2(1): 61-83.
[33] MOHAMMAD S, DUNNE C, DORR B. Generating highcoverage semantic orientation lexicons from overtly marked words and a thesaurus [C] ∥ Proc of EMNLP ′09. Stroudsburg: ACL, 2009: 599-608.
[34] KAMPS J, MARX M, MOKKEN R, et al. Using WordNet to measure semantic orientation of adjectives [C] ∥ Proc of LREC ′04. Genoa: European Language Resources Association, 2004: 1115-1118.
[35] ESULI A, SEBASTIANI F. Determining the semantic orientation of terms through gloss classification [C] ∥ Proc of CIKM ′05. New York: ACM, 2005: 617-624.
[36] HATZIVASSILOGLOU V, MCKEOWN K. Predicting the semantic orientation of adjectives [C] ∥ Proc of ACL ′98. Stroudsburg: ACL, 1998: 174-181.
[37] WILSON T, WIEBE J, HOFFMANN P. Recognizing contextual polarity in phrase-level sentiment analysis [C] ∥ Proc of HLT ′05. Stroudsburg: ACL, 2005: 347-354.
[38] MARIO C, ANDREA B, ILARIA T. Good location, terrible food: detecting feature sentiment in user-generated reviews [J]. Social Network Analysis and Mining, 2013, 3(4): 1149-1163.
[39] BARONI M, VEGNADUZZO S. Identifying subjective adjectives through web-based mutual information [C] ∥ Proc of KONVENS ′04. Vienna: UTpublications, 2004: 17-24.
[40] CAMBRiIA E, HAVASI C, HUSSAIN A. SenticNet 2: a semantic and affective resource for opinion mining and sentiment analysis [C] ∥ Proc of FLAIRS ′12. California: AAAI, 2012: 202-207.
[41] BLAIR-GOLDENSOHN S, HANNAN K. Building a sentiment summarizer for local service reviews [C] ∥ Proc of WWW ′08. Beijing: ACM, 2008.
[42] ZHUANG Li, JING Feng, ZHU Xiao-yan. Movie review mining and summarization [C] ∥ Proc of CIKM ′06. New York: ACM, 2006: 43-50.
[43] WU Yuan-bin, ZHANG Qi, HUANG Xuan-jing, et al. Phrase dependency parsing for opinion mining [C] ∥ Proc of EMNLP ′09. Stroudsburg: ACL, 2009: 1533-1541.
[44] JAKOB N, GUREVYCH I. Extracting opinion targets in a singleand cross-domain setting with conditional random fields [C] ∥ Proc of EMNLP ′10. Stroudsburg: ACL, 2010: 1035-1045.
[45] LI Fang-tao, HUANG Min-lie, ZHU Xiao-yan. Sentiment analysis with global topics and local dependency [C] ∥ Proc of AAAI ′10. California: AAAI, 2010: 1371-1376.
[46] ZHEN Hai, CHANG Kui-yu, KIM J. Implicit feature identification via co-occurrence association rule mining [J]. Computational Linguistics and Intelligent Text Processing, 2011, 6608(2): 393-404.
[47] AGGARWAL C, ZHAI Cheng-xiang. Mining text data [M].New York: Springer, 2012.
[48] BROOKE J, TOFILOSKI M, TABOADA M. Cross-linguistic sentiment analysis: from english to spanish [C] ∥ Proc of International Conference RANLP. Borovets: ACL, 2009: 50-54.
[49] WAN Xiao-jun. Using bilingual knowledge and ensemble techniques for unsupervised Chinese sentiment analysis [C] ∥ Proc of EMNLP ′08. Stroudsburg: ACL, 2008: 553561.
[50] WAN Xiao-jun. Co-training for cross-lingual sentiment classification [C] ∥ Proc of ACL ′09. Stroudsburg: ACL, 2009: 235-243.
[51] KIM S, HOVY E. Identifying and analyzing judgment opinions [C] ∥ Proc of HLT-NAACL ′06. Stroudsburg: ACL, 2006: 200-207.
[52] MIHALCEA R, BANEA C, WIEBE J. Learning multilingual subjective language via cross-lingual projections [C] ∥ Proc of ACL ′07. Stroudsburg: ACL, 2007: 976-983.
[53] NARAYANAN R, LIU Bing, Choudhary A. Sentiment analysis of conditional sentences [C] ∥ Proc of EMNLP ′09. Stroudsburg: ACL,2009: 180-189.

[1] HE Xue-jun, WANG Jin, LU Guo-dong, LIU Zhen-yu, CHEN Li, JIN Jing. 3D head portrait sculpture by industrial robot based on triangular mesh slicing and collision detection[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(6): 1104-1110.
[2] WANG Hua, HAN Tong-yang, ZHOU Ke. KeyGraph-based community detection algorithm for public security intelligence[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(6): 1173-1180.
[3] YOU Hai-hui, MA Zeng-yi, TANG Yi-jun, WANG Yue-lan, ZHENG Lin, YU Zhong, JI Cheng-jun. Soft measurement of heating value of burning municipal solid waste for circulating fluidized bed[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(6): 1163-1172.
[4] BI Xiao-jun, WANG Jia-hui. Teaching-learning-based optimization algorithm with hybrid learning strategy[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(5): 1024-1031.
[5] WANG Liang, YU Zhi-wen, GUO Bin. Moving trajectory prediction model based on double layer multi-granularity knowledge discovery[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(4): 669-674.
[6] LIAO Miao, ZHAO Yu-qian, ZENG Ye-zhan, HUANG Zhong-chao, ZHANG Bing-kui, ZOU Bei-ji. Automatic segmentation for cell images based on support vector machine and ellipse fitting[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(4): 722-728.
[7] HUANG Zheng-yu, JIANG Xin-long, LIU Jun-fa, CHEN Yi-qiang, GU Yang. Fusion feature based semi-supervised manifold localization method[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(4): 655-662.
[8] JIANG Xin-long, CHEN Yi-qiang, LIU Jun-fa, HU Li-sha, SHEN Jian-fei. Wearable system to support proximity awareness for people with autism[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(4): 637-647.
[9] MU Jing-jing, ZHAO Xin-yue, HE Zai-xing, ZHANG Shu-you. Contour reconstruction of overlapped bubbles based on concave-convex transformation and circle fitting[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(4): 714-721.
[10] DAI Cai-yan, CHEN Ling, LI Bin, CHEN Bo-lun. Sampling-based link prediction in complex networks[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(3): 554-561.
[11] LIU Lei, YANG Peng, LIU Zuo-jun. Locomotion-Mode recognition using multiple kernel relevance vector machine[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(3): 562-571.
[12] GUO Meng-li, DA Fei-peng, DENG Xing, GAI Shao-yan. 3D face recognition based on keypoints and local feature[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(3): 584-589.
[13] WANG Hai jun, GE Hong juan, ZHANG Sheng yan. Fast object tracking algorithm via kernel collaborative presentation[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(2): 399-407.
[14] ZHANG Ya nan, CHEN De yun, WANG Ying jie, LIU Yu peng. Incremental graph pattern matching based dynamic recommendation method for cold-start user[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(2): 408-415.
[15] LIU Yu peng, QIAO Xiu ming, ZHAO Shi lei, MA Chun guang. Deep combination of large-scale features in statistical machine translation[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(1): 46-56.