Please wait a minute...
Front. Inform. Technol. Electron. Eng.  2010, Vol. 11 Issue (5): 340-355    DOI: 10.1631/jzus.C0910245
    
Online detection of bursty events and their evolution in news streams
Wei Chen, Chun Chen, Li-jun Zhang, Can Wang*, Jia-jun Bu
Zhejiang Laboratory of Service Robot, Zhejiang University, Hangzhou 310027, China
Online detection of bursty events and their evolution in news streams
Wei Chen, Chun Chen, Li-jun Zhang, Can Wang*, Jia-jun Bu
Zhejiang Laboratory of Service Robot, Zhejiang University, Hangzhou 310027, China
 全文: PDF(250 KB)  
摘要: Online monitoring of temporally-sequenced news streams for interesting patterns and trends has gained popularity in the last decade. In this paper, we study a particular news stream monitoring task: timely detection of bursty events which have happened recently and discovery of their evolutionary patterns along the timeline. Here, a news stream is represented as feature streams of tens of thousands of features (i.e., keyword. Each news story consists of a set of keywords.). A bursty event therefore is composed of a group of bursty features, which show bursty rises in frequency as the related event emerges. In this paper, we give a formal definition to the above problem and present a solution with the following steps: (1) applying an online multi-resolution burst detection method to identify bursty features with different bursty durations within a recent time period; (2) clustering bursty features to form bursty events and associating each event with a power value which reflects its bursty level; (3) applying an information retrieval method based on cosine similarity to discover the event’s evolution (i.e., highly related bursty events in history) along the timeline. We extensively evaluate the proposed methods on the Reuters Corpus Volume 1. Experimental results show that our methods can detect bursty events in a timely way and effectively discover their evolution. The power values used in our model not only measure event’s bursty level or relative importance well at a certain time point but also show relative strengths of events along the same evolution.
关键词: Online event detectionEvent’s evolutionNews streamAffinity propagation    
Abstract: Online monitoring of temporally-sequenced news streams for interesting patterns and trends has gained popularity in the last decade. In this paper, we study a particular news stream monitoring task: timely detection of bursty events which have happened recently and discovery of their evolutionary patterns along the timeline. Here, a news stream is represented as feature streams of tens of thousands of features (i.e., keyword. Each news story consists of a set of keywords.). A bursty event therefore is composed of a group of bursty features, which show bursty rises in frequency as the related event emerges. In this paper, we give a formal definition to the above problem and present a solution with the following steps: (1) applying an online multi-resolution burst detection method to identify bursty features with different bursty durations within a recent time period; (2) clustering bursty features to form bursty events and associating each event with a power value which reflects its bursty level; (3) applying an information retrieval method based on cosine similarity to discover the event’s evolution (i.e., highly related bursty events in history) along the timeline. We extensively evaluate the proposed methods on the Reuters Corpus Volume 1. Experimental results show that our methods can detect bursty events in a timely way and effectively discover their evolution. The power values used in our model not only measure event’s bursty level or relative importance well at a certain time point but also show relative strengths of events along the same evolution.
Key words: Online event detection    Event’s evolution    News stream    Affinity propagation
收稿日期: 2009-04-29 出版日期: 2010-04-28
CLC:  TP391  
基金资助: Project  (No.  2008BAH26B00)  supported  by  the  National  Key Technology R & D Program of China
通讯作者: Can WANG     E-mail: wcan@zju.edu.cn
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章  
Wei Chen
Chun Chen
Li-jun Zhang
Can Wang
Jia-jun Bu

引用本文:

Wei Chen, Chun Chen, Li-jun Zhang, Can Wang, Jia-jun Bu. Online detection of bursty events and their evolution in news streams. Front. Inform. Technol. Electron. Eng., 2010, 11(5): 340-355.

链接本文:

http://www.zjujournals.com/xueshu/fitee/CN/10.1631/jzus.C0910245        http://www.zjujournals.com/xueshu/fitee/CN/Y2010/V11/I5/340

[1] Xin-zheng Xu, Shi-fei Ding, Zhong-zhi Shi, Hong Zhu. Optimizing radial basis function neural network based on rough sets and affinity propagation clustering algorithm[J]. Front. Inform. Technol. Electron. Eng., 2012, 13(2): 131-138.