Front. Inform. Technol. Electron. Eng.  2016, Vol. 17 Issue (2): 122-134    DOI: 10.1631/FITEE.1500187
A social tag clustering method based on common co-occurrence group similarity
Hui-zong Li, Xue-gang Hu, Yao-jin Lin, Wei He, Jian-han Pan
1School of Computer and Information, Hefei University of Technology, Hefei 230009, China; 2School of Economics and Management, Anhui University of Science and Technology, Huainan 232001, China; 3School of Computer, Minnan Normal University, Zhangzhou 363000, China
Abstract  Social tagging systems are widely applied in Web 2.0. Many users use these systems to create, organize, manage, and share Internet resources freely. However, many ambiguous and uncontrolled tags produced by social tagging systems not only worsen users’ experience, but also restrict resources’ retrieval efficiency. Tag clustering can aggregate tags with similar semantics together, and help mitigate the above problems. In this paper, we first present a common co-occurrence group similarity based approach, which employs the ternary relation among users, resources, and tags to measure the semantic relevance between tags. Then we propose a spectral clustering method to address the high dimensionality and sparsity of the annotating data. Finally, experimental results show that the proposed method is useful and efficient.

Key wordsSocial tagging systems      Tag co-occurrence      Spectral clustering      Group similarity     
Received: 11 June 2015      Published: 02 February 2016
Cite this article:

Hui-zong Li, Xue-gang Hu, Yao-jin Lin, Wei He, Jian-han Pan. A social tag clustering method based on common co-occurrence group similarity. Front. Inform. Technol. Electron. Eng., 2016, 17(2): 122-134.

方法:利用共同共现群体相似度来计算两两标签的相似度,建立相似度矩阵(公式(4))。使用谱聚类算法实验标签的聚类,首先使用拉普拉斯(Laplacian)变换对相似度矩阵进行规范化,建立标签的规范化拉普拉斯(Normalized Laplacian)矩阵,然后计算该矩阵的前k个特征值及其对应的特征向量,并将这k个特征向量组成新的特征空间,在此空间上用K-means算法将标签聚成k个类簇(算法1)。

关键词: 社会化标注系统,  标签共现,  谱聚类,  群体相似度 
