Please wait a minute...
Front. Inform. Technol. Electron. Eng.  2018, Vol. 19 Issue (11): 1409-1419    
    
Semantic composition of distributed representations for query subtopic mining
Wei SONG, Ying LIU, Li-zhen LIU, Han-shi WANG
Information and Engineering College, Capital Normal University, Beijing 100048, China
Semantic composition of distributed representations for query subtopic mining
Wei SONG, Ying LIU, Li-zhen LIU, Han-shi WANG
Information and Engineering College, Capital Normal University, Beijing 100048, China
 全文: PDF 
摘要: Inferring query intent is significant in information retrieval tasks. Query subtopic mining aims to find possible sub-
topics for a given query to represent potential intents. Subtopic mining is challenging due to the nature of short queries. Learning
distributed representations or sequences of words has been developed recently and quickly, making great impacts on many fields.
It is still not clear whether distributed representations are effective in alleviating the challenges of query subtopic mining. In this
paper, we exploit and compare the main semantic composition of distributed representations for query subtopic mining. Specif-
ically, we focus on two types of distributed representations: paragraph vector which represents word sequences with an arbitrary
length directly, and word vector composition. We thoroughly investigate the impacts of semantic composition strategies and the
types of data for learning distributed representations. Experiments were conducted on a public dataset offered by the National
Institute of Informatics Testbeds and Community for Information Access Research. The empirical results show that distributed
semantic representations can achieve outstanding performance for query subtopic mining, compared with traditional semantic
representations. More insights are reported as well.
关键词: Subtopic mining Query intent Distributed representation Semantic composition    
Abstract: Inferring query intent is significant in information retrieval tasks. Query subtopic mining aims to find possible sub-
topics for a given query to represent potential intents. Subtopic mining is challenging due to the nature of short queries. Learning
distributed representations or sequences of words has been developed recently and quickly, making great impacts on many fields.
It is still not clear whether distributed representations are effective in alleviating the challenges of query subtopic mining. In this
paper, we exploit and compare the main semantic composition of distributed representations for query subtopic mining. Specif-
ically, we focus on two types of distributed representations: paragraph vector which represents word sequences with an arbitrary
length directly, and word vector composition. We thoroughly investigate the impacts of semantic composition strategies and the
types of data for learning distributed representations. Experiments were conducted on a public dataset offered by the National
Institute of Informatics Testbeds and Community for Information Access Research. The empirical results show that distributed
semantic representations can achieve outstanding performance for query subtopic mining, compared with traditional semantic
representations. More insights are reported as well.
Key words: Subtopic mining    Query intent    Distributed representation    Semantic composition
收稿日期: 2016-08-15 出版日期: 2019-06-13
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章  
Wei SONG
Ying LIU
Li-zhen LIU
Han-shi WANG

引用本文:

Wei SONG, Ying LIU, Li-zhen LIU, Han-shi WANG. Semantic composition of distributed representations for query subtopic mining. Front. Inform. Technol. Electron. Eng., 2018, 19(11): 1409-1419.

链接本文:

http://www.zjujournals.com/xueshu/fitee/CN/        http://www.zjujournals.com/xueshu/fitee/CN/Y2018/V19/I11/1409

No related articles found!