Please wait a minute...
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering)  2008, Vol. 9 Issue (6): 744-757    DOI: 10.1631/jzus.A071551
Electrical & Electronic Engineering     
Bottom-up mining of XML query patterns to improve XML querying
Yi-jun BEI, Gang CHEN, Jin-xiang DONG, Ke CHEN
School of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
Download:     PDF (0 KB)     
Export: BibTeX | EndNote (RIS)      

Abstract  Querying XML data is a computationally expensive process due to the complex nature of both the XML data and the XML queries. In this paper we propose an approach to expedite XML query processing by caching the results of frequent queries. We discover frequent query patterns from user-issued queries using an efficient bottom-up mining approach called VBUXMiner. VBUXMiner consists of two main steps. First, all queries are merged into a summary structure named “compressed global tree guide” (CGTG). Second, a bottom-up traversal scheme based on the CGTG is employed to generate frequent query patterns. We use the frequent query patterns in a cache mechanism to improve the XML query performance. Experimental results show that our proposed mining approach outperforms the previous mining algorithms for XML queries, such as XQPMinerTID and FastXMiner, and that by caching the results of frequent query patterns, XML query performance can be dramatically improved.

Key wordsXML querying      XML mining      Caching      Data mining     
Received: 16 October 2007      Published: 09 May 2008
CLC:  TP311.13  
Cite this article:

Yi-jun BEI, Gang CHEN, Jin-xiang DONG, Ke CHEN. Bottom-up mining of XML query patterns to improve XML querying. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2008, 9(6): 744-757.

URL:

http://www.zjujournals.com/xueshu/zjus-a/10.1631/jzus.A071551     OR     http://www.zjujournals.com/xueshu/zjus-a/Y2008/V9/I6/744

[1] Feng LI, Jin MA, Jian-hua LI. Distributed anonymous data perturbation method for privacy-preserving data mining[J]. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2009, 10(7): 952-963.
[2] Tao JIANG, Yu-cai FENG, Bin ZHANG, Zhong-sheng CAO, Ge FU, Jie SHI. Monitoring correlative financial data streams by local pattern similarity[J]. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2009, 10(7): 937-951.
[3] Qian YE, Ling CHEN, Gen-cai CHEN. Personal continuous route pattern mining[J]. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2009, 10(2): 221-231.
[4] Jin-hua JIANG, Ke CHEN, Xiao-yan LI, Gang CHEN, Li-dan SHOU. Efficient processing of ordered XML twig pattern matching based on extended Dewey[J]. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2009, 10(12): 1769-1783.
[5] LI Guo-qi, SHENG Huan-ye. Classification analysis of microarray data based on ontological engineering[J]. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2007, 8(4): 638-643.
[6] GUO Pan-hong, YANG Yang, LI Xin-you. A P2P streaming service architecture with distributed caching[J]. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2007, 8(4): 605-614.
[7] LIU Jun-qiang, PAN Yun-he. An efficient algorithm for mining closed itemsets[J]. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2004, 5(1): 8-15.