Please wait a minute...
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering)  2005, Vol. 6 Issue (11): 23-    DOI: 10.1631/jzus.2005.A1341
    
Preserving the literary past, looking to the future: the first Hong Kong Literature Database
MA Leo F.H., WONG Rita, LAU Paul
University Library System, The Chinese University of Hong Kong, Hong Kong, China
Download:     PDF (0 KB)     
Export: BibTeX | EndNote (RIS)      

Abstract  In the last two decades of the 20th century, there has been an increasing interest in and emphasis on the study of the Hong Kong literature in both the academic and general public in Hong Kong. Recognizing the emergent need of the resources on Hong Kong literature, the University Library System of the Chinese University of Hong Kong set up the Hong Kong Literature Database (the “Database”), which was the first Chinese literature database in the Internet in 2000. The paper will examine how the database is constructed using XML technology and metadata schema. The database also employs Unicode UTF-8 as the internal code. A mapping table for traditional and simplified Chinese characters was created based on Unihan and is used behind the scene so that a user can either input traditional or simplified Chinese characters and retrieval will give both traditional and simplified Chinese characters. Currently 65% of journals use OCR technology so that full-text searching is possible. The Chinese OCR technology will be examined in greater detail. Special features of the Database such as, page-by-page browse mode, position-highlight for full-page newspaper, linking Table-Of-Contents and book jackets from the Library catalogue, etc. are described. The paper will also bring out the problem of massive downloading and compare the state-of-the-art technology and their shortcomings. This paper shows how the Hong Kong Literature Database facilitates future collaboration and data exchange by using open standard, shareable structure and the latest technology.

Key wordsHong Kong Literature      Hong Kong Literature Database      XML      Metadata schema      Database structure      Unicode UTF-8      OCR technology     
Received: 05 August 2005     
CLC:  TP391  
Cite this article:

MA Leo F.H., WONG Rita, LAU Paul. Preserving the literary past, looking to the future: the first Hong Kong Literature Database. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 23-.

URL:

http://www.zjujournals.com/xueshu/zjus-a/10.1631/jzus.2005.A1341     OR     http://www.zjujournals.com/xueshu/zjus-a/Y2005/V6/I11/23

[1] Jin-hua JIANG, Ke CHEN, Xiao-yan LI, Gang CHEN, Li-dan SHOU. Efficient processing of ordered XML twig pattern matching based on extended Dewey[J]. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2009, 10(12): 1769-1783.
[2] Yi-jun BEI, Gang CHEN, Jin-xiang DONG, Ke CHEN. Bottom-up mining of XML query patterns to improve XML querying[J]. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2008, 9(6): 744-757.
[3] Tian-lei HU, Gang CHEN. Adaptive XML to relational mapping: an integrated approach[J]. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2008, 9(6): 758-769.
[4] Jinhyung KIM, Dongwon JEONG, Doo-Kwon BAIK. VQT: value cardinality and query pattern based R-schema to XML schema translation with implicit referential integrity[J]. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2008, 9(12): 1694-1707.
[5] LIU De-zhi, RAZDAN Anshuman, SIMON Arleyn, BAE Myungsoo. An XML-based information model for archaeological pottery[J]. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(5): 447-453.
[6] OUYANG Ying-xiu, TANG Min, LIN Jun-cheng, DONG Jin-xiang. Distributed collaborative CAD system based on Web Service[J]. Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2004, 5(5): 579-586.