|
Towards a multilingual, multimedia and multimodal digital library platform
HUANG Tie-jun, TIAN Yong-hong, WANG Chun-li, SHI Xiao-dong, GAO Wen
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 3-.
https://doi.org/10.1631/jzus.2005.A1188
The China-US Million Book Digital Library Project (Million Book Project) is an international cooperation program between China and the US. However, one million digitized books are considered not to be the ultimate goal of the project, but a first step towards universal access to human knowledge. In particular, there are four challenges about the new way to analyze, process, operate, visualize and interact with digital media resource in this library. To tackle these challenges, North China Centre of Million Book Project (in Chinese Academy of Sciences) has initiated several innovative research projects in areas such as multimedia content analysis and retrieval, bilingual services, multimodal information presentation, and knowledge-based organization and services. In this keynote speech, we simply review our work in these areas, and argue that by technological cooperation with these innovation research topics, the project will develop a top-level digital library platform for the million book library.
|
|
Towards universal access to all knowledge—Internet Archive
KAHLE Brewster
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 4-.
https://doi.org/10.1631/jzus.2005.A1193
Advances in computing and communications mean that we can cost-effectively store every book, sound recording, movie, software package, and public Web page ever created, and provide access to these collections via the Internet to students and adults all over the world. By mostly using existing institutions and funding sources, we can build this as well as compensate authors within the current worldwide library budget. We should take advantage of our new technologies and our open societies to make a Universal Library again, and go the next step and make all knowledge easily available to every man, woman and child around the world.
|
|
Global Memory Net and the development of digital image information management system: Experience and practice
ZHANG Sheng-qiang, CHEN Ching-chih
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 9-.
https://doi.org/10.1631/jzus.2005.A1216
Global Memory Net (GMNet) is intended to be an effective gateway to the world cultural, historical, and heritage image collections from selected academic educational and research partners in the world. Much of these unique collections of great value to education and research are not currently accessible due to distance, form, and technical barriers. This project is to find new ways to enable users to access and exploit these significant research collections via global network. As GMNet is ending its first 5-year phase in October 2005, it has contributed substantially to the community building in digital library development by accommodating numerous collaborators and technical staff from various parts of the world to spend 3 to 5 months as a full-member of the GMNet team in Boston. They have come from different parts of China—such as Sichuan, Hainan, Shanghai and Xi’an; Croatia; and Hanoi, Vietnam. In addition to contribute to the overall system development and enhancement of system functionalities, they have brought valuable sample image collections of their own institutions/countries, and actually developed prototype collections as a part of GMNet. This paper describes the exciting and productive experience of the first of this visiting research group in developing the GMNet’s Version 2.0 PHP-based system under Prof. Chen’s overall supervision. It also describes both the system’s technical level structure—user/Web-based application/data, and complex functionalities with multi-collection, multi-lingual, multi-modal searching capabilities; system management capabilities; as well as provisions for user uploads and retrieval for our own projects. This Version 2.0 system is built on the Linux/Apache/PHP/MySQL platform. What is described in this paper is an actual case which has formed a base for further new development by others in the research group. It demonstrates fully the value of the synergistic collaboration among global partners for universal digital library development. More information can be found in http://www.memorynet.org/.
|
|
Spatial/temporal indexing and information visualization genre for environmental digital libraries
CHEN Su-shing, GRUNWALD Sabine
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 12-.
https://doi.org/10.1631/jzus.2005.A1235
Protecting and preserving our environmental systems require the ability to understand the spatio-temporal distribution of soils, parent material, topography, and land cover as well as the effects of human activities on ecosystems. Space-time modelling of ecosystems in an environmental digital library is essential for visualizing past, present, and future impacts of changes occurring within such landscapes (e.g., shift in land use practices). In this paper, we describe three novel features, spatio-temporal indexing, visualization, and geostatistical genre, for the environmental digital library, Environmental Visualization and Geographic Enterprise System (ENVISAGE), currently in progress at the University of Florida.
|
|
Technical features in the Portal to CADAL
WU Jiang-qin, ZHUANG Yue-ting, PAN Yun-he
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 13-.
https://doi.org/10.1631/jzus.2005.A1249
China-America Digital Academic Library Project (CADAL) is a collaborative project between universities and institutes in China and the USA, which aims to provide universal access to large scale digital resources and explore the ways of applying multimedia and virtual reality technologies to digital library. The distinct characteristic of the resources in CADAL is that it not only contains one million digital books of different languages, but also contains Terabyte level multimedia resources (image, video, and so on), which are utilized for education and research purposes. So, in the Portal to CADAL, both the traditional services of browsing and searching of digital books, and the services of quickly retrieving and structurally browsing of multimedia documents should be provided. In addition, the services of visual presentation of retrieved results are required too. In this paper, the underlying novel multimedia retrieval methods as well as visualization techniques, which are used in the CADAL portal, are investigated.
|
|
Exploiting multi-context analysis in semantic image classification
TIAN Yong-hong, HUANG Tie-jun, GAO Wen
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 15-.
https://doi.org/10.1631/jzus.2005.A1268
As the popularity of digital images is rapidly increasing on the Internet, research on technologies for semantic image classification has become an important research topic. However, the well-known content-based image classification methods do not overcome the so-called semantic gap problem in which low-level visual features cannot represent the high-level semantic content of images. Image classification using visual and textual information often performs poorly since the extracted textual features are often too limited to accurately represent the images. In this paper, we propose a semantic image classification approach using multi-context analysis. For a given image, we model the relevant textual information as its multi-modal context, and regard the related images connected by hyperlinks as its link context. Two kinds of context analysis models, i.e., cross-modal correlation analysis and link-based correlation model, are used to capture the correlation among different modals of features and the topical dependency among images induced by the link structure. We propose a new collective classification model called relational support vector classifier (RSVC) based on the well-known Support Vector Machines (SVMs) and the link-based correlation model. Experiments showed that the proposed approach significantly improved classification accuracy over that of SVM classifiers using visual and/or textual features.
|
|
Enhancing the usage pattern mining performance with temporal segmentation of QPop Increment in digital libraries
CAO San-xing, KLEIN R. Rody, LIU Jian-bo
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 17-.
https://doi.org/10.1631/jzus.2005.A1290
The convergence of next-generation Networks and the emergence of new media systems have made media-rich digital libraries popular in application and research. The discovery of media content objects’ usage patterns, where QPop Increment is the characteristic feature under study, is the basis of intelligent data migration scheduling, the very key issue for these systems to manage effectively the massive storage facilities in their backbones. In this paper, a clustering algorithm is established, on the basis of temporal segmentation of QPop Increment, so as to improve the mining performance. We employed the standard C-Means algorithm as the clustering kernel, and carried out the experimental mining process with segmented QPop Increases obtained in actual applications. The results indicated that the improved algorithm is more advantageous than the basic one in important indices such as the clustering cohesion. The experimental study in this paper is based on a Media Assets Library prototype developed for the use of the advertainment movie production project for Olympics 2008, under the support of both the Humanistic Olympics Study Center in Beijing, and China State Administration of Radio, Film and TV.
|
|
Resource discovery in distributed digital libraries through visual knowledge navigation
GU Qian-yi, AHMAD Faisal, SUMNER Tamara
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 19-.
https://doi.org/10.1631/jzus.2005.A1306
In order to support users to search and browse for various resources, digital libraries are composed of discovery systems which provide user interface and information retrieve system. Recent researches in Information Retrieval have investigated different techniques through improving precision and recall to enhance the effectiveness of discovery system in digital libraries. In this paper, we present our work to enhance discovery system effectiveness with a different approach, through resource discovery based on visual knowledge navigation. In our Strand Map Services project under National Science Digital Library, we introduce the visual resource discovery system called conceptual browsing interfaces, to help educators and learners to locate, comprehend and use educational resources in digital libraries. The paper begins with a short introduction of the Strand Map Services. Then we illustrate the service architecture, the design and implementation of its major components. We will focus our discussion of how the visualization system of the Strand Map Services supports the visual knowledge navigation for distributed digital libraries. This includes the knowledge acquisition of the conceptual browsing interfaces, different knowledge representations in the system perspective and user interface perspective, visualization system modules, algorithm and Web services integration to use visual knowledge navigation to enhance resource discovery in digital libraries.
|
|
The Million Book Project at Bibliotheca Alexandrina
ELDAKAR Youssef, EL-GAZZAR Khalid, ADLY Noha, NAGI Magdy
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 22-.
https://doi.org/10.1631/jzus.2005.A1327
The Bibliotheca Alexandrina (BA) has been developing and putting to use a workflow for turning printed books into digital books as its contribution to the building of a Universal Digital Library. This workflow is a process consisting of multiple phases, namely, scanning, image processing, OCR, digital archiving, document encoding, and publishing. Over the past couple of years, the BA has defined procedures and special techniques for the scanning, processing, OCR and publishing, especially of Arabic books. This workflow has been automated, allowing the governance of the different phases and making possible the production of 18000 books so far. The BA has also designed and implemented a framework for the encoding of digital books that allows publishing as well as a software system for managing the creation, maintenance, and publishing of the overall digital repository.
|
|
Preserving the literary past, looking to the future: the first Hong Kong Literature Database
MA Leo F.H., WONG Rita, LAU Paul
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 23-.
https://doi.org/10.1631/jzus.2005.A1341
In the last two decades of the 20th century, there has been an increasing interest in and emphasis on the study of the Hong Kong literature in both the academic and general public in Hong Kong. Recognizing the emergent need of the resources on Hong Kong literature, the University Library System of the Chinese University of Hong Kong set up the Hong Kong Literature Database (the “Database”), which was the first Chinese literature database in the Internet in 2000. The paper will examine how the database is constructed using XML technology and metadata schema. The database also employs Unicode UTF-8 as the internal code. A mapping table for traditional and simplified Chinese characters was created based on Unihan and is used behind the scene so that a user can either input traditional or simplified Chinese characters and retrieval will give both traditional and simplified Chinese characters. Currently 65% of journals use OCR technology so that full-text searching is possible. The Chinese OCR technology will be examined in greater detail. Special features of the Database such as, page-by-page browse mode, position-highlight for full-page newspaper, linking Table-Of-Contents and book jackets from the Library catalogue, etc. are described. The paper will also bring out the problem of massive downloading and compare the state-of-the-art technology and their shortcomings. This paper shows how the Hong Kong Literature Database facilitates future collaboration and data exchange by using open standard, shareable structure and the latest technology.
|
|
An experiment on digital library based on the method of TQM system
YANG Ling-yun, ZHU Hai-kang
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 26-.
https://doi.org/10.1631/jzus.2005.A1362
The principles and methods of TQM (total quality management) were applied to digital library management. TQM puts emphasis on process control and continual improvement. The TQM theories and measures explained clearly in this paper were the scientific tools used in the experiment in the Wenzhou University Digital Library. This paper deals with a very broad range of subjects related to a digital library: system model of TQM for digital library, organization structure, and tasks of Wenzhou University Digital Library, task flow chart of electronic reading room, cause and effect of electronic reading room quality management, systematic model of enhancing electronic information acquisition rate, quality comparison of electronic reading room service, etc.
|
|
New mode of universal access and Global Memory Net
CHEN Ching-chih
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 1206-1215.
https://doi.org/10.1631/jzus.2005.A1206
In this digital era, we have witness the exciting convergence of content, technology, and global collaboration in the development of digital libraries. The mode of universal access for information seeking and knowledge acquisition differs greatly from the traditional ways. From the information resources point of views, the old model of “owning” a collection has given way to “sharing,” and the new emphases have shifted from possessing large “physical libraries” to “virtual libraries” digitally distributed all over the world. “Universal access” has taken on a very different meaning when one has the ability to share invaluable resources through the use of cutting edge technologies. The author has experienced much of these transformations through her own R&D activities—from the creation of interactive videodisc and multimedia CD on the First Emperor of China’s terracotta warriors and horses in the 1980s and 1990s to leading a current international digital library project, Global Memory Net (GMNet), supported by the US National Science Foundation. In presenting her vision for linking world digital resources together for universal access, she will share with the audience the latest development of Global Memory Net.
|
|
Relighting multiple color textures
DIAO Chang-yu, LU Dong-ming, LIU Gang
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 1284-1289.
https://doi.org/10.1631/jzus.2005.A1284
With the development of digital library technology, library books made of paper can be digital released and read, and Endangered Cultural Heritages can be preserved. Traditional library’s contents and functions can be greatly enhanced by digital technologies. For these new library objects, the primary key problem is precisely reconstructing their 3D models. When constructing complete 3D models, multiple color texture maps are often necessary. A commonly encountered problem uncounted during fusing of textures from multiple color images is color distortion. Each texture of a single 3D model may be obtained under possibly different lighting conditions and color response of the camera. To remove any visible seam and improve color consistency between the textures while avoiding color distortion, we propose a new efficient algorithm to relight all the texture images globally, spread residual light difference, and recolor each image by homogeneous transformation. A relative illumination model was adopted to obtain the relighting function. We choose lαβ color space with minimal correlation between channels for many natural scenes, for calculating the relighting result. Looking into two overlapped images A and B, we can pairwise relight B into A’s luminosity condition in two steps. We first scale B’s l channel by the lA/lB ratio of the overlapped region. We can assume A and B are in a same color plane now. Then a homogeneous transformation is applied to B’s α and β channels which moves B into A’s hue and saturation condition. For multiple overlapped color textures, a patch based weighted global relighting method was proposed to minimize the total color difference. The pairwise relighting method was used between each two overlapped images, and the difference in every overlapped region after relighting was weighted and summed up to construct an energy value. We used Nelder-Mead method to find a minimal energy value and the relighting parameters for every image. After global relighting, textures become almost coherent. We simply blended the overlapped region along the texture border to remove small visual seams and get a final result. We illustrate our method by calibrating textures of a painted sculpture acquired with laser scanner. Experimental results were realistic and reliable and showed how this method can fuse multiple textures without color distortion.
|
|
Optical Character Recognition for printed Tamil text using Unicode
SEETHALAKSHMI R., SREERANJANI T.R., BALACHANDAR T., Abnikant Singh, Markandey Singh, Ritwaj Ratan, Sarvesh Kumar
Journal of Zhejiang University-SCIENCE A (Applied Physics & Engineering), 2005, 6(11): 1297-1305.
https://doi.org/10.1631/jzus.2005.A1297
Optical Character Recognition (OCR) refers to the process of converting printed Tamil text documents into software translated Unicode Tamil Text. The printed documents available in the form of books, papers, magazines, etc. are scanned using standard scanners which produce an image of the scanned document. As part of the preprocessing phase the image file is checked for skewing. If the image is skewed, it is corrected by a simple rotation technique in the appropriate direction. Then the image is passed through a noise elimination phase and is binarized. The preprocessed image is segmented using an algorithm which decomposes the scanned text into paragraphs using special space detection technique and then the paragraphs into lines using vertical histograms, and lines into words using horizontal histograms, and words into character image glyphs using horizontal histograms. Each image glyph is comprised of 32×32 pixels. Thus a database of character image glyphs is created out of the segmentation phase. Then all the image glyphs are considered for recognition using Unicode mapping. Each image glyph is passed through various routines which extract the features of the glyph. The various features that are considered for classification are the character height, character width, the number of horizontal lines (long and short), the number of vertical lines (long and short), the horizontally oriented curves, the vertically oriented curves, the number of circles, number of slope lines, image centroid and special dots. The glyphs are now set ready for classification based on these features. The extracted features are passed to a Support Vector Machine (SVM) where the characters are classified by Supervised Learning Algorithm. These classes are mapped onto Unicode for recognition. Then the text is reconstructed using Unicode fonts.
|
26 articles
|