|
|
A sustainable development OCR system in CADAL application |
HUANG Chen, ZHAO Ji-hai, HU Xiao |
Zhejiang University Libraries, Zhejiang University, Hangzhou 310027, China; Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign, IL 61801, USA |
|
|
Abstract This paper briefly introduces the main ideas of a sustainable development OCR system based on open architecture techniques and then describes the construction of an optical character recognition (OCR) center built on computer clusters, for the purpose of dynamically improving the recognition precision of the digitized texts of a million volumes of books produced by the China-US Million Books Digital Library (CADAL) Project. The practice of this center will provide helpful reference for other digital library projects.
|
Received: 05 August 2005
|
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|