A Duplicate Chinese Document Image Retrieval System
Yung-Kuan Chan (National Chung Hsing University, Taiwan, R.O.C.), Yu-An Ho (National Chung Hsing University, Taiwan, R.O.C.), Hsien-Chu Wu (National Taichung Institute of Technology, Taiwan, R.O.C.) and Yen-Ping Chu (National Chung Hsing University, Taiwan, R.O.C.)
Copyright: © 2005
An optical character recognition (OCR) system enables a user to feed an article directly into an electronic computer file and translate the optically scanned bitmaps of text characters into machine-readable codes; that is, ASCII, Chinese GB, as well as Big5 codes, and then edits it by using a word processor. OCR is hence being employed by libraries to digitize and preserve their holdings. Billions of letters are sorted every day by OCR machines, which can considerably speed up mail delivery.