Korean Tripitaka; search system; Computerization; Unicode; XML; Hangul Trip
摘要
This research aims for constructing the retrieval system by digitizing a quantity of the 30 Hangul Tripitaka books in the 3rd Hangul Tripitaka Digitalization Project. By revising and digitalizing the Hangul Tripitaka which is a Korean version of the Tripitaka Korean we can input, store in database, and search the archaic documents through the Internet. Since the archaic documents of the Hangul Tripitaka includes extension characters of Chinese origin, missing characters and ,special characters, etc, we use Unicode and make the image fonts that cannot be represented by Unicode. And we apply XML for the efficient representation of document structure and the retrieval. So people can search the same contents as the archaic documents. Moreover we developed the search engine which provides the efficient and easy search method, the archaic documents saved as Unicode can access from the whole world using the Internet. The retrieval system developed in this research uses Microsoft SQL Server and IIS(Internet Information Server) on Windows 2000 Server.