數位典藏與數位人文國際研討會(第9屆)=International Conference of Digital Archives and Digital Humanities (9th)
出版日期
2018.12.18
頁次
287 - 300
出版者
臺灣數位人文學會
出版地
臺北市, 臺灣 [Taipei shih, Taiwan]
資料類型
會議論文=Proceeding Article
使用語言
英文=English
附註項
1. Morioka, Tomohiko:Ph.D / Assistant Professor Center for Informatics in East Asian Studies Institute for Research in Humanities, Kyoto University.
關鍵詞
Chinese character; glyph; linked data; database integration; dataset preservation
摘要
This report describes an attempt to integrate the “CHISE” (“Character Information Service Environment”) character ontology and the “HNG” (“Hanzi Normative Glyphs”) database / dataset. The CHISE character ontology is a large scale character ontology which includes 357 thousand character-objects including Unicode and non-Unicode characters and their glyphs, etc. It was developed for CHISE which is a character processing system not depended on character codes. The framework of CHISE is based on a graph storage named “CONCORD”. We developed a Web service to display and edit objects of CONCORD, called “EsT” (or “CHISE-wiki”). The CHISE character ontology uses the “Multiple Granularity Hanzi Structure Model” to support various glyphs and multiple unification granularity of Chinese characters. This model works fine for modern glyphs of Chinese characters. However, before we started the study to integrate CHISE and HNG, it was not clear that the model is sufficient for premodern Chinese characters. In addition, to design reasonable unification rules for each unification granularity, we need various glyph examples of Chinese characters. In these senses, the CHISE character ontology should integrate glyph database and/or glyph corpus. Therefore, we tried to integrate HNG and the CHISE character ontology. When viewed from the HNG side, this integration has the following significance. The original HNG web service has been stopped since the spring of 2015. Therefore, we applied research on the integration of CHISE and HNG, we provided HNG search function and data browsing function on the CHISE Web service.
目次
1. INTRODUCTION 2. HNG 3. DATA STRUCTURE OF HNG 4. CHISE CHARACTER ONTOLOGY 4.1. Multiple glyph granularity 4.2. Multiple Granularity Hanzi Structure Model 5. REPRESENTATION OF HNG GLYPHS IN CHISE 5.1. Integration of HNG glyphs into the CHISE character ontology 5.2. Encoding of HNG glyph image object 6. IMPLEMENTATION 6.1. Classification of HNG glyphs 6.2. Integration without classification 6.3. Web applications