数字人文研究 ›› 2021, Vol. 1 ›› Issue (3): 83-88.

• • 上一篇    下一篇

古籍数字化关键技术评述

  

  • 出版日期:2021-09-08 发布日期:2021-12-04

Key Technologies for Digitization of Ancient Chinese Books

  • Online:2021-09-08 Published:2021-12-04

摘要:

中国历史文化典籍是中华民族的宝贵财富。在数字环境下,实现古籍的数字化整理与利用,能够为数字人文研究、历史学研究及其他人文研究提供基础性资源,也是推动中华文明创造性转化与创新性发展的重要依托。古籍的数字化整理包括纸本资源的电子化,以及在电子化文本基础上的断句、标点、词语切分等基础性加工和深层知识提取。本文对现有古籍数字化整理的技术方法与平台进行梳理与评述,分析古籍数字化整理的挑战,探讨古籍数字化整理任务的未来发展方向。

关键词:

古籍整理, 古籍数字化, 自然语言处理, 数字人文

Abstract:

Chinese historical and cultural classics are the great treasure of the Chinese nation. In the digital environment, the realization of digital documentation and utilization of ancient books can provide basic resources for digital humanities research, history research and other humanities researches, and it also serves as an important support for promoting the creative transformation and innovative development of Chinese civilization. The work of digitization includes the electronization of paper resources, as well as basic processing and deep knowledge extraction such as sentence segmentation, punctuation, and word segmentation based on electronic texts. This article reviews and comments on the existing technical methods and platforms of the digital collation of ancient books, analyzes its challenges, and discusses its future development direction.

Key words:

Collation of ancient books,  Digitization of ancient books,  Natural language processing;  Digital Humanities

中图分类号: