We use cookies to ensure that we give you the best experience on our website. By continuing to browse this repository, you give consent for essential cookies to be used. You can read more about our Privacy and Cookie Policy.

Durham Research Online
You are in:

Chinese Text Project: a dynamic digital library of premodern Chinese

Sturgeon, Donald (2021) 'Chinese Text Project: a dynamic digital library of premodern Chinese.', Digital scholarship in the humanities., 36 (S1). i101-i112.


This article presents technical approaches and innovations in digital library design developed during the design and implementation of the Chinese Text Project, a widely-used, large-scale full-text digital library of premodern Chinese writing. By leveraging a combination of domain-optimized Optical Character Recognition, a purpose-designed crowdsourcing system, and an Application Programming Interface (API), this project simultaneously provides a sustainable transcription system, search interface and reading environment, as well as an extensible platform for transcribing and working with premodern Chinese textual materials. By means of the API, intentionally loosely integrated text mining tools are used to extend the platform, while also being reusable independently with materials from other sources and in other languages.

Item Type:Article
Full text:(AM) Accepted Manuscript
Download PDF
Publisher Web site:
Publisher statement:This is a pre-copyedited, author-produced PDF of an article accepted for publication in Digital Scholarship in the Humanities following peer review. The version of record: Sturgeon, Donald (2021). Chinese Text Project: a dynamic digital library of premodern Chinese. Digital Scholarship in the Humanities 36(S1), i101-i112 is available online at:
Date accepted:No date available
Date deposited:02 November 2021
Date of first online publication:29 August 2019
Date first made open access:02 November 2021

Save or Share this output

Look up in GoogleScholar