Applied Linguistics (Yuyan Wenzi Yingyong)


No. 4 , Pages 16 - 24 , 2003

Standardization for Corpus Processing (Article written in Chinese)

JIN Guangiin, GUO Shulun , XIAO Hang, & ZHANG Yunfan

Abstract

This paper presents our comments on POS tag standardization and its methods. The standardization is by no means compulsory; it represents simply the output of processing and not the procedure. The main purpose for the standardization is to provide a POS tag as a norm for Chinese language processing, so that all the Chinese language processing can be normalized within this system. The characteristics for this standardization can be concluded as continuity, mono-functionality, generality and extensibility. The paper also discusses the problems of principle-setting and sub-categorization, and provides the experimental data of the coverage of the standardization-based POS tagging in corpus.

Keywords: POS tag; standardization; corpus

[Chinese Version | Index | Applied Linguistics (Yuyan Wenzi Yingyong) | Other Journals | Subscription form | Enquiry ]


Mail any comments and suggestions to hkier-journal@cuhk.edu.hk .