Abstract
The research is based on the results of extraction of Chinese lettered chunks from the corpus of the Peoples Daily (year 2002). There are about 70 millions characters in the corpus, and the lettered chunks were automatically picked up with programs, and were proofread by some linguistics masters, and then, the lettered chunks with interpunctions were picked out with program. In this paper the lettered chunks with interpunctions will be analyzed and discussed about their occurrences, usage, and their statistical characters, in order to provide a reference to Chinese lettered chunks criterion and data to the auto-identifying.
Keywords: | Chinese lettered chunk; interpunction; auto-extraction |
---|
[Chinese Version | Index | Applied Linguistics (Yuyan Wenzi Yingyong) | Other Journals | Subscription form | Enquiry ]