Applied Linguistics (Yuyan Wenzi Yingyong)


No. 1 , Pages 75 - 82 , 2003

An Automatic Tibetan Segmentation Scheme Based on Case-Auxiliary Words and Continuous Features (Article written in Chinese)

CHEN Yuzhong, LI Baoli, YU Shiwen, & LAN Cuoji

Abstract

This paper proposes a cascaded written Tibetan word segmentation scheme, which is based on case-auxiliary words and continuous features. Using inflectional information such as case-auxiliary words and continuous features and adopting a cascaded strategy are the key features of the proposed scheme. Preliminary experiments suggest that it could detect and eliminate segmentation ambiguities and deal with unknown words. The scheme has significant practical value in increasing the precision of segmentation.

Keywords: case-auxiliary words; continuous features; Tibetan word segmentation

[Chinese Version | Index | Applied Linguistics (Yuyan Wenzi Yingyong) | Other Journals | Subscription form | Enquiry ]


Mail any comments and suggestions to hkier-journal@cuhk.edu.hk .