Abstract
This paper proposes a cascaded written Tibetan word segmentation scheme, which is based on case-auxiliary words and continuous features. Using inflectional information such as case-auxiliary words and continuous features and adopting a cascaded strategy are the key features of the proposed scheme. Preliminary experiments suggest that it could detect and eliminate segmentation ambiguities and deal with unknown words. The scheme has significant practical value in increasing the precision of segmentation.
Keywords: | case-auxiliary words; continuous features; Tibetan word segmentation |
---|
[Chinese Version | Index | Applied Linguistics (Yuyan Wenzi Yingyong) | Other Journals | Subscription form | Enquiry ]