An affix stripping morphological analyzer for Turkish

IASTED International Conference on Artificial Intelligence and Applications, Innsbruck, Avusturya, 16 - 18 Şubat 2004, ss.299-304

Yayın Türü: Bildiri / Tam Metin Bildiri
Basıldığı Şehir: Innsbruck
Basıldığı Ülke: Avusturya
Sayfa Sayıları: ss.299-304
Anahtar Kelimeler: natural language processing, morphology, affix stripping, Turkish
İstanbul Teknik Üniversitesi Adresli: Evet

Özet

This paper presents the design and the implementation of a morphological analyzer for Turkish. A new methodology is proposed for doing the analysis of Turkish words with an affix stripping approach and without using any lexicon. The rule-based and agglutinative structure of the language allows Turkish to be modeled with finite state machines (FSMs). In contrast to the previous works, in this study, FSMs are formed by using the morphotactic rules in reverse order. This paper describes the steps of this new methodology including the classification of the suffixes, the generation of the FSMs for each suffix class and their unification into a main machine to cooperate in the analysis.