An affix stripping morphological analyzer for Turkish


Eryigit G., Adali E.

IASTED International Conference on Artificial Intelligence and Applications, Innsbruck, Avusturya, 16 - 18 Şubat 2004, ss.299-304 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Basıldığı Şehir: Innsbruck
  • Basıldığı Ülke: Avusturya
  • Sayfa Sayıları: ss.299-304
  • Anahtar Kelimeler: natural language processing, morphology, affix stripping, Turkish
  • İstanbul Teknik Üniversitesi Adresli: Evet

Özet

This paper presents the design and the implementation of a morphological analyzer for Turkish. A new methodology is proposed for doing the analysis of Turkish words with an affix stripping approach and without using any lexicon. The rule-based and agglutinative structure of the language allows Turkish to be modeled with finite state machines (FSMs). In contrast to the previous works, in this study, FSMs are formed by using the morphotactic rules in reverse order. This paper describes the steps of this new methodology including the classification of the suffixes, the generation of the FSMs for each suffix class and their unification into a main machine to cooperate in the analysis.