Question Similarity Detection in Turkish Using Semantic Textual Similarity Methods


Yildiz E., Findik Y.

27th Signal Processing and Communications Applications Conference (SIU), Sivas, Türkiye, 24 - 26 Nisan 2019 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası:
  • Doi Numarası: 10.1109/siu.2019.8806308
  • Basıldığı Şehir: Sivas
  • Basıldığı Ülke: Türkiye
  • İstanbul Teknik Üniversitesi Adresli: Evet

Özet

In this study, we evaluate the performance of various semantic textual similarity methods on question similarity detection task in Turkish. Various handcrafted features and neural models, specifically siamese recurrent networks, are studied to detect questions which have a similar meaning to given question in a dataset. Several experiments have been performed to compare the performance of features and neural methods. Our Experiments demonstrate that siamese recurrent networks significantly outperforms traditional methods which are based on handcrafted features such as word and stem matching counts, TF-IDF vectors and similarity of word embeddings. We also observed that the performance of siamese recurrent networks could be further improved by incorporating handcrafted features to the process.