Incorporating Dropped Pronouns into Coreference Resolution: The case for Turkish


Creative Commons License

Arslan T. P., Eryiğit G.

17th Conference of the European Chapter of the Associationfor Computational Linguistics, EACL 2023 - Student Research Workshop, SRW 2023, Dubrovnik, Hırvatistan, 2 - 04 Mayıs 2023, ss.14-25 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Basıldığı Şehir: Dubrovnik
  • Basıldığı Ülke: Hırvatistan
  • Sayfa Sayıları: ss.14-25
  • İstanbul Teknik Üniversitesi Adresli: Evet

Özet

Representation of coreferential relations is a challenging and actively studied topic for pro-drop and morphologically rich languages (PD-MRLs) due to dropped pronouns (e.g., null subjects and omitted possessive pronouns). These phenomena require a representation scheme at the morphology level and enhanced evaluation methods. In this paper, we propose a representation & evaluation scheme to incorporate dropped pronouns into coreference resolution and validate it on the Turkish language. Using the scheme, we extend the annotations on the only existing Turkish coreference dataset, which originally did not contain annotations for dropped pronouns. We provide publicly available pre and post processors to enhance the prominent CoNLL coreference scorer also to cover coreferential relations arising from dropped pronouns. As a final step, the paper reports the first neural Turkish coreference resolution results in the literature. Although validated on Turkish, the proposed scheme is language-independent and may be used for other PD-MRLs.