Building the first comprehensive machine-readable Turkish sign language resource: methods, challenges and solutions


Eryiğit G., Eryiğit G., Karabuklu S., KELEPİR WOOD M., Ozkul A., Pamay Arslan T., ...More

LANGUAGE RESOURCES AND EVALUATION, vol.54, no.1, pp.97-121, 2020 (SCI-Expanded) identifier identifier

  • Publication Type: Article / Article
  • Volume: 54 Issue: 1
  • Publication Date: 2020
  • Doi Number: 10.1007/s10579-019-09465-5
  • Journal Name: LANGUAGE RESOURCES AND EVALUATION
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, FRANCIS, Aerospace Database, Applied Science & Technology Source, Communication Abstracts, Computer & Applied Sciences, EBSCO Education Source, Educational research abstracts (ERA), Humanities Abstracts, INSPEC, Linguistic Bibliography, Linguistics & Language Behavior Abstracts, Metadex, MLA - Modern Language Association Database, Civil Engineering Abstracts
  • Page Numbers: pp.97-121
  • Keywords: Turkish sign language, TiD, Parallel dependency treebank, Turkish, Machine-readable, Parallel corpus, EXPRESSION
  • Istanbul Technical University Affiliated: Yes

Abstract

This article describes the procedures employed during the development of the first comprehensive machine-readable Turkish Sign Language (TiD) resource: a bilingual lexical database and a parallel corpus between Turkish and TiD. In addition to sign language specific annotations (such as non-manual markers, classifiers and buoys) following the recently introduced TiD knowledge representation (Eryigit et al. 2016), the parallel corpus contains also annotations of dependency relations, which makes it the first parallel treebank between a sign language and an auditory-vocal language.