Automatic assessment of student rhythmic pattern imitation performances

Köktürk-Güzel B. E., Büyük O., Bozkurt B., Baysal O.

Digital Signal Processing: A Review Journal, vol.133, 2023 (SCI-Expanded) identifier identifier

  • Publication Type: Article / Article
  • Volume: 133
  • Publication Date: 2023
  • Doi Number: 10.1016/j.dsp.2022.103880
  • Journal Name: Digital Signal Processing: A Review Journal
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Aerospace Database, Applied Science & Technology Source, Communication Abstracts, Compendex, Computer & Applied Sciences, INSPEC
  • Keywords: Online music education, Automatic music performance assessment, Comparison of regression methods, Metric learning
  • Istanbul Technical University Affiliated: Yes


© 2022 Elsevier Inc.In this study, we present an automatic performance assessment system for a student's rhythmic pattern imitation. The system compares the student's performance recording with a teacher's reference recording and assigns a grade between 1 and 4. For this task, we collected and publicly shared a data set that consists of recordings from real auditions for conservatory entrance exams in Turkey. The recordings are annotated/graded by three experts. The automatic assessment (grade assignment) task is considered as a regression problem and two approaches are tested and compared. The first approach applies classical regression methods to distance features. Distance features are computed by using several distance metrics to compute the distances between the performance and reference recording onset detection functions, binary onset vectors and onset times. The second approach applies metric learning using a Siamese neural network to directly learn the most efficient feature representations from the onset positions. We also investigate the effect of onset detection errors on automatic assessment performance. In the experiments, we observe that the assessment performance is significantly improved with an improvement in the onset detection accuracy. The best performance is achieved using the Siamese network with a mean absolute error (MAE) of 0.49. This performance is comparable to variation in expert annotations showing the potential of the system to be used in online music education. All the source codes and the data set (together with the annotations) are openly shared. We think that this completely reproducible work would potentially serve as a strong baseline for future studies on rhythmic pattern imitation tasks.