Real Time Sound Source Localization Using von-Mises ResNet


Bozkurtlar M., Yen B., Itoyama K., Nakadai K.

2024 IEEE/SICE International Symposium on System Integration, SII 2024, Ha Long, Vietnam, 8 - 11 January 2024, pp.466-471 identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/sii58957.2024.10417224
  • City: Ha Long
  • Country: Vietnam
  • Page Numbers: pp.466-471
  • Istanbul Technical University Affiliated: No

Abstract

This paper addresses the task of learning periodic information using deep neural networks to achieve real-time, environment-independent sound source localization. Previous papers showed phase data is the most significant cue in sound source localization tasks and the proposed vM-B DNN was validated to be able to handle such periodic information using on synthesized data. However, they haven't shown its effectiveness and robustness in realistic use cases. This paper introduces a more complex model based on residual networks and adapts vM-B activation function for convolutional layers for use cases that require real-time predictions in dynamically changing environments.