A Fully Convolutional Encoder-Decoder Network for Moving Object Segmentation

Turker A., Ekşioğlu E. M.

16th International Conference on INnovations in Intelligent SysTems and Applications, INISTA 2022, Biarritz, France, 8 - 12 August 2022 identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/inista55318.2022.9894190
  • City: Biarritz
  • Country: France
  • Keywords: background subtraction, change detection, deep learning, flux tensor, foreground segmentation, Moving object segmentation, spatiotemporal
  • Istanbul Technical University Affiliated: Yes


© 2022 IEEE.Moving object segmentation (MOS) is one of the important and well-studied computer vision problems. It is used in applications such as video surveillance systems, human tracking, self-driving cars, and video compression. Traditional approaches solve this problem by using hand-crafted features and then modeling the background by using these features. Convolutional Neural Networks (CNNs), on the other hand, have proven to be more powerful than traditional methods in extracting features. In this work, a hybrid system is presented that contains flux tensors together with 3D CNN, enhancing the performance of the algorithm on the unseen videos. 3D CNN can extract spatial and temporal features, thus exploiting motion information between adjacent frames. Motion entropy feature maps extracted by 3D CNN and the output of the flux tensor are jointly fed into an encoder-decoder network. ChangeDetection 2014 dataset is used for both training and test stages. Training and test videos are selected separately, and the networks are tested on unseen videos. Our proposed network gives promising segmentation results, which are competitive with existing methods.