DEEP LEARNING-BASED DOOR AND WINDOW DETECTION FROM BUILDING FAÇADE

Sezen G., Çakır M. A., Atik M. E., Duran Z.

2022 24th ISPRS Congress on Imaging Today, Foreseeing Tomorrow, Commission IV, Nice, Fransa, 6 - 11 Haziran 2022, cilt.43, ss.315-320

Yayın Türü: Bildiri / Tam Metin Bildiri
Cilt numarası: 43
Doi Numarası: 10.5194/isprs-archives-xliii-b4-2022-315-2022
Basıldığı Şehir: Nice
Basıldığı Ülke: Fransa
Sayfa Sayıları: ss.315-320
Anahtar Kelimeler: Deep learning, Building Facade Elements, Object Detection, YOLO, Faster R-CNN
İstanbul Teknik Üniversitesi Adresli: Evet

Özet

© 2022 G. Sezen et al.Detecting building façade elements is a crucial problem in computer vision for image interpretation. In Building Information Modeling (BIM) studies, the detection of building façade elements has an important role. BIM is a tool that allows maintaining a digital representation of all aspects of building information; therefore, it will enable the storage of almost any data related to a given structure, regarding its geometric and non-geometric aspects. Façade segmentation was first studied in the 1970s using hand-crafted expertise. Later, detection and segmentation studies emerged based on shapes of objects and parametric rules. With the developing technology, deep learning approaches in object detection studies have intensified. It is obvious that the desired analyses can be performed faster with deep learning approaches. However, deep learning methods require large training data. Algorithms that consider different situations and are suitable for real-world scenarios continue to be developed. The need in this direction continues in the literature. In this study, door and window detection was carried out with deep learning on an original data set. The algorithms used are YOLOv3, YOLOv4, YOLOv5, and Faster R-CNN. Precision, recall and mean average precision (mAP) are used as evaluation metrics. As a result of the study, precision, recall, and mAP values with YOLOv5 were obtained as 0.85, 0.72, and 0.79, respectively. With Faster R-CNN with the lowest performance, precision, recall, and mAP were obtained as 0.54, 0.63, and 0.54, respectively.