Combining texture and stereo disparity cues for real-time face detection

Jiang F., FISCHER M., Ekenel H. K. , SHI B. E.

SIGNAL PROCESSING-IMAGE COMMUNICATION, vol.28, no.9, pp.1100-1113, 2013 (SCI-Expanded) identifier identifier


Intuitively, integrating information from multiple visual cues, such as texture, stereo disparity, and image motion, should improve performance on perceptual tasks, such as object detection. On the other hand, the additional effort required to extract and represent information from additional cues may increase computational complexity. In this work, we show that using biologically inspired integrated representation of texture and stereo disparity information for a multi-view facial detection task leads to not only improved detection performance, but also reduced computational complexity. Disparity information enables us to filter out 90% of image locations as being less likely to contain faces. Performance is improved because the filtering rejects 32% of the false detections made by a similar monocular detector at the same recall rate. Despite the additional computation required to compute disparity information, our binocular detector takes only 42 ms to process a pair of 640 x 480 images, 35% of the time required by the monocular detector. We also show that this integrated detector is computationally more efficient than a detector with similar performance where texture and stereo information is processed separately. (c) 2013 Elsevier B.V. All rights reserved.