Abstract
In this chapter, we propose an ensemble of face detectors for maximizing the number of true positives found by the system. Unfortunately, combining different face detectors increases both the number of true positives and false positives. To overcome this difficulty, several methods for reducing false positives are tested and proposed. The different filtering steps are based on the characteristics of the depth map related to the subwindows of the whole image that contain the candidate faces. The most simple and easiest criteria to use, for instance, is to filter the candidate face region by considering its size in metric units.
The experimental section demonstrates that the proposed set of filtering steps greatly reduces the number of false positives without decreasing the detection rate. The proposed approach has been validated on a dataset of 549 images (each including both 2D and depth data) representing 614 upright frontal faces. The images were acquired both outdoors and indoors, with both first and second generation Kinect sensors. This was done in order to simulate a real application scenario. Moreover, for further validation and comparison with the state-of-the-art, our ensemble of face detectors is tested on the widely used BioID dataset where it obtains 100 % detection rate with an acceptable number of false positives.
A MATLAB version of the filtering steps and the dataset used in this paper will be freely available from http://www.dei.unipd.it/node/2357.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
http://www.mathworks.com/matlabcentral/fileexchange/loadFile.do?objectId = 13701&objectType = FILE.
- 2.
- 3.
The face detectors FL and RF give the positions of the eye centers as the output, while for VJ and SN the detected eye position is assumed to be a fixed position inside the face bounding box.
References
L. Nanni, A. Lumini, F. Dominio, P. Zanuttigh, Effective and precise face detection based on color and depth data. Appl. Comput. Inform. 10(1), 1–13 (2014)
C. Zhang, Z. Zhang, A survey of recent advances in face detection. Microsoft Research Technical Report, MSR-TR-2010-66, June 2010
Z. Zeng, M. Pantic, G.I. Roisman, T.S. Huang, A survey of affect recognition methods: audio, visual, and spontaneous expressions. IEEE Trans. Pattern Anal. Mach. Intell. 31(1), 39–58 (2009)
H.L. Jin, Q.S. Liu, H.Q. Lu, Face detection using one-class based support vectors, in Proceedings 6th IEEE International Conference Automatic Face Gesture Recognition, Hoboken, NJ 07030 USA, (2004), pp. 457–462
P. Viola, M.J. Jones, Rapid object detection using a boosted cascade of simple features, Proceedings of the 2001 IEEE Computer Society conference on Computer Vision and Pattern Recognition, 1, 511--518 (2001)
C. Küblbeck, A. Ernst, Face detection and track in video sequences using the modified census transformation. Image Vis. Comput. 24(6), 564–572 (2006)
C. Huang, H. Ai, Y. Li, S. Lao, High-performance rotation invariant multiview face detection. IEEE Trans. Pattern Anal. Mach. Intell. 29(4), 671–686 (2007)
J. Wu, S. Charles Brubaker, M.D. Mullin, J.M. Rehg, Fast asymmetric learning for cascade face detection. IEEE Trans. Pattern Anal. Mach. Intell. 30, 369–382 (2008)
M. Anisetti, Fast and robust face detection, in Multimedia Techniques for Device and Ambient Intelligence, Chapter 3 (Springer, US, 2009). ISBN: 978-0-387-88776-0
J. Li, Y. Zhang, Learning surf cascade for fast and accurate object detection, in IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (IEEE, 2013) Hoboken, NJ 07030 USA
M. Mathias, et al. Face detection without bells and whistles, in Computer Vision–ECCV 2014 (Springer International Publishing, 2014), Zurich, Switzerland, pp. 720–735
F. Tsalakanidou, D. Tzovaras, M.G. Strintzis, Use of depth and colour eigenfaces for face recognition. Pattern Recogn. Lett. 24(910), 1427–1435 (2003)
R.I. Hg, P. Jasek, C. Rofidal, K. Nasrollahi, T.B. Moeslund, G. Tranchet, An RGB-D database using Microsoft’s Kinect for windows for face detection, in 2012 Eighth International Conference on Signal Image Technology and Internet Based Systems (SITIS) (IEEE, 2012), Hoboken, NJ 07030 USA, pp. 42–46
M.-Y. Shieh, T.-M. Hsieh, Fast facial detection by depth map analysis. Math. Problems Eng. (2013), Article ID 694321, pp.~10, doi: 10.1155/2013/694321
J. Shotton, A. Fitzgibbon, M. Cook, T. Sharp, M. Finocchio, R. Moore, A. Kipman, A. Blake, Real-time human pose recognition in parts from single depth images. CVPR 2, 3 (2011)
R. Mattheij, E. Postma, Y. Van den Hurk, P. Spronck, Depth-based detection using Haarlike features, in Proceedings of the BNAIC 2012 Conference (Maastricht University, The Netherlands, 2012), pp. 162–169
M. Anisetti, V. Bellandi, E. Damiani, L. Arnone, B. Rat, A3FD: Accurate 3D face detection, in Signal Processing for Image Enhancement and Multimedia Processing (Springer, US, 2008), pp. 155–165
F. Jiang, M. Fischer, H.K. Ekenel, B.E. Shi, Combining texture and stereo disparity cues for real-time face detection. Signal Process. Image Commun. 28(9), 1100–1113 (2013)
G. Goswami, S. Bharadwaj, M. Vatsa, R. Singh, On RGB-D face recognition using Kinect, in 2013 IEEE Sixth International Conference on Biometrics: Theory, Applications and Systems (BTAS) (IEEE, 2013), Hoboken, NJ 07030 USA, pp. 1–6
Y. Taigman, M. Yang, M.A., Ranzato, L. Wolf, Deepface: Closing the gap to human-level performance in face verification, in 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (IEEE, 2014), Hoboken, NJ 07030 USA, pp. 1701–1708
O. Jesorsky, K. Kirchberg, R. Frischholz, in Face Detection Using the Hausdorff Distance, eds. by J. Bigun, F. Smeraldi. Audio and Video based Person Authentication—AVBPA, (Springer, 2001), Berlin, Germany, pp. 90–95
N. Markuš, M. Frljak, I.S. Pandžić, J. Ahlberg, R. Forchheimer, Fast localization of facial landmark points, in Proceedings of the Croatian Computer Vision Workshop, Zagreb, Croatia, (2014)
M. Nilsson, J. Nordberg, I. Claesson, Face detection using local SMQT features and split up SNOW classifier. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2, Hoboken, NJ 07030 USA, 589–592 (2007)
A. Asthana, S. Zafeiriou, S. Cheng, M. Pantic, Robust discriminative response map fitting with constrained local models, in 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (IEEE, 2013), Hoboken, NJ 07030 USA, pp. 3444–3451
D. Herrera, J. Kannala, J. Heikkilä, Joint depth and color camera calibration with distortion correction. IEEE Trans. Pattern Anal. Mach. Intell. 34, 2058–782 (2012)
D. Comaniciu, P. Meer, Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)
C. Dal Mutto, P. Zanuttigh, G.M. Cortelazzo, Fusion of geometry and color information for scene segmentation. IEEE J. Sel. Top. Signal Process. 6(5), 505–521 (2012)
X. Tan, F. Song, Z.-H. Zhou, S. Chen, Enhanced pictorial structures for precise eye localization under incontrolled conditions, in IEEE Conference on Computer Vision and Pattern Recognition 2009 (CVPR 2009), Hoboken, NJ 07030 USA, 20–25 June 2009, pp. 1621, 1628
E. Skodras, N. Fakotakis, Precise localization of eye centers in low resolution color images. Image Vision Comput. (2015). doi:10.1016/j.imavis.2015.01.006
Z. Ren, J. Meng, J. Yuan. Depth camera based hand gesture recognition and its applications in human-computer-interaction, in Proceedings of ICICS (2011), Hoboken, NJ 07030 USA, pp. 1–5
F. Dominio, M. Donadeo, P. Zanuttigh, Combining multiple depth-based descriptors for hand gesture recognition. Pattern Recogn. Lett. 50, 101–111 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Nanni, L., Lumini, A., Minto, L., Zanuttigh, P. (2016). Face Detection Coupling Texture, Color and Depth Data. In: Kawulok, M., Celebi, M., Smolka, B. (eds) Advances in Face Detection and Facial Image Analysis. Springer, Cham. https://doi.org/10.1007/978-3-319-25958-1_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-25958-1_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25956-7
Online ISBN: 978-3-319-25958-1
eBook Packages: EngineeringEngineering (R0)