Abstract
The Attention based Segmentation and Recognition (ASR) algorithm for hand postures against complex backgrounds is discussed in this chapter. The ASR algorithm can detect, segment and recognize multi-class hand postures. Visual attention, which is a cognitive process of selectively concentrating on a region of interest in visual field, helps humans to recognize objects in cluttered natural scenes. TheĀ ASR algorithm utilizes a Bayesian model of visual attention to generate a saliency map, and to detect and identify the hand region. Feature based visual attention is implemented using a combination of high level (shape, texture) and low level (color) image features. The shape and texture features are extracted from a skin similarity map, using a computational model of the ventral stream of visual cortex. The skin similarity map, which represents the similarity of each pixel to the human skin color in HSI color space, enhances the edges and shapes within the skin colored regions. The color features used are discretized chrominance components in HSI, YCbCr color spaces, and similarity-to-skin map. The hand postures are classified using shape and texture features, with a support vector machines classifier. The NUS hand posture dataset-II with 10 classes of complex background hand postures is utilized for testing the algorithm. The dataset contains hand postures from 40 subjects of different ethnicities. A total of 2,750 hand postures and 2,000 background images are available in the dataset. The hand postures vary in size and shape. The ASR algorithm is tested for hand detection and hand posture recognition using 10 fold cross-validation. The experimental results show that the algorithm has a person independent performance, and is reliable against variations in hand sizes and complex backgrounds.
Simple can be harder than complex: You have to work hard to get your thinking clean to make it simple. But itās worth it in the end because once you get there, you can move mountains
Steve Jobs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Graph matching is considered as one of the most complex algorithms in vision based object recognition [2]. The complexity is due to the combinatorial nature of matching process.
- 2.
The dataset is available for free download: http://www.vadakkepat.com/NUS-HandSet/.
- 3.
V1, V2, V3, V4, and V5 are the visual areas in the visual cortex. V1 is the primary visual cortex. V2āV5 are the secondary visual areas, and are collectively termed as the extrastriate visual cortex.
- 4.
Reference [23] for further explanation on \(S_1\) and \(C_1\) stages (layer 1 and 2).
- 5.
The luminance color components are not utilized as these components are sensitive to skin color as well as lighting.
- 6.
The dataset consists of hand postures by 40 subjects, with different ethnic origins.
- 7.
400 images (1 image per class per subject) are considered. During the training phase the hand area is selected manually.
- 8.
The dataset is available for academic research purposes: http://www.vadakkepat.com/NUS-HandSet/.
- 9.
The dataset is available for free download: http://www.vadakkepat.com/NUS-HandSet/.
References
V. Athitsos, S. Sclaroff, Estimating 3d hand pose from a cluttered image. IEEE Conf. Comput. Vis. Pattern Recogn. 2, 432ā439 (2003)
E. Bienenstock, C. von der Malsburg, A neural network for invariant pattern recognition. Europhys. Lett. 4(1), 121ā126 (1987)
C. Bishop, Neural Networks for Pattern Recognition (Oxford, Oxford University Press, 1995)
J.M. Chaves-GonzĆ”lez, M.A. Vega-RodrĆgueza, J.A. GĆ³mez-Pulidoa, J.M. SĆ”nchez-PĆ©reza, Detecting skin in face recognition systems: a colour spaces study. Digit. Signal Process. 20(03), 806ā823 (2010)
S. Chikkerur, T. Serre, C. Tan, T. Poggio, What and where: a bayesian inference theory of attention. Vis. Res. 50(22), 2233ā2247 (2010)
P. Dayan, G.E. Hinton, R.M. Neal, The helmholtz machine. Neural Comput. 7(5), 889ā904 (1995)
L. Itti, C. Koch, Computational modelling of visual attention. Nat. Rev. Neurosci. 2(3), 194ā203 (2001)
L. Itti, C. Koch, E. Niebur, A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell. 20(11), 1254ā1259 (1998)
M.J. Jones, J.M. Rehg, Statistical color models with application to skin detection. IEEE Conf. Comput. Vis. Pattern Recogn. 1 (1999)
M. Kolsch, M. Turk, Robust hand detection. IEEE Conf. Autom. Face Gesture Recogn. 614ā619 (2004)
K. Murphy, Bayes net toolbox for matlab (2003), http://code.google.com/p/bnt/
E. Niebur, C. Koch, Computational architectures for attention, in The Attentive Brain, ed. by R. Parasuraman (Cambridge, MIT Press, 1998) pp. 163ā186
E.J. Ong, R. Bowden, A boosted classifier tree for hand shape detection. IEEE Conf. Autom. Face Gesture Recogn. 889ā894 (2004)
J. Pearl, Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference (Morgan Kaufmann Publishers, California, 1988)
S.L. Phung, A. Bouzerdoum, D. Chai, Skin segmentation using color pixel classification: analysis and comparison. IEEE Trans. Pattern Anal. Mach. Intell. 27(01), 148ā154 (2005)
P.K. Pisharady, Computational intelligence techniques in visual pattern recognition. Ph.D. Thesis, National University of Singapore (2011)
P.K. Pisharady, P. Vadakkepat, A.P. Loh, Graph matching based hand posture recognition using neuro-biologically inspired features. International Conference on Control, Automation, Robotics and Vision (ICARCV) 2010 (Singapore), December 2010
P.K. Pisharady, P. Vadakkepat, A.P. Loh, Attention based detection and recognition of hand postures against complex backgrounds. Int. J. Comput. Vis. 101(03), 403ā419 (2013)
P.K. Pisharady, P. Vadakkepat, A.P. Loh, Hand posture and face recognition using a fuzzy-rough approach. Int. J. Humanoid Rob. 07(03), 331ā356 (2010)
T. Poggio, E. Bizzi, Generalization in vision and motor control. Nature 431, 768ā774 (2004)
R. Rao, Bayesian inference and attentional modulation in the visual cortex. Neuro Rep. 16(16), 1843ā1848 (2005)
M. Riesenhuber, T. Poggio, Hierarchical models of object recognition in cortex. Nat. Neurosci. 2(11), 1019ā1025 (1999)
T. Serre, L. Wolf, S. Bileschi, M. Riesenhuber, T. Poggio, Robust object recognition with cortex-like mechanisms. IEEE Trans. Pattern Anal. Mach. Intell. 29(3), 411ā426 (2007)
C. Siagian, L. Itti, Rapid biologically-inspired scene classification using features shared with visual attention. IEEE Trans. Pattern Anal. Mach. Intell. 29(2), 300ā312 (2007)
J. Triesch, C. Malsburg, Sebastien marcel hand posture and gesture datasets: Jochen triesch static hand posture database (1996), http://www.idiap.ch/resource/gestures/
J. Triesch, C. Malsburg, A system for person-independent hand posture recognition against complex backgrounds. IEEE Trans. Pattern Anal. Mach. Intell. 23(12), 1449ā1453 (2001)
J. Triesch, C. Malsburg, Robust classification of hand postures against complex backgrounds. Proceedings of the Second International Conference on Automatic Face and Gesture Recognition (Killington, VT, USA), October 1996, pp. 170ā175
J.K. Tsotsos, S.M. Culhane, Y.H. Wai, W.Y.K. Lai, N. Davis, F. Nuflo, Modelling visual attention via selective tuning. Artif. Intell. 78(1ā2), 507ā545 (1995)
Y. Wu, T.S. Huang, View-independent recognition of hand postures. IEEE Conf. Comput. Vis. Pattern Recogn. 2, 88ā94 (2000)
Acknowledgments
Figures and tables in this chapter are adapted from the following article with kind permission from Springer Science+Business Media: International Journal of Computer Vision, Attention Based Detection and Recognition of Hand Postures Against Complex Backgrounds, Vol.101, Issue No.3, 2013, Page Nos. 403-419, Pramod Kumar Pisharady, Prahlad Vadakkepat and Loh Ai Poh.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
Ā© 2014 Springer Science+Business Media Singapore
About this chapter
Cite this chapter
Pisharady, P.K., Vadakkepat, P., Poh, L.A. (2014). Attention Based Segmentation and Recognition Algorithm for Hand Postures Against Complex Backgrounds. In: Computational Intelligence in Multi-Feature Visual Pattern Recognition. Studies in Computational Intelligence, vol 556. Springer, Singapore. https://doi.org/10.1007/978-981-287-056-8_8
Download citation
DOI: https://doi.org/10.1007/978-981-287-056-8_8
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-287-055-1
Online ISBN: 978-981-287-056-8
eBook Packages: EngineeringEngineering (R0)