Probabilistic Reasoning for Closed-Room People Monitoring

  • Ji Tao
  • Yap-Peng Tan
Part of the Studies in Fuzziness and Soft Computing book series (STUDFUZZ, volume 168)


In this chapter, we present a probabilistic reasoning approach to recognizing people entering and leaving a closed room by exploiting low-level visual features and high-level domain-specific knowledge. Specifically, people in the view of a monitoring camera are first detected and tracked so that their color and facial features can be extracted and analyzed. Then, recognition of people is carried out using a mapped feature similarity measure and exploiting the temporal correlation and constraints among each sequence of observations. The optimality of recognition is achieved in the sense of maximizing the joint posterior probability of the multiple observations. Experimental results of real and synthetic data are reported to show the effectiveness of the proposed approach.


people monitoring probabilistic reasoning Viterbi algorithm domain knowledge HMM 


  1. 1.
    Sage, K., Young, S. (1999) Security applications of computer vision. IEEE Aerospace and Electronic Systems Magazine, 14, 19–29CrossRefGoogle Scholar
  2. 2.
    Foresti, G.L. (1999) Object recognition and tracking for remote video surveillance. IEEE Trans. Circuits and Systems for Video Technology, 9, 1045–1062CrossRefGoogle Scholar
  3. 3. Scholar
  4. 4.
    Open source computer vision library reference manual. Intel CorporationGoogle Scholar
  5. 5.
    Viola, P., Jones, M. (2001) Robust real-time object detection. Technical Report, Compaq CRLGoogle Scholar
  6. 6.
    Lienhart, R., Maydt, J. (2002) An extended set of Haar-like features for rapid object detection. Proc. IEEE International Conf. on Image Processing (ICIP 2002), 1, 900–903Google Scholar
  7. 7.
    Nefian, A.V., Hayes III, M.H. (1999) An embedded HMM based approach for face detection and recognition. Proc. IEEE International Conf. on Acoustics, Speech, and Signal Processing (ICASSP 1999), 6, 3553–3556Google Scholar
  8. 8.
    Oliver, N.M., Rosario, B., Pentland, A.P. (2000) A Bayesian computer vision system for modeling human interactions. IEEE Trans. Pattern Analysis and Machine Intelligence, 22, 831–843CrossRefGoogle Scholar
  9. 9.
    Haritaoglu, I., Harwood, D., Davis, L.S. (2000) W 4: Real time surveillance of people and their activities. IEEE Trans. Pattern Analysis and Machine Intelligence, 22, 809–830CrossRefGoogle Scholar
  10. 10.
    CDC SARS Investigative Team, Fleischauer, A.T. (2003) Outbreak of Severe Acute Respiratory Syndrome — Worldwide. Morbidity and Mortality Weekly Report, 52, 269–272Google Scholar
  11. 11.
    Shen, W., Surette, M., Khanna, R. (1997) Evaluation of automated biometrics-based identification, and verification systems. Proc. IEEE, 85, 1464–1478CrossRefGoogle Scholar
  12. 12.
    Chellappa, R., Wilson, C.L., Sirohey, S. (1995) Human and machine recognition of faces: a survey. Proc. IEEE, 83, 705–741CrossRefGoogle Scholar
  13. 13.
    Kale, A., Rajagopalan, A., Cuntoor, N., Kruger, V. (2002) Gait-based recognition of humans using continuous HMMs. Proc. of the IEEE International Conf. on Automatic Face and Gesture Recognition, 321–326Google Scholar
  14. 14.
    Bhanu, B., Han, J. (2002) Bayesian-based performance prediction for gait recognition. Workshop on Motion and Video Computing, 145–150Google Scholar
  15. 15.
    Duda, R., Hart, P., Stork, D. (2001) Pattern classification, second edition. Wiley-InterscienceGoogle Scholar
  16. 16.
    Vega, I.R., Sarkar, S. (2003) Statistical motion model based on the change of feature relationships: human gait-based recognition. IEEE Trans. Pattern. Analysis and Machine Intelligence, 25, 1323–1328CrossRefGoogle Scholar
  17. 17.
    Lu, W., Tan, Y.-P. (2001) A color histogram based people tracking system. Proc. IEEE International Symposium on Circuits and Systems (ISCAS 2001), 2, 137–140zbMATHGoogle Scholar
  18. 18.
    Tao, J., Tan, Y.-P. (2003) Color appearance-based approach to robust tracking and recognition of multiple people, the Fourth International Conference on Information, Communications & Signal Processing and Fourth Pacific-Rim. Conference on Multimedia, 1, 95–99Google Scholar
  19. 19.
    Swain, M., Ballard, D. (1991) Color Indexing. Intl. Journal of Computer Vision, 7, 11–22CrossRefGoogle Scholar
  20. 20.
    Deng, Y., Manjunath, B.S., Kenney, C, Moore, M.S., Shin, H. (2001) An efficient color representation for image retrieval. IEEE Trans. Image Processing, 10, 140–147zbMATHCrossRefGoogle Scholar
  21. 21.
    Ferman, A.M., Tekalp, A.M., Mehrotra, R. (2002) Robust color histogram descriptors for video segment retrieval and identification. IEEE Trans. Image Processing, 11, 497–508CrossRefGoogle Scholar
  22. 22.
    Moreno, F., Tarrida, A., Andrade-Cetto, J., Sanfeliu, A. (2002) 3D real-time head tracking using color histograms and stereovision. Proc. 16th International Conference on Pattern Recognition, 368–371Google Scholar
  23. 23.
    Kalman, B.L., Kwasny, S.C., (1992) Why tanh: choosing a sigmoidal function., Proc. International Joint Conference on Neural Networks, 4, 7–11Google Scholar
  24. 24.
    Yang, J., Xu, Y., Chen, S.C. (1997) Human action learning via hidden Markov model. IEEE Trans. Systems, Man, and Cybernetics, 27, 34–44zbMATHCrossRefGoogle Scholar
  25. 25.
    Starner, T.E., Pentland, A.P. (1995) Visual recognition of American sign language using hidden Markov models. Proc. International Workshop on Automatic Face-and Gesture-Recognition, 189–194Google Scholar
  26. 26.
    Nair, V., Clark, J.J. (2002) Automated visual surveillance using hidden Markov models. Proc. the 15th Vision Interface Conference, 88–92Google Scholar
  27. 27.
    Rabiner, L.R. (1989) A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE, 77, 257–285CrossRefGoogle Scholar
  28. 28.
    Church, K. (1992) Comments on Computational Learning Model for Metrical Phonology. R. Levine, (ed.), Formal Grammar: Theory and Implementation, Vancouver Studies in Cognitive Science Series, UBC Press, 2 Google Scholar
  29. 29.
    Forney, G.D. (1973) The Viterbi algorithm. Proc. IEEE, 61, 268–278MathSciNetCrossRefGoogle Scholar
  30. 30.
    Heckerman, D. (1995) A tutorial on learning with Bayesian networks, Microsoft Research. Technical ReportGoogle Scholar
  31. 31.
    Murphy, K. (1998) A brief introduction to graphical models and Bayesian networks, available at Scholar
  32. 32.
    Siskind, J.M., Morris, Q. (1996) A maximum-likelihood approach to visual event classification. Proc. the 4th European Conference on Computer Vision, 347–360Google Scholar
  33. 33.
    McKenna, S.J., Jabri, S., Duric, Z., Rosenfeld, A., Wechsler, H. (2000) Tracking groups of people. Computer Vision and Image Understanding, 80, 42–56zbMATHCrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Ji Tao
    • 1
  • Yap-Peng Tan
    • 1
  1. 1.School of Electrical and Electronic EngineeringNanyang Technological UniversitySingapore

Personalised recommendations