Real Time Segmentation of Lip Pixels for Lip Tracker Initialization

  • Mohammad Sadeghi
  • Josef Kittler
  • Kieron Messer
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2124)


We propose a novel segmentation method for real time lip tracker initialisation which is based on a Gaussian mixture model of the pixel data. The model is built using the Predictive Validation technique advocated in [4]. In order to construct an accurate model in real time, we adopt a quasi-random image sampling technique based on Sobol sequences. We test the proposed method on a database of 145 images and demonstrate that its accuracy, even with a few number of samples, is satisfactory and significantly better than the segmentation obtained by k-means clustering. Moreover, the proposed method does not require the number of segments to be specified a priori.


gaussian mixture modelling lip tracking 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Dempster, A., Laird, N., Rubin, D.: Maximum likelyhood from incomplete data via the em algorithm. Journal of the Royal Statistical Society 39(1) (1977) 1–38zbMATHMathSciNetGoogle Scholar
  2. 2.
    Dupont, S., Luettin, J.: Audio-visual speech modeling for continuous speech recognition. IEEE Transactions on Multimedia 2(3) (Sep 2000) 141–151Google Scholar
  3. 3.
    Galambos, C., Kittler, J., Matas, J.: Sampling techniques for the progressive probabilistic Hough transform. (2001) submitted to Pattern Recognition LettersGoogle Scholar
  4. 4.
    Kittler, J., Messer, K. Sadeghi, M.: Model validation for model selection. Second Conference on Advances in Pattern Recognition. Brazil, 11–14 March (2001)Google Scholar
  5. 5.
    McGurk, H., MacDonald, J.: Hearing lips and seeing voices. Nature 264 (1976) 746–748CrossRefGoogle Scholar
  6. 6.
    Messer,Matas, J., Kittler, J., Luettin, J., Maitre, G.: Xm2vtsdb: The extended m2vts database. Second International Conference on Audio and Video-based Bio-metric Person Authentication (Mar 1999)Google Scholar
  7. 7.
    Press, W.H., Flanney, B.P., Teukolsky, S.A., Vetterling, W.T.: Numerical Recipes in C: The Art of Scientific Computing. Cambridge University Press (1992)Google Scholar
  8. 8.
    Rissanen, J.: Stochastic complexity. Journal of The Royal Statistical Society, Series 49(3) (1987) 223–239, 252–265zbMATHMathSciNetGoogle Scholar
  9. 9.
    Schwarz, G.: Estimating the dimension of a model. The Annals of Statistics 6(2) (1978) 461–464zbMATHMathSciNetCrossRefGoogle Scholar
  10. 10.
    Yang, J., Waibel, A.: A real-time face tracker. Third IEEE Workshop on Applications of Computer Vision. Sarasota, Florida, USA (1996) 142–147Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2001

Authors and Affiliations

  • Mohammad Sadeghi
    • 1
  • Josef Kittler
    • 1
  • Kieron Messer
    • 1
  1. 1.Centre for Vision, Speech and Signal Processing School of Electronics, Computing and MathematicsUniversity of SurreyGuildfordUK

Personalised recommendations