Context-Aided Human Recognition – Clustering

Song, Yang; Leung, Thomas

doi:10.1007/11744078_30

Context-Aided Human Recognition – Clustering

Yang Song¹⁹ &
Thomas Leung¹⁹

Conference paper

3217 Accesses
28 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3953))

Abstract

Context information other than faces, such as clothes, picture-taken-time and some logical constraints, can provide rich cues for recognizing people. This aim of this work is to automatically cluster pictures according to person’s identity by exploiting as much context information as possible in addition to faces. Toward that end, a clothes recognition algorithm is first developed, which is effective for different types of clothes (smooth or highly textured). Clothes recognition results are integrated with face recognition to provide similarity measurements for clustering. Picture-taken-time is used when combining faces and clothes, and the cases of faces or clothes missing are handled in a principle way. A spectral clustering algorithm which can enforce hard constraints (positive and negative) is presented to incorporate logic-based cues (e.g. two persons in one picture must be different individuals) and user feedback. Experiments on real consumer photos show the effectiveness of the algorithm.

Download to read the full chapter text

Chapter PDF

References

Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR (2003)
Google Scholar
Ioffe, S.: Red eye detection with machine learning. In: Proc. ICIP (2003)
Google Scholar
Ioffe, S.: Probabilistic linear discriminant analysis. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 531–542. Springer, Heidelberg (2006)
Chapter Google Scholar
Jordan, M.I., Jacobs, R.A.: Hierarchical mixtures of experts and the em algorithm. Neural Computation 6, 181–214 (1994)
Article Google Scholar
Leung, T.: Texton correlation for recognition. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 203–214. Springer, Heidelberg (2004)
Chapter Google Scholar
Lowe, D.: Object recognition from local scale-invariant features. In: ICCV (1999)
Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. In: CVPR (2003)
Google Scholar
Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: Analysis and an algorithm. In: NIPS 14 (2002)
Google Scholar
Schneiderman, H., Kanade, T.: A statistical method for 3d object detection applied to faces and cars. In: Proc. CVPR (2000)
Google Scholar
Shi, J., Malik, J.: Normalized cuts and image segmentation. In: Proc. CVPR, June 1997, pp. 731–737 (1997)
Google Scholar
Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: Proc. ICCV (2003)
Google Scholar
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proc. CVPR (2001)
Google Scholar
Wagstaff, K., Cardie, C., Rogers, S., Schroedl, S.: Contrained k-means clustering with background knowledge. In: Proc. ICML (2001)
Google Scholar
Weiss, Y.: Segmentation using eigenvectors. In: Proc. ICCV (1999)
Google Scholar
Yu, S.X.: Computational Models of Perceptual Organization, Ph.d. thesis, Carnegie Mellon University (2003)
Google Scholar
Yu, S.X., Shi, J.: Grouping with bias. In: NIPS (2001)
Google Scholar
Yu, S.X., Shi, J.: Multiclass spectral clustering. In: Proc. ICCV (2003)
Google Scholar
Zhang, L., Chen, L., Li, M., Zhang, H.: Automated annotation of human faces in family albums. In: MM 2003 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Fujifilm Software (California), Inc., 1740 Technology Drive, Suite 490, San Jose, CA, 95110, USA
Yang Song & Thomas Leung

Authors

Yang Song
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Leung
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Ljubljana, Slovenia
Aleš Leonardis
Institute for Computer Graphics and Vision, TU Graz, Inffeldgasse 16, 8010, Graz, Austria
Horst Bischof
Vision-based Measurement Group, Inst. of El. Measurement and Meas. Sign. Proc. Graz, University of Technology, Austria
Axel Pinz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Song, Y., Leung, T. (2006). Context-Aided Human Recognition – Clustering. In: Leonardis, A., Bischof, H., Pinz, A. (eds) Computer Vision – ECCV 2006. ECCV 2006. Lecture Notes in Computer Science, vol 3953. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11744078_30

Download citation

DOI: https://doi.org/10.1007/11744078_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33836-9
Online ISBN: 978-3-540-33837-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics