Abstract
In this paper, we propose a unified scheme of subspace and distance metric learning under the Bayesian framework for image classification. According to the local distribution of data, we divide the k-nearest neighbors of each sample into the intra-class set and the inter-class set, and we aim to learn a distance metric in the embedding subspace, which can make the distances between the sample and its intra-class set smaller than the distances between it and its inter-class set. To reach this goal, we consider the intra-class distances and the inter-class distances to be from two different probability distributions respectively, and we model the goal with minimizing the overlap between two distributions. Inspired by the Bayesian classification error estimation, we formulate the objective function by minimizing the Bhattachyrra coefficient between two distributions. We further extend it with the kernel trick to learn nonlinear distance metric. The power and generality of the proposed approach are demonstrated by a series of experiments on the CMU-PIE face database, the extended YALE face database, and the COREL-5000 nature image database.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sim, T., Baker, S., Bsat, M.: The cmu pose, illumination, and expression database. IEEE Trans. on PAMI 25(12), 1615–1618 (2003)
Lee, K.C., Ho, J., Kriegman, D.: Acquiring linear subspaces for face recognition under variable lighting. IEEE Trans. Pattern Analysis and Machine Intelligence 27(5), 1–15 (2005)
Tong, H., He, J., Li, M., Zhang, C., Ma, W.: Graph based multi-modality learning. In: Proc. ACM Multimedia (2005)
Turk, M., Pentland, A.: Eigenfaces for recognition. Journal of Cognitive Neuroscience 3(1), 72–86 (1991)
Zhao, W., Chellappa, R., Phillips, P.J.: Subspace linear discriminant analysis for face recognition. Tech. Report CAR-TR-914, University of Maryland (1999)
Belhumeur, P.N., Hespanha, J.P., Kriegman, D.J.: Eigenfaces vs. fisherfaces: Recognition using class specific linear projection. IEEE Trans. Pattern Analysis and Machine Intelligence 19(7), 711–720 (1997)
Scholkopf, B., Smola, A., Muller, K.R.: Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation 10(5), 1299–1319 (1998)
Mika, S., Ratsch, G., Weston, J.: Fisher discriminant analysis with kernels. In: Proc. of Neural Networks for Signal Processing Workshop (1999)
Roweis, S.T., Saul, L.K.: Nonlinear dimensionality reduction by locally linear embedding. Sciences 290(5500), 2323–2326 (2000)
Tenenbaum, J.B., de Silva, V., Langford, J.C.: A global geometric framework for nonlinear dimensionality reduction. Sciences 290(5500), 2319–2323 (2000)
He, X.F., Niyogi, P.: Locality preserving projections. In: Advances in Neural Information Processing Systems (NIPS) (2003)
Chen, H.T., Chang, H.W., Liu, T.L.: Local discriminant embedding and its variants. In: Proc. of Int. Conf. Computer Vision and Pattern Recognition (CVPR) (2005)
Sugiyama, M.: Local fisher discriminant analysis for supervised dimensionality reduction. In: Proc. of Int. Conf. Machine Learning (ICML) (2006)
Yan, S.C., Xu, D., Zhang, B.Y., Zhang, H.J.: Graph embedding: A general framework for dimensionality reduction. In: Proc. of Int. Conf. Computer Vision and Pattern Recognition (CVPR) (2005)
Xing, E., Ng, A., Jordan, M., Russell, S.: Distance metric learning, with application to clustering with side-information. In: Advances in Neural Information Processing Systems (NIPS) (2004)
Bar-Hillel, A., Hertz, T., Shental, N., Weinshall, D.: Learning a mahalanobis metric from equivalence constrains. Journal of Machine Learning Research (2005)
Shental, N., Hertz, T., Weinshall, D., Pavel, M.: Adjustment learning and relevant component analysis. In: Europen Conf. on Computer Vision (ECCV) (2003)
Hoi, S.C., Liu, W., Lyu, M.R., Ma, W.Y.: Learning distance metrics with contextual constraints for image retrieval. In: Proc. of Int. Conf. Computer Vision and Pattern Recognition (CVPR) (2006)
Weinberger, K.Q., Blitzer, J., Saul, L.K.: Metric learning for large margin nearest neighbor classification. In: Advances in Neural Information Processing Systems (NIPS) (2005)
Torresani, L., Lee, K.C.: Large margin component analysis. In: Advances in Neural Information Processing Systems (NIPS) (2006)
Goldberger, J., Roweis, S., Hinton, G., Salakhutdinov, R.: Neighborhood component analysis. In: Advances in Neural Information Processing Systems (NIPS) (2004)
Globerson, A., Roweis, S.: Metric learning by collapsing classes. In: Advances in Neural Information Processing Systems (NIPS) (2005)
Yang, L., Jin, R., Sukthankar, R., Liu, Y.: An efficient algorithm for local distance metric learning. In: AAAI (2006)
Salakhutdinov, R., Roweis, S.T.: Adaptive over- relaxed bound optimization methods. In: Proc. of Int. Conf. Machine Learning (ICML) (2003)
Comaniciu, D., Ramesh, V., Meer, P.: Kernel-based object tracking. IEEE Trans. on Pattern Analysis and Machine Intelligence 25(5), 564–577 (2003)
Fukunaga, K.: Introduction to statistical pattern recognition. Academic Press, New York (1990)
Moghaddam, B., Jebara, T., Pentland, A.: Bayesian face recognition. Pattern Recognition 33(11), 1771–1782 (2000)
Cai, D., He, X., Han, J.: Using graph model for face analysis. Tech Report UIUCDCS-R-2636, University of UIUC (2005)
Wu, H., Lu, H.Q., Ma, S.D.: A practical svm-based algorithm for ordinal regression in image retrieval. In: Proc. ACM Multimedia (2003)
Osuna, E., Freund, R., Girosi, F.: Support vector machines: Training and applications. Tech Report, AI Lab, MIT (1997)
Chen, Y., Wang, J.Z.: Image categorization by learning and reasoning with regions. Journal of Machine Learning Research (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Liu, Q., Metaxas, D.N. (2009). Unifying Subspace and Distance Metric Learning with Bhattacharyya Coefficient for Image Classification. In: Nielsen, F. (eds) Emerging Trends in Visual Computing. ETVC 2008. Lecture Notes in Computer Science, vol 5416. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00826-9_11
Download citation
DOI: https://doi.org/10.1007/978-3-642-00826-9_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00825-2
Online ISBN: 978-3-642-00826-9
eBook Packages: Computer ScienceComputer Science (R0)