Learning and Recognition Methods for Image Search and Video Retrieval

Puthenputhussery, Ajit; Chen, Shuo; Lee, Joyoung; Spasovic, Lazar; Liu, Chengjun

doi:10.1007/978-3-319-52081-0_2

Ajit Puthenputhussery⁴,
Shuo Chen⁵,
Joyoung Lee⁴,
Lazar Spasovic⁴ &
…
Chengjun Liu⁴

Part of the book series: Intelligent Systems Reference Library ((ISRL,volume 121 ))

659 Accesses
1 Citations

Abstract

Effective learning and recognition methods play an important role in intelligent image search and video retrieval. This chapter therefore reviews some popular learning and recognition methods that are broadly applied for image search and video retrieval . First some popular deep learning methods are discussed, such as the feedforward deep neural networks , the deep autoencoders , the convolutional neural networks, and the Deep Boltzmann Machine (DBM) . Second, Support Vector Machine (SVM), which is one of the popular machine learning methods, is reviewed. In particular, the linear support vector machine, the soft-margin support vector machine, the non-linear support vector machine , the simplified support vector machine , the efficient Support Vector Machine (eSVM) , and the applications of SVM to image search and video retrieval are discussed. Finally, other popular kernel methods and new similarity measures are briefly reviewed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Alpaydin, E.: Introduction to machine learning. The MIT Press, Cambridge (2010)
Google Scholar
Baldi, P.: Autoencoders, unsupervised learning, and deep architectures. J. Mach. Learn. Res. (Proceedings of ICML unsupervised and transfer learning) 27, 37–50 (2011)
Google Scholar
Baudat, G., Anouar, F.: Generalized discriminant analysis using a kernel approach. Neural Comput. 12(10), 2385–2404 (2000)
Article Google Scholar
Belhumeur, P.N., Hespanha, J.P., Kriegman, D.J.: Eigenfaces vs. Fisherfaces: recognition using class specific linear projection. IEEE Trans. Pattern Anal. Mach. Intell. 19(7), 711–720 (1997)
Google Scholar
Bengio, Y., Yao, L., Alain, G., Vincent, P.: Generalized denoising auto-encoders as generative models. Adv, Neural Inf. Process. Syst. 26, 899–907 (2013)
Google Scholar
Beveridge, J., Givens, G., Phillips, P., Draper, B.: Factors that influence algorithm performance in the face recognition grand challenge. Comput. Vis. Image Underst. 113(6), 750–762 (2009)
Article Google Scholar
Brodatz, P.: Textures: a photographic album for artists and designers. Dover, New York (1966)
Google Scholar
Burges, C.: Simplified support vector decision rule. In: Proceedings of the Thirteenth International Conference on Machine Learning (ICML’96), Bari, Italy, July 3–6, 1996 (1996)
Google Scholar
Chambon, S., Crouzil, A.: Similarity measures for image matching despite occlusions in stereo vision. Pattern Recognit. 44(9), 2063–2075 (2011)
Article Google Scholar
Chapelle, O., Haffner, P., Vapnik, V.N.: Support vector machines for histogram-based image classification. IEEE Trans. Neural Netw. 10(5), 1055–1064 (1999)
Article Google Scholar
Chen, J., Chen, C.: Reducing SVM classification time using multiple mirror classifers. IEEE Trans. Syst. Man Cybern. 34(2), 1173–1183 (2004)
Article Google Scholar
Chen, S., Liu, C.: Eye detection using color information and a new efficient SVM. In: IEEE Fourth International Conference on Biometrics: Theory, Applications, and Systems (BATS’10), Washington DC, USA (2010)
Google Scholar
Chen, S., Liu, C.: A new efficient SVM and its application to a real-time accurate eye localization system. In: International Joint Conference on Neural Networks, San Jose, California, USA (2011)
Google Scholar
Chen, S., Liu, C.: Eye detection using discriminatory haar features and a new efficient SVM. Image Vis. Comput. 33(c), 68–77 (2015)
Google Scholar
Chen, P., Lin, C., Scholkopf, B.: A tutorial on \(\upsilon \)-support vector machines. Appl. Stoch. Models Bus. Ind. 21, 111–136 (2005)
Article MathSciNet MATH Google Scholar
Cooke, T.: Two variations on Fisher’s linear discriminant for pattern recognition. IEEE Trans. Pattern Anal. Mach. Intell. 24(2), 268–273 (2002)
Google Scholar
Corso, J.J., Alahi, A., Grauman, K., Hager, G.D., Morency, L.P., Sawhney, H., Sheikh, Y.: Video Analysis for Bodyworn Cameras in Law Enforcement. The Computing Community Consortium whitepaper (2015)
Google Scholar
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, Cambridge (2000)
Google Scholar
Dhanjal, C., Gunn, S., Shawe-Taylor, J.: Efficient sparse kernel feature extraction based on partial least squares. IEEE Trans. Pattern Anal. Mach. Intell. 31(8), 1347–1361 (2009)
Google Scholar
Etemad, K., Chellappa, R.: Discriminant analysis for recognition of human face images. J. Opt. Soc. Am. A 14, 1724–1733 (1997)
Article Google Scholar
Glorot, X., Bordes, A., Bengio, Y.: Deep sparse rectifier neural networks. In: Aistats, vol. 15, p. 275 (2011)
Google Scholar
Goodfellow, I., Mirza, M., Courville, A., Bengio, Y.: Multi-prediction deep boltzmann machines. In: Advances in Neural Information Processing Systems, pp. 548–556 (2013)
Google Scholar
Goodfellow, I., Bengio, Y., Courville, A.: Deep learning (2016). URL http://www.deeplearningbook.org (Book in preparation for MIT Press)
Gundimada, S., Asari, V.: Facial recognition using multisensor images based on localized kernel eigen spaces. IEEE Trans. Image Process. 18(6), 1314–1325 (2009)
Article MathSciNet Google Scholar
Guo, G., Zhang, H.J., Li, S.Z.: Distance-from-boundary as a metric for texture image retrieval. In: IEEE International Conference on Acoustics. Speech, and Signal Processing, vol. 3, pp. 1629–1632, Washington DC, USA (2001)
Google Scholar
Haykin, S.: Neural Networks — A Comprehensive Foundation. Macmillan College Publishing Company, Inc., New York (1994)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1026–1034 (2015)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)
Google Scholar
Herbrich, R.: Learning Kernel Classifiers: Theory and Algorithms. MIT Press, Cambridge (2002)
Google Scholar
Hoi, C.H., Chan, C.H., Huang, K., Lyu, M.R., King, I.: Biased support vector machine for relevance feedback in image retrieval. In: 2004 IEEE International Joint Conference on Neural Networks, vol. 4, pp. 3189–3194 (2004)
Google Scholar
Hong, P., Tian, Q., Huang, T.S.: Incorporate support vector machines to content-based image retrieval with relevance feedback. In: 2000 International Conference on Image Processing, vol. 3, pp. 750–753 (2000)
Google Scholar
Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: 10th European Conference on Machine Learning (1999)
Google Scholar
Kirby, M., Sirovich, L.: Application of the Karhunen-Loeve procedure for the characterization of human faces. IEEE Trans. Pattern Anal. Mach. Intell. 12(1), 103–108 (1990)
Google Scholar
Kittler, J., Hatef, M., Duin, R., Matas, J.: On combining classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 20(3), 226–239 (1998)
Article Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Pereira, F., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 25, pp. 1097–1105 (2012)
Google Scholar
Lee, Y., Mangasarian, O.: RSVM: Reduced support vector machines. In: The First SIAM International Conference on Data Mining (2001)
Google Scholar
Li, J., Allinson, N., Tao, D., Li, X.: Multitraining support vector machine for image retrieval. IEEE Trans. Image Process. 15(11), 3597–3601 (2006)
Article Google Scholar
Lin, K., Lin, C.: A study on reduced support vector machine. IEEE Trans. Neural Netw. 14(6), 1449–1559 (2003)
Article Google Scholar
Liu, C.: Gabor-based kernel PCA with fractional power polynomial models for face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 26(5), 572–581 (2004)
Google Scholar
Liu, C.: Capitalize on dimensionality increasing techniques for improving face recognition grand challenge performance. IEEE Trans. Pattern Anal. Mach. Intell. 28(5), 725–737 (2006)
Google Scholar
Liu, C.: The Bayes decision rule induced similarity measures. IEEE Trans. Pattern Anal. Mach. Intell. 29(6), 1086–1090 (2007)
Google Scholar
Liu, C.: Clarification of assumptions in the relationship between the bayes decision rule and the whitened cosine similarity measure. IEEE Trans. Pattern Anal. Mach. Intell. 30(6), 1116–1117 (2008)
Google Scholar
Liu, C.: Effective use of color information for large scale face verification. Neurocomputing 101, 43–51 (2013)
Article Google Scholar
Liu, C.: Discriminant analysis and similarity measure. Pattern Recognit. 47(1), 359–367 (2014)
Article Google Scholar
Liu, Z., Liu, C.: Fusion of color, local spatial and global frequency information for face recognition. Pattern Recognit. 43(8), 2882–2890 (2010)
Article MATH Google Scholar
Liu, C., Mago, V. (eds.): Cross Disciplinary Biometric Systems. Springer, New York (2012)
Google Scholar
Liu, X., Chen, W., Yuen, P., Feng, G.: Learning kernel in kernel-based LDA for face recognition under illumination variations. IEEE Signal Process. Lett. 16(12), 1019–1022 (2009)
Article Google Scholar
Ma, C., Randolph, M., Drish, J.: A support vector machines-based rejection technique for speech recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 381–384 (2001)
Google Scholar
Maas, A.L., Hannun, A.Y., Ng, A.Y.: Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of ICML, vol. 30 (2013)
Google Scholar
Mika, S., Ratsch, G., Weston, J., Scholkopf, B., Mller, K.R.: Fisher discriminant analysis with kernels. In: Hu, Y.H., Larsen, J., Wilson, E., Douglas, S. (eds.) Neural Networks for Signal Processing IX, pp. 41–48. IEEE (1999)
Google Scholar
Moghaddam, B.: Principal manifolds and probabilistic subspaces for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 24(6), 780–788 (2002)
Google Scholar
Nagaraja, G., Murthy, S.R., Deepak, T.: Content based video retrieval using support vector machine classification. In: 2015 IEEE International Conference on Applied and Theoretical Computing and Communication Technology, pp. 821–827 (2015)
Google Scholar
Nakajima, C., Pontil, M., Poggio, T.: People recognition and pose estimation in image sequences. In: IEEE International Joint Conference on Neural Networks, vol. 4, pp. 189–194 (2000)
Google Scholar
Nguyen, D., Ho, T.: An efficient method for simplifying support vector machines. In: International Conference on Machine Learning, Bonn, Germany (2005)
Google Scholar
OToole, A.J., Phillips, P.J., Jiang, F., Ayyad, J., Penard, N., Abdi, H.: Face recognition algorithms surpass humans matching faces across changes in illumination. IEEE Trans. Pattern Anal. Mach. Intell. 29(9), 1642–1646 (2007)
Google Scholar
Pekalska, E., Haasdonk, B.: Kernel discriminant analysis for positive definite and indefinite kernels. IEEE Trans. Pattern Anal. Mach. Intell. 31(6), 1017–1032 (2009)
Google Scholar
Ranzato, M.A., Szummer, M.: Semi-supervised learning of compact document representations with deep networks. In: Proceedings of the 25th International Conference on Machine Learning, ICML ’08, pp. 792–799 (2008)
Google Scholar
Romdhani, S., Torr, B., Scholkopf, B., Blake, A.: Computationally efficient face detection. In: IEEE International Conference on Computer Vision (2001)
Google Scholar
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. Nature 323(6088), 533–536 (1986)
Article Google Scholar
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., Berg, A.C., Fei-Fei, L.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. (IJCV) 115(3), 211–252 (2015). doi:10.1007/s11263-015-0816-y
Article MathSciNet Google Scholar
Salakhutdinov, R., Hinton, G.: An efficient learning procedure for deep boltzmann machines. Neural Comput. 24(8), 1967–2006 (2012)
Article MathSciNet MATH Google Scholar
Santhiya, G., Singaravelan, S.: Multi-SVM for enhancing image search. Int. J. Sci. Eng. Res. 4(6) (2013)
Google Scholar
Scholkopf, B., Smola, A.: Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond. MIT Press, Cambridege (2002)
Google Scholar
Scholkopf, B., Smola, A., Muller, K.: Nonlinear component analysis as a kernel eigenvalue problem. Neural Comput. 10, 1299–1319 (1998)
Article Google Scholar
Scholkopf, B., Mika, S., Burges, C., Knirsch, P., Muller, K., Ratsch, G., Smola, A.: Input space versus feature space in kernel-based methods. IEEE Trans. Neural Netw. 10(5), 1000–1017 (1999)
Article Google Scholar
Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press, Cambridge (2004)
Google Scholar
Shin, C., Kim, K., Park, M., Kim, H.: Support vector machine-based text detection in digital video. In: IEEE Workshop on Neural Networks for Signal Processing (2000)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Swets, D.L., Weng, J.: Using discriminant eigenfeatures for image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 18(8), 831–836 (1996)
Google Scholar
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
Google Scholar
Tao, D., Tang, X., Li, X., Wu, X.: Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval. IEEE Trans. Pattern Anal. Mach. intell. 28(7), 1088–1099 (2006)
Article Google Scholar
Tefas, A., Kotropoulos, C., Pitas, I.: Using support vector machines to enhance the performance of elastic graph matching for frontal face authentication. IEEE Trans. Pattern Anal. Mach. Intell. 23(7), 735–746 (2001)
Article Google Scholar
Teow, L., Loe, K.: Robust vision-based features and classification schemes for off-line handwritten digit recognition. Pattern Recognit. (2002)
Google Scholar
Thung, K., Paramesran, R., Lim, C.: Content-based image quality metric using similarity measure of moment vectors. Pattern Recognit. 45(6), 2193–2204 (2012)
Article MATH Google Scholar
Tong, S., Chang, E.: Support vector machine active learning for image retrieval. In: the Ninth ACM International Conference on Multimedia, pp. 107–118 (2001)
Google Scholar
Turk, M., Pentland, A.: Eigenfaces for recognition. J. Cogn. Neurosci. 13(1), 71–86 (1991)
Article Google Scholar
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York, NY (1995)
Google Scholar
Vapnik, Y.N.: The Nature of Statistical Learning Theory, second edn. Springer, New York (2000)
Google Scholar
Varma, M., Babu, B.: More generality in efficient multiple kernel learning. In: Proceedings of the International Conference on Machine Learning, Montreal, Canada (2009)
Google Scholar
Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: Proceedings of the International Conference on Computer Vision, Kyoto, Japan (2009)
Google Scholar
Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y., Manzagol, P.A.: Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J. Mach. Learn. Res. 11, 3371–3408 (2010)
MathSciNet MATH Google Scholar
Wang, Z., Chen, S., Sun, T.: Multik-MHKS: a novel multiple kernel learning algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 30(2), 348–353 (2008)
Google Scholar
Wright, H.: Video Analysis for Body-worn Cameras in Law Enforcement (2015). http://www.cccblog.org/2015/08/06/video-analysis-for-body-worn-cameras-in-law-enforcement
Xie, C., Kumar, V.: Comparison of kernel class-dependence feature analysis (KCFA) with kernel discriminant analysis (KDA) for face recognition. In: Proceedings of IEEE on Biometrics: Theory, Application and Systems (2007)
Google Scholar
Yang, M.H., Ahuja, N., Kriegman, D.: Face recognition using kernel Eigenfaces. In: Proc. IEEE International Conference on Image Processing, Vancouver, Canada (2000)
Google Scholar
Yang, J., Yan, R., Hauptmann, A.G.: Cross-domain video concept detection using adaptive svms. In: 15th ACM International Conference on Multimedia, pp. 188–197 (2007)
Google Scholar
Yazdi, H.S., Javidi, M., Pourreza, H.R.: SVM-based relevance feedback for semantic video retrieval. Int. J. Signal Imaging Syst. Eng. 2(3), 99–108 (2009)
Article Google Scholar
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: European Conference on Computer Vision, pp. 818–833. Springer, New York (2014)
Google Scholar
Zhang, L., Lin, F., Zhang, B.: Support vector machine learning for image retrieval. In: 2001 International Conference on Image Processing, vol. 2, pp. 721–724 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

New Jersey Institute of Technology, Newark, NJ, 07102, USA
Ajit Puthenputhussery, Joyoung Lee, Lazar Spasovic & Chengjun Liu
The Neat Company, Philadelphia, PA, 19103, USA
Shuo Chen

Authors

Ajit Puthenputhussery
View author publications
You can also search for this author in PubMed Google Scholar
Shuo Chen
View author publications
You can also search for this author in PubMed Google Scholar
Joyoung Lee
View author publications
You can also search for this author in PubMed Google Scholar
Lazar Spasovic
View author publications
You can also search for this author in PubMed Google Scholar
Chengjun Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Ajit Puthenputhussery , Shuo Chen or Chengjun Liu .

Editor information

Editors and Affiliations

Department of Computer Science, New Jersey Institute of Technology, University Heights, Newark, New Jersey, USA
Chengjun Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Puthenputhussery, A., Chen, S., Lee, J., Spasovic, L., Liu, C. (2017). Learning and Recognition Methods for Image Search and Video Retrieval. In: Liu, C. (eds) Recent Advances in Intelligent Image Search and Video Retrieval. Intelligent Systems Reference Library, vol 121 . Springer, Cham. https://doi.org/10.1007/978-3-319-52081-0_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-52081-0_2
Published: 19 April 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-52080-3
Online ISBN: 978-3-319-52081-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics