Skip to main content

Feature Representation and Extraction for Image Search and Video Retrieval

  • Conference paper
  • First Online:
Recent Advances in Intelligent Image Search and Video Retrieval

Part of the book series: Intelligent Systems Reference Library ((ISRL,volume 121 ))

Abstract

The ever-increasing popularity of intelligent image search and video retrieval warrants a comprehensive study of the major feature representation and extraction methods often applied in image search and video retrieval. Towards that end, this chapter reviews some representative feature representation and extraction approaches, such as the Spatial Pyramid Matching (SPM) , the soft assignment coding, the Fisher vector coding , the sparse coding and its variants, the Local Binary Pattern (LBP) , the Feature Local Binary Patterns (FLBP) , the Local Quaternary Patterns (LQP), the Feature Local Quaternary Patterns (FLQP) , the Scale-invariant feature transform (SIFT) , and the SIFT variants, which are broadly applied in intelligent image search and video retrieval .

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ahonen, T., Hadid, A., Pietikainen, M.: Face recognition with local binary patterns. In: 8th European Conference on Computer Vision, Prague, Czech Republic, pp. 469–481 (2004)

    Google Scholar 

  2. Ahonen, T., Hadid, A., Pietikainen, M.: Face description with local binary patterns: application to face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 28(12), 2037–2041 (2006)

    Article  MATH  Google Scholar 

  3. Aly, A., Farag, A.: Csift: a sift descriptor with color invariant characteristics. In: IEEE International Conference on Computer Vision and Pattern Recognition, New York, NY, pp. 1978–1983 (2006)

    Google Scholar 

  4. Arandjelovic, R., Zisserman, A.: Three things everyone should know to improve object retrieval. In: IEEE International Conference on Computer Vision and Pattern Recognition, Providence, RI, pp. 2911–2918 (2012)

    Google Scholar 

  5. Banerji, S., Sinha, A., Liu, C.: New image descriptors based on color, texture, shape, and wavelets for object and scene image classification. Neurocomputing 117, 173–185 (2013)

    Article  Google Scholar 

  6. Banerji, S., Verma, A., Liu, C.: Novel color LBP descriptors for scene and image texture classification. In: 15th International Conference on Image Processing, Computer Vision, and Pattern Recognition, Las Vegas, Nevada, USA (2011)

    Google Scholar 

  7. Bay, H., Tuytelaars, T., Van Gool, L.V.: SURF: speeded up robust features. Comput. Vision Image Underst. 110(3), 346–359 (2008)

    Article  Google Scholar 

  8. Beck, A., Teboulle, M.: A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imaging Sci. 2(1), 183–202 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  9. Bo, L., Ren, X., Fox, D.: Hierarchical matching pursuit for image classification: architecture and fast algorithms. In: Advances in Neural Information Processing Systems, pp. 2115–2123 (2011)

    Google Scholar 

  10. Brown, M., Ssstrunk, S.: Multi-spectral sift for scene category recognition. In: IEEE International Conference on Computer Vision and Pattern Recognition, Providence, RI, pp. 177–184 (2011)

    Google Scholar 

  11. Chen, S., Liu, C.: Eye detection using discriminatory haar features and a new efficient svm. Image Vision Comput. 33, 68–77 (2015)

    Article  Google Scholar 

  12. Chiu, L., Chang, T.S., Chen, J.Y., Chang, N.Y.C.: Fast sift design for real-time visual feature extraction. IEEE Trans. Image Process. 22(8), 3158–3167 (2013)

    Article  Google Scholar 

  13. Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision, Prague (2004)

    Google Scholar 

  14. Dong, J., Soatto, S.: Domain-size pooling in local descriptors: Dsp-sift. In: IEEE International Conference on Computer Vision and Pattern Recognition, Boston, MA (2015)

    Google Scholar 

  15. Freund, Y., Iyer, R., Schapire, R.E., Singer, Y.: An efficient boosting algorithm for combining preferences. J. Mach. Learn. Res. 4, 933–969 (2003)

    MathSciNet  MATH  Google Scholar 

  16. Gao, S., Tsang, I.W.H., Chia, L.T.: Laplacian sparse coding, hypergraph laplacian sparse coding, and applications. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 92–104 (2013)

    Article  Google Scholar 

  17. Gemert, J.C., Geusebroek, J.M., Veenman, C.J., Smeulders, A.W.: Kernel codebooks for scene categorization. In: ECCV, pp. 696–709 (2008)

    Google Scholar 

  18. van Gemert, J.C., Veenman, C.J., Smeulders, A.W.M., Geusebroek, J.M.: Visual word ambiguity. IEEE Trans. Pattern Anal. Mach. Intell. 32(7), 1271–1283 (2010)

    Article  Google Scholar 

  19. Geusebroek, J., Boomgaard, R.v.d., Smeulders, A., Geerts, H.: Color invariance. IEEE Trans. Pattern Anal. Mach. Intell. 23(12), 1338–1350 (2001)

    Google Scholar 

  20. Ghaoui, L.E., Viallon, V., Rabbani, T.: Safe feature elimination in sparse supervised learning. Technical report UC/EECS-2010-126, EECS Department, University of California at Berkeley (2010)

    Google Scholar 

  21. Griffin, G., Holub, A., Perona, P.: Caltech-256 object category dataset. Technical report, California Institute of Technology (2007)

    Google Scholar 

  22. Gu, J., Liu, C.: Local quaternary patterns and feature local quaternary patterns. In: 16th International Conference on Image Processing, Computer Vision, and Pattern Recognition, Las Vegas, Nevada, USA (2012)

    Google Scholar 

  23. Gu, J., Liu, C.: Feature local binary patterns with application to eye detection. Neurocomputing 113, 138–152 (2013)

    Article  Google Scholar 

  24. Jegou, H., Perronnin, F., Douze, M., Sanchez, J., Perez, P., Schmid, C.: Aggregating local image descriptors into compact codes. IEEE Trans. Pattern Anal. Mach. Intell. 34(9), 1704–1716 (2012)

    Article  Google Scholar 

  25. Ke, Y., Sukthankar, R.: Pca-sift: a more distinctive representation for local image descriptors. In: IEEE International Conference on Computer Vision and Pattern Recognition, Washington, DC, vol. 2, pp. 506–513 (2004)

    Google Scholar 

  26. Kubelka, P.: New contribution to the optics of intensely light-scattering materials, part i. J. Opt. Soc. Am. 38(5), 448–457 (1948)

    Article  MathSciNet  Google Scholar 

  27. Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: CVPR, pp. 2169–2178 (2006)

    Google Scholar 

  28. Lee, H., Battle, A., Raina, R., Ng, A.Y.: Efficient sparse coding algorithms. In: NIPS, pp. 801–808 (2007)

    Google Scholar 

  29. Liu, C.: Gabor-based kernel PCA with fractional power polynomial models for face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 26(5), 572–581 (2004)

    Article  Google Scholar 

  30. Liu, C.: Effective use of color information for large scale face verification. Neurocomputing 101, 43–51 (2013)

    Article  Google Scholar 

  31. Liu, C., Mago, V. (eds.): Cross Disciplinary Biometric Systems. Springer, Berlin (2012)

    Google Scholar 

  32. Liu, C., Wechsler, H.: Gabor feature based classification using the enhanced Fisher linear discriminant model for face recognition. IEEE Trans. Image Process. 11(4), 467–476 (2002)

    Article  Google Scholar 

  33. Liu, Q., Puthenputhussery, A., Liu, C.: A novel inheritable color space with application to kinship verification. In: the IEEE Winter Conference on Applications of Computer Vision, Lake Placid, New York (2016)

    Google Scholar 

  34. Liu, Z., Liu, C.: Fusion of color, local spatial and global frequency information for face recognition. Pattern Recognit. 43(8), 2882–2890 (2010)

    Article  MATH  Google Scholar 

  35. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  Google Scholar 

  36. Mairal, J., Bach, F., Ponce, J., Sapiro, G.: Online dictionary learning for sparse coding. In: ICML, p. 87 (2009)

    Google Scholar 

  37. Morel, J., Yu, G.: Asift: a new framework for fully affine invariant image comparison. SIAM J. Imaging Sci. 2(2), 438–469 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  38. Ojala, T., Pietikainen, M., Harwood, D.: Performance evaluation of texture measures with classification based on kullback discrimination of distributions. In: Proceedings of the 12th IAPR International Conference on Pattern Recognition, Jerusalem, Israel, pp. 582–585 (1994)

    Google Scholar 

  39. Ojala, T., Pietikainen, M., Harwood, D.: A comparative study of texture measures with classification based on feature distributions. Pattern Recognit. 29(1), 51–59 (1996)

    Article  Google Scholar 

  40. Ojala, T., Pietikainen, M., Maenpaa, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 971–987 (2002)

    Article  MATH  Google Scholar 

  41. Pang, Y., Lia, W., Yuanb, Y., Panc, J.: Fully affine invariant surf for image matching. Neurocomputing 85(8), 6–10 (2012)

    Article  Google Scholar 

  42. Sanchez, J., Perronnin, F., Mensink, T., Verbeek, J.J.: Image classification with the fisher vector: theory and practice. Int. J. Comput. Vis. 105(3), 222–245 (2013)

    Article  MathSciNet  MATH  Google Scholar 

  43. Sinha, A., Banerji, S., Liu, C.: New color GPHOG descriptors for object and scene image classification. Mach. Vis. Appl. 25(2), 361–375 (2014)

    Article  Google Scholar 

  44. Verma, A., Liu, C.: Novel EFM-KNN classifier and a new color descriptor for image classification. In: 20th IEEE Wireless and Optical Communications Conference (Multimedia Services and Applications), Newark, New Jersey, USA (2011)

    Google Scholar 

  45. Verma, A., Liu, C., Jia, J.: New color SIFT descriptors for image classification with applications to biometrics. Int. J. Biom. 1(3), 56–75 (2011)

    Article  Google Scholar 

  46. Wang, J., Yang, J., Yu, K., Lv, F., Huang, T.S., Gong, Y.: Locality-constrained linear coding for image classification. In: CVPR, pp. 3360–3367 (2010)

    Google Scholar 

  47. Wang, J., Zhou, J., Liu, J., Wonka, P., Ye, J.: A safe screening rule for sparse logistic regression. In: Advances in Neural Information Processing Systems, pp. 1053–1061 (2014)

    Google Scholar 

  48. Wang, J., Zhou, J., Wonka, P., Ye, J.: Lasso screening rules via dual polytope projection. In: Advances in Neural Information Processing Systems, pp. 1070–1078 (2013)

    Google Scholar 

  49. Xiang, Z., Xu, H., Ramadge, P.: Learning sparse representations of high dimensional data on large scale dictionaries. Adv. Neural Inf. Process. Syst. 24, 900–908 (2011)

    Google Scholar 

  50. Xiang, Z.J., Ramadge, P.J.: Fast lasso screening tests based on correlations. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2012, pp. 2137–2140 (2012)

    Google Scholar 

  51. Xin, X., Li, Z., Katsaggelos, A.: Laplacian sift in visual search. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, Kyoto, Japan (2012)

    Google Scholar 

  52. Yan, S., Xu, D., Zhang, B., Zhang, H.J., Yang, Q., Lin, S.: Graph embedding and extensions: a general framework for dimensionality reduction. IEEE Trans. Pattern Anal. Mach. Intell. (2007)

    Google Scholar 

  53. Yang, J., Yu, K., Gong, Y., Huang, T.S.: Linear spatial pyramid matching using sparse coding for image classification. In: CVPR, pp. 1794–1801 (2009)

    Google Scholar 

  54. Zhang, S., Tian, Q., Lu, K., Huang, Q., Gao, W.: Edge-sift: discriminative binary descriptor for scalable partial-duplicate mobile search. IEEE Trans. Image Process. 29(1), 40–51 (2013)

    Google Scholar 

  55. Zheng, M., Bu, J., Chen, C., Wang, C., Zhang, L., Qiu, G., Cai, D.: Graph regularized sparse coding for image representation. IEEE Trans. Image Process. 20(5), 1327–1336 (2011)

    Article  MathSciNet  Google Scholar 

  56. Zhou, X., Yu, K., Zhang, T., Huang, T.S.: Image classification using super-vector coding of local image descriptors. In: Proceedings of the 11th European Conference on Computer Vision: Part V, pp. 141–154 (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Qingfeng Liu , Yukhe Lavinia , Abhishek Verma or Chengjun Liu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Liu, Q., Lavinia, Y., Verma, A., Lee, J., Spasovic, L., Liu, C. (2017). Feature Representation and Extraction for Image Search and Video Retrieval. In: Liu, C. (eds) Recent Advances in Intelligent Image Search and Video Retrieval. Intelligent Systems Reference Library, vol 121 . Springer, Cham. https://doi.org/10.1007/978-3-319-52081-0_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-52081-0_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-52080-3

  • Online ISBN: 978-3-319-52081-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics