Abstract
This chapter presents a computational model to address one prominent psychological behavior of human beings to recognize images. The basic pursuit of our method can be concluded as that differences among multiple images help visual recognition. Generally speaking, we propose a statistical framework to distinguish what kind of image features capture sufficient category information and what kind of image features are common ones shared in multiple classes. Mathematically, the whole formulation is subject to a generative probabilistic model. Meanwhile, a discriminative functionality is incorporated into the model to interpret the differences among all kinds of images. The whole Bayesian formulation is solved in an Expectation-Maximization paradigm. After finding those discriminative patterns among different images, we design an image categorization algorithm to interpret how these differences help visual recognition within the bag-of-feature framework. The proposed method is verified on a variety of image categorization tasks including outdoor scene images, indoor scene images as well as the airborne SAR images from different perspectives.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
This chapter is reproduced from [1], an open access article with the authors’ own copyright.
References
Deng Y, Zhao Y, Liu Y, Dai Q (2013) Differences help recognition: a probabilistic interpretation. PLoS One 8(6):e63385
Deng Y, Li Y, Qian Y, Ji X, Dai Q (2014) Visual words assignment via information-theoretic manifold embedding. To appear in IEEE Transactions on Cybernetics
Friedman J, Hastie T, Tibshirani R (2001) The elements of statistical learning, vol 1. Springer Series in Statistics, Berlin
Fukunaga K (1990) Introduction to statistical pattern recognition. Academic Press, Massachusetts
Deng Y, Dai Q, Zhang Z (2011) Graph laplace for occluded face completion and recognition. IEEE Trans Image Process 20(8):2329–2338
Lafferty JD, McCallum A, Pereira FCN (2001) Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of the eighteenth international conference on machine learning, ser. ICML ’01. Morgan Kaufmann Publishers Inc, San Francisco, USA, pp 282–289. [Online]. Available at http://dl.acm.org/citation.cfm?id=645530.655813
Pan L, Chu W, Saragih J, De la Torre F, Xie M (2011) Fast and robust circular object detection with probabilistic pairwise voting. IEEE Signal Process Lett 18(11):639–642
Deng Y, Dai Q, Wang R, Zhang Z (2012) Commute time guided transformation for feature extraction. Comput Vision Image Underst 116(4):473–483
Liu R, Lin Z, Su Z, Tang K (2010) Feature extraction by learning Lorentzian metric tensor and its extensions. Pattern Recogn 43(10):3298–3306
Fei-Fei L, Perona P (2005) A Bayesian hierarchical model for learning natural scene categories. In: IEEE Computer Society conference on computer vision and pattern recognition, vol 2. IEEE, pp 524–531
Quattoni A, Torralba A (2009) Recognizing indoor scenes. In: IEEE conference on computer vision and pattern recognition, IEEE, pp 413–420
Libioulle C, Louis E, Hansoul S, Sandor C, Farnir F, Franchimont D, Vermeire S, Dewit O, De Vos M, Dixon A et al (2007) Novel Crohn disease locus identified by genome-wide association maps to a gene desert on 5p13. 1 and modulates expression of PTGER4. PLoS Genet 3(4):e58
Brynedal B, Duvefelt K, Jonasdottir G, Roos I, Ã…kesson E, Palmgren J, Hillert J (2007) HLA-A confers an HLA-DRB1 independent influence on the risk of multiple sclerosis. PLoS One 2(7):e664
LaFramboise T, Weir B, Zhao X, Beroukhim R, Li C, Harrington D, Sellers W, Meyerson M (2005) Allele-specific amplification in cancer revealed by SNP array analysis. PLoS Comput Biol 1(6):e65
Shoemaker B, Panchenko A (2007) Deciphering protein–protein interactions. part ii. Computational methods to predict protein and domain interaction partners. PLoS Comput Biol 3(4):e43
Borman S (2009) The expectation maximization algorithm-a short tutorial. Technical report
Bishop C (2006) Pattern recognition and machine learning, vol 4, no 4. Springer, New York
van Gemert JC, Veenman CJ, Smeulders AW, Geusebroek J-M (2010) Visual word ambiguity. IEEE Trans Pattern Anal Mach Intell 32(7):1271–1283
Yang J, Yu K, Gong Y, Huang T (2009) Linear spatial pyramid matching using sparse coding for image classification. In: IEEE conference on computer vision and pattern recognition. IEEE, pp 1794–1801
Svetlana L, Cordelia S, Jean P (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: IEEE Computer Society conference on computer vision and pattern recognition, vol 2. IEEE, pp 2169–2178
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Deng, Y. (2015). Discriminative Structure for Visual Signal Understanding. In: High-Dimensional and Low-Quality Visual Information Processing. Springer Theses. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44526-6_5
Download citation
DOI: https://doi.org/10.1007/978-3-662-44526-6_5
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-44525-9
Online ISBN: 978-3-662-44526-6
eBook Packages: EngineeringEngineering (R0)