Abstract
Large-scale visual data understanding is a long-standing popular problem in the computer vision society. When more visual data become available, problems become more challenging to traditional approaches. In this chapter, we will briefly review three important research problems, indoor/outdoor classification, outdoor scene categorization and geometric labeling. In addition, we will provide an overview of the book and its perspective benefits to the readers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Boyd, C.R., Tolson, M.A., Copes, W.S.: Evaluating trauma care: the triss method. J. Trauma-Inj., Infect., Crit. Care 27(4), 370–378 (1987)
Chang, C.C., Lin, C.J.: Libsvm: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011)
Chatzichristofis, S.A., Boutalis, Y.S.: Cedd: color and edge directivity descriptor: a compact descriptor for image indexing and retrieval. In: Computer Vision Systems, pp. 312–322. Springer (2008)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2005, vol. 1, pp. 886–893 (2005)
Dollar, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian detection: an evaluation of the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 34(4), 743–761 (2012)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)
Freund, Y., Schapire, R.E., et al.: Experiments with a new boosting algorithm. In: ICML, vol. 96, pp. 148–156 (1996)
Gupta, A., Efros, A.A., Hebert, M.: Blocks world revisited: image understanding using qualitative geometry and mechanics. In: Computer Vision ECCV 2010, pp. 482–496. Springer (2010)
Gupta, A., Hebert, M., Kanade, T., Blei, D.M.: Estimating spatial layout of rooms using volumetric reasoning about objects and surfaces. In: Advances in Neural Information Processing Systems, pp. 1288–1296 (2010)
Hoiem, D., Efros, A., Hebert, M., et al.: Geometric context from a single image. In: Tenth IEEE International Conference on Computer Vision. ICCV 2005, vol. 1, pp. 654–661. IEEE (2005)
Jégou, H., Douze, M., Schmid, C.: Improving bag-of-features for large scale image search. Int. J. Comput. Vis. 87(3), 316–336 (2010)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference, vol. 2, pp. 2169–2178 (2006)
Li, L.J., Fei-Fei, L.: What, where and who? classifying events by scene and object recognition. In: IEEE 11th International Conference on Computer Vision. ICCV 2007, pp. 1–8. IEEE (2007)
Lienhart, R., Maydt, J.: An extended set of haar-like features for rapid object detection. In: 2002 International Conference on Image Processing. Proceedings., vol. 1, pp. I–900. IEEE (2002)
Lim, J.H., Jin, J.S.: A structured learning framework for content-based image indexing and visual query. Multimed. Syst. 10(4), 317–331 (2005)
Liu, X., Zhao, Y., Zhu, S.C.: Single-view 3D scene parsing by attributed grammar. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 684–691. IEEE (2014)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Niebles, J.C., Fei-Fei, L.: A hierarchical model of shape and appearance for human action classification. In: IEEE Conference on Computer Vision and Pattern Recognition. CVPR’07, pp. 1–8. IEEE (2007)
Niebles, J.C., Wang, H., Fei-Fei, L.: Unsupervised learning of human action categories using spatial-temporal words. Int. J. Comput. Vis. 79(3), 299–318 (2008)
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. Int. J. Comput. Vis. 42(3), 145–175 (2001)
Quattoni, A., Torralba, A.: Recognizing indoor scenes. In: Computer vision and pattern recognition (CVPR), 2009 IEEE conference (2009)
van Gemert, J.C., Geusebroek, J.M., Veenman, C.J., Smeulders, A.W.: Kernel codebooks for scene categorization. In: Computer Vision-ECCV 2008, pp. 696–709. Springer (2008)
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, vol. 1, pp. I–511. IEEE (2001)
Wu, J., Rehg, J.M.: Centrist: a visual descriptor for scene categorization. IEEE Trans. Pattern Anal. Mach. Intell. 33(8), 1489–1501 (2011)
Xiao, J., Hays, J., Ehinger, K.A., Oliva, A., Torralba, A.: Sun database: large-scale scene recognition from abbey to zoo. In: Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference, pp. 3485–3492. IEEE (2010)
Yan, J., Zhang, X., Lei, Z., Liao, S., Li, S.Z.: Robust multi-resolution pedestrian detection in traffic scenes. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3033–3040. IEEE (2013)
Zhou, H., Yuan, Y., Shi, C.: Object tracking using sift features and mean shift. Comput. Vis. Image Underst. 113(3), 345–352 (2009)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2016 The Author(s)
About this chapter
Cite this chapter
Chen, C., Ren, Y., Kuo, CC.J. (2016). Introduction. In: Big Visual Data Analysis. SpringerBriefs in Electrical and Computer Engineering(). Springer, Singapore. https://doi.org/10.1007/978-981-10-0631-9_1
Download citation
DOI: https://doi.org/10.1007/978-981-10-0631-9_1
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-0629-6
Online ISBN: 978-981-10-0631-9
eBook Packages: EngineeringEngineering (R0)