Abstract
In this paper, we incorporate shape detection into contextual scene labeling and make use of both shape, texture, and context information in a graphical representation. We propose a candidacy graph, whose vertices are two types of recognition candidates for either a superpixel or a window patch. The superpixel candidates are generated by a discriminative classifier with textural features as well as the window proposals by a learned deformable templates model in the bottom-up steps. The contextual and competitive interactions between graph vertices, in form of probabilistic connecting edges, are defined by two types of contextual metrics and the overlapping of their image domain, respectively. With this representation, a composite clustering sampling algorithm is proposed to fast search the optimal convergence globally using the Markov Chain Monte Carlo (MCMC). Our approach is applied on both lotus hill institute (LHI) and MSRC public datasets and achieves the state-of-art results.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Laferte, J.M., Heitz, F., Perez, P., Fabre, E.: Hierarchical statistical methord for the fusion of multiresolution data. In: ICCV (1995)
Xuming, H., Zemel, R.S., Carreira-Perpinan, M.A.: Multiscale conditional random fields for image labeling. In: CVPR (2004)
Shotton, J., Winn, J., Rother, C., Criminisi, A.: Textonboost: Joint appearnce, shape and context modeling for multiclass object recognition and segmentation. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 1–15. Springer, Heidelberg (2006)
Gould, S., Rodgers, J., Cohen, D., Elidan, D., Koller, D.: Multi-class segmentation with relative location prior. IJCV 80, 300–316 (2008)
Galleguillos, C., Rabinovich, A., Belongie, S.: Object categorization using co-occurrence, location and appearance. In: CVPR (2008)
Rabinovich, A., Vedaldi, A., Galleguillos, C., Wiewiora, E., Belongie, S.: Object in context. In: ICCV (2007)
Bastian, L., Ales, L., Bernt, S.: Combined object categorization and segmentation with an implicit shape model. In: ECCV Workshop on Statistical Learning in Computer Vision (2004)
Zuowen, T.: Auto-context and its application for high-level vision tasks. In: CVPR (2008)
Kolmogorov, V., Zabin, R.: What energy functions can be minimized via graph cuts? PAMI 26, 147–159 (2004)
Frey, B.J., Mackay, D.: A revolution: Belief propagation in graphs with cycles. In: NIPS (1997)
Apt, K.: The essence of constraint propagation. Theoretical Computer Science 221, 179–210 (1999)
Geman, S., Geman, D.: Stochastic relaxation, gibbs distributions and the bayesian restoration of images. PAMI 6, 721–741 (1984)
Ferrari, V., Jurie, F., Schmid, C.: Accurate object detection with deformable shape models learnt from images. In: CVPR (2007)
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. PAMI 24, 509–522 (2002)
Yinnian, W., Zhangzhang, S., Songchun, Z.: Deformable template as active basis. In: ICCV (2007)
Xiang, B., Xinggan, W., Longin, J.L., Wenyu, L., Zuowen, T.: Active Skeleton for Non-rigid Object Detection. In: ICCV (2009)
Borenstein, E., Ullman, S.: Combined top-down/bottom-up segmentation. PAMI 30, 2109–2125 (2008)
Tu, Z.W., Chen, X., Yulle, A., Zhu, S.: Image parsing: Unifying segmentation, detection, and recognition. IJCV 63 (2005)
Metropolis, N.: Equation of state calculations by fast computing machines. Journal of Chemical Physics 21, 1087–1092 (1953)
Barbu, A., Zhu, S.: Generalizing swendsen-wang for image analysis. Journal of Computational and Graphical Statistics 16, 877–900 (2007)
Yao, B., Yang, X., Zhu, S.-C.: Introduction to a large-scale general purpose ground truth database: Methodology, annotation tool and benchmarks. In: Yuille, A.L., Zhu, S.-C., Cremers, D., Wang, Y. (eds.) EMMCVPR 2007. LNCS, vol. 4679, pp. 169–183. Springer, Heidelberg (2007)
Bai, X., Sapiro, G.: Geodesic matting: A framework for fast interactive image and video segmentation and matting. IJCV 82, 113–132 (2009)
Torralba, A., Murphy, K.P., Freeman, W.T.: Sharing features: efficient boosting procedures for multiclass object detection. In: CVPR (2004)
Lin, Y., Meer, P., Foran, D.J.: Multiple class segmentation using a unified framework over mean-shift patches. In: CVPR (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhou, Q., Liu, W. (2011). Inference Scene Labeling by Incorporating Object Detection with Explicit Shape Model. In: Kimmel, R., Klette, R., Sugimoto, A. (eds) Computer Vision – ACCV 2010. ACCV 2010. Lecture Notes in Computer Science, vol 6494. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19318-7_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-19318-7_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19317-0
Online ISBN: 978-3-642-19318-7
eBook Packages: Computer ScienceComputer Science (R0)