CollageParsing: Nonparametric Scene Parsing by Adaptive Overlapping Windows

Tung, Frederick; Little, James J.

doi:10.1007/978-3-319-10599-4_33

Frederick Tung¹⁹ &
James J. Little¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8694))

Included in the following conference series:

European Conference on Computer Vision

17k Accesses
13 Citations

Abstract

Scene parsing is the problem of assigning a semantic label to every pixel in an image. Though an ambitious task, impressive advances have been made in recent years, in particular in scalable nonparametric techniques suitable for open-universe databases. This paper presents the CollageParsing algorithm for scalable nonparametric scene parsing. In contrast to common practice in recent nonparametric approaches, CollageParsing reasons about mid-level windows that are designed to capture entire objects, instead of low-level superpixels that tend to fragment objects. On a standard benchmark consisting of outdoor scenes from the LabelMe database, CollageParsing achieves state-of-the-art nonparametric scene parsing results with 7 to 11% higher average per-class accuracy than recent nonparametric approaches.

Download to read the full chapter text

Chapter PDF

Nonparametric Scene Parsing via Label Transfer

Superpixel Correspondence for Non-parametric Scene Parsing of Natural Images

Scene Parsing with Object Instance Inference Using Regions and Per-exemplar Detectors

Article 28 November 2014

Keywords

References

Alexe, B., Deselaers, T., Ferrari, V.: Measuring the objectness of image windows. IEEE Transactions on Pattern Analysis and Machine Intelligence 34(11), 2189–2202 (2012)
Article Google Scholar
Barnes, C., Shechtman, E., Finkelstein, A., Goldman, D.B.: PatchMatch: a randomized correspondence algorithm for structural image editing. In: Proc. ACM SIGGRAPH (2009)
Google Scholar
Boiman, O., Shechtman, E., Irani, M.: In defense of nearest-neighbor based image classification. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (2008)
Google Scholar
Boykov, Y., Kolmogorov, V.: An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. IEEE Transactions on Pattern Analysis and Machine Intelligence 26(9), 1124–1137 (2004)
Article Google Scholar
Boykov, Y., Veksler, O., Zabih, R.: Efficient approximate energy minimization via graph cuts. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(12), 1222–1239 (2001)
Article Google Scholar
Chen, X., Shrivastava, A., Gupta, A.: NEIL: extracting visual knowledge from web data. In: Proc. IEEE International Conference on Computer Vision (2013)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 886–893 (2005)
Google Scholar
Eigen, D., Fergus, R.: Nonparametric image parsing using adaptive neighbor sets. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition, pp. 2799–2806 (2012)
Google Scholar
Farabet, C., Couprie, C., Najman, L., LeCun, Y.: Scene parsing with multiscale feature learning, purity trees, and optimal covers. In: Proc. International Conference on Machine Learning (2012)
Google Scholar
Farhadi, A., Endres, I., Hoiem, D., Forsyth, D.: Describing objects by their attributes. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition, pp. 1778–1785 (2009)
Google Scholar
Felzenszwalb, P.F., Huttenlocher, D.P.: Efficient graph-based image segmentation. International Journal of Computer Vision 59(2), 167–181 (2004)
Article Google Scholar
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Transactions on Pattern Analysis and Machine Intelligence 32(9), 1627–1645 (2010)
Article Google Scholar
Gould, S., Fulton, R., Koller, D.: Decomposing a scene into geometric and semantically consistent regions. In: Proc. IEEE International Conference on Computer Vision (2009)
Google Scholar
Gould, S., Zhang, Y.: patchMatchGraph: Building a graph of dense patch correspondences for label transfer. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 439–452. Springer, Heidelberg (2012)
Chapter Google Scholar
Hays, J., Efros, A.A.: Scene completion using millions of photographs. In: Proc. ACM SIGGRAPH (2007)
Google Scholar
Heitz, G., Koller, D.: Learning spatial context: Using stuff to find things. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 30–43. Springer, Heidelberg (2008)
Chapter Google Scholar
Hou, X., Zhang, L.: Saliency detection: a spectral residual approach. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (2007)
Google Scholar
Isola, P., Liu, C.: Scene collaging: analysis and synthesis of natural images with semantic layers. In: Proc. IEEE International Conference on Computer Vision (2013)
Google Scholar
Juneja, M., Vedaldi, A., Jawahar, C.V., Zisserman, A.: Blocks that shout: distinctive parts for scene classification. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (2013)
Google Scholar
Kolmogorov, V., Zabih, R.: What energy functions can be minimized via graph cuts? IEEE Transactions on Pattern Analysis and Machine Intelligence 26(2), 147–159 (2004)
Article Google Scholar
Liu, C., Yuen, J., Torralba, A.: Nonparametric scene parsing via label transfer. IEEE Transactions on Pattern Analysis and Machine Intelligence 33(12), 2368–2382 (2011)
Article Google Scholar
Liu, C., Yuen, J., Torralba, A.: SIFT Flow: dense correspondence across scenes and its applications. IEEE Transactions on Pattern Analysis and Machine Intelligence 33(5), 978–994 (2011)
Article Google Scholar
Malisiewicz, T., Gupta, A., Efros, A.A.: Ensemble of Exemplar-SVMs for object detection and beyond. In: Proc. IEEE International Conference on Computer Vision, pp. 89–96 (2011)
Google Scholar
McCann, S., Lowe, D.G.: Spatially local coding for object recognition. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds.) ACCV 2012, Part I. LNCS, vol. 7724, pp. 204–217. Springer, Heidelberg (2013)
Chapter Google Scholar
Myeong, H., Chang, J.Y., Lee, K.M.: Learning object relationships via graph-based context model. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition, pp. 2727–2734 (2012)
Google Scholar
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. International Journal of Computer Vision 42(3), 145–175 (2001)
Article MATH Google Scholar
Parikh, D., Grauman, K.: Relative attributes. In: Proc. IEEE International Conference on Computer Vision, pp. 503–510 (2011)
Google Scholar
Patterson, G., Hays, J.: SUN Attribute database: discovering, annotating, and recognizing scene attributes. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition, pp. 2751–2758 (2012)
Google Scholar
Russell, B.C., Torralba, A., Murphy, K., Freeman, W.T.: LabelMe: a database and web-based tool for image annotation. International Journal of Computer Vision 77(1-3), 157–173 (2008)
Article Google Scholar
van de Sande, K.E.A., Uijlings, J.R.R., Gevers, T., Smeulders, A.W.M.: Segmentation as selective search for object recognition. In: Proc. IEEE International Conference on Computer Vision, pp. 1879–1886 (2011)
Google Scholar
Singh, G., Košecká, J.: Nonparametric scene parsing with adaptive feature relevance and semantic context. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition, pp. 3151–3157 (2013)
Google Scholar
Singh, S., Gupta, A., Efros, A.A.: Unsupervised discovery of mid-level discriminative patches. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part II. LNCS, vol. 7573, pp. 73–86. Springer, Heidelberg (2012)
Chapter Google Scholar
Tighe, J., Lazebnik, S.: Finding things: image parsing with regions and per-exemplar detectors. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition, pp. 3001–3008 (2013)
Google Scholar
Tighe, J., Lazebnik, S.: Superparsing: scalable nonparametric image parsing with superpixels. International Journal of Computer Vision 101(2), 329–349 (2013)
Article MathSciNet Google Scholar
Tuytelaars, T., Fritz, M., Saenko, K., Darrell, T.: The NBNN kernel. In: Proc. IEEE International Conference on Computer Vision, pp. 1824–1831 (2011)
Google Scholar
Wu, J., Rehg, J.M.: CENTRIST: a visual descriptor for scene categorization. IEEE Transactions on Pattern Analysis and Machine Intelligence 33(8), 1489–1501 (2011)
Article Google Scholar
Xiao, J., Hays, J., Ehinger, K., Oliva, A., Torralba, A.: SUN database: large-scale scene recognition from abbey to zoo. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition, pp. 3485–3492 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of British Columbia, Vancouver, Canada
Frederick Tung & James J. Little

Authors

Frederick Tung
View author publications
You can also search for this author in PubMed Google Scholar
James J. Little
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Toronto, 6 King’s College Road, M5H 3S5, Toronto, ON, Canada
David Fleet
Faculty of Electrical Engineering, Department of Cybernetics, Czech Technical University in Prague, Technicka 2, 166 27, Prague 6, Czech Republic
Tomas Pajdla
Max-Planck-Institut für Informatik, Campus E1 4, 66123, Saarbrücken, Germany
Bernt Schiele
ESAT - PSI, iMinds, KU Leuven, Kasteelpark Arenberg 10, Bus 2441, 3001, Leuven, Belgium
Tinne Tuytelaars

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tung, F., Little, J.J. (2014). CollageParsing: Nonparametric Scene Parsing by Adaptive Overlapping Windows. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds) Computer Vision – ECCV 2014. ECCV 2014. Lecture Notes in Computer Science, vol 8694. Springer, Cham. https://doi.org/10.1007/978-3-319-10599-4_33

Download citation

DOI: https://doi.org/10.1007/978-3-319-10599-4_33
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10598-7
Online ISBN: 978-3-319-10599-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

CollageParsing: Nonparametric Scene Parsing by Adaptive Overlapping Windows

Abstract

Chapter PDF

Similar content being viewed by others

Nonparametric Scene Parsing via Label Transfer

Superpixel Correspondence for Non-parametric Scene Parsing of Natural Images

Scene Parsing with Object Instance Inference Using Regions and Per-exemplar Detectors

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

CollageParsing: Nonparametric Scene Parsing by Adaptive Overlapping Windows

Abstract

Chapter PDF

Similar content being viewed by others

Nonparametric Scene Parsing via Label Transfer

Superpixel Correspondence for Non-parametric Scene Parsing of Natural Images

Scene Parsing with Object Instance Inference Using Regions and Per-exemplar Detectors

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation