Road Scene Segmentation from a Single Image

Alvarez, Jose M.; Gevers, Theo; LeCun, Yann; Lopez, Antonio M.

doi:10.1007/978-3-642-33786-4_28

Jose M. Alvarez^21,23,
Theo Gevers^22,23,
Yann LeCun²¹ &
…
Antonio M. Lopez²³

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7578))

Included in the following conference series:

European Conference on Computer Vision

8805 Accesses
90 Citations

Abstract

Road scene segmentation is important in computer vision for different applications such as autonomous driving and pedestrian detection. Recovering the 3D structure of road scenes provides relevant contextual information to improve their understanding.

In this paper, we use a convolutional neural network based algorithm to learn features from noisy labels to recover the 3D scene layout of a road image. The novelty of the algorithm relies on generating training labels by applying an algorithm trained on a general image dataset to classify on–board images. Further, we propose a novel texture descriptor based on a learned color plane fusion to obtain maximal uniformity in road areas. Finally, acquired (off–line) and current (on–line) information are combined to detect road areas in single images.

From quantitative and qualitative experiments, conducted on publicly available datasets, it is concluded that convolutional neural networks are suitable for learning 3D scene layout from noisy labels and provides a relative improvement of 7% compared to the baseline. Furthermore, combining color planes provides a statistical description of road areas that exhibits maximal uniformity and provides a relative improvement of 8% compared to the baseline. Finally, the improvement is even bigger when acquired and current information from a single image are combined.

Download to read the full chapter text

Chapter PDF

Image semantic segmentation with an improved fully convolutional network

Article 23 November 2019

Fusing Classification and Segmentation DCNNs for Road Feature Mining on Aerial Images

Road Segmentation from High-Fidelity Remote Sensing Images using a Context Information Capture Network

Article 15 January 2022

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Brostow, G.J., Shotton, J., Fauqueur, J., Cipolla, R.: Segmentation and Recognition Using Structure from Motion Point Clouds. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 44–57. Springer, Heidelberg (2008)
Chapter Google Scholar
Lookingbill, A., Rogers, J., Lieb, D., Curry, J., Thrun, S.: Reverse optical flow for self-supervised adaptive autonomous robot navigation. IJCV 74, 287–302 (2007)
Article Google Scholar
Geronimo, D., Lopez, A.M., Sappa, A.D., Graf, T.: Survey of pedestrian detection for advanced driver assistance systems. PAMI 32, 1239–1258 (2010)
Article Google Scholar
Ladicky, L., Sturgess, P., Russell, C., Sengupta, S., Bastanlar, Y., Clocksin, W., Torr, P.: Joint optimization for object class segmentation and dense stereo reconstruction. IJCV, 1–12 (2011)
Google Scholar
Sturgess, P., Alahari, K., Ladicky, L., Torr, P.H.S.: Combining appearance and structure from motion features for road scene understanding. In: BMVC 2009 (2009)
Google Scholar
Hoiem, D., Efros, A.A., Hebert, M.: Recovering surface layout from an image. IJCV 75, 151–172 (2007)
Article Google Scholar
Saxena, A., Min, S., Ng, A.Y.: Make3d: Learning 3d scene structure from a single still image. PAMI 31(5), 824–840 (2009)
Article Google Scholar
Saenko, K., Kulis, B., Fritz, M., Darrell, T.: Adapting Visual Category Models to New Domains. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 213–226. Springer, Heidelberg (2010)
Chapter Google Scholar
Duan, L., Tsang, I.W., Xu, D.: Domain transfer multiple kernel learning. PAMI 34, 465–479 (2012)
Article Google Scholar
Kulis, B., Saenko, K., Darrell, T.: What you saw is not what you get: Domain adaptation using asymmetric kernel transforms. In: CVPR 2011, pp. 1785–1792 (2011)
Google Scholar
Alvarez, J.M., Lopez, A.M.: Road detection based on illuminant invariance. IEEE Trans. on ITS 12(1), 184–193 (2011)
Google Scholar
Rasmussen, C.: Grouping dominant orientations for ill-structured road following. In: CVPR 2004 (2004)
Google Scholar
Kong, H., Audibert, J., Ponce, J.: Vanishing point detection for road detection. In: CVPR 2009, pp. 96–103 (2009)
Google Scholar
Ess, A., Mueller, T., Grabner, H., Gool, L.J.V.: Segmentation-based urban traffic scene understanding. In: BMVC 2009 (2009)
Google Scholar
Gould, S., Fulton, R., Koller, D.: Decomposing a scene into geometric and semantically consistent regions. In: ICCV 2009 (2009)
Google Scholar
LeCun, Y., Bengio, Y.: Convolutional networks for images, speech, and time-series. In: The Handbook of Brain Theory and Neural Networks. MIT Press (1995)
Google Scholar
Cecotti, H., Graser, A.: Convolutional neural networks for p300 detection with application to brain-computer interfaces. PAMI 33, 433–445 (2011)
Article Google Scholar
Turaga, S.C., Murray, J.F., Jain, V., Roth, F., Helmstaedter, M., Briggman, K., Denk, W., Seung, H.S.: Convolutional networks can learn to generate affinity graphs for image segmentation. Neural Comp. 22, 511–538 (2010)
Article MATH Google Scholar
Levinshtein, A., Stere, A., Kutulakos, K.N., Fleet, D.J., Dickinson, S.J., Siddiqi, K.: Turbopixels: Fast superpixels using geometric flows. PAMI 31 (2009)
Google Scholar
Petrou, M.: Image Processing: Dealing with Texture. Wiley (2006)
Google Scholar
van de Sande, K.E.A., Gevers, T., Snoek, C.G.M.: Evaluation of color descriptors for object and scene recognition. In: CVPR 2008, pp. 453–464 (2008)
Google Scholar
Gonzalez, R., Woods, R.: Section 10.4. In: Digital Image Processing, 2nd edn. Prentice Hall (2002)
Google Scholar
Brostow, G.J., Fauqueur, J., Cipolla, R.: Semantic object classes in video: A high-definition ground truth database. Pattern Recognition Letters (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Courant Institute of Mathematical Sciences, New York University, New York, NY, USA
Jose M. Alvarez & Yann LeCun
Faculty of Science, University of Amsterdam, Amsterdam, The Netherlands
Theo Gevers
Computer Vision Center, Univ. Autònoma de Barcelona, Barcelona, Spain
Jose M. Alvarez, Theo Gevers & Antonio M. Lopez

Authors

Jose M. Alvarez
View author publications
You can also search for this author in PubMed Google Scholar
Theo Gevers
View author publications
You can also search for this author in PubMed Google Scholar
Yann LeCun
View author publications
You can also search for this author in PubMed Google Scholar
Antonio M. Lopez
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Ltd., CB3 0FB, Cambridge, UK
Andrew Fitzgibbon
Dept. of Computer Science, University of North Carolina, 27599, Chapel Hill, NC, USA
Svetlana Lazebnik
California Institute of Technology, 91125, Pasadena, CA, USA
Pietro Perona
Institute of Industrial Science, The University of Tokyo, 153-8505, Tokyo, Japan
Yoichi Sato
INRIA, 38330, Montbonnot, France
Cordelia Schmid

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Alvarez, J.M., Gevers, T., LeCun, Y., Lopez, A.M. (2012). Road Scene Segmentation from a Single Image. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7578. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33786-4_28

Download citation

DOI: https://doi.org/10.1007/978-3-642-33786-4_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33785-7
Online ISBN: 978-3-642-33786-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Road Scene Segmentation from a Single Image

Abstract

Chapter PDF

Similar content being viewed by others

Image semantic segmentation with an improved fully convolutional network

Fusing Classification and Segmentation DCNNs for Road Feature Mining on Aerial Images

Road Segmentation from High-Fidelity Remote Sensing Images using a Context Information Capture Network

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Road Scene Segmentation from a Single Image

Abstract

Chapter PDF

Similar content being viewed by others

Image semantic segmentation with an improved fully convolutional network

Fusing Classification and Segmentation DCNNs for Road Feature Mining on Aerial Images

Road Segmentation from High-Fidelity Remote Sensing Images using a Context Information Capture Network

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation