Skip to main content

Learning to Select Long-Track Features for Structure-From-Motion and Visual SLAM

  • Conference paper
  • First Online:
Pattern Recognition (GCPR 2016)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9796))

Included in the following conference series:

Abstract

With the emergence of augmented reality platforms, Structure-From-Motion or visual SLAM approaches have regained in importance in order to deliver the next generation of immersive 3D experiences. As a new quality is achieved by deployment on mobile devices, computational efficiency plays an important role. In this work, we aim to reduce complexity by limiting the number of features without sacrificing quality. We select a subset of image features, using a learning based approach. A random forest is trained to pick 2D image features which are likely to be significant for a 3D reconstruction. Additionally, we aim for an objective that selects long track features, so that they can be “re-used” in multiple frames. We evaluate our feature selection technique on real world sequences and show a significant reduction of image features and the resulting decreased computation time is not effecting the accuracy of the 3D reconstruction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    We acknowledge Hartmann et al. for providing their PARK datasets.

References

  1. Breiman, L., Friedman, J., Olshen, R., Stone, C.: Classification and Regression Trees. Wadsworth and Brooks, Monterey (1984)

    MATH  Google Scholar 

  2. Gionis, A., Indyk, P., Motwani, R.: Similarity search in high dimensions via hashing. In: Proceedings of the 25th International Conference on Very Large Data Bases. VLDB 1999, pp. 518–529. Morgan Kaufmann Publishers Inc., San Francisco (1999). http://dl.acm.org/citation.cfm?id=645925.671516

  3. Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision, 2nd edn. Cambridge University Press, Cambridge (2004). ISBN: 0521540518

    Book  MATH  Google Scholar 

  4. Hartmann, W., Havlena, M., Schindler, K.: Predicting matchability. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 9–16, June 2014

    Google Scholar 

  5. Hauagge, D., Snavely, N.: Image matching using local symmetry features. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 206–213, June 2012

    Google Scholar 

  6. Kalman, R.: On the general theory of control systems. IRE Trans. Autom. Control 4(3), 110–110 (1959)

    Article  Google Scholar 

  7. Khan, N., McCane, B., Mills, S.: Feature set reduction for image matching in large scale environments. In: Proceedings of the 27th Conference on Image and Vision Computing New Zealand, IVCNZ 2012, pp. 67–72. ACM, New York (2012). http://doi.acm.org/10.1145/2425836.2425852

  8. Klein, G., Murray, D.: Parallel tracking and mapping for small AR workspaces. In: Proceedings of the Sixth IEEE and ACM International Symposium on Mixed and Augmented Reality (ISMAR’07), Nara, Japan, November 2007

    Google Scholar 

  9. Leutenegger, S., Chli, M., Siegwart, R.: BRISK: binary robust invariant scalable keypoints. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 2548–2555, November 2011

    Google Scholar 

  10. Lowe, D.G.: Distinctive image features from scale-invariant keypoints, vol. 60, pp. 91–110. Kluwer Academic Publishers, Hingham, November 2004. http://dx.doi.org/10.1023/B:VISI.0000029664.99615.94

  11. Rosten, E., Drummond, T.W.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  12. Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to sift or surf. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 2564–2571, November 2011

    Google Scholar 

  13. Silpa-Anan, C., Hartley, R.: Optimised KD-trees for fast image descriptor matching. In: IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008, pp. 1–8, June 2008

    Google Scholar 

  14. Snavely, N., Seitz, S.M., Szeliski, R.: Photo tourism: exploring photo collections in 3D. ACM Trans. Graph. 25(3), 835–846 (2006)

    Article  Google Scholar 

  15. Stefan, W.: Random-forests (2012). https://github.com/stefan-w/random-forests

  16. Strecha, C., von Hansen, W., Van Gool, L., Fua, P., Thoennessen, U.: On benchmarking camera calibration and multi-view stereo for high resolution imagery. In: IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008, pp. 1–8, June 2008

    Google Scholar 

  17. Sweeney, C.: Theia Multiview Geometry Library: Tutorial & Reference. University of California Santa Barbara

    Google Scholar 

  18. Triggs, B., McLauchlan, P.F., Hartley, R.I., Fitzgibbon, A.W.: Bundle adjustment – a modern synthesis. In: Triggs, B., Zisserman, A., Szeliski, R. (eds.) ICCV-WS 1999. LNCS, vol. 1883, pp. 298–372. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  19. Vedaldi, A., Fulkerson, B.: VLFeat: An open and portable library of computer vision algorithms (2008). http://www.vlfeat.org

  20. Zhang, G., Vela, P.A.: Good features to track for visual SLAM, June 2015

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jonas Scheer .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Scheer, J., Fritz, M., Grau, O. (2016). Learning to Select Long-Track Features for Structure-From-Motion and Visual SLAM. In: Rosenhahn, B., Andres, B. (eds) Pattern Recognition. GCPR 2016. Lecture Notes in Computer Science(), vol 9796. Springer, Cham. https://doi.org/10.1007/978-3-319-45886-1_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-45886-1_33

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-45885-4

  • Online ISBN: 978-3-319-45886-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics