Learning to Select Long-Track Features for Structure-From-Motion and Visual SLAM

Scheer, Jonas; Fritz, Mario; Grau, Oliver

doi:10.1007/978-3-319-45886-1_33

Jonas Scheer¹⁵,
Mario Fritz¹⁶ &
Oliver Grau¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9796))

Included in the following conference series:

German Conference on Pattern Recognition

2085 Accesses
1 Citations

Abstract

With the emergence of augmented reality platforms, Structure-From-Motion or visual SLAM approaches have regained in importance in order to deliver the next generation of immersive 3D experiences. As a new quality is achieved by deployment on mobile devices, computational efficiency plays an important role. In this work, we aim to reduce complexity by limiting the number of features without sacrificing quality. We select a subset of image features, using a learning based approach. A random forest is trained to pick 2D image features which are likely to be significant for a 3D reconstruction. Additionally, we aim for an objective that selects long track features, so that they can be “re-used” in multiple frames. We evaluate our feature selection technique on real world sequences and show a significant reduction of image features and the resulting decreased computation time is not effecting the accuracy of the 3D reconstruction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
We acknowledge Hartmann et al. for providing their PARK datasets.

References

Breiman, L., Friedman, J., Olshen, R., Stone, C.: Classification and Regression Trees. Wadsworth and Brooks, Monterey (1984)
MATH Google Scholar
Gionis, A., Indyk, P., Motwani, R.: Similarity search in high dimensions via hashing. In: Proceedings of the 25th International Conference on Very Large Data Bases. VLDB 1999, pp. 518–529. Morgan Kaufmann Publishers Inc., San Francisco (1999). http://dl.acm.org/citation.cfm?id=645925.671516
Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision, 2nd edn. Cambridge University Press, Cambridge (2004). ISBN: 0521540518
Book MATH Google Scholar
Hartmann, W., Havlena, M., Schindler, K.: Predicting matchability. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 9–16, June 2014
Google Scholar
Hauagge, D., Snavely, N.: Image matching using local symmetry features. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 206–213, June 2012
Google Scholar
Kalman, R.: On the general theory of control systems. IRE Trans. Autom. Control 4(3), 110–110 (1959)
Article Google Scholar
Khan, N., McCane, B., Mills, S.: Feature set reduction for image matching in large scale environments. In: Proceedings of the 27th Conference on Image and Vision Computing New Zealand, IVCNZ 2012, pp. 67–72. ACM, New York (2012). http://doi.acm.org/10.1145/2425836.2425852
Klein, G., Murray, D.: Parallel tracking and mapping for small AR workspaces. In: Proceedings of the Sixth IEEE and ACM International Symposium on Mixed and Augmented Reality (ISMAR’07), Nara, Japan, November 2007
Google Scholar
Leutenegger, S., Chli, M., Siegwart, R.: BRISK: binary robust invariant scalable keypoints. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 2548–2555, November 2011
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints, vol. 60, pp. 91–110. Kluwer Academic Publishers, Hingham, November 2004. http://dx.doi.org/10.1023/B:VISI.0000029664.99615.94
Rosten, E., Drummond, T.W.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006)
Chapter Google Scholar
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to sift or surf. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 2564–2571, November 2011
Google Scholar
Silpa-Anan, C., Hartley, R.: Optimised KD-trees for fast image descriptor matching. In: IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008, pp. 1–8, June 2008
Google Scholar
Snavely, N., Seitz, S.M., Szeliski, R.: Photo tourism: exploring photo collections in 3D. ACM Trans. Graph. 25(3), 835–846 (2006)
Article Google Scholar
Stefan, W.: Random-forests (2012). https://github.com/stefan-w/random-forests
Strecha, C., von Hansen, W., Van Gool, L., Fua, P., Thoennessen, U.: On benchmarking camera calibration and multi-view stereo for high resolution imagery. In: IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008, pp. 1–8, June 2008
Google Scholar
Sweeney, C.: Theia Multiview Geometry Library: Tutorial & Reference. University of California Santa Barbara
Google Scholar
Triggs, B., McLauchlan, P.F., Hartley, R.I., Fitzgibbon, A.W.: Bundle adjustment – a modern synthesis. In: Triggs, B., Zisserman, A., Szeliski, R. (eds.) ICCV-WS 1999. LNCS, vol. 1883, pp. 298–372. Springer, Heidelberg (2000)
Chapter Google Scholar
Vedaldi, A., Fulkerson, B.: VLFeat: An open and portable library of computer vision algorithms (2008). http://www.vlfeat.org
Zhang, G., Vela, P.A.: Good features to track for visual SLAM, June 2015
Google Scholar

Download references

Author information

Authors and Affiliations

Intel Visual Computing Institute, Saarbrücken, Germany
Jonas Scheer & Oliver Grau
Max-Planck Institute for Informatics, Saarbrücken, Germany
Mario Fritz

Authors

Jonas Scheer
View author publications
You can also search for this author in PubMed Google Scholar
Mario Fritz
View author publications
You can also search for this author in PubMed Google Scholar
Oliver Grau
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jonas Scheer .

Editor information

Editors and Affiliations

University of Hannover, Hannover, Germany
Bodo Rosenhahn
Max Planck Institute for Informatics, Saarbrücken, Germany
Bjoern Andres

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Scheer, J., Fritz, M., Grau, O. (2016). Learning to Select Long-Track Features for Structure-From-Motion and Visual SLAM. In: Rosenhahn, B., Andres, B. (eds) Pattern Recognition. GCPR 2016. Lecture Notes in Computer Science(), vol 9796. Springer, Cham. https://doi.org/10.1007/978-3-319-45886-1_33

Download citation

DOI: https://doi.org/10.1007/978-3-319-45886-1_33
Published: 27 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-45885-4
Online ISBN: 978-3-319-45886-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics