Abstract
Visual odometry is a popular area of computer vision that has seen a paradigm shift towards direct methods, where whole image alignment is used to determine camera poses. Current methods not robust to lighting changes in the scene and rely on standard feature-based methods for multi-camera systems. Binary descriptors are an option for alleviating both problems, but current methods do not scale well to larger and more robust descriptors. We present a method for performing direct tracking with binary descriptors of any size by approximating the gradient and descent direction with Hamming weights. We also present alternative methods that approximate the entire descriptor by its Hamming weights. Our results show improved accuracy compared to tracking on intensity alone, and our primary method improves significantly upon similar methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Engel, J., Schöps, T., Cremers, D.: LSD-SLAM: large-scale direct monocular SLAM. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8690, pp. 834–849. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10605-2_54
Mur-Artal, R., Tardós, J.D.: ORB-SLAM2: an open-source SLAM system for monocular, stereo, and RGB-D cameras. IEEE Trans. Rob. 33(5), 1255–1262 (2017)
Forster, C., et al.: SVO: semidirect visual odometry for monocular and multicamera systems. IEEE Trans. Rob. 33(2), 249–265 (2017)
Engel, J., Koltun, V., Cremers, D.: Direct sparse odometry. IEEE Trans. Pattern Anal. Mach. Intell. 40(3), 611–625 (2018)
Torr, P.H.S., Zisserman, A.: Feature based methods for structure and motion estimation. In: Triggs, B., Zisserman, A., Szeliski, R. (eds.) IWVA 1999. LNCS, vol. 1883, pp. 278–294. Springer, Heidelberg (2000). https://doi.org/10.1007/3-540-44480-7_19
Davison, A.J., et al.: MonoSLAM: real-time single camera SLAM. IEEE Trans. Pattern Anal. Mach. Intell. 29(6), 1052–1067 (2007)
Lucas, B.D., Kanade, T.: An iterative image registration technique with an application to stereo vision, p. 674 (1981)
Premaratne, P., Ajaz, S., Premaratne, M.: Hand gesture tracking and recognition system for control of consumer electronics. In: Huang, D.-S., Gan, Y., Gupta, P., Gromiha, M.M. (eds.) ICIC 2011. LNCS (LNAI), vol. 6839, pp. 588–593. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-25944-9_76
Premaratne, P., Ajaz, S., Premaratne, M.: Hand gesture tracking and recognition system using Lucas-Kanade algorithms for control of consumer electronics. Neurocomputing 116, 242–249 (2013)
Baker, S., Matthews, I.: Lucas-kanade 20 years on: a unifying framework. Int. J. Comput. Vis. 56(3), 221–255 (2004)
Newcombe, R.A., Lovegrove, S.J., Davison, A.J.: DTAM: dense tracking and mapping in real-time. In: 2011 International Conference on Computer Vision. IEEE (2011)
Demetz, O.: Feature invariance versus change estimation in variational motion estimation. Dissertation, Saarländische Universitäts-und Landesbibliothek (2015)
Calonder, M., et al.: BRIEF: computing a local binary descriptor very fast. IEEE Trans. Pattern Anal. Mach. Intell. 34(7), 1281–1298 (2012)
Rublee, E., et al.: ORB: an efficient alternative to SIFT or SURF. In: ICCV, vol. 11. no. 1 (2011)
Leutenegger, S., Chli, M., Siegwart, R.: BRISK: binary robust invariant scalable keypoints. In: 2011 IEEE International Conference on Computer Vision (ICCV). IEEE (2011)
Liu, C., Yuen, J., Torralba, A.: Sift flow: dense correspondence across scenes and its applications. IEEE Trans. Pattern Anal. Mach. Intell. 33(5), 978–994 (2011)
Zabih, R., Woodfill, J.: Non-parametric local transforms for computing visual correspondence. In: Eklundh, J.-O. (ed.) ECCV 1994. LNCS, vol. 801, pp. 151–158. Springer, Heidelberg (1994). https://doi.org/10.1007/BFb0028345
Silveira, G., Malis, E.: Real-time visual tracking under arbitrary illumination changes. In: 2007 IEEE Conference on Computer Vision and Pattern Recognition. IEEE (2007)
Bristow, H., Lucey, S.: In defense of gradient-based alignment on densely sampled sparse features. In: Hassner, T., Liu, C. (eds.) Dense Image Correspondences for Computer Vision, pp. 135–152. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-23048-1_7
Sevilla-Lara, L., Learned-Miller, E.: Distribution fields for tracking. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition. IEEE (2012)
Sevilla-Lara, L., Sun, D., Learned-Miller, E.G., Black, M.J.: Optical flow estimation with channel constancy. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 423–438. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10590-1_28
Crivellaro, A., Lepetit, V.: Robust 3D tracking with descriptor fields. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2014)
Alismail, H., et al.: Direct visual odometry in low light using binary descriptors. IEEE Rob. Autom. Lett. 2(2), 444–451 (2017)
Park, S., Schöps, T., Pollefeys, M.: Illumination change robustness in direct visual slam. In: 2017 IEEE International Conference on Robotics and Automation (ICRA). IEEE (2017)
Peris, M., et al.: Towards a simulation driven stereo vision system. In: Proceedings of the 21st International Conference on Pattern Recognition (ICPR 2012). IEEE (2012)
Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? The kitti vision benchmark suite. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition. IEEE (2012)
Acknowledgements
This research has been conducted with the support of the Australian Government Research Training Program Scholarship.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Halloran, B., Premaratne, P., Vial, P.J., Kadhim, I. (2019). Single and Multi-channel Direct Visual Odometry with Binary Descriptors. In: Huang, DS., Huang, ZK., Hussain, A. (eds) Intelligent Computing Methodologies. ICIC 2019. Lecture Notes in Computer Science(), vol 11645. Springer, Cham. https://doi.org/10.1007/978-3-030-26766-7_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-26766-7_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-26765-0
Online ISBN: 978-3-030-26766-7
eBook Packages: Computer ScienceComputer Science (R0)