Skip to main content
Log in

Multi-frame super-resolution reconstruction based on global motion estimation using a novel CNN descriptor

  • Published:
Optoelectronics Letters Aims and scope Submit manuscript

Abstract

In this paper, we introduce a novel feature descriptor based on deep learning that trains a model to match the patches of images on scenes captured under different viewpoints and lighting conditions for Multi-frame super-resolution. The patch matching of images capturing the same scene in varied circumstances and diverse manners is challenging. We develop a model which maps the raw image patch to a low dimensional feature vector. As our experiments show, the proposed approach is much better than state-of-the-art descriptors and can be considered as a direct replacement of SURF. The results confirm that these techniques further improve the performance of the proposed descriptor. Then we propose an improved Random Sample Consensus algorithm for removing false matching points. Finally, we show that our neural network based image descriptor for image patch matching outperforms state-of-the-art methods on a number of benchmark datasets and can be used for image registration with high quality in multi-frame super-resolution reconstruction.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Hyde R. Eyeglass, SPIE 4849, 28 (2002).

    ADS  Google Scholar 

  2. Bay H., Ess A., Tuytelaars T. and Van Gool L., Computer Vision and image Understanding 110, 346 (2008).

    Article  Google Scholar 

  3. Brown M., Hua G. and Winder S., IEEE Transactions on Pattern Analysis and Machine Intelligence 33, 43 (2010).

    Article  Google Scholar 

  4. Trzcinski T., Christoudias M. and Lepetit V., IEEE Transactions on Pattern Analysis and Machine Intelligence 37, 597 (2015).

    Article  Google Scholar 

  5. Trzcinski T., Christoudias M., Fua P. and Lepetit, V., Boosting Binary Key-Point Descriptors, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2874 (2013).

    Google Scholar 

  6. Russakovsky O., Deng J., Su H., Krause J., Satheesh S., Ma S., Huang Z., Karpathy A., Khosla A., Bernstein M., Berg A. and Fei-Fei L., International Journal Of Computer Vision 115, 211 (2015).

    Article  MathSciNet  Google Scholar 

  7. Fischer P., Dosovitskiy A. and Brox T., Descriptor Matching with Convolutional Neural Networks: a Comparison to SIFT, arXiv:1405.5769, 2014.

    Google Scholar 

  8. Simo-Serra E., Trulls E., Ferraz L., Kokkinos I., Fua P. and Moreno-Noguer F., Discriminative Learning of Deep Convolutional Feature Point Descriptors, IEEE International Conference on Computer Vision, 118 (2015).

    Google Scholar 

  9. Han X., Leung T., Jia Y., Sukthankar R. and Berg A., Matchnet: Unifying Feature and Metric Learning for Patch-Based Matching, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 3279 (2015).

    Google Scholar 

  10. Yi K., Trulls E., Lepetit V. and Fua P., LIFT: Learned Invariant Feature Transform, European Conference Computer Vision, 467 (2016).

    Google Scholar 

  11. Tian Y., Fan B., Wu F., L2-Net: Deep Learning of Discriminative Patch Descriptor in Euclidean Space, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 6128 (2017).

    Google Scholar 

  12. Chen M., Wang C. and Qin H., Computer Aided Geometric Design 62, 192 (2018).

    Article  MathSciNet  Google Scholar 

  13. Brown L., ACM Computing Surveys 24, 325 (1992).

    Article  Google Scholar 

  14. Zitova B. and Flusser J., Image and Vision Computing 21, 977 (2003).

    Article  Google Scholar 

  15. Lucas B. and Kanade T., An Iterative Image Registration Technique with an Application to Stereo Vision, The 7th International Joint Conference on Artificial Intelligence, 674 (1981).

    Google Scholar 

  16. Harris C. and Stephens M., A Combined Corner and Edge Detector, The 4th Alvey Vision Conference, 10 (1988).

    Google Scholar 

  17. Lowe D., International Journal of Computer Vision 60, 91 (2004).

    Article  Google Scholar 

  18. Keren D., Peleg S. and Brada R., Image Sequence Enhancement Using Subpixel Displacements. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 742 (1988).

    Google Scholar 

  19. Irani M. and Peleg S., CVGIP: Graphical Models & Image Processing 53, 231 (1991).

    Google Scholar 

  20. Schultz R. and Stevenson R., IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society 5, 996 (1996).

    Article  Google Scholar 

  21. Baker S. and Kanade T., IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 1167 (2002).

    Article  Google Scholar 

  22. Liao R., Tao X., Li, R., Video Super-Resolution via Deep Draft-Ensemble Learning, IEEE International Conference on Computer Vision, 531 (2015).

    Google Scholar 

  23. Kappeler A., Yoo S., Dai Q. and Katsaggelos A., IEEE Transactions on Computational Imaging 2, 109 (2016).

    Article  MathSciNet  Google Scholar 

  24. Caballero J., Ledig C., Aitken A., Acosta A., Totz J., Wang Z. and Shi W., Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation, IEEE Computer Vision and Pattern Recognition, 2848 (2017).

    Google Scholar 

  25. Tao X., Gao H., Liao R., Wang J. and Jia J., Detail-Revealing Deep Video Super-Resolution, IEEE International Conference on Computer Vision, 4482 (2017).

    Google Scholar 

  26. Ren S., He K., Girshick R. and Sun J., IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 1137 (2017).

    Article  Google Scholar 

  27. Fischler M. and Bolles R., Communications of the ACM 24, 381 (1981).

    Article  MathSciNet  Google Scholar 

  28. Verdie Y., Yi K., Fua P. and Lepetit V., TILDE: A Temporally Invariant Learned Detector, IEEE Conference on Computer Vision and Pattern Recognition, 5279 (2015).

    Google Scholar 

  29. Strecha C., Hansen W., Van Gool L., Fua P. and Thoennessen, U., On Benchmarking Camera Calibration and Multi-View Stereo for High Resolution Imagery, IEEE Conference on Computer Vision and Pattern Recognition, 1 (2008).

    Google Scholar 

  30. Rublee E., Rabaud V., Konolidge K. and Bradski G., ORB: An Efficient Alternative to SIFT or SURF, International Conference on Computer Vision, 2564 (2011).

    Google Scholar 

  31. Balntas V., Johns E., Tang L. and Mikolajczyk K., PN-Net: Conjoined Triple Deep Network for Learning Local Image Descriptors, arXiv:1601.05030, 2016.

    Google Scholar 

  32. Han X., Leung T., Jia Y., Sukthankar R. and Berg A., MatchNet: Unifying Feature and Metric Learning for Patch-Based Matching, IEEE Conference on Computer Vision and Pattern Recognition, 3279 (2015).

    Google Scholar 

  33. Wang Z., Bovik A., Sheikh H. and Simoncelli E., IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society 13, 600 (2004).

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hui Kang  (康慧).

Additional information

This work has been supported by the National Natural Science Foundation of China (No.61603105), the Fundamental Research Funds for the Central Universities (No.2015ZM128), and the Science and Technology Program of Guangzhou in China (Nos.201707010054 and 201704030072). This paper was presented in part at the Chinese Conference on Pattern Recognition and Computer Vision, Guangzhou, 2018. This paper was recommended by the program committee.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gao, Hx., Xie, W., Kang, H. et al. Multi-frame super-resolution reconstruction based on global motion estimation using a novel CNN descriptor. Optoelectron. Lett. 15, 468–475 (2019). https://doi.org/10.1007/s11801-019-8208-0

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11801-019-8208-0

Document code

Navigation