Multi-frame super-resolution reconstruction based on global motion estimation using a novel CNN descriptor

Gao, Hong-xia; Xie, Wang; Kang, Hui; Lin, Guo-yuan

doi:10.1007/s11801-019-8208-0

Multi-frame super-resolution reconstruction based on global motion estimation using a novel CNN descriptor

Published: 21 November 2019

Volume 15, pages 468–475, (2019)
Cite this article

Optoelectronics Letters Aims and scope Submit manuscript

Hong-xia Gao (高红霞)¹,
Wang Xie (谢旺)¹,
Hui Kang (康慧)² &
…
Guo-yuan Lin (林国远)¹

72 Accesses
2 Citations
3 Altmetric
Explore all metrics

Abstract

In this paper, we introduce a novel feature descriptor based on deep learning that trains a model to match the patches of images on scenes captured under different viewpoints and lighting conditions for Multi-frame super-resolution. The patch matching of images capturing the same scene in varied circumstances and diverse manners is challenging. We develop a model which maps the raw image patch to a low dimensional feature vector. As our experiments show, the proposed approach is much better than state-of-the-art descriptors and can be considered as a direct replacement of SURF. The results confirm that these techniques further improve the performance of the proposed descriptor. Then we propose an improved Random Sample Consensus algorithm for removing false matching points. Finally, we show that our neural network based image descriptor for image patch matching outperforms state-of-the-art methods on a number of benchmark datasets and can be used for image registration with high quality in multi-frame super-resolution reconstruction.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Image Registration Based on Patch Matching Using a Novel Convolutional Descriptor

GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints

Robust dense correspondence using deep convolutional features

Article 09 May 2019

References

Hyde R. Eyeglass, SPIE 4849, 28 (2002).
ADS Google Scholar
Bay H., Ess A., Tuytelaars T. and Van Gool L., Computer Vision and image Understanding 110, 346 (2008).
Article Google Scholar
Brown M., Hua G. and Winder S., IEEE Transactions on Pattern Analysis and Machine Intelligence 33, 43 (2010).
Article Google Scholar
Trzcinski T., Christoudias M. and Lepetit V., IEEE Transactions on Pattern Analysis and Machine Intelligence 37, 597 (2015).
Article Google Scholar
Trzcinski T., Christoudias M., Fua P. and Lepetit, V., Boosting Binary Key-Point Descriptors, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2874 (2013).
Google Scholar
Russakovsky O., Deng J., Su H., Krause J., Satheesh S., Ma S., Huang Z., Karpathy A., Khosla A., Bernstein M., Berg A. and Fei-Fei L., International Journal Of Computer Vision 115, 211 (2015).
Article MathSciNet Google Scholar
Fischer P., Dosovitskiy A. and Brox T., Descriptor Matching with Convolutional Neural Networks: a Comparison to SIFT, arXiv:1405.5769, 2014.
Google Scholar
Simo-Serra E., Trulls E., Ferraz L., Kokkinos I., Fua P. and Moreno-Noguer F., Discriminative Learning of Deep Convolutional Feature Point Descriptors, IEEE International Conference on Computer Vision, 118 (2015).
Google Scholar
Han X., Leung T., Jia Y., Sukthankar R. and Berg A., Matchnet: Unifying Feature and Metric Learning for Patch-Based Matching, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 3279 (2015).
Google Scholar
Yi K., Trulls E., Lepetit V. and Fua P., LIFT: Learned Invariant Feature Transform, European Conference Computer Vision, 467 (2016).
Google Scholar
Tian Y., Fan B., Wu F., L2-Net: Deep Learning of Discriminative Patch Descriptor in Euclidean Space, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 6128 (2017).
Google Scholar
Chen M., Wang C. and Qin H., Computer Aided Geometric Design 62, 192 (2018).
Article MathSciNet Google Scholar
Brown L., ACM Computing Surveys 24, 325 (1992).
Article Google Scholar
Zitova B. and Flusser J., Image and Vision Computing 21, 977 (2003).
Article Google Scholar
Lucas B. and Kanade T., An Iterative Image Registration Technique with an Application to Stereo Vision, The 7th International Joint Conference on Artificial Intelligence, 674 (1981).
Google Scholar
Harris C. and Stephens M., A Combined Corner and Edge Detector, The 4th Alvey Vision Conference, 10 (1988).
Google Scholar
Lowe D., International Journal of Computer Vision 60, 91 (2004).
Article Google Scholar
Keren D., Peleg S. and Brada R., Image Sequence Enhancement Using Subpixel Displacements. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 742 (1988).
Google Scholar
Irani M. and Peleg S., CVGIP: Graphical Models & Image Processing 53, 231 (1991).
Google Scholar
Schultz R. and Stevenson R., IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society 5, 996 (1996).
Article Google Scholar
Baker S. and Kanade T., IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 1167 (2002).
Article Google Scholar
Liao R., Tao X., Li, R., Video Super-Resolution via Deep Draft-Ensemble Learning, IEEE International Conference on Computer Vision, 531 (2015).
Google Scholar
Kappeler A., Yoo S., Dai Q. and Katsaggelos A., IEEE Transactions on Computational Imaging 2, 109 (2016).
Article MathSciNet Google Scholar
Caballero J., Ledig C., Aitken A., Acosta A., Totz J., Wang Z. and Shi W., Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation, IEEE Computer Vision and Pattern Recognition, 2848 (2017).
Google Scholar
Tao X., Gao H., Liao R., Wang J. and Jia J., Detail-Revealing Deep Video Super-Resolution, IEEE International Conference on Computer Vision, 4482 (2017).
Google Scholar
Ren S., He K., Girshick R. and Sun J., IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 1137 (2017).
Article Google Scholar
Fischler M. and Bolles R., Communications of the ACM 24, 381 (1981).
Article MathSciNet Google Scholar
Verdie Y., Yi K., Fua P. and Lepetit V., TILDE: A Temporally Invariant Learned Detector, IEEE Conference on Computer Vision and Pattern Recognition, 5279 (2015).
Google Scholar
Strecha C., Hansen W., Van Gool L., Fua P. and Thoennessen, U., On Benchmarking Camera Calibration and Multi-View Stereo for High Resolution Imagery, IEEE Conference on Computer Vision and Pattern Recognition, 1 (2008).
Google Scholar
Rublee E., Rabaud V., Konolidge K. and Bradski G., ORB: An Efficient Alternative to SIFT or SURF, International Conference on Computer Vision, 2564 (2011).
Google Scholar
Balntas V., Johns E., Tang L. and Mikolajczyk K., PN-Net: Conjoined Triple Deep Network for Learning Local Image Descriptors, arXiv:1601.05030, 2016.
Google Scholar
Han X., Leung T., Jia Y., Sukthankar R. and Berg A., MatchNet: Unifying Feature and Metric Learning for Patch-Based Matching, IEEE Conference on Computer Vision and Pattern Recognition, 3279 (2015).
Google Scholar
Wang Z., Bovik A., Sheikh H. and Simoncelli E., IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society 13, 600 (2004).
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Automation Science and Engineering, South China University of Technology, Guangzhou, 510640, China
Hong-xia Gao (高红霞), Wang Xie (谢旺) & Guo-yuan Lin (林国远)
Guangdong Polytechnic Normal University, Guangzhou, 510665, China
Hui Kang (康慧)

Authors

Hong-xia Gao (高红霞)
View author publications
You can also search for this author in PubMed Google Scholar
Wang Xie (谢旺)
View author publications
You can also search for this author in PubMed Google Scholar
Hui Kang (康慧)
View author publications
You can also search for this author in PubMed Google Scholar
Guo-yuan Lin (林国远)
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hui Kang (康慧).

Additional information

This work has been supported by the National Natural Science Foundation of China (No.61603105), the Fundamental Research Funds for the Central Universities (No.2015ZM128), and the Science and Technology Program of Guangzhou in China (Nos.201707010054 and 201704030072). This paper was presented in part at the Chinese Conference on Pattern Recognition and Computer Vision, Guangzhou, 2018. This paper was recommended by the program committee.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gao, Hx., Xie, W., Kang, H. et al. Multi-frame super-resolution reconstruction based on global motion estimation using a novel CNN descriptor. Optoelectron. Lett. 15, 468–475 (2019). https://doi.org/10.1007/s11801-019-8208-0

Download citation

Received: 31 December 2018
Revised: 20 March 2019
Published: 21 November 2019
Issue Date: November 2019
DOI: https://doi.org/10.1007/s11801-019-8208-0

Document code

A

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-frame super-resolution reconstruction based on global motion estimation using a novel CNN descriptor

Abstract

Access this article

Similar content being viewed by others

Image Registration Based on Patch Matching Using a Novel Convolutional Descriptor

GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints

Robust dense correspondence using deep convolutional features

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Document code

Navigation

Multi-frame super-resolution reconstruction based on global motion estimation using a novel CNN descriptor

Abstract

Access this article

Similar content being viewed by others

Image Registration Based on Patch Matching Using a Novel Convolutional Descriptor

GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints

Robust dense correspondence using deep convolutional features

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Document code

Search

Navigation