A Mask R-CNN Model with Improved Region Proposal Network for Medical Ultrasound Image

  • Jun LiuEmail author
  • PengFei Li
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10955)


In medical ultrasound image processing, it is often necessary to select the ROI before segmentation to obtain better segmentation accuracy. With the development of deep learning, the technology of object detection can well implement the function of automatically selecting ROI. The combination of object detection and image segmentation has also been proposed, such as Mask R-CNN, an end-to-end image segmentation model. However, the ROI selection by the algorithm above cannot meet the needs of medical image segmentation. Because its RPN layer is inherited from Faster R-CNN, a target classification framework. What we need is a region that can cover the whole object area with the details of edge. This information has an important influence for the further segmentation. Therefore, this paper improves the selection criteria of the anchor in the RPN layer, making the improved RPN layer more suitable for image segmentation tasks. Finally, the experimental results show that the improved model can achieve higher segmentation accuracy with the appropriate parameters selected.


Image segmentation Deep learning Mask R-CNN Ultrasound image Machine learning 



This work was supported by the National Natural Science Foundation of China (Grant No.31201121, No.61373109 and No.61403287), the Natural Science Foundation of Hubei Province (Grant No.2014CFB288) and Open foundation of Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System (Grant Nos.ZNSS2013A0001 and ZNSS2013A004).


  1. 1.
    Samundeeswari, E.S., Saranya, P.K., Manavalan, R.: Segmentation of breast ultrasound image using regularized K-means (ReKM) clustering. In: International Conference on Wireless Communications, Signal Processing and NETWORKING, pp. 1379–1383. IEEE (2016)Google Scholar
  2. 2.
    Li, L., Lin, J., Li, D., Wang, T.: Segmentation of medical ultrasound image based on Markov random field. In: The International Conference on Bioinformatics and Biomedical Engineering, pp. 968–971. IEEE Xplore (2007)Google Scholar
  3. 3.
    Nguyen, T.D., Sang, H.K., Kim, N.C.: Surface extraction using SVM-based texture classification for 3D fetal ultrasound imaging. In: International Conference on Communications and Electronics, pp. 285–290. IEEE (2007)Google Scholar
  4. 4.
    He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN (2017)Google Scholar
  5. 5.
    Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: International Conference on Neural Information Processing Systems, vol. 39, pp. 91–99. MIT Press (2015)Google Scholar
  6. 6.
    Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Computer Vision and Pattern Recognition, vol. 79, pp. 3431–3440. IEEE (2015)Google Scholar
  7. 7.
    Xie, S., Girshick, R., Dollar, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks, pp. 5987–5995 (2016)Google Scholar
  8. 8.
    Lin, T.Y., Dollar, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection, pp. 936–944 (2016)Google Scholar
  9. 9.
    Hariharan, B., Arbeláez, P., Girshick, R., Malik, J.: Simultaneous detection and segmentation. In: Fleet, D., Pajdla, T., Schiele, Bernt, Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8695, pp. 297–312. Springer, Cham (2014). Scholar
  10. 10.
    Girshick, R.: Fast R-CNN. Computer Science (2015)Google Scholar
  11. 11.
    Liu, F., Lin, G., Shen, C.: CRF learning with CNN features for image segmentation. Elsevier Science Inc. (2015)CrossRefGoogle Scholar
  12. 12.
    He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition, pp. 770–778 (2015)Google Scholar
  13. 13.
    Jiang, N., Wang, L.: Quantum image scaling using nearest neighbor interpolation. Quantum Inf. Process. 14(5), 1559–1571 (2015)MathSciNetCrossRefGoogle Scholar
  14. 14.
    Kirkland, E.J.: Bilinear interpolation. In: Kirkland, E.J. (ed.) Advanced Computing in Electron Microscopy. Springer, Boston (2010). Scholar
  15. 15.
    Andrews, S., Hamarneh, G.: Multi-region probabilistic dice similarity coefficient using the Aitchison distance and bipartite graph matching. Computer Science (2015)Google Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.College of Computer Science and TechnologyWuhan University of Science and TechnologyWuhanChina
  2. 2.Hubei Province Key Laboratory of Intelligent Information Processing and Real-Time Industrial SystemWuhanChina

Personalised recommendations