Advertisement

Analysis of Basic-SegNet Architecture with Variations in Training Options

  • Ganesh R. PadalkarEmail author
  • Madhuri B. Khambete
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 940)

Abstract

Deep learning techniques are becoming popular for vision-based automation applications. Recently, various deep convolutional neural network architectures have been evolved for image classification, object detection and semantic image segmentation tasks. SegNet is one of the successful encoder-decoder convolution architectures, implemented for semantic image segmentation. We simulated Basic-SegNet architecture using MATLAB R2017b. SegNet architecture is built layer by layer without using any pre-trained model. Multi-class images from Pattern Analysis, Statistical Modelling and Computational Learning Visual Object Classes 2012 database, are used to train the architecture. The segmentation results obtained on test images are evaluated by calculating accuracy, intersection of union, boundary F1 measure and execution time. These evaluation parameters are computed over database as well as for individual object class. Training options like learning rate and its schedule, filter size, number of filters and number of epochs are varied to analyze their effects on the performance of architecture. This research work is focused on analysis of Basic-SegNet architecture with variations in training options.

Keywords

CONV-Convolution layer ReLu-Rectified linear units Pool-max pooling layer Un-pool- Un-pooling layer DAG-Directional acyclic graph 

References

  1. 1.
    Farabet, C., Couprie, C., Najman, L., LeCun, Y.: Learning hierarchical features for scene labelling. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1915–1929 (2013)CrossRefGoogle Scholar
  2. 2.
    Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)Google Scholar
  3. 3.
    Yosinski, J., Clune, J., Bengio, Y., Lipson, H.: How transferable are features in deep neural networks? In: Advances in Neural Information Processing Systems, pp. 3320–3328 (2014)Google Scholar
  4. 4.
    Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)Google Scholar
  5. 5.
    Badrinarayanan, V., Kendall, A., Cipolla, R.: Segnet: a deep convolutional encoder-decoder architecture for image segmentation, arXiv preprint arXiv:1511.00561 (2015)
  6. 6.
    Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition, arXiv preprint arXiv:1409.1556 (2014)
  7. 7.
    Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)Google Scholar
  8. 8.
    He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)Google Scholar
  9. 9.
    Graves, A., Fernández, S., Schmidhuber, J.: Multidimensional recurrent neural networks, CoRR, vol. abs/0705.2011 (2007)Google Scholar
  10. 10.
    Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Semantic image segmentation with deep convolutional nets and fully connected crfs, arXiv preprint arXiv:1412.7062 (2014)
  11. 11.
    Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs, arXiv preprint arXiv:1606.00915 (2016)
  12. 12.
    Zheng, S., Jayasumana, S., Romera-Paredes, B., Vineet, V., Su, Z., Du, D., Huang, C., Torr, P.H.: Conditional random fields as recurrent neural networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1529–1537 (2015)Google Scholar
  13. 13.
    Visin, F., Ciccone, M., Romero, A., Kastner, K., Cho, K., Bengio, Y., Matteucci, M., Courville, A.: Reseg: A recurrent neural network-based model for semantic segmentation. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, June 2016Google Scholar
  14. 14.
    Byeon, W., Breuel, T.M., Raue, F., Liwicki, M.: Scene labeling with LSTM recurrent neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3547–3555 (2015)Google Scholar
  15. 15.
    Pinheiro, P.H., Collobert, R.: Recurrent convolutional neural networks for scene labelling. In: ICML, pp. 82–90 (2014)Google Scholar
  16. 16.
    Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions, arXiv reprint arXiv:1511.07122 (2015)
  17. 17.

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.MKSSS Cummins College of Engineering for WomenPuneIndia

Personalised recommendations