A Novel Automatic CNN Architecture Design Approach Based on Genetic Algorithm

Ahmed, Amr AbdelFatah; Darwish, Saad M. Saad; El-Sherbiny, Mohamed M.

doi:10.1007/978-3-030-31129-2_43

Amr AbdelFatah Ahmed¹⁷,
Saad M. Saad Darwish¹⁸ &
Mohamed M. El-Sherbiny¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1058))

Included in the following conference series:

International Conference on Advanced Intelligent Systems and Informatics

2689 Accesses
9 Citations

Abstract

The deep “Convolutional Neural Networks (CNNs)” gained a grand success on a broad of computer vision tasks. However, CNN structures training consumes a massive computing resources amount. The researchers in this field are concerned on designing CNN structures to maximize the performance and accuracy. The main design methods are human hand-crafted fixed model structures and automatic generated models. We proposed an automatic CNN structure design approach based on genetic algorithm that concerned with generating light weight CNN structures. We also introduce a chromosome novel representation for the structure of CNN. Unlike existing approaches, the proposed methodology is designed to work on limited computing assets with achieving high accuracy. It utilizes advanced training methods to decrease the overhead on the computing resources that are involved in the process. Our experimental results denote the proposed model effectiveness over the related work methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Sermanet, P., Chintala, S., LeCun, Y.: Convolutional neural networks applied to house numbers digit classification. In: International Conference on Pattern Recognition, pp. 10–13. Tsukuba, Japan (2012)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations ICLR, pp. 1–14. San Diego, USA (2015)
Google Scholar
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: The IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9. Boston, USA (2015)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: The IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778. Las Vegas, USA (2016)
Google Scholar
Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., Adam, H.: Mobilenets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv preprint arXiv:1704.04861 (2017)
Iandola, F.N., Han, S., Moskewic, M.W., Ashraf, K., Dally, W.J., Keutzer, K.: Squeezenet: Alexnet-level Accuracy with 50x Fewer Parameters and < 0.5 MB Model Size. arXiv preprint arXiv:1602.07360 (2016)
Zhang, X., Zhou, X., Lin, M., Sun, J.: ShuffleNet: an extremely efficient convolutional neural network for mobile devices. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6848–6856. Utah, USA (2018)
Google Scholar
Xie, L., Yuille, A.L.: Genetic CNN. In: The International Conference on Computer Vision ICCV, pp. 1388–1397. Venice, Italy (2017)
Google Scholar
Baldominos, A., Saez, Y., Isasi, P.: Evolutionary convolutional neural networks: an application to handwriting recognition. Int. J. Neurocomput. 283, 38–52 (2018)
Article Google Scholar
Suganuma, M., Shirakawa, S., Nagao, T.: A genetic programming approach to designing convolutional neural network architectures. In: Genetic and Evolutionary Computation Conference, pp. 497–504. ACM, Berlin (2017)
Google Scholar
Zoph, B., Le, Q.V.: Neural architecture search with reinforcement learning. In: International Conference on Learning Representations ICLR, pp. 1–16. Toulon, France (2017)
Google Scholar
Zoph, B., Vasudevan, V., Shlens, J., Le, Q.V.: Learning transferable architectures for scalable image recognition. In: The IEEE Conference on Computer Vision and Pattern Recognition CVPR, pp. 8697–8710. Utah, USA (2018)
Google Scholar
Zhong, Z., Yan, J., Liu, C.L.: Practical block-wise neural network architecture generation. In: The IEEE Conference on Computer Vision and Pattern Recognition CVPR, pp. 2423–2432. Utah, USA (2018)
Google Scholar
Baker, B., Gupta, O., Naik, N., Raskar, R.: Designing neural network architectures using reinforcement learning. In: International Conference on Learning Representations ICLR, pp. 1–18. Toulon, France (2017)
Google Scholar
Cai, H., Chen, T., Zhang, W., Yu, Y., Wang, J.: Efficient architecture search by network transformation. In: International Conference on Artificial Intelligence AAAI, pp. 2787–2794. Louisiana, USA (2018)
Google Scholar
Pham, H., Guan, M., Zoph, B., Le, Q., Dean, J.: Efficient neural architecture search via parameter sharing. In: International Conference on Machine Learning, pp. 4092–4101. Stockholm, Sweden, (2018)
Google Scholar
Real, E., Aggarwal, A., Huang, Y., Le, Q.V.: Regularized Evolution for Image Classifier Architecture Search. arXiv preprint arXiv:1802.01548 (2018)
Liu, H., Simonyan, K., Yang, Y.: DARTS: differentiable architecture search. In: International Conference on Learning Representations, New Orleans (2019). https://openreview.net/forum?id=S1eYHoC5FX
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. Int. J. Mach. Learn. Res. 15, 1929–1958 (2014)
MathSciNet MATH Google Scholar
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing ınternal covariate shift. In: The 32nd International Conference on Machine Learning, PMLR, pp. 448–456. Lille, France (2015)
Google Scholar
Yosinski, J., Clune, J., Bengio, Y., Lipson, H.: How transferable are features in deep neural networks? In: Advances in Neural Information Processing Systems, pp. 3320–3328. Quebec, Canada (2014)
Google Scholar
Zhou, Z.H.: Ensemble Methods: Foundations and Algorithms. Chapman & Hall/CRC, Boca Raton (2012)
Google Scholar
Deng, L.: The MNIST database of handwritten digit images for machine learning research. IEEE Sig. Process. Mag. 29(6), 141–142 (2012)
Article Google Scholar
Krizhevsky, A., Hinton, G.: Learning Multiple Layers of Features from Tiny Images. Technical Report, University of Toronto (2009)
Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: International Conference on Learning Representations ICLR, pp. 1–15. San Diego, USA (2015)
Google Scholar
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT press, Cambridge (2016)
Google Scholar
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., Chen, L.C.: Mobilenetv2: inverted residuals and linear bottlenecks. In: The IEEE Conference on Computer Vision and Pattern Recognition, pp. 4510–4520. Utah, USA (2018)
Google Scholar
Freeman, I., Roese-Koerner, L., Kummert, A.: Effnet: an efficient structure for convolutional neural networks. In: The 25th IEEE International Conference on Image Processing (ICIP), pp. 6–10, Athens, Greece (2018)
Google Scholar
Chen, H.Y., Su, C.Y.: An enhanced hybrid MobileNet. In: The 9th International Conference on Awareness Science and Technology (iCAST), pp. 308–312, Fukuoka, Japan (2018)
Google Scholar
Xie, S., Zheng, H., Liu, C., Lin, L.: SNAS: Stochastic Neural Architecture Search. arXiv preprint arXiv:1812.09926 (2018)

Download references

Author information

Authors and Affiliations

Department of Computer Engineering, Alexandria High Institute of Engineering and Technology, Alexandria, Egypt
Amr AbdelFatah Ahmed
Department of Information Technology, Institute of Graduate Studies and Research, Alexandria University, Alexandria, Egypt
Saad M. Saad Darwish
Department of Material Science, Institute of Graduate Studies and Research, Alexandria University, Alexandria, Egypt
Mohamed M. El-Sherbiny

Authors

Amr AbdelFatah Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Saad M. Saad Darwish
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed M. El-Sherbiny
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Amr AbdelFatah Ahmed .

Editor information

Editors and Affiliations

Cairo University, Giza, Egypt
Aboul Ella Hassanien
The British University in Dubai, Dubai, United Arab Emirates
Khaled Shaalan
Ain Shams University, Cairo, Egypt
Mohamed Fahmy Tolba

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ahmed, A.A., Darwish, S.M.S., El-Sherbiny, M.M. (2020). A Novel Automatic CNN Architecture Design Approach Based on Genetic Algorithm. In: Hassanien, A., Shaalan, K., Tolba, M. (eds) Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2019. AISI 2019. Advances in Intelligent Systems and Computing, vol 1058. Springer, Cham. https://doi.org/10.1007/978-3-030-31129-2_43

Download citation

DOI: https://doi.org/10.1007/978-3-030-31129-2_43
Published: 02 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-31128-5
Online ISBN: 978-3-030-31129-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics