A Novel Diminish Smooth L1 Loss Model with Generative Adversarial Network

Sutanto, Arief Rachman; Kang, Dae-Ki

doi:10.1007/978-3-030-68449-5_36

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12615))

Included in the following conference series:

International Conference on Intelligent Human Computer Interaction

1581 Accesses
7 Citations

Abstract

The training process of GAN can be regarded as a process in which the generation network and the identification network play against each other and finally reach a state where it cannot be further improved if the opponent does not change. At the same time, the start of the gradient descent method will choose a direction to reduce the defined loss. The loss function plays a key role in the performance of the model. Choosing the right loss function can help your model learn how to focus on the correct set of features in the data to achieve optimal and faster convergence. In this work, we propose a novel loss function scheme, namely, Diminish Smooth L1 loss. We improve a robust L1 loss called Smooth L1 loss by lowering the threshold so that the network can converge to a lower minimum. From our experimental results on several benchmark data, we found that our algorithm often outperforms the previous approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Gatys, L.A., Ecker, A.S., Bethge, M.: Image Style Transfer Using Convolutional Neural Networks (2016)
Google Scholar
Zhao, H., Gallo, O., Frosio, I., Kautz, J.: Loss Functions for Image Restoration with Neural Networks, 28 November 2015
Google Scholar
Zhang, L., Zhang, L., Mou, X., Zhang, D.: A Comprehensive Evaluation of Full Reference Image Quality Assessment Algorithms (2012)
Google Scholar
Zhang, H., Chang, H., Ma, B., Wang, N., Chen, X.: Dynamic R-CNN: towards high quality object detection via dynamic training. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12360, pp. 260–275. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58555-6_16
Chapter Google Scholar
Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13, 600–612 (2004)
Article Google Scholar
Wang, W., Liu, J., Yang, S., Guo, Z.: Typography With Decor: Intelligent Text Style Transfer (2019)
Google Scholar
Tyleček, R., Šára, R.: Spatial pattern templates for recognition of objects with regular structure, pp. 364–374 (2013)
Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: Convolutional Networks for Biomedical Image Segmentation, 18 May 2015
Google Scholar
Mirza, M., Osindero, S.: Conditional Generative Adversarial Nets, 6 November 2014
Google Scholar
Isola, P., Zhu, J.-Y., Zhou, T., Efros, A.A.: Image-to-Image Translation with Conditional Adversarial Networks, 21 November 2016
Google Scholar
Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B., Hochreiter, S.: GANs trained by a two time-scale update rule converge to a local nash equilibrium. In: Advances in Neural Information Processing Systems (NIPS 2017), vol. 30, 26 June 2017
Google Scholar
Gulrajani, I., Ahmed, F., Arjovsky, M., Dumoulin, V., Courville, A.: Improved Training of Wasserstein GANs, 31 March 2017
Google Scholar
Goodfellow, I.J., et al.: Generative Adversarial Networks, 10 June 2014
Google Scholar
Girshick, R.: Fast R-CNN, 30 April 2015
Google Scholar
Fu, C.-Y., Shvets, M., Berg, A.C.: RetinaMask: Learning to Predict Masks Improves State-of-the-Art Single-Shot Detection for Free, 10 January 2019
Google Scholar
Kukacka, J., Golkov, V., Cremers, D.: Regularization for Deep Learning: A Taxonomy, CoRR, vol. abs/1710.10686 (2017)
Google Scholar
Arjovsky, M., Chintala, S., Bottou, L.: Wasserstein GAN, CoRR, vol. abs/1701.07875 (2017)
Google Scholar
Pathak, D., Krähenbühl, P., Donahue, J., Darrell, T., Efros, A.A.: Context Encoders: Feature Learning by Inpainting. CoRR, vol. abs/1604.07379 (2016)
Google Scholar

Download references

Acknowledgment

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2018R1D1A1A02050166) and Institute for Information and Communications Technology Promotion (IITP), South Korea grant funded by the Korea Government (MSIT) (No. 2018–0-00245, Development of prevention technology against AI dysfunction induced by deception attack).

Author information

Authors and Affiliations

Department of Computer Engineering, Dongseo University, 47 Jurye-ro, Sasang-gu, Busan, 47011, Republic of Korea
Arief Rachman Sutanto
Division of Information and Communication Engineering, Dongseo University, 47 Jurye-ro, Sasang-gu, Busan, 47011, Republic of Korea
Dae-Ki Kang

Authors

Arief Rachman Sutanto
View author publications
You can also search for this author in PubMed Google Scholar
Dae-Ki Kang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dae-Ki Kang .

Editor information

Editors and Affiliations

Woosong University, Daejeon, Korea (Republic of)
Madhusudan Singh
Dongseo University, Busan, Korea (Republic of)
Dae-Ki Kang
Keimyung University, Daegu, Korea (Republic of)
Jong-Ha Lee
Indian Institute of Information Technoloy, Allahabad, India
Uma Shanker Tiwary
Hankuk University of Foreign Studies, Yongin, Korea (Republic of)
Dhananjay Singh
Pukyong National University, Busan, Korea (Republic of)
Wan-Young Chung

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sutanto, A.R., Kang, DK. (2021). A Novel Diminish Smooth L1 Loss Model with Generative Adversarial Network. In: Singh, M., Kang, DK., Lee, JH., Tiwary, U.S., Singh, D., Chung, WY. (eds) Intelligent Human Computer Interaction. IHCI 2020. Lecture Notes in Computer Science(), vol 12615. Springer, Cham. https://doi.org/10.1007/978-3-030-68449-5_36

Download citation

DOI: https://doi.org/10.1007/978-3-030-68449-5_36
Published: 06 February 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-68448-8
Online ISBN: 978-3-030-68449-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics