Empirical Study on the Distribution of Object-Oriented Metrics in Software Systems

Muthukumaran, K.; Bhanu Murthy, N. L.; Sarguna Janani, P.

doi:10.1007/978-3-030-30275-7_23

K. Muthukumaran⁹,
N. L. Bhanu Murthy¹⁰ &
P. Sarguna Janani¹⁰

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1078))

Included in the following conference series:

International Conference on Information and Software Technologies

907 Accesses

Abstract

We attempt to model the probability distribution of object-oriented software metrics. We employ 5 distribution models to find out the distributions the metrics follow. We use AIC, BIC and RMSE as goodness-of-fit measures. Though the past studies have shown that the software projects frequently follow power law, having a Pareto distribution, we seek to study more number of software systems and distribution models to infer more generalizable results, since they occasionally seem to follow Log-normal or Gamma distribution as well. Apart from these three models we have also considered Weibull distribution and Generalized Pareto Distribution (GPD). In this study, we have made an attempt to answer the hypothesis that the object-oriented software metrics follow a particular distribution by comparing various distributions applied over a large number of projects using a recognized statistical framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Adamic, L.A., Huberman, B.A.: Zipf’s law and the internet. Glottometrics 3(1), 143–150 (2002)
Google Scholar
Akaike, H.: A new look at the statistical model identification. IEEE Trans. Autom. Control 19(6), 716–723 (1974)
Article MathSciNet Google Scholar
Andersson, C., Runeson, P.: A replicated quantitative analysis of fault distributions in complex software systems. IEEE Trans. Soft. Eng. 33(5), 273–286 (2007)
Article Google Scholar
Baxter, G., Frean, M., Noble, J., Rickerby, M., Smith, H., Visser, M., Melton, H., Tempero, E.: Understanding the shape of Java software. ACM Sigplan Not. 41, 397–412 (2006)
Article Google Scholar
Booch, G.: Object-oriented development. IEEE Trans. Software Eng. 2, 211–221 (1986)
Article Google Scholar
Chidamber, S.R., Kemerer, C.F.: A metrics suite for object oriented design. IEEE Trans. Software Eng. 20(6), 476–493 (1994)
Article Google Scholar
Concas, G., Marchesi, M., Murgia, A., Tonelli, R., Turnu, I.: On the distribution of bugs in the eclipse system. IEEE Trans. Software Eng. 37(6), 872–877 (2011)
Article Google Scholar
Concas, G., Marchesi, M., Pinna, S., Serra, N.: Power-laws in a large object-oriented software system. IEEE Trans. Software Eng. 33(10), 687–708 (2007)
Article Google Scholar
D’Ambros, M., Lanza, M., Robbes, R.: Evaluating defect prediction approaches: a benchmark and an extensive comparison. Empirical Soft. Eng. 17(4–5), 531–577 (2012)
Article Google Scholar
Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)
MathSciNet MATH Google Scholar
Fenton, N.E., Ohlsson, N.: Quantitative analysis of faults and failures in a complex software system. IEEE Trans. Softw. Eng. 26(8), 797–814 (2000)
Article Google Scholar
Gibrat, R.: Les inégalités économiques: applications: aux inégalitês des richesses, à la concentration des entreprises, aux populations des villes, aux statistiques des familles, etc: d’une loi nouvelle: la loi de l’effet proportionnel. Librairie du Recueil Sirey (1931)
Google Scholar
Grbac, T.G., Huljenić, D.: On the probability distribution of faults in complex software systems. Inf. Softw. Technol. 58, 250–258 (2015)
Article Google Scholar
Herraiz, I., Rodriguez, D., Harrison, R.: On the statistical distribution of object-oriented system properties. In: 2012 3rd International Workshop on Emerging Trends in Software Metrics (WETSoM), pp. 56–62. IEEE (2012)
Google Scholar
Huang, C.Y., Kuo, C.S., Luan, S.P.: Evaluation and application of bounded generalized pareto analysis to fault distributions in open source software. IEEE Trans. Reliab. 63(1), 309–319 (2013)
Article Google Scholar
Jureczko, M.: Significance of different software metrics in defect prediction. Soft. Eng. Int. J. 1(1), 86–95 (2011)
Google Scholar
Jureczko, M., Madeyski, L.: Towards identifying software project clusters with regard to defect prediction. In: Proceedings of the 6th International Conference on Predictive Models in Software Engineering, p. 9. ACM (2010)
Google Scholar
Kolassa, C., Riehle, D., Salim, M.A.: A model of the commit size distribution of open source. In: van Emde Boas, P., Groen, F.C.A., Italiano, G.F., Nawrocki, J., Sack, H. (eds.) SOFSEM 2013. LNCS, vol. 7741, pp. 52–66. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-35843-2_6
Chapter Google Scholar
Kumar, L., Misra, S., Rath, S.K.: An empirical analysis of the effectiveness of software metrics and fault prediction model for identifying faulty classes. Comput. Stand. & Interfaces 53, 1–32 (2017)
Article Google Scholar
Lessmann, S., Baesens, B., Mues, C., Pietsch, S.: Benchmarking classification models for software defect prediction: a proposed framework and novel findings. IEEE Trans. Soft. Eng. 34(4), 485–496 (2008)
Article Google Scholar
Li, P.L., Shaw, M., Herbsleb, J., Ray, B., Santhanam, P.: Empirical evaluation of defect projection models for widely-deployed production software systems. ACM SIGSOFT Soft. Eng. Notes 29, 263–272 (2004)
Article Google Scholar
Louridas, P., Spinellis, D., Vlachos, V.: Power laws in software. ACM Trans. Soft. Eng. Methodol. (TOSEM) 18(1), 2 (2008)
Google Scholar
Meyer, B.: Object-Oriented Software Construction, vol. 2. Prentice hall, New York (1988)
Google Scholar
Misra, S., Adewumi, A., Fernandez-Sanz, L., Damasevicius, R.: A suite of object oriented cognitive complexity metrics. IEEE Access 6, 8782–8796 (2018)
Article Google Scholar
Mitzenmacher, M.: A brief history of generative models for power law and lognormal distributions. Internet Math. 1(2), 226–251 (2004)
Article MathSciNet Google Scholar
Murgia, A., Concas, G., Marchesi, M., Tonelli, R., Turnu, I.: An analysis of bug distribution in object oriented systems. arXiv preprint arXiv:0905.3296 (2009)
Newman, M.E.: Power laws, pareto distributions and Zipf’s law. Contemp. Phys. 46(5), 323–351 (2005)
Article Google Scholar
Pani, F.E., Concas, G.: Stochastic models of software development activities. In: Proceedings of WSEAS International Conference. Recent Advances in Computer Engineering Series, No. 7 (2012)
Google Scholar
Pareto, V., Page, A.N.: Translation of manuale di economia politica (“manual of political economy”). AM Kelley, New York (1971)
Google Scholar
Pickands III, J.: Statistical inference using extreme order statistics. Ann. Stat. 3, 119–131 (1975)
Article MathSciNet Google Scholar
Shatnawi, R., Althebyan, Q.: An empirical study of the effect of power law distribution on the interpretation of OO metrics. ISRN Soft. Eng. 2013 (2013)
Google Scholar
Shriram, C., Muthukumaran, K., Bhanu Murthy, N.: Empirical study on the distribution of bugs in software systems. Int. J. Soft. Eng. Knowl. Eng. 28(01), 97–122 (2018)
Article Google Scholar
Shukla, S., Radhakrishnan, T., Muthukumaran, K., Neti, L.B.M.: Multi-objective cross-version defect prediction. Soft Comput. 22(6), 1959–1980 (2018)
Article Google Scholar
Vose, D.: Fitting distributions to data (2010)
Google Scholar
Wheeldon, R., Counsell, S.: Power law distributions in class relationships. In: 2003 Proceedings of 3rd IEEE International Workshop on Source Code Analysis and Manipulation, pp. 45–54. IEEE (2003)
Google Scholar
Zhang, H.: On the distribution of software faults. IEEE Trans. Soft. Eng. 34(2), 301 (2008)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing and Information Technology, Manipal University Jaipur, Jaipur, India
K. Muthukumaran
Department of Computer Science and Information Systems, BITS Pilani Hyderabad Campus, Hyderabad, India
N. L. Bhanu Murthy & P. Sarguna Janani

Authors

K. Muthukumaran
View author publications
You can also search for this author in PubMed Google Scholar
N. L. Bhanu Murthy
View author publications
You can also search for this author in PubMed Google Scholar
P. Sarguna Janani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to K. Muthukumaran .

Editor information

Editors and Affiliations

Kaunas University of Technology, Kaunas, Lithuania
Robertas Damaševičius
Kaunas University of Technology, Kaunas, Lithuania
Giedrė Vasiljevienė

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Muthukumaran, K., Bhanu Murthy, N.L., Sarguna Janani, P. (2019). Empirical Study on the Distribution of Object-Oriented Metrics in Software Systems. In: Damaševičius, R., Vasiljevienė, G. (eds) Information and Software Technologies. ICIST 2019. Communications in Computer and Information Science, vol 1078. Springer, Cham. https://doi.org/10.1007/978-3-030-30275-7_23

Download citation

DOI: https://doi.org/10.1007/978-3-030-30275-7_23
Published: 03 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30274-0
Online ISBN: 978-3-030-30275-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics