Abstract
Since information systems have become the heartbeat of many organizations, the investment in software is growing rapidly and consuming then a significant portion of the company budget. In this context, both the software engineering practitioners and researchers are more interested than ever about accurately estimating the effort and the quality of software product under development. Accurate estimates are desirable but no technique has demonstrated to be successful at effectively and reliably estimating software development effort. In this paper, we propose the use of an optimal trees ensemble (OTE) to predict the software development effort. The ensemble employed is built by combining only the top ranked trees, one by one, from a set of random forests. Each included tree must decrease the unexplained variance of the ensemble for software development effort estimation (SDEE). The effectiveness of the OTE model is compared with other techniques such as regression trees, random forest, RBF neural networks, support vector regression and multiple linear regression in terms of the mean magnitude relative error (MMRE), MdMRE and Pred(l) obtained on five well known datasets namely: ISBSG R8, COCOMO, Tukutuku, Desharnais and Albrecht. According to the results obtained from the experiments, it is shown that the proposed ensemble of optimal trees outperformed almost all the other techniques. Also, OTE model outperformed statistically the other techniques at least in one dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kusumoto, S., Matukawa, F., Inoue, K., Hanabusa, S., Maegawa, Y.: Estimating effort by use case points: method, tool and case study. In: Proceedings of the 10th International Symposium on Software Metrics, 2004, Chicago, Illinois, USA, pp. 292–299 (2004)
de A. AraĂºjo, R., Oliveira, A.L.I., Meira, S.: A class of hybrid multilayer perceptrons for software development effort estimation problems. Expert. Syst. Appl. 90, 1–12 (2017)
Zakrani, A., Idri, A.: Applying radial basis function neural networks based on fuzzy clustering to estimate web applications effort. Int. Rev. Comput. Softw. 5(5), 516–524 (2010)
Zakrani, A., Namir, A., Hain, M.: Investigating the use of random forest in software cost estimation. Procedia Comput. Sci. 148, 343–352 (2019)
Idri, A., Abnane, I.: Fuzzy analogy based effort estimation: an empirical comparative study. In: 17th IEEE International Conference on Computer and Information Technology, CIT 2017. Institute of Electrical and Electronics Engineers Inc. (2017)
Oliveira, A.L.I.: Estimation of software project effort with support vector regression. Neurocomputing 69(13–15), 1749–1753 (2006)
Sehra, S.K., et al.: Research patterns and trends in software effort estimation. Inf. Softw. Technol. 91, 1–21 (2017)
Elish, M.O.: Improved estimation of software project effort using multiple additive regression trees. Expert Syst. Appl. 36(7), 10774–10778 (2009)
Idri, A., Hosni, M., Abran, A.: Improved estimation of software development effort using Classical and Fuzzy Analogy ensembles. Appl. Soft Comput. J. 49, 990–1019 (2016)
Zakrani, A., Moutachaouik, H., Namir, A.: An ensemble of optimal trees for software development effort estimation. In: The 3rd International Conference on Advanced Information Technology, Services and System, Mohammedia, Morocco (2018)
Khan, Z., et al.: An ensemble of optimal trees for class membership probability estimation. In: Wilhelm, A., Kestler, H. (eds.) Analysis of Large and Complex Data, pp. 395–409. Springer, Cham (2014)
Boehm, B.W.: Software Engineering Economics, p. 768. Prentice Hall PTR, Englewood Cliffs, NJ (1981)
Breiman, L., et al.: Classification and Regression Trees. Wadsworth, Belmont (1984)
ISBSG, International Software Benchmarking Standards Group, Data Release 8 Repository, Data Release 8 Repository (2003). http://www.isbsg.org
Amazal, F.A., Idri, A., Abran, A.: Software development effort estimation using classical and fuzzy analogy: a cross-validation comparative study. Int. J. Comput. Intell. Appl. 13(3), 1450013 (2014)
Albrecht, A.J., Gaffney Jr., J.E.: Software function, source lines of code, and development effort prediction: a software science validation. IEEE Trans. Softw. Eng. SE-9(6), 639–648 (1983)
Idri, A., Abnane, I., Abran, A.: Evaluating Pred(p) and standardized accuracy criteria in software development effort estimation. J. Softw.: Evol. Process 30(4), e1925 (2018)
Desharnais, J.-M.: Analyse statistique de la productivitie des projets informatique a partie de la technique des point des fonction. University of Montreal (1989)
Foss, T., et al.: A simulation study of the model evaluation criterion MMRE. IEEE Trans. Software Eng. 29(11), 985–995 (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Zakrani, A., Idri, A., Hain, M. (2020). Software Effort Estimation Using an Optimal Trees Ensemble: An Empirical Comparative Study. In: Bouhlel, M., Rovetta, S. (eds) Proceedings of the 8th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT’18), Vol.1. SETIT 2018. Smart Innovation, Systems and Technologies, vol 146. Springer, Cham. https://doi.org/10.1007/978-3-030-21005-2_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-21005-2_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-21004-5
Online ISBN: 978-3-030-21005-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)