Abstract
CONTEXT: Studies have shown contradictory results on the effectiveness of using a moving window of only the most recent projects for effort estimation, compared to using the full history of past data. Moving windows improved the accuracy of effort estimates for a single-company subset of the ISBSG dataset (www.isbsg.org), but not for three single-company subsets of the Finnish dataset (www.4sumpartners.com). The contradiction may be caused by different characteristics of the data sets: in particular, they differ noticeably in heterogeneity of industry sector. GOAL: To investigate the effect on estimation accuracy of differences in the characteristics of the data sets. METHOD: Conduct an experiment with a virtual data set, composed from the three subsets of the Finnish dataset. The composite data set is similar to the ISBSG subset in that it includes data from multiple industry sectors; the largest group of projects in both data sets comes from the same industry sector; and in both data sets the projects are concentrated in a similar number of years. RESULTS: The conclusions is the same as in the past study using the individual Finnish subsets: in the composite data set, moving windows are of no help. CONCLUSIONS: In this instance, increased heterogeneity of projects does not explain the contradiction. It is still not clear when windows may be helpful. Practitioners and researchers should not assume automatically that only the most recent data is best for effort estimation.
Notes
- 1.
The results are graphed for all window sizes. The tables only show every twentieth window size, due to space limitations. This is sufficient to show the essential trends.
References
Amasaki, S., Lokan, C.: The effect of moving windows on software effort estimation: comparative study with CART. In: Proceedings of IWESEP 2014, pp. 1–6. IEEE (2014)
Amasaki, S., Lokan, C.: A Replication of comparative study of moving windows on linear regression and estimation by analogy. In: Proceedings of PROMISE 2015, pp. 1–10. ACM, New York, October 2015
Amasaki, S., Lokan, C.: On the effectiveness of weighted moving windows: experiment on linear regression based software effort estimation. J. Softw. Evol. Process 27(7), 488–507 (2015)
Amasaki, S., Lokan, C.: Evaluation of moving window policies with CART. In: Proceedings of IWESEP 2016, pp. 24–29. IEEE (2016)
Kitchenham, B.A., Mendes, E.: Why comparative effort prediction studies may be invalid. In: Proceedings of PROMISE 2009, p. 4. ACM (2009)
Lokan, C., Mendes, E.: Investigating the use of chronological splitting to compare software cross-company and single-company effort predictions. In: Proceedings of EASE 2008 (2008)
Lokan, C., Mendes, E.: Applying moving windows to software effort estimation. In: Proceedings of ESEM 2009, pp. 111–122 (2009)
Lokan, C., Mendes, E.: Investigating the use of duration-based moving windows to improve software effort prediction. In: Proceedings of APSEC 2012, pp. 818–827 (2012)
Lokan, C., Mendes, E.: Investigating the use of duration-based moving windows to improve software effort prediction: a replicated study. Inf. Softw. Technol. 56(9), 1063–1075 (2014)
Lokan, C., Mendes, E.: Investigating the use of moving windows to improve software effort prediction: a replicated study. Empir. Softw. Eng. 22(2), 1–52 (2016)
Shepperd, M., MacDonell, S.: Evaluating prediction systems in software project estimation. Inf. Softw. Technol. 54(8), 820–827 (2012)
Acknowledgment
This work was supported by JSPS KAKENHI Grant #15K15975.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Amasaki, S., Lokan, C. (2017). A Virtual Study of Moving Windows for Software Effort Estimation Using Finnish Datasets. In: Felderer, M., Méndez Fernández, D., Turhan, B., Kalinowski, M., Sarro, F., Winkler, D. (eds) Product-Focused Software Process Improvement. PROFES 2017. Lecture Notes in Computer Science(), vol 10611. Springer, Cham. https://doi.org/10.1007/978-3-319-69926-4_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-69926-4_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69925-7
Online ISBN: 978-3-319-69926-4
eBook Packages: Computer ScienceComputer Science (R0)