Skip to main content

Using Support Vector Regression for Web Development Effort Estimation

  • Conference paper
Software Process and Product Measurement (IWSM 2009)

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 5891))

Included in the following conference series:

Abstract

The objective of this paper is to investigate the use of Support Vector Regression (SVR) for Web development effort estimation when using a cross-company data set. Four kernels of SVR were used, linear, polynomial, Gaussian and sigmoid and two preprocessing strategies of the variables were applied, namely normalization and logarithmic. The hold-out validation process was carried out for all the eight configurations using a training set and a validation set from the Tukutuku data set. Our results suggest that the predictions obtained with linear kernel applying a logarithmic transformation of variables (LinLog) are significantly better than those obtained with the other configurations. In addition, SVR has been compared with the traditional estimation techniques, such as Manual StepWise Regression, Case-Based Reasoning, and Bayesian Networks. Our results suggest that SVR with LinLog configuration can provide significantly superior prediction accuracy than other techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abrahao, S.M., Mendes, E., Gomez, J., Insfran, E.: A Model-Driven Measurement Procedure for Sizing Web Applications: Design, Automation and Validation. In: Engels, G., Opdyke, B., Schmidt, D.C., Weil, F. (eds.) MODELS 2007. LNCS, vol. 4735, pp. 467–481. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  2. Bailey, J.W., Basili, V.R.: A meta model for software development resource expenditure. In: Proceedings of the Fifth International Conference on Software Engineering, San Diego, California, USA, pp. 107–116 (1981)

    Google Scholar 

  3. Baresi, L., Morasca, S.: Three Empirical Studies on Estimating the Design Effort of Web Applications. Transactions on Software Engineering and Methodology 16(4)

    Google Scholar 

  4. Braga, P.L., Oliveira, A.L.I., Meira, S.R.L.: Software Effort Estimation using Machine Learning Techniques with Robust Confidence Intervals. In: HIS 2007, pp. 352–357 (2007)

    Google Scholar 

  5. Brieman, L., Friedman, J., Olshen, R., Stone, C.: Classification and Regression Trees. Wadsworth Inc., Belmont (1984)

    Google Scholar 

  6. Christodoulou, S.P., Zafiris, P.A., Papatheodorou, T.S.: WWW2000: The Developer’s view and a practitioner’s approach to Web Engineering. In: Proc. ICSE Workshop on Web Engineering, Limerick, Ireland, pp. 75–92 (2000)

    Google Scholar 

  7. Chulani, S., Boehm, B., Steece, B.: Bayesian Analysis of Empirical Software Engineering Cost Models. IEEE Transactions on Software Engineering 25, 573–583 (1999)

    Article  Google Scholar 

  8. Conte, S.D., Dunsmore, H.E., Shen, V.Y.: Software Engineering Metrics and Models. Benjamin-Cummins (1986)

    Google Scholar 

  9. Costagliola, G., Di Martino, S., Ferrucci, F., Gravino, C., Tortora, G., Vitiello, G.: Effort estimation modeling techniques: a case study for web applications. In: Procs. Intl. Conference on Web Engineering (ICWE 2006), pp. 9–16 (2006)

    Google Scholar 

  10. Desharnais, J.M.: Analyse statistique de la productivitie des projets in 834 formatique a partie de la technique des point des fonction, Ph.D. thesis, 835 Unpublished Masters Thesis, University of Montreal (1989)

    Google Scholar 

  11. Di Martino, S., Ferrucci, F., Gravino, C., Mendes, E.: Comparing Size Measures for Predicting Web Application Development Effort: A Case Study. In: Proceedings of Empirical Software Engineering and Measurement, pp. 324–333. IEEE Press, Los Alamitos (2007)

    Google Scholar 

  12. Kitchenham, B.A.: A Procedure for Analyzing Unbalanced Datasets. IEEE Transactions on Software Engineering 24(4), 278–301 (1998)

    Article  Google Scholar 

  13. Kitchenham, B., Pickard, L.M., MacDonell, S.G., Shepperd, M.J.: What accuracy statistics really measure. IEE Proceedings Software 148(3), 81–85 (2001)

    Article  Google Scholar 

  14. Kitchenham, B.A., Mendes, E.: A Comparison of Cross-company and Single-company Effort Estimation Models for Web Applications. In: Procs. EASE 2004, pp. 47–55 (2004)

    Google Scholar 

  15. Kitchenham, B.A., Mendes, E., Travassos, G.: A Systematic Review of Cross- and Within-company Cost Estimation Studies. In: Proceedings of Empirical Assessment in Software Engineering, pp. 89–98 (2006)

    Google Scholar 

  16. Kitchenham, B., Mendes, E.: Travassos, Cross versus Within-Company Cost Estimation Studies: A systematic Review. IEEE Transactions on Software Engineering 33(5) (2007)

    Google Scholar 

  17. Jeffery, R., Ruhe, M., Wieczorek, I.: Using public domain metrics to estimate software development effort. In: Proceedings Metrics 2001, London, pp. 16–27 (2001)

    Google Scholar 

  18. Joachims, T.: Making large-Scale SVM Learning Practical. In: Schölkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning. MIT-Press, Cambridge (1999)

    Google Scholar 

  19. Mattera, D., Haykin, S.: Support vector machines for dynamic reconstruction of a chaotic system. In: Scholkopf, B., Burges, C.J.C., Smola, A.J. (eds.) Advances in Kernel Methods— Support Vector Learning, pp. 211–242. MIT Press, Cambridge (1999)

    Google Scholar 

  20. Maxwell, K.: Applied Statistics for Software Managers. Software Quality Institute Series. Prentice Hall, Englewood Cliffs (2002)

    Google Scholar 

  21. Mendes, E.: The Use of Bayesian Networks for Web Effort Estimation: Further Investigation. In: Proceedings of International Conference on Web Engineering (2008)

    Google Scholar 

  22. Mendes, E., Kitchenham, B.A.: Further Comparison of Cross-company and Within-company Effort Estimation Models for Web Applications. In: Proc. IEEE Metrics, pp. 348–357 (2004)

    Google Scholar 

  23. Mendes, E., Counsell, S.: Web Development Effort Estimation using Analogy. In: Proc. 2000 Australian Software Engineering Conference, pp. 203–212 (2000)

    Google Scholar 

  24. Mendes, E., Mosley, N., Counsell, S.: Investigating Web Size Metrics for Early Web Cost Estimation. Journal of Systems and Software 77(2), 157–172 (2005)

    Article  Google Scholar 

  25. Mendes, E., Mosley, N.: Bayesian Network Models for Web Effort Prediction: A Comparative Study. IEEE Transactions on Software Engineering 34(6), 723–737 (2008)

    Article  Google Scholar 

  26. Mendes, E., Mosley, N., Counsell, S.: Web Effort Estimation. In: Mendes, E., Mosley, N. (eds.) Web Engineering. Springer, Heidelberg (2005)

    Google Scholar 

  27. Mendes, E., Mosley, N., Counsell, S.: Early Web Size Measures and Effort Prediction for Web Costimation. In: Proceedings of the IEEE Metrics Symposium, pp. 18–29 (2003)

    Google Scholar 

  28. Mendes, E., Mosley, N., Counsell, S.: Comparison of Length, complexity and functionality as size measures for predicting Web design and authoring effort. IEE Proc. Software 149(3), 86–92 (2002)

    Article  Google Scholar 

  29. Mendes, E., Counsell, S., Mosley, N., Triggs, C., Watson, I.: A Comparative Study of Cost Estimation Models for Web Hypermedia Applications. Empirical Software Engineering 8(23), 163–196 (2003)

    Article  Google Scholar 

  30. Mendes, E., Mosley, N., Counsell, S.: Web metrics - Metrics for estimating effort to design and author Web applications. IEEE MultiMedia, 50–57 (January-March 2001)

    Google Scholar 

  31. Mendes, E., Martino, S.D., Ferrucci, F., Gravino, C.: Cross-company vs. single-company web effort models using the Tukutuku database: An extended study. Journal of System & Software 81(5), 673–690 (2008)

    Article  Google Scholar 

  32. Oliveira, A.L.I.: Estimation of software project effort with support vector regression. Neurocomputing 69(13-15), 1749–1753 (2006)

    Article  Google Scholar 

  33. Reifer, D.J.: Web Development: Estimating Quick-to-Market Software. IEEE Software, 57–64 (November-December 2000)

    Google Scholar 

  34. Reifer, D.J.: Ten deadly risks in Internet and intranet software development. IEEE Software, 12–14 (March-April 2002)

    Google Scholar 

  35. Ruhe, M., Jeffery, R., Wieczorek, I.: Cost estimation for Web applications. In: Proc. ICSE 2003, pp. 285–294 (2003)

    Google Scholar 

  36. Scholkopf, B.: Support Vector Learning. R. Oldenbourg Verlag, Munchen. Doktorarbeit, TU Berlin (1997), http://www.kernel-machines.org

  37. Scholkopf, B., Smola, A.J.: Learning with Kernels. MIT Press, Cambridge (2002)

    Google Scholar 

  38. Smola, A.J., Schölkopf, B.: A tutorial on support vector regression. Statistics and Computing 14(3), 199–222 (2004)

    Article  MathSciNet  Google Scholar 

  39. Shepperd, M.J., Kadoda, G.: Using Simulation to Evaluate Prediction Techniques. In: Proceedings IEEE Metrics 2001, London, UK, pp. 349–358 (2001)

    Google Scholar 

  40. Vapnik, V., Lerner, A.: Pattern recognition using generalized portrait method. Automation and Remote Control 24, 774–780 (1963)

    Google Scholar 

  41. Vapnik, V., Chervonenkis, A.: A note on one class of perceptrons. Automatics and Remote Control, 25 (1964)

    Google Scholar 

  42. Vapnik, V.: The nature of Statistical Learning Theory. Springer, Heidelberg (1995)

    MATH  Google Scholar 

  43. Vapnik, V.: Statistical Learning Theory. Wiley, Chichester (1998)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Corazza, A., Di Martino, S., Ferrucci, F., Gravino, C., Mendes, E. (2009). Using Support Vector Regression for Web Development Effort Estimation. In: Abran, A., Braungarten, R., Dumke, R.R., Cuadrado-Gallego, J.J., Brunekreef, J. (eds) Software Process and Product Measurement. IWSM 2009. Lecture Notes in Computer Science, vol 5891. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-05415-0_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-05415-0_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-05414-3

  • Online ISBN: 978-3-642-05415-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics