Skip to main content

Statistical Regression Analysis

  • Chapter
  • First Online:
Software Project Effort Estimation
  • 3060 Accesses

Abstract

In this chapter, we briefly introduce effort estimation based on statistical regression analysis. Regression analysis represents a data-driven, model-based, parametric estimation method that implements the define-your-own-model approach. In other words, in this approach an effort estimation model is created “from scratch” using quantitative project data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Chapter 3 provides an overview of the most common effort factors used in the context of software effort estimation.

  2. 2.

    If the functional form is not known and cannot be parameterized in terms of any “basis” function, then methods known as nonparametric regression should be applied. Nonparametric regression constructs regression models according to information derived from the data. Please refer to work of Green and Silverman (1994) for more information on available nonparametric regression analysis techniques.

  3. 3.

    There are a number of imputation techniques available. Please refer to Rubin and Little (1983) for brief introduction to the problem of missing data and to Schafer (1997) for an overview of techniques for dealing with missing data.

References

  • Bailey J, Basili V (1981) A meta-model for software development resource expenditures. In: Proceedings of the 5th international conference on software engineering, 9–12 March, San Diego, CA, USA, pp 107–116 (Reprinted in Kemerer CF (1997), Software project management readings and cases, The McGraw-Hill Companies, pp 30–40)

    Google Scholar 

  • Fitzmaurice G (2002) Sample size and power: how big is big enough? Nutrition 18(3):289–290

    Article  Google Scholar 

  • Green SB (1991) How many subjects does it take to do a regression analysis? Multivar Behav Res 26(3):499–510

    Article  Google Scholar 

  • Green PJ, Silverman BW (1994) Nonparametric regression and generalized linear models: a roughness penalty approach, vol 58, Chapman & Hall/CRC monographs on statistics and applied probability. Chapman and Hall, London

    Book  MATH  Google Scholar 

  • Greene WH (2011) Econometric analysis, 7th edn. Prentice-Hall, New York

    Google Scholar 

  • MacKinnon JG (1983) Model specification tests against non-nested alternatives. Econom Rev 2(1):85–110

    Article  MATH  MathSciNet  Google Scholar 

  • Maxwell SE (2000) Sample size and multiple regression analysis. Psychol Methods 5(4):434–458

    Article  MathSciNet  Google Scholar 

  • Mendes E, Mosley N (2008) Bayesian network models for web effort prediction: a comparative study. IEEE Trans Softw Eng 34(6):723–737

    Article  Google Scholar 

  • Pesaran MH, Deaton AS (1978) Testing non-nested regression models. Econometrica 46(3):677–694

    Article  MATH  MathSciNet  Google Scholar 

  • Rousseeuw PJ, Leroy AM (2003) Robust regression and outlier detection. Wiley, Hoboken, NJ

    Google Scholar 

  • Pesaran MH, Weeks M (1999) Non-nested hypothesis testing: an overview. Technical Report 9918, Faculty of Economics, University of Cambridge, Cambridge, MA

    Google Scholar 

  • Schafer JL (1997) Analysis of incomplete multivariate data, Monographs on statistics and applied probability. Chapman & Hall/CRC, London

    Book  MATH  Google Scholar 

  • Rubin DB, Little RJA (1983) Incomplete data. In: Kotz S, Johnson NL (eds) Encyclopedia of statistical sciences, vol 4. Wiley, New York, pp 46–53

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Further Reading

Further Reading

  • L. Schroeder, D. L. Sjoquist, and P. E. Setphan (1996), Understanding Regression Analysis: An Introductory Guide. Sage Publications, Inc.

    The book provides a basic introduction to regression analysis. It aims at beginners in this topic and provides background knowledge for understanding regression analysis methods and application results presented in the literature.

  • N. R. Draper and H. Smith (1998), Applied Regression Analysis. 3rd Edition. John Wiley & Sons, New York, NY, USA.

    This book provides a comprehensive view on applied regression analysis. It covers linear and parametric regression in detail and provides many other useful references.

  • P.J. Green and B.W. Silverman (1994), Nonparametric Regression and Generalized Linear Models: A roughness penalty approach. 1st Edition. Monographs on Statistics and Applied Probability, Chapman & Hall/CRC.

    This book describes nonparametric regression and Generalized Linear Models.

  • J. Miller, J. Daly, M. Wood, M. Roper, and A. Brooks (1997), “Statistical Power and Its Subcomponents—Missing and Misunderstood Concepts in Empirical Software Engineering Research,” Information and Software Technology, vol. 39, no. 4, pp. 285–295.

    Authors discuss typical misconceptions of power analysis in the context of empirical software engineering.

  • Y. Miyazaki, M. Terakado, K. Ozaki, and H. Nozaki (1994), “Robust Regression for Developing Software Estimation Models,” Journal of Systems and Software, vol. 27, pp. 3–16.

    Authors discuss different types of robust regression analysis for the purpose of developing software effort estimation models. Example analyses include the least-squares of balanced relative errors (LBRS) method that minimizes the sum of squares of balanced relative error or least-squares of inverted balanced relative errors (LIRS) that minimizes the sum of squares of inverted balanced relative error.

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Trendowicz, A., Jeffery, R. (2014). Statistical Regression Analysis. In: Software Project Effort Estimation. Springer, Cham. https://doi.org/10.1007/978-3-319-03629-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-03629-8_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-03628-1

  • Online ISBN: 978-3-319-03629-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics