Skip to main content

Descriptive Statistics

  • Chapter
  • First Online:
Market Research

Part of the book series: Springer Texts in Business and Economics ((STBE))

Abstract

We first provide an overview of market research’s workflow. We then discuss efficient strategies to help you structure your project’s database, as well as enter, clean, and easily check the collected data for inconsistencies. In addition, we provide easy strategies that allow you to handle missing data observations before we describe the most common and useful univariate and bivariate descriptive graphs and statistics. Thereafter, we take you through the basics of Stata, including its toolbar and shortcuts to frequently used commands, and provide useful tips on how to create and interpret descriptive graphs and table outputs. A range of descriptive statistics is illustrated and applied in Stata, including bar charts, histograms, box plots, pie charts, frequency tables, scatter graphs, crosstabs, and correlation tables, all of which are useful for differently scaled variables. We make use of a case study for an easy and meaningful interpretation of the graphs and table outputs. We conclude with recommendations for further readings and a case study with review questions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 159.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Alternatively, you could also choose one of the many control system versions, including Subversion, Git, and Mecurial, which enable simple branching and project management. These systems work well with version control in centralized and in distributed environments.

  2. 2.

    There are multivariate techniques that consider three, or more, variables simultaneously in order to detect outliers. See Hair et al. (2010) for an introduction, and Agarwal (2013) for a more detailed methodological discussion.

  3. 3.

    For more information on missing data, see https://www.iriseekhout.com

  4. 4.

    The mode is another measure. However, unlike the median and mean, it is ill-defined, because it can take on multiple values. Consequently, we do not discuss the mode.

  5. 5.

    A similar type of chart is the line chart . In a line chart, measurement points are ordered (typically by their x-axis value) and joined with straight line segments.

  6. 6.

    Note that the terms n−1 in the numerator and denominator cancel each other and are therefore not shown here.

  7. 7.

    In Stata, this is best done using the rowmean command. For example, egen commitment = rowmean (com1 com2 com3). This command automatically calculates the mean over the number of nonmissing responses.

  8. 8.

    The logarithm is calculated as follows: If x = y b, then y = log b (x) where x is the original variable, b the logarithm’s base, and y the exponent. For example, log 10 of 100 is 2. Logarithms cannot be calculated for negative values (such as household debt) and for the value of zero. In Stata, you can generate a log-transformed variable by typing: gen loginc = log(income), whereby loginc refers to the newly created log-transformed variable and income refers to the income variable.

  9. 9.

    If you open Stata in the Windows or Linux operating systems, the toolbar looks a bit different, but is structured along the same lines as discussed in this chapter.

  10. 10.

    http://www.stata.com/manuals14/ddatatypes.pdf

  11. 11.

    http://www.stata.com/manuals14/u.pdf

  12. 12.

    http://www.stata.com/manuals14/dformat.pdf

  13. 13.

    Note an ordinary year has 52 weeks and 1 day, while a leap year has 52 weeks and 2 days. This is because 1 week comprises part of 2016 and part of 2017.

References

  • Agarwal, C. C. (2013). Outlier analysis. New York: Springer.

    Book  Google Scholar 

  • Agresti, A., & Finlay, B. (2014). Statistical methods for the social sciences (4th ed.). London: Pearson.

    Google Scholar 

  • Barchard, K. A., & Pace, L. A. (2011). Preventing human error: The impact of data entry methods on data accuracy and statistical results. Computers in Human Behavior, 27(5), 1834–1839.

    Article  Google Scholar 

  • Barchard, K. A., & Verenikina, Y. (2013). Improving data accuracy: Electing the best data checking technique. Computers in Human Behavior, 29(50), 1917–1912.

    Article  Google Scholar 

  • Baumgartner, H., & Steenkamp, J.-B. E. M. (2001). Response styles in marketing research: A cross-national investigation. Journal of Marketing Research, 38(2), 143–156.

    Article  Google Scholar 

  • Carpenter, J., & Kenward, M. (2013). Multiple imputation and its application. New York: Wiley.

    Book  Google Scholar 

  • Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). Hillsdale: Lawrence Erlbaum Associates.

    Google Scholar 

  • Drolet, A. L., & Morrison, D. G. (2001). Do we really need multiple-item measures in service research? Journal of Service Research, 3(3), 196–204.

    Article  Google Scholar 

  • Eekhout, I., de Vet, H. C. W., Twisk, J. W. R., Brand, J. P. L., de Boer, M. R., & Heymans, M. W. (2014). Missing data in a multi-item instrument were best handled by multiple imputation at the item score level. Journal of Clinical Epidemiology, 67(3), 335–342.

    Article  Google Scholar 

  • Gladwell, M. (2008). Outliers: The story of success. New York: Little, Brown, and Company.

    Google Scholar 

  • Graham, J. W. (2012). Missing data: Analysis and design. Berlin et al.: Springer.

    Book  Google Scholar 

  • Hair, J. F., Jr., Black, W. C., Babin, B. J., & Anderson, R. E. (2010). Multivariate data analysis. A global perspective (7th ed.). Upper Saddle River: Pearson.

    Google Scholar 

  • Harzing, A. W. (2005). Response styles in cross-national survey research: A 26-country study. International Journal of Cross Cultural Management, 6(2), 243–266.

    Article  Google Scholar 

  • Johnson, T., Kulesa, P., Lic, I., Cho, Y. I., & Shavitt, S. (2005). The relation between culture and response styles. Evidence from 19 countries. Journal of Cross-Cultural Psychology, 36(2), 264–277.

    Article  Google Scholar 

  • Krippendorff, K. (2012). Content analysis: An introduction to its methodology. Thousand Oaks: Sage.

    Google Scholar 

  • Little, R. J. A. (1998). A test of missing completely at random for multivariate data with missing values. Journal of the American Statistical Association, 83(404), 1198–1202.

    Article  Google Scholar 

  • Paulsen, A., Overgaard, S., & Lauritsen, J. M. (2012). Quality of data entry using single entry, double entry and automated forms processing – An example based on a study of patient-reported outcomes. PloS One, 7(4), e35087.

    Article  Google Scholar 

  • Rubin, D. B. (1987). Multiple imputation for nonresponse in surveys. New York: Wiley.

    Book  Google Scholar 

  • Sarstedt, M., Diamantopoulos, A., Salzberger, T., & Baumgartner, P. (2016). Selecting single items to measure doubly-concrete constructs: A cautionary tale. Journal of Business Research, 69(8), 3159–3167.

    Article  Google Scholar 

  • Schafer, J. L. (1997). Analysis of incomplete multivariate data. London: Chapman & Hall.

    Book  Google Scholar 

  • White, I. R., Royston, P., & Wood, A. M. (2011). Multiple imputation using chained equations: Issues and guidance for practice. Statistics in Medicine, 30(4), 377–399.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Mooi, E., Sarstedt, M., Mooi-Reci, I. (2018). Descriptive Statistics. In: Market Research. Springer Texts in Business and Economics. Springer, Singapore. https://doi.org/10.1007/978-981-10-5218-7_5

Download citation

Publish with us

Policies and ethics