Descriptive Statistics

Mooi, Erik; Sarstedt, Marko; Mooi-Reci, Irma

doi:10.1007/978-981-10-5218-7_5

Erik Mooi⁴,
Marko Sarstedt⁵ &
Irma Mooi-Reci⁶

Part of the book series: Springer Texts in Business and Economics ((STBE))

158k Accesses
1 Citations

Abstract

We first provide an overview of market research’s workflow. We then discuss efficient strategies to help you structure your project’s database, as well as enter, clean, and easily check the collected data for inconsistencies. In addition, we provide easy strategies that allow you to handle missing data observations before we describe the most common and useful univariate and bivariate descriptive graphs and statistics. Thereafter, we take you through the basics of Stata, including its toolbar and shortcuts to frequently used commands, and provide useful tips on how to create and interpret descriptive graphs and table outputs. A range of descriptive statistics is illustrated and applied in Stata, including bar charts, histograms, box plots, pie charts, frequency tables, scatter graphs, crosstabs, and correlation tables, all of which are useful for differently scaled variables. We make use of a case study for an easy and meaningful interpretation of the graphs and table outputs. We conclude with recommendations for further readings and a case study with review questions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Alternatively, you could also choose one of the many control system versions, including Subversion, Git, and Mecurial, which enable simple branching and project management. These systems work well with version control in centralized and in distributed environments.
2.
There are multivariate techniques that consider three, or more, variables simultaneously in order to detect outliers. See Hair et al. (2010) for an introduction, and Agarwal (2013) for a more detailed methodological discussion.
3.
For more information on missing data, see https://www.iriseekhout.com
4.
The mode is another measure. However, unlike the median and mean, it is ill-defined, because it can take on multiple values. Consequently, we do not discuss the mode.
5.
A similar type of chart is the line chart . In a line chart, measurement points are ordered (typically by their x-axis value) and joined with straight line segments.
6.
Note that the terms n−1 in the numerator and denominator cancel each other and are therefore not shown here.
7.
In Stata, this is best done using the rowmean command. For example, egen commitment = rowmean (com1 com2 com3). This command automatically calculates the mean over the number of nonmissing responses.
8.
The logarithm is calculated as follows: If x = y ^b, then y = log _b (x) where x is the original variable, b the logarithm’s base, and y the exponent. For example, log 10 of 100 is 2. Logarithms cannot be calculated for negative values (such as household debt) and for the value of zero. In Stata, you can generate a log-transformed variable by typing: gen loginc = log(income), whereby loginc refers to the newly created log-transformed variable and income refers to the income variable.
9.
If you open Stata in the Windows or Linux operating systems, the toolbar looks a bit different, but is structured along the same lines as discussed in this chapter.
10.
http://www.stata.com/manuals14/ddatatypes.pdf
11.
http://www.stata.com/manuals14/u.pdf
12.
http://www.stata.com/manuals14/dformat.pdf
13.
Note an ordinary year has 52 weeks and 1 day, while a leap year has 52 weeks and 2 days. This is because 1 week comprises part of 2016 and part of 2017.

References

Agarwal, C. C. (2013). Outlier analysis. New York: Springer.
Book Google Scholar
Agresti, A., & Finlay, B. (2014). Statistical methods for the social sciences (4^th ed.). London: Pearson.
Google Scholar
Barchard, K. A., & Pace, L. A. (2011). Preventing human error: The impact of data entry methods on data accuracy and statistical results. Computers in Human Behavior, 27(5), 1834–1839.
Article Google Scholar
Barchard, K. A., & Verenikina, Y. (2013). Improving data accuracy: Electing the best data checking technique. Computers in Human Behavior, 29(50), 1917–1912.
Article Google Scholar
Baumgartner, H., & Steenkamp, J.-B. E. M. (2001). Response styles in marketing research: A cross-national investigation. Journal of Marketing Research, 38(2), 143–156.
Article Google Scholar
Carpenter, J., & Kenward, M. (2013). Multiple imputation and its application. New York: Wiley.
Book Google Scholar
Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2^nd ed.). Hillsdale: Lawrence Erlbaum Associates.
Google Scholar
Drolet, A. L., & Morrison, D. G. (2001). Do we really need multiple-item measures in service research? Journal of Service Research, 3(3), 196–204.
Article Google Scholar
Eekhout, I., de Vet, H. C. W., Twisk, J. W. R., Brand, J. P. L., de Boer, M. R., & Heymans, M. W. (2014). Missing data in a multi-item instrument were best handled by multiple imputation at the item score level. Journal of Clinical Epidemiology, 67(3), 335–342.
Article Google Scholar
Gladwell, M. (2008). Outliers: The story of success. New York: Little, Brown, and Company.
Google Scholar
Graham, J. W. (2012). Missing data: Analysis and design. Berlin et al.: Springer.
Book Google Scholar
Hair, J. F., Jr., Black, W. C., Babin, B. J., & Anderson, R. E. (2010). Multivariate data analysis. A global perspective (7^th ed.). Upper Saddle River: Pearson.
Google Scholar
Harzing, A. W. (2005). Response styles in cross-national survey research: A 26-country study. International Journal of Cross Cultural Management, 6(2), 243–266.
Article Google Scholar
Johnson, T., Kulesa, P., Lic, I., Cho, Y. I., & Shavitt, S. (2005). The relation between culture and response styles. Evidence from 19 countries. Journal of Cross-Cultural Psychology, 36(2), 264–277.
Article Google Scholar
Krippendorff, K. (2012). Content analysis: An introduction to its methodology. Thousand Oaks: Sage.
Google Scholar
Little, R. J. A. (1998). A test of missing completely at random for multivariate data with missing values. Journal of the American Statistical Association, 83(404), 1198–1202.
Article Google Scholar
Paulsen, A., Overgaard, S., & Lauritsen, J. M. (2012). Quality of data entry using single entry, double entry and automated forms processing – An example based on a study of patient-reported outcomes. PloS One, 7(4), e35087.
Article Google Scholar
Rubin, D. B. (1987). Multiple imputation for nonresponse in surveys. New York: Wiley.
Book Google Scholar
Sarstedt, M., Diamantopoulos, A., Salzberger, T., & Baumgartner, P. (2016). Selecting single items to measure doubly-concrete constructs: A cautionary tale. Journal of Business Research, 69(8), 3159–3167.
Article Google Scholar
Schafer, J. L. (1997). Analysis of incomplete multivariate data. London: Chapman & Hall.
Book Google Scholar
White, I. R., Royston, P., & Wood, A. M. (2011). Multiple imputation using chained equations: Issues and guidance for practice. Statistics in Medicine, 30(4), 377–399.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Management and Marketing, University of Melbourne, Parkville, Victoria, Australia
Erik Mooi
Chair of Marketing, Otto-von-Guericke-University, Magdeburg, Sachsen-Anhalt, Germany
Marko Sarstedt
School of Social and Political Sciences, University of Melbourne, Parkville, Victoria, Australia
Irma Mooi-Reci

Authors

Erik Mooi
View author publications
You can also search for this author in PubMed Google Scholar
Marko Sarstedt
View author publications
You can also search for this author in PubMed Google Scholar
Irma Mooi-Reci
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Mooi, E., Sarstedt, M., Mooi-Reci, I. (2018). Descriptive Statistics. In: Market Research. Springer Texts in Business and Economics. Springer, Singapore. https://doi.org/10.1007/978-981-10-5218-7_5

Download citation

DOI: https://doi.org/10.1007/978-981-10-5218-7_5
Published: 02 November 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-5217-0
Online ISBN: 978-981-10-5218-7
eBook Packages: Business and ManagementBusiness and Management (R0)

Publish with us

Policies and ethics