Abstract
Univariate analysis of data means to deal with each variable separately. How to choose the method to examine the variables highly depends upon the so-called “scale type” of each variable. Therefore, it is important to assign the correct scale to the variables in the first place, before starting the analysis. In this chapter, the reader is familiarized with the concept of discrete and continuous variables. A procedure for determining the correct scale type is outlined in detail.
Based on the decision of the scale type of a variable in the next step, the information must be extracted and represented in a well comprehensible form. To do so, values are classified or binned. The different methods offered by the IBM SPSS Modeler are explained in this chapter. Additionally, the concept of SuperNodes is being outlined in the case when normalizing variables.
After finishing this chapter, the reader is able to …
-
1.
Explain in detail the necessity and the characteristics of different scale types as well as to keep hold of the big picture
-
2.
Create diagrams of frequency distributions to assess the shape and to determine outliers after assigning the correct scale type to a variable
-
3.
Describe the necessity to use reclassification or binning procedures to determine frequencies of values in specific intervals and finally
-
4.
Use SuperNodes to transform variables and compact streams
So the successful reader is familiar with the statistical theory of assigning correct scale types to variables and after that to select and apply correct methods to show, determine, and assess frequency distributions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Literature
Aczel, A. D., & Sounderpandian, J. (2009). Complete business statistics (7th ed.). Boston: McGraw-Hill Higher Education.
Anderson, D. R., Sweeney, D. J., & Williams, T. A. (2014). Essentials of statistics for business and economics, 7th edn.
Box, G. E. P., & Cox, D. R. (1964). An analysis of transformations. Journal of the Royal Statistical Society Series B (Methodological), 26(2), 211–252.
Chapman, C., & Feit, E. M. (2015). R for marketing research and analytics, use R! Berlin: Springer.
Fox, J., & Weisberg, S. (2015). Package ‘car’. Accessed June 5, 2015, from http://cran.r-project.org/web/packages/car/car.pdf
IBM. (2014). SPSS modeler 16 source, process, and output nodes. Available at: ftp://public.dhe.ibm.com/software/analytics/spss/documentation/modeler/16.0/en/modeler_nodes_general.pdf
Schulz, L. O., Bennett, P. H., Ravussin, E., Kidd, J. R., Kidd, K. K., Esparza, J., et al. (2006). Effects of traditional and western environments on prevalence of type 2 diabetes in Pima Indians in Mexico and the USA. Diabetes Care, 29(8), 1866–1871.
Tukey, J. W. (1957). On the comparative anatomy of transformations. The Annals of Mathematical Statistics, 28(3), 602–632.
Zimmerman, D. W. (1998). Invalidation of parametric and nonparametric statistical tests by concurrent violation of two assumptions. The Journal of Experimental Education, 67(1), 55–68.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Wendler, T., Gröttrup, S. (2016). Univariate Statistics. In: Data Mining with SPSS Modeler. Springer, Cham. https://doi.org/10.1007/978-3-319-28709-6_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-28709-6_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28707-2
Online ISBN: 978-3-319-28709-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)