Abstract
Categorical data analysis is typically based on two- or higher dimensional contingency tables, cross-tabulating the co-occurrences of levels of nominal and/or ordinal data. In order to explain these, statisticians typically look for (conditional) independence structures using common methods such as independence tests and log-linear models. One idea behind the use of visualization techniques is to use the human visual system to detect structures in the data that may not be obvious fromsolely numeric output (e.g., test statistics). Whether the task is purely exploratory or modelbased, techniques such as mosaic, sieve, and association plots offer good support for visualization. Mosaic and sieve plots in particular have been extended over the last two decades, and implementations exist in many statistical environments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agresti, A. (2002). Categorical Data Analysis. Wiley, Hoboken, NJ, 2nd edn.
Andersen, E.B. (1991). The Statistical Analysis of Categorical Data. Springer, Berlin, 2nd edn.
Bickel, P.J., Hammel, E.A. and O’Connell, J.W. (1975). Sex bias in graduate admissions: Data from Berkeley. Science, 187:398–403.
Brewer, C.A. (1999). Color use guidelines for data representation. In Proceedings of the Section on Statistical Graphics, American Statistical Association, Alexandria, VA, pp 55–60.
Cleveland, W.S. and McGill, R. (1983). A color-caused optical illusion on a statistical graph. The American Statistician, 37:101–105.
Cohen, A. (1980). On the graphical display of the significant components in a two-way contingency table. Communications in Statistics – Theory and Methods, A9:1025–1041.
Commission Internationale de l’Éclairage (2004). Colorimetry. Publication CIE 15:2004, Vienna, Austria, 3rd edn. ISBN 3-901-90633-9.
Ernst, M.D. (2004). Permutation methods: A basis for exact inference. Statistical Science, 19:676–685.
Friendly, M. (1994). Mosaic displays for multi-way contingency tables. Journal of the American Statistical Association, 89:190–200.
Friendly, M. (1999). Extending mosaic displays: Marginal, conditional, and partial views of categorical data. Journal of Computational and Graphical Statistics, 8(3):373–395.
Friendly, M. (2000). Visualizing Categorical Data. SAS Institute, Carey, NC.
Haberman, S.J. (1974). Log-linear models for frequency tables with ordered classifications. Biometrics, 30:689–700.
Harrower, M.A. and Brewer, C.A. (2003). ColorBrewer.org: An online tool for selecting color schemes for maps. The Cartographic Journal, 40:27–37.
Hartigan, J.A. and Kleiner, B. (1981). Mosaics for contingency tables. In W.F. Eddy (ed) Computer Science and Statistics: Proceedings of the 13th Symposium on the Interface, Springer, New York, pp 268–273.
Hartigan, J.A. and Kleiner, B. (1984). A mosaic of television ratings. The American Statistician, 38:32–35.
Hofmann, H. (2001). Generalized odds ratios for visual modelling. Journal of Computational and Graphical Statistics, 10:1–13.
Hummel, J. (1996). Linked bar charts: Analysing categorical data graphically. Computational Statistics, 11:23–33.
Ihaka, R. (2003). Colour for presentation graphics. In K. Hornik, F. Leisch and A. Zeileis (eds), Proceedings of the 3rd International Workshop on Distributed Statistical Computing, Vienna, Austria. ISSN 1609-395X.
Insightful Inc. (2005). S-PLUS 7. Seattle, WA.
Koch, G. and Edwards, S. (1988). Clinical efficiency trials with categorical data. In K.E. Peace (ed), Biopharmaceutical Statistics for Drug Development, pp 403–451. Marcel Dekker, New York.
Ligges, U. and Mächler, M. (2003). scatterplot3d – an R package for visualizing multivariate data. Journal of Statistical Software, 8(11):1–20.
Mazanec, J.A. and Strasser, H. (2000). A Nonparametric Approach to Perceptions-based Market Segmentation: Foundations. Springer, Berlin.
Meyer, D., Zeileis, A. and Hornik, K. (2003). Visualizing independence using extended association plots. In K. Hornik, F. Leisch and A. Zeileis (eds), Proceedings of the 3rd International Workshop on Distributed Statistical Computing, Vienna, Austria. ISSN 1609-395X.
Meyer, D., Zeileis, A. and Hornik, K. (2006). The strucplot framework: Visualizing multi-way contingency tables with vcd. Journal of Statistical Software, 17(3):1–48.
Meyer, D., Zeileis, A. and Hornik, K. (2006). vcd: Visualizing Categorical Data. R package version 1.0-6.
Mollon, J. (1995). Seeing color. In T. Lamb and J. Bourriau (eds), Colour: Art and Science. Cambridge University Press, Cambridge.
Moretti, G. and Lyons, P. (2002). Tools for the selection of colour palettes. In Proceedings of the New Zealand Symposium On Computer-Human Interaction (SIGCHI 2002), University of Waikato, New Zealand.
Munsell, A.H. (1905). A Color Notation. Munsell Color Company, Boston, MA.
Patefield, W.M. (1981). An efficient method of generating r×c tables with given row and column totals. Applied Statistics, 30:91–97. Algorithm AS 159.
Pesarin, F. (2001). Multivariate Permutation Tests. Wiley, Chichester, UK.
Poynton, C. (2000). Frequently-asked questions about color. http://www.poynton.com/ColorFAQ.html. Accessed 2006-09-14.
SAS Institute Inc. (2005). SAS/STAT Version 9. Cary, NC.
R Development Core Team (2006). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-00-3.
Riedwyl, H. and Schüpbach, M. (1994). Parquet diagram to plot contingency tables. In F. Faulbaum (ed), Softstat ’93: Advances in Statistical Software, pp 293–299, Gustav Fischer, New York.
Strasser, H. and Weber, C. (1999). On the asymptotic theory of permutation statistics. Mathematical Methods of Statistics, 8:220–250.
Theus, M. (2003). Interactive data visualization using Mondrian. Journal of Statistical Software, 7(11):1–9.
Theus, M. and Lauer, S.R.W. (1999). Visualizing loglinear models. Journal of Computational and Graphical Statistics, 8(3):396–412.
Unwin, A.R., Hawkins, G., Hofmann, H. and Siegl, B. (1996). Interactive graphics for data sets with missing values – MANET. Journal of Computational and Graphical Statistics, 4(6):113–122.
Wing, J.K. (1962). Institutionalism in mental hospitals. British Journal of Social Clinical Psychology, 1:38–51.
Young, F.W. (1996). ViSta: The visual statistics system. Technical Report 94–1(c), UNC L.L. Thurstone Psychometric Laboratory Research Memorandum.
Zeileis, A., Meyer, D. and Hornik, K. (2007). Residual-based shadings for visualizing (conditional) independence. Journal of Computational and Graphical Statistics, 16(3):507–525.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Meyer, D., Zeileis, A., Hornik, K. (2008). Visualizing Contingency Tables. In: Handbook of Data Visualization. Springer Handbooks Comp.Statistics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-33037-0_23
Download citation
DOI: https://doi.org/10.1007/978-3-540-33037-0_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33036-3
Online ISBN: 978-3-540-33037-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)