Skip to main content

Visual Representation of Data Including Graphical Exploratory Data Analysis

  • Chapter
Methods of Environmental Data Analysis

Part of the book series: Environmental Management Series ((EMANS))

Abstract

Reliance on simple number summaries, such as correlation coefficients, without plotting the data used to derive the coefficients, can lead one to misinterpret their real meaning. 1–3 An excellent example of this is shown in Fig. 1, which shows a series of plots of data sets, all of which have correlation coefficients, some of which apparently indicate reasonably strong correlation. However, the plots reveal clearly the dangers of relying only on number summaries. They also demonstrate the value of graphical displays in understanding data behaviour and identifying influential observations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 74.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chambers, J.M., Cleveland, W.S., Kleiner, B. & Tukey, P., Graphical Methods for Data Analysis. Duxbury Press, Boston, MA, 1983, pp. 76–80.

    Google Scholar 

  2. Tiley, P.F., The misuse of correlation coefficients. Chemistry in Britain. (Feb. 1985) 162–3.

    Google Scholar 

  3. Tukey, J.W., Exploratory Data Analysis. Addison-Wesley, Reading, MA, 1977, Chapter 5.

    Google Scholar 

  4. Cleveland, W.S., Dauconis, P. & McGill, R., Variables on scatterplots look more highly correlated when the scales are increased. Science, 216 (1982) 1138–41.

    Article  Google Scholar 

  5. Dauconis, P., Theories of data anlaysis: From magical thinking through classical statistics. In Exploring Data Tables, Trends, and Shapes, ed. D.C. Hoaglin, F. Mosteller & J.W. Tukey. John Wiley & Sons, New York, 1985, Chapter 1.

    Google Scholar 

  6. Chambers, J.M., Cleveland, W.S., Kleiner, B. & Tukey, P., Graphical Methods of Data Analysis. Duxbury Press, Boston, MA, 1983, Chapter 8.

    Google Scholar 

  7. Chernoff, H., Graphical representations as a discipline. In Graphical Representation of Multivariate Data, ed. P.C. Wang. Academic Press, New York, 1978, pp. 1–11.

    Google Scholar 

  8. Ross, L. & Lepper, M.R., The perseverance of beliefs: empirical and normative considerations. In New Directions for Methodology of Behavioural Sciences: Fallible Judgement in Behavioral Research. Jossey-Bass, San Francisco, 1980.

    Google Scholar 

  9. Kahneman, D., Slovic, P. & Tversky, A. (ed.), Judgement under Uncertainty: Heuristics and Biases. Cambridge University Press, Cambridge, 1982.

    Google Scholar 

  10. Schmid, C.F., Statistical Graphics. John Wiley & Sons, New York, 1983.

    Google Scholar 

  11. Tukey, J.W., Exploratory Data Analysis. Addison-Wesley, Reading, MA, 1977.

    Google Scholar 

  12. Tukey, J.W. & Mosteller, F.W., Data Analysis and Regression: A Second Course in Statistics. Addison-Wesley, Reading, MA, 1977.

    Google Scholar 

  13. Hoaglin, D.C., Mosteller, F. & Tukey, J.W., Understanding Robust and Exploratory Data Analysis. John Wiley & Sons, New York, 1983, pp. 1–4.

    Google Scholar 

  14. Ames, A.E. & Szonyi, G., How to avoid lying with statistics. In Chemometrics: Theory and Applications. ACS Symposium Series 52, American Chemical Society, Washington, DC, 1977, Chapter II.

    Google Scholar 

  15. Fisher, R.A., On the mathematical foundations of theoretical statistics. Phil. Trans. Royal Soc., 222A (1922) 322.

    Google Scholar 

  16. Gibbons, J.D., Nonparametric Methods for Quantitative Analysis. Holt, Rinehart and Winston, New York, 1976, pp. 56–77.

    Google Scholar 

  17. Chambers, J.M., Cleveland, W.S., Kleiner, B. & Tukey, P.A., Graphical Methods for Data Analysis. Wadsworth International Group, Belmont, CA. 1983, pp. 19–21.

    Google Scholar 

  18. Velleman, P.F. & Hoaglin, D.C., Applications, Basics and Computing of Exploratory Data Analysis. Duxbury Press, Boston, MA, 1981. Chapter I.

    Google Scholar 

  19. Emerson, J.D. & Hoaglin. D.C., Stem-and-leaf displays. In Understanding Robust and Exploratory Data Analysis, ed. D.C. Hoaglin, F. Mosteller & J.W. Tukey. John Wiley & Sons, New York, 1983, Chapter I.

    Google Scholar 

  20. McGill, R., Tukey, J.W. & Larsen, W.A., Variations of box plots. The American Statistician, 32 (1978) 12–16.

    Google Scholar 

  21. Mosteller, F. & Tukey, J.W., Data Analysis and Regression: A Second Course in Statistics, Addison-Wesley, Reading, MA, 1977, Chapter 3.

    Google Scholar 

  22. Velleman, P.F. & Hoaglin, D.C., Applications, Basics and Computing of Exploratory Data Analysis. Duxbury Press, Boston, MA, 1981, Chapter 3.

    Google Scholar 

  23. Emerson, J.D. & Strenio, J., Boxplots and batch comparison. In Understanding Robust and Exploratory Data Analysis, ed. D.C. Hoaglin, F. Mosteller & J.W. Tukey. John Wiley & Sons, New York, 1983, Chapter 3.

    Google Scholar 

  24. Tukey, J.W., Exploratory Data Analysis, Addison-Wesley, Reading, MA. 1977, Chapter 2.

    Google Scholar 

  25. Frigge, M., Hoaglin, D.C. & Iglewicz, B., Some implementations of the boxplot. The American Statistician, 43 (1989) 50–4.

    Google Scholar 

  26. Minitab, Inc. Minitab Reference Manual, release 6, January 1988. Minitab, Inc., State College, PA, pp. 234–6.

    Google Scholar 

  27. Hoaglin, D.C., Letter values: a set of order statistics. In Understanding Robust and Exploratory Data Analysis, ed. D.C. Hoaglin, F. Mosteller & J.W. Tuke John Wiley & Sons, New York, 1983, Chapter

    Google Scholar 

  28. Velleman, P.F. & Hoaglin, D.C., Applications, Basics and Computing of Exploratory Data Analysis. Duxbury Press, Boston, MA, 1981, Chapter 2.

    Google Scholar 

  29. Hoaglin, D.C. & Tukey, J.W., Checking the shape of discrete distributions. In Exploring Data Tables, Trends, and Shapes, ed. D.C. Hoaglin, F. Mosteller & J.W. Tukey, John Wiley & Sons, New York, 1985, Chapter 9.

    Google Scholar 

  30. Ord, J.K., Graphical methods for a class of discrete distributions. J. Roy. Statist. Assoc. A, 130 (1967) 232–8.

    Article  Google Scholar 

  31. Hoaglin, D.C.,A Poissonness plot. The American Statistician, 34 (1980) 146–9.

    Google Scholar 

  32. Southwood, T.R.E., Ecological Methods with Particular Reference to the Study of Insect Populations. Chapman and Hall, London, 1978, Chapter 2.

    Google Scholar 

  33. Chambers, J.M., Cleveland, W.S., Kleiner, B. & Tukey, P.A., Graphical Methods for Data Analysis. Wadsworth International Group, Belmont, CA, 1983, Chapter 6.

    Google Scholar 

  34. Hoaglin, D.C., In Exploring Data Tables, Trends, and Shapes, ed. D.C. Hoaglin, F. Mosteller & J.W. Tukey. John Wiley & Sons, New York, 1985, pp.437–8.

    Google Scholar 

  35. Statistical Graphics Corporation. STATGRAPHICS Statistical Graphics System. User’s Guide, Version 2.6. STSC, Inc., Rockville, MA, 1987, pp. 11–13 to ll–14.

    Google Scholar 

  36. Velleman, P.F. & Hoaglin, D.C., Applications, Basics, and Computing of Exploratory Data Analysis. Duxbury Press, Boston, MA, 1981, Chapter 9.

    Google Scholar 

  37. Hoaglin, D.C., Using quantiles to study shape. In Exploring Data Tables, Trends, and Shapes, ed. D.C. Hoaglin, F. Mosteller & J.W. Tukey, John Wiley & Sons, New York, 1985, Chapter 10.

    Google Scholar 

  38. Hoaglin, D.C., Summarizing shape numerically: the g-and-h distributions. In Exploring Data Tables, Trends, and Shapes, ed. D.C. Hoaglin, F. Mosteller & J.W. Tukey, John Wiley & Sons, New York, 1985, Chapter II.

    Google Scholar 

  39. Deming, W.E., Statistical Adjustment of Data. John Wiley & Sons, New York, 1943, pp. 178–82.

    Google Scholar 

  40. Mandel, J., The Statistical Analysis of Experimental Data. John Wiley & Sons, New York, 1964, pp. 288–92.

    Google Scholar 

  41. Theil, H., A rank invariant method of linear and polynomial regression analysis, parts I, II and III. Proc. Kon. Nederl. Akad. Wetensch., A, 53 (1950) 386–92, 521–5, 397–412.

    Google Scholar 

  42. Tukey, J.W., Exploratory Data Analysis, Limited Preliminary Edition, Addison-Wesley, Reading, MA, 1970.

    Google Scholar 

  43. Rousseeuw, P.J. & Leroy, A.M., Robust Regression and Outlier Detection. John Wiley & Sons, New York, 1987, Chapter 2.

    Book  Google Scholar 

  44. Lancaster, J.F. & Quade, D., A nonparametric test for linear regression based on combining Kendall’s tau with the sign test. J. Amer. Statist. Assoc., 80 (1985) 393–7.

    Article  Google Scholar 

  45. Thompson, J.M., The use of a robust resistant regression method for personal monitor validation with decay of trapped materials during storage. Analytica Chimica Acta, 186 (1986), 205–12.

    Article  Google Scholar 

  46. Cleveland, W.S. & McGill, R., The many faces of a scatterplot. J. Amer. Statist. Assoc., 79 (1984) 807–22.

    Article  Google Scholar 

  47. Tukey, J.W. & Tukey, P.A., Some graphics for studying four-dimensional data. Computer Science and Statistics: Proceedings of the 14th. Symposium on the Interface. Springer-Verlag, New York, 1983, pp. 60–6.

    Google Scholar 

  48. Chambers, J.M., Cleveland, W.S., Kleiner, B. & Tukey, P.A., Graphical Methods for Data Analysis. Duxbury Press, Belmont, CA, 1983, pp. 110–21.

    Google Scholar 

  49. Goodall, C., Examining residuals. In Understanding Robust and Exploratory Data Analysis, ed. D.C. Hoaglin, F. Mosteller & J.W. Tukey. John Wiley & Sons, New York, 1983, Chapter 7.

    Google Scholar 

  50. Atkinson, A.C., Plots, Transformations and Regression. An Introduction to Graphical Methods of Diagnostic Regression Analysis. Oxford University Press, Oxford, 1985.

    Google Scholar 

  51. Hampel, F.R., Ronchetti, E.M., Rousseeuw, P.J. & Stahel, W.A. Robust Statistics. The Approach Based on Influence Functions. John Wiley & Sons, New York, 1986, Chapter 1.

    Google Scholar 

  52. Rousseeuw, P.J & Leroy, A.M., Robust Regression and Outlier Detection. John Wiley & Sons, New York, 1987, Chapter 1.

    Google Scholar 

  53. Thompson, J.M., Sithamparanadarajah, R., Robinson, J.S. & Stephen, W.I.. Occupational exposure to nitrous oxide, halothane and isopropanol in operating theatres. Health & Hygiene, 8 (1987) 60–8.

    Google Scholar 

  54. Krzanowski, W.J., Principles of Multivariate Analysis. A User’s Perspective. Oxford University Press, Oxford, 1988, Chapter 2.

    Google Scholar 

  55. Kosslyn, S.M., Graphics and human information processing. A review of five books. J. Amer. Statist. Assoc., 80 (1985) 499–512.

    Article  Google Scholar 

  56. Tukey, P.A. & Tukey, J.W., Graphical display of data sets in 3 or more dimensions. In Interpreting Multivariate Data, ed. V. Barnett. John Wiley & Sons, New York, 1981, Chapters 10, 11 & 12.

    Google Scholar 

  57. Chambers, J.M., Cleveland, W.S., Kleiner, B. & Tukey, P.A., Graphical Methods for Data Analysis. Duxbury Press, Boston, MA, 1983, Chapter 5.

    Google Scholar 

  58. Davis, J.C., Statistics and Data Analysis in Geology. John Wiley & Sons, New York, 1986, p. 488.

    Google Scholar 

  59. Becker, R.A. & Cleveland, W.S., Brushing scatterplots. Technometrics, 29 (1987) 127–42.

    Article  Google Scholar 

  60. Mead, G.A., The sorted binary plot: A new technique for exploratory data analysis. Technometrics, 31 (1989) 61–7.

    Article  Google Scholar 

  61. Seber, G.A.F., Multivariate Observations. John Wiley & Sons, New York, 1984, Chapter 4.

    Book  Google Scholar 

  62. Newton, C.M., Graphics: From alpha to omega in data analysis. In Graphical Representation of Multivariate Data. ed. P.C. Wang. Academic Press. New York, 1978, pp. 59–92.

    Google Scholar 

  63. Kleiner, B. & Hartigan, J.A., Representing points in many dimensions by trees and castles. J. Amer. Statist. Assoc., 76 (1981) 260–76.

    Article  Google Scholar 

  64. Anderson, E., A semigraphical method for the analysis of complex problems. Technometrics, 2 (1960) 387–91.

    Article  Google Scholar 

  65. Andrews, D.F., Plots of high-dimensional data. Biometrics, 28 (1972) 125–36.

    Article  Google Scholar 

  66. Chernoff, H., Using faces to represent points in K-dimensional space graphically. J. Amer. Statist. Assoc., 68 (1973) 361–8.

    Article  Google Scholar 

  67. Flury, B. & Riedwyl, H., Multivariate Statistics. A Practical Approach. Chapman and Hall, London, 1988. Chapter 4.

    Book  Google Scholar 

  68. Flury, B., Construction of an asymmetrical face to represent multivariate data graphically. Technische Bericht No. 3, Institut für Mathematische Statistik und Versicherungslehre, Universität Bern, 1980.

    Google Scholar 

  69. Flury, B. & Riedwyl, H., Some applications of asymmetrical faces. Technische Bericht No. 11. Institut für Mathematische Statistik und Versichungslehre, Universität Bern, 1983.

    Google Scholar 

  70. Schüpbach, M., ASYMFACE: Asymmetrical faces on IBM and Olivetti PC Technische Bericht No. 16. Institut für Mathematische Statistik und Versichungslehre, Universität Bern, 1984.

    Google Scholar 

  71. Schüpbach, M., ASYMFACE: Asymmetrical faces in Turbo Pascal. Technische Bericht No. 25. Institut für Mathematische Statistik und Versichungslehre, Universität Bern, 1987.

    Google Scholar 

  72. du Toit, S.H.C., Steyn, A.W.G. & Stumpf, R.H., Graphical Exploratory Data Analysis. Springer-Verlag, New York, 1986, Chapter 4.

    Book  Google Scholar 

  73. Tufte, E.R., The Visual Display of Quantitative Information. Graphics Press, Cheshire, CT, 1983.

    Google Scholar 

  74. Schmid, C.F., Statistical Graphics. John Wiley & Sons, New York, 1983, pp. 188–90.

    Google Scholar 

  75. Dickenson, G.C., Statistical Mapping and the Presentation of Statistics, 2nd edn. Edward Arnold, London, 1973, Chapter 5.

    Google Scholar 

  76. Rock, N.M.S., Numerical Geology. A Source Guide, Glossary and Selective Bibliography to Geological Uses of Computers and Statistics. Springer-Verlag. Berlin, 1988, Topic 21, pp. 275–96.

    Google Scholar 

  77. Mandel, R., The world according to Micros. Byte, 15 (1990) 256–67.

    Google Scholar 

  78. Monmonier, M.S., Computer-Assisted Cartography: Principles and Prospects. Prentice-Hall, Englewood-Cliffs, NJ, 1982, p. 90.

    Google Scholar 

  79. Seber, G.A.F., Multivariate Observations. John Wiley & Sons, New York, 1984. Chapter 7.

    Book  Google Scholar 

  80. Cleveland, W.S. & McGill, M.E. (ed.), Dynamic Graphics for Statistics. Wadsworth, Belmont, CA, 1989.

    Google Scholar 

  81. Davis, J.C., Statistics and Data Analysis in Geology. John Wiley & Sons, New York, 1973.

    Google Scholar 

  82. Eddy, W.F. (ed.), Computer Science and Statistics: Proceedings of the 13th Symposium on the Interface. Springer-Verlag, New York. 1981.

    Google Scholar 

  83. Green. W.R., Computer-aided Data Analysis. A Practical Guide. John Wiley & Sons, New York, 1985.

    Google Scholar 

  84. Haining, R., Spatial Data Analysis in the Social and Environmental Sciences. Cambridge University Press, Cambridge, 1990.

    Book  Google Scholar 

  85. Isaaks, E.H. & Srivastava, R.M., An Introduction to Applied Geostatistics. Oxford University Press, New York, 1989.

    Google Scholar 

  86. Ripley, B.D., Spatial Statistics. J. Wiley & Sons, New York, 1981.

    Book  Google Scholar 

  87. Ripley, B.D., Statistical Inference for Spatial Processes. Cambridge University Press, Cambridge, 1988.

    Book  Google Scholar 

  88. Upton, G. & Fingleton, B., Spatial Data Analysis by Example. Volume 1, Point Pattern and Quantitative Data and Volume 2, Categorical and Directional Data. John Wiley & Sons, New York, 1985, 1989.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1992 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Thompson, J.M. (1992). Visual Representation of Data Including Graphical Exploratory Data Analysis. In: Hewitt, C.N. (eds) Methods of Environmental Data Analysis. Environmental Management Series. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-2920-6_6

Download citation

  • DOI: https://doi.org/10.1007/978-94-011-2920-6_6

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-010-5296-2

  • Online ISBN: 978-94-011-2920-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics