Skip to main content

Correlation Analysis of Paired Ranking Data

  • Chapter
  • First Online:
  • 3271 Accesses

Part of the book series: Frontiers in Probability and the Statistical Sciences ((FROPROSTAS))

Abstract

A ranking represents the order of preference one has with respect to a set of t objects. If we label the objects by the integers 1 to t, a ranking can then be thought of as a permutation of the integers \((1,2,\ldots,t)\). We may denote such a permutation by μ = (μ(1), μ(2), , μ(t))′ which may also be conceptualized as a point in t-dimensional space. It is natural to measure the spread between two individual permutations μ, ν by means of a distance function.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Bibliography

  • Alvo, M. (1998). On non-parametric measures of correlation for directional data. Environmetrics, 9, 645–656.

    Article  Google Scholar 

  • Alvo, M., & Cabilio, P. (1992). Correlation methods for incomplete rankings. (Technical Report 200) Laboratory for Research in Statistics and Probability: Carleton University and University of Ottawa.

    Google Scholar 

  • Alvo, M., & Cabilio, P. (1993). Tables of critical values of rank tests for trend when the data is incomplete. (Technical Report 230) Laboratory for Research in Statistics and Probability: Carleton University and University of Ottawa.

    Google Scholar 

  • Alvo, M., & Cabilio, P. (1994). Rank test of trend when data are incomplete. Environmetrics, 5, 21–27.

    Article  Google Scholar 

  • Alvo, M., & Cabilio, P. (1995a). Rank correlation methods for missing data. Canadian Journal of Statistics, 23, 345–358.

    Article  MATH  MathSciNet  Google Scholar 

  • Alvo, M., & Park, J. (2002). Multivariate non-parametric tests of trend when the data are incomplete. Statistics and Probability Letters, 57, 281–290.

    Article  MATH  MathSciNet  Google Scholar 

  • Alvo, M., & Smrz, P. (2005). An arc model for ranking data. Journal of Statistical Research, 39, 43–54.

    MathSciNet  Google Scholar 

  • Breckling, J. (1989). The analysis of directional time series. Lecture Notes in Statistics (Vol. 61). Berlin: Springer.

    Google Scholar 

  • Cabilio, P., & Tilley, J. (1999). Power calculations for tests of trend with missing observations. Environmetrics, 10, 803–816.

    Article  Google Scholar 

  • Daniels, H. (1950). Rank correlation and population models. Journal of the Royal Statistical Society Series B, 12, 171–181.

    MATH  MathSciNet  Google Scholar 

  • Diaconis, P., & Graham, R. (1977). Spearman’s footrule as a measure of disarray. Journal of the Royal Statistical Society Series B, 39, 262–268.

    MATH  MathSciNet  Google Scholar 

  • Downs, T. (1973). Rotational angular correlations. New York: Wiley.

    Google Scholar 

  • Hájek, J., & Sidak, Z. (1967). Theory of rank tests. New York: Academic.

    MATH  Google Scholar 

  • Hoeffding, W. (1951). A combinatorial central limit theorem. Annals of Mathematical Statistics, 22, 558–566.

    Article  MATH  MathSciNet  Google Scholar 

  • Johnson, R., & Wehrly, T. (1977). Measures and models for angular correlation and angular-linear correlation. Journal of the Royal Statistical Society Series B, 39, 222–229.

    MATH  MathSciNet  Google Scholar 

  • Jupp, P., & Mardia, K. (1989). A unified view of the theory of directional statistics, 1975–1988. International Statistical Review, 57, 261–294.

    Article  MATH  Google Scholar 

  • Kendall, M., & Gibbons, J. (1990). Rank correlation methods. London: Edward Arnold.

    MATH  Google Scholar 

  • Kidwell, P., Lebanon, G., & Cleveland, W. S. (2008). Visualizing incomplete and partially ranked data. IEEE Transactions on Visualization and Computer Graphics, 14(6), 1356–1363.

    Article  Google Scholar 

  • Lehmann, E. (1975). Nonparametrics: Statistical methods based on ranks. New York: McGraw-Hill.

    MATH  Google Scholar 

  • Mardia, K. (1975). Statistics of directional data. Journal of the Royal Statistical Society Series B, 37, 349–393.

    MATH  MathSciNet  Google Scholar 

  • Mardia, K. (1976). Linear-circular correlation coefficients and rhythnometry. Biometrika, 63, 403–405.

    Article  MathSciNet  Google Scholar 

  • Quade, D. (1972). Average internal rank correlation. (Technical report) Amsterdam: Mathematical Centre.

    Google Scholar 

  • Yu, P. L. H., Lam, K. F., & Alvo, M. (2002). Nonparametric rank test for independence in opinion surveys. Austrian Journal of Statistics, 31, 279–290.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Chapter Notes

Chapter Notes

In this chapter, the traditional rank correlation has been extended to include incomplete rankings. This was made possible using the notion of compatibility which was developed by Alvo and Cabilio in a series of papers. Cabilio and Tilley (1999) report the results of a simulation study where they considered linear, quadratic, and square root trends. They observed that when there were no missing observations, the Spearman statistic was more powerful than Kendall’s. In the incomplete case, however, the new Kendall statistic has superior power for more patterns.

The calculation of the exact variance of \(\mathcal{A}_{K}^{{\ast}}\) under H 2, in Theorem 3.4, is more involved, and the reader is referred to Alvo and Cabilio (1992), where it is shown that

$$\displaystyle{V ar\left (\mathcal{A}_{K}^{{\ast}}\right ) = \frac{\kappa _{1}\kappa _{2}} {9t\left (t - 1\right )}\left [\frac{\left (2t + k_{1} + 3\right )\left (2t + k_{2} + 3\right )} {2} + \frac{\left (t^{2} - k_{1} - 2\right )\left (t^{2} - k_{2} - 2\right )} {\left (t - 2\right )} \right ].}$$

An important application of the results presented above which do not discard missing data is in tests of trend where k 2 = t and k 1 < t. It is seen that in this context, the superiority of the extended Spearman statistic is established through the calculation of its asymptotic relative efficiency relative to the “naive” statistic. (Alvo and Cabilio 1994) applied these methods to test for trend in precipitation data for St John and Fredericton (NB) and showed that the extended statistic based on Spearman distance is more sensitive in detecting trends than the statistic which ignores the missing observations. Tables of selected critical values of \(\mathcal{A}_{S}^{{\ast}}\) and \(\mathcal{A}_{K}^{{\ast}}\) for the trend case when k ≥ t∕2 have been developed for both hypotheses (Alvo and Cabilio 1993). The results of this section have been extended to the case of ties (Yu et al. 2002) and applied to deal with tests of independence in opinion surveys. A further extension to assess trend in proportions appears in Chap. 7.

Alvo and Smrz (2005) proposed an arc model which serves as a good approximation to Kendall distance.

Although not considered in this book, Alvo and Park (2002) were concerned with multivariate tests of trend when the data are partially incomplete. Such is the case in environmental studies when pH data for one or more lakes are often recorded over regular time intervals and examined for monotone increasing or decreasing trends in order to test for trend in acidification. In monitoring recovering patients, one looks for trends in their vital signs which are often multivariate data in nature. There may be as many as 20–30 blood constituents measured weekly over a period of several months or years. In those case, the use of separate tests on each constituent is inefficient.

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media New York

About this chapter

Cite this chapter

Alvo, M., Yu, P.L.H. (2014). Correlation Analysis of Paired Ranking Data. In: Statistical Methods for Ranking Data. Frontiers in Probability and the Statistical Sciences. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-1471-5_3

Download citation

Publish with us

Policies and ethics