ClasSi: Measuring Ranking Quality in the Presence of Object Classes with Similarity Information

Ivanescu, Anca Maria; Wichterich, Marc; Seidl, Thomas

doi:10.1007/978-3-642-28320-8_16

Anca Maria Ivanescu²³,
Marc Wichterich²³ &
Thomas Seidl²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7104))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

1458 Accesses
1 Citations

Abstract

The quality of rankings can be evaluated by computing their correlation to an optimal ranking. State of the art ranking correlation coefficients like Kendall’s τ and Spearman’s ρ do not allow for the user to specify similarities between differing object classes and thus treat the transposition of objects from similar classes the same way as that of objects from dissimilar classes. We propose ClasSi, a new ranking correlation coefficient which deals with class label rankings and employs a class distance function to model the similarities between the classes. We also introduce a graphical representation of ClasSi akin to the ROC curve which describes how the correlation evolves throughout the ranking.

The authors gratefully acknowledge the financial support of the Deutsche Forschungsgemeinschaft (DFG) within the Collaborative Research Center (SFB) 686 “Model-Based Control of Homogenized Low-Temperature Combustion” and DFG grant SE 1039/1-3.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ferri, C., Hernández-Orallo, J., Salido, M.A.: Volume under the ROC surface for multi-class problems. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) ECML 2003. LNCS (LNAI), vol. 2837, pp. 108–120. Springer, Heidelberg (2003)
Chapter Google Scholar
Flach, P., Blockeel, H., Ferri, C., Hernández-Orallo, J., Struyf, J.: Decision support for data mining; introduction to ROC analysis and its applications. In: Data Mining and Decision Support: Integration and Collaboration, pp. 81–90. Kluwer Academic Publishers (2003)
Google Scholar
Goodman, L.A., Kruskal, W.H.: Measures of association for cross classifications. Journal of the American Statistical Association 49(268), 732–764 (1954)
MATH Google Scholar
Hand, D.J., Till, R.J.: A simple generalisation of the area under the ROC curve for multiple class classification problems. Machine Learning 45, 171–186 (2001)
Article MATH Google Scholar
Hassan, M.R., Ramamohanarao, K., Karmakar, C.K., Hossain, M.M., Bailey, J.: A Novel Scalable Multi-Class ROC for Effective Visualization and Computation. In: Zaki, M.J., Yu, J.X., Ravindran, B., Pudi, V. (eds.) PAKDD 2010. LNCS, vol. 6118, pp. 107–120. Springer, Heidelberg (2010)
Chapter Google Scholar
Kendall, M.: A new measure of rank correlation. Biometrika 30(1-2), 81–89 (1938)
Article MATH Google Scholar
Kendall, M., Gibbons, J.D.: Rank Correlation Methods. Edward Arnold (1990)
Google Scholar
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press (2008)
Google Scholar
van Rijsbergen, C.J.: Information Retrieval, 2nd edn. Butterworths, London (1979)
MATH Google Scholar
Somers, R.H.: A new asymmetric measure of association for ordinal variables. American Sociological Review 27(6), 799–811 (1962)
Article Google Scholar
Spearman, C.: The proof and measurement of association between two things. The American Journal of Psychology 100, 441–471 (1987)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Data Management and Data Exploration Group, RWTH Aachen University, Informatik 9, 52056, Aachen, Germany
Anca Maria Ivanescu, Marc Wichterich & Thomas Seidl

Authors

Anca Maria Ivanescu
View author publications
You can also search for this author in PubMed Google Scholar
Marc Wichterich
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Seidl
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Engineering and Information Technology, University of Technology Sydney, Broadway, PO Box 123, NSW 2007, Sydney, Australia
Longbing Cao
Shenzhen Institute of Advanced Technology (SIAT), Chinese Academy of Sciences, 518055, Shenzhen, China
Joshua Zhexue Huang & Jun Luo &
The University of Melbourne, VIC 3010, Melbourne, Australia
James Bailey
The University of Auckland, Auckland, New Zealand
Yun Sing Koh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ivanescu, A.M., Wichterich, M., Seidl, T. (2012). ClasSi: Measuring Ranking Quality in the Presence of Object Classes with Similarity Information. In: Cao, L., Huang, J.Z., Bailey, J., Koh, Y.S., Luo, J. (eds) New Frontiers in Applied Data Mining. PAKDD 2011. Lecture Notes in Computer Science(), vol 7104. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28320-8_16

Download citation

DOI: https://doi.org/10.1007/978-3-642-28320-8_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28319-2
Online ISBN: 978-3-642-28320-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics