Partition Measures for Data Mining

Yager, Ronald R.

doi:10.1007/978-3-642-05177-7_15

Ronald R. Yager⁵

Part of the book series: Studies in Computational Intelligence ((SCI,volume 262))

2178 Accesses
1 Citations

Abstract

We investigate a number of measures associated with partitions. The first of these is congruence measures, which are used to calculate the similarity between two partitions. We provide a number of examples of this type of measure. Another class of measures we investigate are prognostication measures. This measure, closely related to a concept of containment between partitions, are useful in indicating how well knowledge of an objects class in one partition predicts its class in a second partitioning. Finally we introduce a measure of the non-specificity of a partition. This measures a feature of a partition related to the generality of the constituent classes of the partition. A common task in machine learning is developing rules that allow us to predict the class of an object based upon the value of some features of the object. The more narrowly we categorize the features in the rules the better we can predict an objects classification. However counterbalancing this is the fact that to many narrow feature categories are difficult for human experts to cognitively manage, this introduces a fundamental issue in data mining. We shown how the combined use of our measures prognostication and non-specificity allow us navigate this issue.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Michalski, R.S., Stepp, R.E.: Learning from observation: Conceptual clustering. In: Michalski, R.S., Carbonell, J.G., Mitchell, T.M. (eds.) Machine Learning: An Artificial Intelligence Approach. Morgan Kaufmann, San Mateo (1983)
Google Scholar
Michalski, R.S., Stepp, R.E.: Automated construction of classifications: Conceptual clustering versus numerical taxonomy. IEEE Transactions on Pattern Analysis and Machine Intelligence 5, 396–410 (1983)
Article Google Scholar
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco (2001)
Google Scholar
Pawlak, Z.: Rough Sets- Theoretical Aspects of Reasoning About Data. Kluwer, Hingham (1991)
MATH Google Scholar
Zadeh, L.A.: Similarity relations and fuzzy orderings. Information Sciences 3, 177–200 (1971)
Article MATH MathSciNet Google Scholar
Hillier, F.S., Lieberman, G.J.: Introduction to Operations Research. McGraw Hill, New York (2005)
Google Scholar
Goldberg, D.E.: Genetic Algorithms in Search Optimization and Machine Learning. Addison-Wesley, Reading (1989)
MATH Google Scholar
Yager, R.R.: On measures of specificity. In: Kaynak, O., Zadeh, L.A., Turksen, B., Rudas, I.J. (eds.) Computational Intelligence: Soft Computing and Fuzzy-Neuro Integration with Applications, pp. 94–113. Springer, Berlin (1998)
Google Scholar
Klir, G.J.: Uncertainty and Information. John Wiley & Sons, New York (2006)
Google Scholar
Miller, G.A.: The magical number seven, plus or minus two: Some limitations on our capacity for processing information. Psychological Review 63, 81–97 (1956)
Article Google Scholar
Yager, R.R., Petry, F.E.: Evidence resolution using concept hierarchies. IEEE Transactions on Fuzzy Systems 16, 299–308 (2008)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Machine Intelligence Institute, Iona College, New Rochelle, NY, 10801
Ronald R. Yager

Authors

Ronald R. Yager
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computer Science, Polish Academy of Sciences, ul.Ordona 21, 01-237, Warsaw, Poland
Jacek Koronacki & Sławomir T. Wierzchoń &
Woodward Hall 430C University of North Carolina, 9201 University City Blvd., N.C. 28223, Charlotte, USA
Zbigniew W. Raś
Systems Research Institute, Polish Academy of Sciences, ul.Newelska 6, 01-447, Warsaw, 01-447
Janusz Kacprzyk

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Yager, R.R. (2010). Partition Measures for Data Mining. In: Koronacki, J., Raś, Z.W., Wierzchoń, S.T., Kacprzyk, J. (eds) Advances in Machine Learning I. Studies in Computational Intelligence, vol 262. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-05177-7_15

Download citation

DOI: https://doi.org/10.1007/978-3-642-05177-7_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-05176-0
Online ISBN: 978-3-642-05177-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics