Abstract
Chapter 6 is intended to give the reader a first advice on which similarity measure is suitable for which task, when designing a CBR system. For this purpose, it contains the basic material necessary for understanding and building simple similarity measures. Similarity assessment is a substep of the retrieve process step and is responsible for identifying potentially useful cases from the case base. Because usefulness depends on the user’s context there is no universal method to define similarity. In order to understand the most suitable methods, this chapter gives a general overview of different concepts connected with similarity, including mathematical models of similarity. We discuss the difference between relational and functional similarity, and the important nearest neighbour concept, then list a variety of elementary measures that differ only partially but are used frequently. Some of the measures discussed are counting measures, structure-oriented, and transformational. These approaches are unified by introducing the local-global principle from case representations for similarity measures too. This principle relates object representations and measures. In constructing a similarity measure, a basic difficulty is finding the values for weights. The approach presented formalizes weights on the basis of their predictive power. This approach is also used for defining the independence of attributes. The problem of dependent attributes is approached by introducing virtual attributes. This chapter assumes understanding of Part I and Chap. 5, Case Representations. Similarity is also discussed in Chap. 7, where complex topics are discussed. In later chapters we will occasionally mention similarity when describing topics such as fuzzy sets, random variables and texts.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Bock WJ (1973) Philosophical foundations of classical evolutionary classifications. Syst Zool 22:375–392
Fisher DH (1987) Knowledge acquisition via incremental conceptional clustering. Mach Learn 2:139–172
Hall M, Frank E, Holmes G et al. (2009) The WEKA data mining software: an update. SIGKDD Explor Newsl 11(1):10–18
Levenshtein VI (1966) Binary codes capable of correcting deletions, insertions and reversals. Sov Phys Dokl 10(8):707–710
Richter MM (2007) Similarity. In: Perner P (ed) Case-based reasoning for signals and imaging. Springer, Berlin, pp 25–90
Ruhe G (2010) Product release planning: methods, tools and applications. CRC Press, Boca Raton
Santini S, Jain R (1999) Similarity measures. IEEE Trans Pattern Anal Mach Intell 21:871–883
Tversky A (1977) Features of similarity. Psychol Rev 84:327–352
Tversky A, Gati I (1978) Studies of similarity. In: Rosch E, Lloyd BB (eds) Cognition and categorization. Erlbaum, Hillsdale, pp 79–98
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Richter, M.M., Weber, R.O. (2013). Basic Similarity Topics. In: Case-Based Reasoning. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40167-1_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-40167-1_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40166-4
Online ISBN: 978-3-642-40167-1
eBook Packages: Computer ScienceComputer Science (R0)