Similarity and Dissimilarity Measures for Mixed Feature-Type Symbolic Data

Ichino, Manabu; Umbleja, Kadri

doi:10.1007/978-3-319-73906-9_12

Manabu Ichino⁴ &
Kadri Umbleja⁵

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 227))

Included in the following conference series:

Convegno della Società Italiana di Statistica

967 Accesses
1 Citations

Abstract

This paper presents some preliminary results for the similarity and dissimilarity measures based on the Cartesian System Model (CSM) that is a mathematical model to manipulate mixed feature-type symbolic data. We define the notion of concept size for the description of each object in the feature space. By extending the notion to the concept sizes of the Cartesian join and the Cartesian meet of the descriptions for objects, we can obtain various similarity and dissimilarity measures. We present especially asymmetric and symmetric similarity measures useful for pattern recognition problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Johnson, S.C.: Hierarchical clustering schemes. Psychometrika 32(3), 241–254 (1967)
Article MATH Google Scholar
Hubert, L.: Some extensions of Johnson’s hierarchical clustering algorithms. Psychometrika 37(3) 261–27 L. 4 (1972)
Google Scholar
Tversky, A.: Features of similarity. Psychol. Rev. 84(4) (1977)
Google Scholar
Michalski, R., Stepp, R.: Learning from observation: Conceptual clustering. In: Michalski, R.S., Carbonell, J.G., Mitchel, T.M. (eds.) Machine Learning, An Artificial Intelligence Approach, vol. II, pp. 331–363. TIOGA Publishing Co., Palo Alto (1983)
Google Scholar
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999)
Article Google Scholar
Bock, H.-H., Diday, E.: Analysis of Symbolic Data. Exploratory Methods for Extracting Statistical Information from Complex Data. Springer, Berlin, Heidelberg (2000)
MATH Google Scholar
Billard, L., Diday, E.: Symbolic Data Analysis: Conceptual Statistics and Data Mining. Wiley, Chichester (2007)
MATH Google Scholar
Diday, E., Noirhomme-Fraiture, M.: Symbolic Data Analysis and the SODAS Software. Wiley, Chichester (2008)
MATH Google Scholar
De Carvalho, F.D.A.T., De Souza, M.C.R.: Unsupervised pattern recognition models for mixed feature-type data. Pattern Recognit. Lett. 31, 430–443 (2010)
Google Scholar
Ichino, M., Yaguchi, H.: Generalized Minkowski metrics for mixed feature-type data analysis. IEEE Trans. Syst. Man Cybern. 24(4), 698–708 (1994)
Article MathSciNet MATH Google Scholar
Guru, D.S., Kiranagi, B.B., Nagabhushan, P.: Multivalued type proximity measure and concept of mutual similarity value useful for clustering symbolic patterns. Pattern Recognit. 25, 1203–1213 (2004)
Article Google Scholar
Ichino, M.: The quantile method of symbolic principal component analysis 4, 184–198 (2011)
Google Scholar
Ono, Y., Ichino, M.: A new feature selection method based on geometrical thickness. Int. J. Off. Stat. 1(2), 19–38 (1998)
Google Scholar

Download references

Acknowledgements

The authors thank anonymous referees for their helpful comments. This work was supported by JSPS KAKENHI (Grants–in–Aid for Scientific Research) Grant Number 25330268.

Author information

Authors and Affiliations

Tokyo Denki University, Tokyo, Japan
Manabu Ichino
Tallinn University of Technology, Tallinn, Estonia
Kadri Umbleja

Authors

Manabu Ichino
View author publications
You can also search for this author in PubMed Google Scholar
Kadri Umbleja
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Manabu Ichino .

Editor information

Editors and Affiliations

Dipartimento di Scienze Economiche e Statistiche, Università degli Studi di Salerno, Fisciano, Salerno, Italy
Cira Perna
Dipartimento di Economia e Management, Università degli Studi di Pisa, Pisa, Italy
Monica Pratesi
Toulouse School of Economics, University of Toulouse, Toulouse Cedex 6, France
Anne Ruiz-Gazen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ichino, M., Umbleja, K. (2018). Similarity and Dissimilarity Measures for Mixed Feature-Type Symbolic Data. In: Perna, C., Pratesi, M., Ruiz-Gazen, A. (eds) Studies in Theoretical and Applied Statistics. SIS 2016. Springer Proceedings in Mathematics & Statistics, vol 227. Springer, Cham. https://doi.org/10.1007/978-3-319-73906-9_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-73906-9_12
Published: 02 April 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73905-2
Online ISBN: 978-3-319-73906-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics