Skip to main content

Similarity and Dissimilarity Measures for Mixed Feature-Type Symbolic Data

  • Conference paper
  • First Online:
Studies in Theoretical and Applied Statistics (SIS 2016)

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 227))

Included in the following conference series:

Abstract

This paper presents some preliminary results for the similarity and dissimilarity measures based on the Cartesian System Model (CSM) that is a mathematical model to manipulate mixed feature-type symbolic data. We define the notion of concept size for the description of each object in the feature space. By extending the notion to the concept sizes of the Cartesian join and the Cartesian meet of the descriptions for objects, we can obtain various similarity and dissimilarity measures. We present especially asymmetric and symmetric similarity measures useful for pattern recognition problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Johnson, S.C.: Hierarchical clustering schemes. Psychometrika 32(3), 241–254 (1967)

    Article  MATH  Google Scholar 

  2. Hubert, L.: Some extensions of Johnson’s hierarchical clustering algorithms. Psychometrika 37(3) 261–27 L. 4 (1972)

    Google Scholar 

  3. Tversky, A.: Features of similarity. Psychol. Rev. 84(4) (1977)

    Google Scholar 

  4. Michalski, R., Stepp, R.: Learning from observation: Conceptual clustering. In: Michalski, R.S., Carbonell, J.G., Mitchel, T.M. (eds.) Machine Learning, An Artificial Intelligence Approach, vol. II, pp. 331–363. TIOGA Publishing Co., Palo Alto (1983)

    Google Scholar 

  5. Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999)

    Article  Google Scholar 

  6. Bock, H.-H., Diday, E.: Analysis of Symbolic Data. Exploratory Methods for Extracting Statistical Information from Complex Data. Springer, Berlin, Heidelberg (2000)

    MATH  Google Scholar 

  7. Billard, L., Diday, E.: Symbolic Data Analysis: Conceptual Statistics and Data Mining. Wiley, Chichester (2007)

    MATH  Google Scholar 

  8. Diday, E., Noirhomme-Fraiture, M.: Symbolic Data Analysis and the SODAS Software. Wiley, Chichester (2008)

    MATH  Google Scholar 

  9. De Carvalho, F.D.A.T., De Souza, M.C.R.: Unsupervised pattern recognition models for mixed feature-type data. Pattern Recognit. Lett. 31, 430–443 (2010)

    Google Scholar 

  10. Ichino, M., Yaguchi, H.: Generalized Minkowski metrics for mixed feature-type data analysis. IEEE Trans. Syst. Man Cybern. 24(4), 698–708 (1994)

    Article  MathSciNet  MATH  Google Scholar 

  11. Guru, D.S., Kiranagi, B.B., Nagabhushan, P.: Multivalued type proximity measure and concept of mutual similarity value useful for clustering symbolic patterns. Pattern Recognit. 25, 1203–1213 (2004)

    Article  Google Scholar 

  12. Ichino, M.: The quantile method of symbolic principal component analysis 4, 184–198 (2011)

    Google Scholar 

  13. Ono, Y., Ichino, M.: A new feature selection method based on geometrical thickness. Int. J. Off. Stat. 1(2), 19–38 (1998)

    Google Scholar 

Download references

Acknowledgements

The authors thank anonymous referees for their helpful comments. This work was supported by JSPS KAKENHI (Grants–in–Aid for Scientific Research) Grant Number 25330268.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Manabu Ichino .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ichino, M., Umbleja, K. (2018). Similarity and Dissimilarity Measures for Mixed Feature-Type Symbolic Data. In: Perna, C., Pratesi, M., Ruiz-Gazen, A. (eds) Studies in Theoretical and Applied Statistics. SIS 2016. Springer Proceedings in Mathematics & Statistics, vol 227. Springer, Cham. https://doi.org/10.1007/978-3-319-73906-9_12

Download citation

Publish with us

Policies and ethics