Abstract
Human knowledge develops through complex relationships between categories. In the era of Big Data, the concept of categorization implies data summarization in a limited number of well-separated groups that must be maximally and internally homogeneous at the same time. This proposal exploits archetypal analysis capabilities by finding a set of extreme points that can summarize entire data sets in homogeneous groups. The archetypes are then used to identify the best prototypes according to Rosch’s definition. Finally, in the geometric approach to cognitive science, the Voronoi tessellation based on the prototypes is used to define categorization. An example using a well-known wine dataset by Forina et al. illustrates the procedure.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Adams, B., Raubal, M.: A metric conceptual space algebra. In: Spatial Information Theory. Springer (2009)
Aitchison, J., Barceló-Vidal, C., Martín-Fernández, J., Pawlowsky-Glahn, V.: Logratio analysis and compositional distance. Math. Geol. 32(3), 271–275 (2000)
Barsalou, L.W.: Deriving categories to achieve goals. In: Psychology of Learning and Motivation. Elsevier (1991)
Bien, J., Tibshirani, R.: Prototype selection for interpretable classification. Ann. Appl. Stat. 2403–2424 (2011)
Bauckhage, C., Thurau, C.: Making archetypal analysis practical. In: DAGM-Symposium, pp. 272–281 (2009)
Chang, F., Lin, C.C., Lu, C.J.: Adaptive prototype learning algorithms: theoretical and experimental studies. J. Mach. Learn. Res. 7, 2125–2148 (2006)
Cowan, N.: The magical mystery four: how is working memory capacity limited, and why? Curr. Dir. Psychol. Sci. 19(1), 51–57 (2010)
Cutler, A., Breiman, L.: Archetypal analysis. Technometrics 36(4), 338–347 (1994)
D’Esposito, M., Palumbo, F., Ragozini, G.: Interval archetypes: a new tool for interval data analysis. Stat. Anal. Data Min. 5(4), 322–335 (2012)
D’Esposito, M.R., Palumbo, F., Ragozini, G.: Archetypal Symbolic Objects, pp. 41–49. Springer, Berlin (2013)
Diday, E.: Optimization in non-hierarchical clustering. Pattern Recognit. 6(1), 17–33 (1974)
Diday, E.: Categorization in symbolic data analysis. In: Cohen, H., Lefebvre, C. (eds.) Handbook of Categorization in Cognitive Science, pp. 845–867. Elsevier Science Ltd, Oxford (2005)
Edelsbrunner, H.: Algorithms in Combinatorial Geometry, vol. 10. Springer-Verlag, Berlin (1987)
Eugster, M., Leisch, F., Seth, S.: Archetypes: archetypal analysis. In: R Package Version, pp. 2–2 (2014)
Fordellone, M., Palumbo, F.: Prototypes definition through consensus analysis between fuzzy c-means and archetypal analysis. Ital. J. Appl. Stat. 26(2), 141–162 (2014)
Forina, M., Armanino, C., Castino, M., Ubigli, M.: Multivariate data analysis as a discriminating method of the origin of wines. Vitis 25, 189–201 (1986)
Frigui, H., Nasraoui, O.: Unsupervised learning of prototypes and attribute weights. Pattern Recognit. 37(3), 567–581 (2004)
Gärdenfors, P.: Conceptual Spaces: The Geometry of Thought. A Bradford Book, vol. 3, p. 16. MIT Press (2000)
Gärdenfors, P.: Concept Learning and Nonmonotonic Reasoning, pp. 823–843. Elsevier (2005)
Grolemund, G., Wickham, H.: A cognitive interpretation of data analysis. Int. Stat. Rev. 82(2), 184–204 (2014)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer (2011)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. (CSUR) 31(3), 264–323 (1999)
Lakoff, G.: Cognitive semantics. Mean. Ment. Represent. 8(1), 119–154. https://escholarship.org/uc/item/04086580 (1988)
Lesot, M., Kruse, R.: Typicality degrees and fuzzy prototypes for clustering. In: Advances in Data Analysis, pp. 107–114. Springer (2007)
Lesot, M., Rifqi, M., Bouchon-Meunier, B.: Fuzzy prototypes: from a cognitive view to a machine learning principle. In: Fuzzy Sets and Their Extensions: Representation, Aggregation and Models, pp. 431–452. Springer (2008)
Okabe, A., Boots, B., Sugihara, K.: Nearest neighbourhood operations with generalized voronoi diagrams: a review. Int. J. Geogr. Inf. Syst. 8(1), 43–71 (1994)
Papadopoulou, E., Lee, D.T.: The Hausdorff Voronoi diagram of polygonal objects: a divide and conquer approach. Int. J. Comput. Geom. Appl. 14(06), 421–452 (2004)
Porzio, G.C., Ragozini, G., Vistocco, D.: On the use of archetypes as benchmarks. Appl. Stoch. Model. Bus. Ind. 25, 419–437 (2008)
Ragozini, G., Palumbo, F., D’Esposito, M.R.: Archetypal analysis for data-driven prototype identification. Stat. Anal. Data Min.: ASA Data Sci. J. (2016)
Rifqi, M.: Constructing prototypes from large databases. In: International conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems. IPMU’96, Granada, Spain. https://hal.archives-ouvertes.fr/hal-01075383 (1996)
Robert, S.: Categorization, reasoning and memory from a neo-logical point of view. In: Cohen, H., Lefebvre, C. (eds.) Handbook of Categorization in Cognitive Science, pp. 700-718. Elsevier (2005)
Rosch, E.: Natural categories. Cogn. Psychol. 4(3), 328–350 (1973)
Rosch, E.: Prototype classification and logical classification: the two systems. In: New Trends in Conceptual Representation: Challenges to Piaget’s Theory, pp. 73–86 (1983)
Siemens, G.: Connectivism: A Learning Theory for the Digital Age. http://er.dut.ac.za/handle/123456789/69 (2014)
Watson, D.F.: Computing the n-dimensional delaunay tessellation with application to Voronoi polytopes. Comput. J. 24(2), 167–172 (1981)
Zenker, F., Gärdenfors, P. (ed.): Applications of Conceptual Spaces: The Case for Geometric Knowledge Representation. Springer, Berlin; Synthese Library: Studies in Epistemology, Logic, Methodology, and Philosophy of Science. 24(2), 2–10. Springer (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Santelli, F., Palumbo, F., Ragozini, G. (2019). Statistical Archetypal Analysis for Cognitive Categorization. In: Petrucci, A., Racioppi, F., Verde, R. (eds) New Statistical Developments in Data Science. SIS 2017. Springer Proceedings in Mathematics & Statistics, vol 288. Springer, Cham. https://doi.org/10.1007/978-3-030-21158-5_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-21158-5_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-21157-8
Online ISBN: 978-3-030-21158-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)