Abstract
Originally developed in bioinformatics, sequence analysis is being increasingly used in social sciences for the study of life-course processes. The methodology generally employed consists in computing dissimilarities between the trajectories and, if typologies are sought, in clustering the trajectories according to their similarities or dissemblances. The choice of an appropriate dissimilarity measure is a major issue when dealing with sequence analysis for life sequences. Several dissimilarities are available in the literature, but neither of them succeeds to become indisputable. In this paper, instead of deciding upon one dissimilarity measure, we propose to use an optimal convex combination of different dissimilarities. The optimality is automatically determined by the clustering procedure and is defined with respect to the within-class variance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Elzinga, C.H.: Sequence similarity: a nonaligning technique. Sociological Methods & Research 3270(1), 3–29 (2003)
Robette, N.: Explorer et décrire les parcours de vie: les typologies de trajectoires. CEPED (“Les Clefs pour”), Université Paris Descartes (2011)
Fénelon, J.-P., Grelet, Y., Houzel, Y.: The sequence of steps in the analysis of youth trajectories. European Journal of Economic and Social Systems 14(1), 27–36 (2000)
Olteanu, M., Villa-Vialaneix, N., Cottrell, M.: On-line relational SOM for dissimilarity data. In: Estevez, P.A., Principe, J.C., Zegers, P. (eds.) Advances in Self-Organizing Maps. AISC, vol. 198, pp. 13–22. Springer, Heidelberg (2013)
Needleman, S., Wunsch, C.: A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Biology 48(3), 443–453 (1970)
Abbott, A., Forrest, J.: Optimal matching methods for historical sequences. Journal of Interdisciplinary History 16, 471–494 (1986)
Abbott, A., Tsay, A.: Sequence analysis and optimal matching methods in sociology. Review and prospect. Sociological Methods and Research 29(1), 3–33 (2000)
Wu, L.: Some comments on “sequence analysis and optimal matching methods in sociology, review and prospect”. Sociological Methods and Research 29(1), 41–64 (2000)
Müller, N.S., Gabadinho, A., Ritschard, G., Studer, M.: Extracting knowledge from life courses: Clustering and visualization. In: Song, I.-Y., Eder, J., Nguyen, T.M. (eds.) DaWaK 2008. LNCS, vol. 5182, pp. 176–185. Springer, Heidelberg (2008)
Lesnard, L.: Setting cost in optimal matching to uncover contempo-raneous socio-temporal patterns. Sociological Methods et Research 38(3), 389–419 (2010)
Elzinga, C.H.: Sequence analysis: metric representations of categorical time series. Sociological Methods and Research (2006)
Kohonen, T.: Self-Organizing Maps, 3rd edn., vol. 30. Springer, Heidelberg (2001)
Massoni, S., Olteanu, M., Rousset, P.: Career-path analysis using optimal matching and self-organizing maps. In: Príncipe, J.C., Miikkulainen, R. (eds.) WSOM 2009. LNCS, vol. 5629, pp. 154–162. Springer, Heidelberg (2009)
Conan-Guez, B., Rossi, F., El Golli, A.: Fast algorithm and implementation of dissimilarity self-organizing maps. Neural Networks 19(6-7), 855–863 (2006)
Mac Donald, D., Fyfe, C.: The kernel self organising map. In: Proceedings of 4th International Conference on Knowledge-Based Intelligence Engineering Systems and Applied Technologies, pp. 317–320 (2000)
Hammer, B., Hasenfuss, A., Strickert, M., Rossi, F.: Topographic processing of relational data. In: Proceedings of the 6th Workshop on Self-Organizing Maps (WSOM 2007), Bielefeld, Germany (September 2007) (to be published)
Olteanu, M., Villa-Vialaneix, N., Cierco-Ayrolles, C.: Multiple kernel self-organizing maps. In: Volume Proceedings of ESANN (2013)
Rakotomamonjy, A., Bach, F.R., Canu, S., Grandvalet, Y.: Simplemkl. Journal of Machine Learning Research 9, 2491–2521 (2008)
R Development Core Team: R: A Language and Environment for Statistical Computing, Vienna, Austria (2012) ISBN 3-900051-07-0
Gabadinho, A., Ritschard, G., Müller, N., Studer, M.: Analyzing and visualizing state sequences in r with traminer. Journal of Statistical Software 40(4), 1–37 (2011)
Pölzlbauer, G.: Survey and comparison of quality measures for self-organizing maps. In: Volume Proceedings of the Fifth Workshop on Data Analysis (WDA 2004), pp. 67–82. Elfa Academic Press (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Massoni, S., Olteanu, M., Villa-Vialaneix, N. (2013). Which Dissimilarity Is to Be Used When Extracting Typologies in Sequence Analysis? A Comparative Study. In: Rojas, I., Joya, G., Gabestany, J. (eds) Advances in Computational Intelligence. IWANN 2013. Lecture Notes in Computer Science, vol 7902. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38679-4_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-38679-4_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38678-7
Online ISBN: 978-3-642-38679-4
eBook Packages: Computer ScienceComputer Science (R0)