Abstract
Missing values can occur frequently in many real world situations. Such is the case of multi-way data applications, where objects are usually represented by arrays of 2 or more dimensions e.g. biomedical signals that can be represented as time-frequency matrices. This lack of attributes tends to influence the analysis of the data. In classification tasks for example, the performance of classifiers is usually deteriorated. Therefore, it is necessary to address this problem before classifiers are built. Although the absence of values is common in these types of data sets, there are just a few studies to tackle this problem for classification purposes. In this paper, we study two approaches to overcome the missing values problem in dissimilarity-based classification of multi-way data. Namely, imputation by factorization, and a modification of the previously proposed Continuous Multi-way Shape measure for comparing multi-way objects.
Chapter PDF
Similar content being viewed by others
References
Mortensen, P.P., Bro, R.: Real time monitoring and chemical profiling of a cultivation process. Chemometr. Intell. Lab. 84(1-2), 106–113 (2005)
Møller, J., Parolari, G., Gabba, L., Christensen, J., Skibsted, L.: Evaluated surface autofluorescence spectroscopy in order to measure age-related quality index of parma ham during processing. J. Agr. Food Chem. 51, 1224–1230 (2003)
Tomasi, G., Bro, R.: PARAFAC and missing values. Chemometr. Intell. Lab. 75, 163–180 (2005)
Kroonenberg, P.M.: Applied Multiway Data Analysis. John Wiley & Sons (2008)
Acar, E., Dunlavy, D.M., Kolda, T.G., Mrup, M.: Scalable tensor factorizations for incomplete data. Chemometr. Intell. Lab. 106(1), 41–56 (2011)
Walczak, B., Massart, D.: Dealing with missing data: Part I. Chemometr. Intell. Lab. 58, 15–27 (2001)
Smilde, A.K., Bro, R., Geladi, P.: Multi-way Analysis. Applications in the chemical sciences. John Wiley & Sons, Inc. (2004)
Pekalska, E., Duin, R.P.W.: The Dissimilarity Representation For Pattern Recognition. Foundations and Applications. World Scientific (2005)
Porro-Muñoz, D., Duin, R.P.W., Talavera, I., Orozco-Alzate, M.: Classification of three-way data by the dissimilarity representation. Signal Processing 91(11), 2520–2529 (2011)
Millán-Giraldo, M., Duin, R.P.W., Sánchez, J.S.: Dissimilarity-based classification of data with missing attributes. In: Proc. of CIP 2010, pp. 293–298 (2010)
Porro-Muñoz, D., Duin, R.P.W., Orozco-Alzate, M., Talavera Bustamante, I.: Continuous multi-way shape measure for dissimilarity representation. In: Alvarez, L., Mejail, M., Gomez, L., Jacobo, J. (eds.) CIARP 2012. LNCS, vol. 7441, pp. 430–437. Springer, Heidelberg (2012)
Lathauwer, L., De Moor, B.: From matrix to tensor: Multilinear algebra and signal processing. In: Proc. of the 4th International Conference on Mathematics in Signal Processing, Warwick, UK, vol. I, pp. 1–11 (1996)
Gonzalez, R.C., Woods, R.E.: Digital Image Processing, 3rd edn. Prentice-Hall, Inc., Upper Saddle River (2006)
Harshman, R.: Foundations of the Parafac procedure: models and conditions for an explanation multi-modal factor analysis. UCLA Working Papers in Phonetics, Los Angeles 16, 1–84 (1970)
Acar, E., Bro, R., Schmidt, B.: New exploratory clustering tool. Journal of Chemomometrics 22, 91–100 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Porro-Muñoz, D., Duin, R.P.W., Talavera, I. (2013). Missing Values in Dissimilarity-Based Classification of Multi-way Data. In: Ruiz-Shulcloper, J., Sanniti di Baja, G. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2013. Lecture Notes in Computer Science, vol 8258. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41822-8_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-41822-8_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41821-1
Online ISBN: 978-3-642-41822-8
eBook Packages: Computer ScienceComputer Science (R0)