Abstract
The aim of Symbolic Data Analysis (SDA) is to provide a set of techniques to summarize large data sets into smaller ones called symbolic data tables. This paper considers a kind of symbolic data called Interval-Valued Data (IVD) which stores data intrinsic variability and/or uncertainty from the original data set. Recent works have been proposed to fit the classic linear regression model to symbolic data. However, those works do not consider the presence of symbolic data outliers. Generally, most specialists treat outliers as errors and discard them. Nevertheless, a single interval-data outlier holds significant information which should not be discarded or ignored. This work introduces a prediction method for IVD based on the symmetrical linear regression (SLR) analysis whose response model is less susceptible to the IVD outliers. The model considers a symmetrical distribution for error which allows to the model possibility of applying regular statistical hypothesis tests.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Diday, E., Noirhomme-Fraiture, M.: Symbolic Data Analysis and the SODAS Software. Wiley, West Sussex (2008)
Billard, L., Diday, E.: Regression Analysis for Interval-Valued Data. In: 7th Conf. of Int. Fed. of Classif. Soc., pp. 369–374. Springer, Belgium (2000)
Billard, L., Diday, E.: Symbolic Regression Analysis. In: 8th Conf. of Int. Fed. of Classif. Soc., pp. 281–288. Springer, Poland (2002)
Billard, L., Diday, E.: Symbolic Data Analysis: Conceptual Statistics and Data Mining. Wiley, West Sussex, England (2006)
Lima Neto, E.A., De Carvalho, F.A.T.: Centre and Range method for fitting a linear regression model to symbolic interval data. CSDA 52, 1500–1515 (2008)
Montgomery, D.C., Peck, E.A.: Introduction to Linear Regression Analysis. Wiley, New York (1982)
Fang, K.T., Kotz, S., Ng, K.W.: Symmetric Multivariate and Related Distributions. Chapman and Hall, London (1990)
Cysneiros, F.J.A., Paula, G.A.: Restricted methods in symmetrical linear regression models. Comp. Stat. and Data Analysis 49, 689–708 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Domingues, M.A.O., de Souza, R.M.C.R., Cysneiros, F.J.A. (2009). A Symmetrical Model Applied to Interval-Valued Data Containing Outliers with Heavy-Tail Distribution. In: Köppen, M., Kasabov, N., Coghill, G. (eds) Advances in Neuro-Information Processing. ICONIP 2008. Lecture Notes in Computer Science, vol 5507. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03040-6_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-03040-6_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03039-0
Online ISBN: 978-3-642-03040-6
eBook Packages: Computer ScienceComputer Science (R0)