Abstract
Missing values are a data quality problem affecting almost every type of real world datasets. Since poor data quality has a direct impact on organisational success, there is a dire need to eradicate missing values as a way to minimise costs and increase efficiency in companies. There are different methods to deal with missing values including the Imputation Methods, which try to compute an accurate estimation of missing values using the rest of the information available. This article devises the potential of Semantic Technologies towards the solution of the limitations of current Imputation Methods by proposing alternative approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Esnaola-Gonzalez, I., Bermúdez, J., Fernandez, I., Arnaiz, A.: EEPSA as a core ontology for energy efficiency and thermal comfort in buildings. Appl. Ontol. 16(2), 193–228 (2021). https://doi.org/10.3233/AO-210245
Farhangfar, A., Kurgan, L.A., Pedrycz, W.: A novel framework for imputation of missing values in databases. IEEE Trans. Syst. Man Cybern.-Part A: Syst. Hum. 37(5), 692–709 (2007)
Friedman, T., Smith, M.: Measuring the business value of data quality. Tech. Rep. G00218962 (2011). https://www.data.com/export/sites/data/common/assets/pdf/DS_Gartner.pdf
Fürber, C., Hepp, M.: Using semantic web resources for data quality management. In: Cimiano, P., Pinto, H.S. (eds.) EKAW 2010. LNCS (LNAI), vol. 6317, pp. 211–225. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-16438-5_15
Garciarena, U., Santana, R.: An extensive analysis of the interaction between missing data types, imputation methods, and supervised classifiers. Expert Syst. Appl. 89, 52–65 (2017)
Haller, A., et al.: The modular SSN ontology: a joint W3C and OGC standard specifying the semantics of sensors, observations, sampling, and actuation. Semant. Web 10(1), 9–32 (2019). https://doi.org/10.3233/SW-180320
Kontokostas, D., Zaveri, A., Auer, S., Lehmann, J.: TripleCheckMate: a tool for crowdsourcing the quality assessment of linked data. In: Klinov, P., Mouromtsev, D. (eds.) KESW 2013. CCIS, vol. 394, pp. 265–272. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-41360-5_22
Little, R.J., Rubin, D.B.: Statistical Analysis with Missing Data, vol. 793. Wiley, Hoboken (2002). https://doi.org/10.1002/9781119013563
Luengo, J., García, S., Herrera, F.: On the choice of the best imputation methods for missing values considering three groups of classification methods. Knowl. Inf. Syst. 32(1), 77–108 (2012). https://doi.org/10.1007/s10115-011-0424-2
Marcus, G.: The next decade in AI: four steps towards robust artificial intelligence. arXiv preprint arXiv:2002.06177 (2020)
Rasmussen, M.H., Lefrançois, M., Schneider, G., Pauwels, P.: BOT: the building topology ontology of the W3C linked building data group. Semant. Web 12, 143–161 (2021). https://doi.org/10.3233/SW-200385
Redman, T.C.: The impact of poor data quality on the typical enterprise. Commun. ACM 41(2), 79–82 (1998)
Schmidt, J., Marques, M.R., Botti, S., Marques, M.A.: Recent advances and applications of machine learning in solid-state materials science. npj Comput. Mater. 5(1), 1–36 (2019)
Acknowledgement
This work has received funding from ELKARTEK project KK-2020/00049 3KIA of the Basque Government, REACT project from the European Union’s Horizon 2020 research and innovation programme under grant agreement no. 824395, FEDER/TIN2016-78011-C4-2-R, TIN2016-78365-R (Spanish Ministry of Economy, Industry and Competitiveness) and IT-1244-19 (Basque Government). Unai Garciarena holds a predoctoral grant from the University of the Basque Country (ref. PIF16/238).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Esnaola-Gonzalez, I., Garciarena, U., Bermúdez, J. (2021). Semantic Technologies Towards Missing Values Imputation. In: Fujita, H., Selamat, A., Lin, J.CW., Ali, M. (eds) Advances and Trends in Artificial Intelligence. Artificial Intelligence Practices. IEA/AIE 2021. Lecture Notes in Computer Science(), vol 12798. Springer, Cham. https://doi.org/10.1007/978-3-030-79457-6_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-79457-6_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-79456-9
Online ISBN: 978-3-030-79457-6
eBook Packages: Computer ScienceComputer Science (R0)