Skip to main content

Semantic Technologies Towards Missing Values Imputation

  • Conference paper
  • First Online:
Advances and Trends in Artificial Intelligence. Artificial Intelligence Practices (IEA/AIE 2021)

Abstract

Missing values are a data quality problem affecting almost every type of real world datasets. Since poor data quality has a direct impact on organisational success, there is a dire need to eradicate missing values as a way to minimise costs and increase efficiency in companies. There are different methods to deal with missing values including the Imputation Methods, which try to compute an accurate estimation of missing values using the rest of the information available. This article devises the potential of Semantic Technologies towards the solution of the limitations of current Imputation Methods by proposing alternative approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Esnaola-Gonzalez, I., Bermúdez, J., Fernandez, I., Arnaiz, A.: EEPSA as a core ontology for energy efficiency and thermal comfort in buildings. Appl. Ontol. 16(2), 193–228 (2021). https://doi.org/10.3233/AO-210245

    Article  Google Scholar 

  2. Farhangfar, A., Kurgan, L.A., Pedrycz, W.: A novel framework for imputation of missing values in databases. IEEE Trans. Syst. Man Cybern.-Part A: Syst. Hum. 37(5), 692–709 (2007)

    Article  Google Scholar 

  3. Friedman, T., Smith, M.: Measuring the business value of data quality. Tech. Rep. G00218962 (2011). https://www.data.com/export/sites/data/common/assets/pdf/DS_Gartner.pdf

  4. Fürber, C., Hepp, M.: Using semantic web resources for data quality management. In: Cimiano, P., Pinto, H.S. (eds.) EKAW 2010. LNCS (LNAI), vol. 6317, pp. 211–225. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-16438-5_15

    Chapter  Google Scholar 

  5. Garciarena, U., Santana, R.: An extensive analysis of the interaction between missing data types, imputation methods, and supervised classifiers. Expert Syst. Appl. 89, 52–65 (2017)

    Article  Google Scholar 

  6. Haller, A., et al.: The modular SSN ontology: a joint W3C and OGC standard specifying the semantics of sensors, observations, sampling, and actuation. Semant. Web 10(1), 9–32 (2019). https://doi.org/10.3233/SW-180320

    Article  Google Scholar 

  7. Kontokostas, D., Zaveri, A., Auer, S., Lehmann, J.: TripleCheckMate: a tool for crowdsourcing the quality assessment of linked data. In: Klinov, P., Mouromtsev, D. (eds.) KESW 2013. CCIS, vol. 394, pp. 265–272. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-41360-5_22

    Chapter  Google Scholar 

  8. Little, R.J., Rubin, D.B.: Statistical Analysis with Missing Data, vol. 793. Wiley, Hoboken (2002). https://doi.org/10.1002/9781119013563

    Book  MATH  Google Scholar 

  9. Luengo, J., García, S., Herrera, F.: On the choice of the best imputation methods for missing values considering three groups of classification methods. Knowl. Inf. Syst. 32(1), 77–108 (2012). https://doi.org/10.1007/s10115-011-0424-2

    Article  Google Scholar 

  10. Marcus, G.: The next decade in AI: four steps towards robust artificial intelligence. arXiv preprint arXiv:2002.06177 (2020)

  11. Rasmussen, M.H., Lefrançois, M., Schneider, G., Pauwels, P.: BOT: the building topology ontology of the W3C linked building data group. Semant. Web 12, 143–161 (2021). https://doi.org/10.3233/SW-200385

    Article  Google Scholar 

  12. Redman, T.C.: The impact of poor data quality on the typical enterprise. Commun. ACM 41(2), 79–82 (1998)

    Article  Google Scholar 

  13. Schmidt, J., Marques, M.R., Botti, S., Marques, M.A.: Recent advances and applications of machine learning in solid-state materials science. npj Comput. Mater. 5(1), 1–36 (2019)

    Article  Google Scholar 

Download references

Acknowledgement

This work has received funding from ELKARTEK project KK-2020/00049 3KIA of the Basque Government, REACT project from the European Union’s Horizon 2020 research and innovation programme under grant agreement no. 824395, FEDER/TIN2016-78011-C4-2-R, TIN2016-78365-R (Spanish Ministry of Economy, Industry and Competitiveness) and IT-1244-19 (Basque Government). Unai Garciarena holds a predoctoral grant from the University of the Basque Country (ref. PIF16/238).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Iker Esnaola-Gonzalez .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Esnaola-Gonzalez, I., Garciarena, U., Bermúdez, J. (2021). Semantic Technologies Towards Missing Values Imputation. In: Fujita, H., Selamat, A., Lin, J.CW., Ali, M. (eds) Advances and Trends in Artificial Intelligence. Artificial Intelligence Practices. IEA/AIE 2021. Lecture Notes in Computer Science(), vol 12798. Springer, Cham. https://doi.org/10.1007/978-3-030-79457-6_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-79457-6_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-79456-9

  • Online ISBN: 978-3-030-79457-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics