Skip to main content

A Model for Enriching Multilingual Wikipedias Using Infobox and Wikidata Property Alignment

  • Conference paper
  • First Online:
Semantic Technology (JIST 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8943))

Included in the following conference series:

Abstract

Wikipedia supports a large converged data with millions of contributions in more than 287 languages currently. Its content changes rapidly and continuously every hour with thousands of edits which trigger many challenges for Wikipedia in controlling, associating and balancing article content among language editions. This paper provides some processes to enrich Wikipedia content, which will retrieve semantic relations based on alignment between infobox properties and Wikidata properties in various languages. Then, the outcomes mainly contribute these semantic relations back to Wikidata and Wikipedias, especially ones are based on the Latin alphabet. The case study will offer a specific case about aligning biological infoboxes and detecting missing interwiki links of biological species at Vietnamese Wikipedia and English Wikipedia.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Mendes, P.N., Jakob, M., Bizer, C.: In: DBPedia: A Multilingual Cross-Domain Knowledge Base. LREC, pp. 1813–1817. European Language Resources Association ELRA (2012)

    Google Scholar 

  2. Cabrio, E., Cojan, J., Gandon, F., Hallili, A.: Querying multilingual DBpedia with QAKiS. In: Cimiano, P., Fernández, M., Lopez, V., Schlobach, S., Völker, J. (eds.) ESWC 2013. LNCS, vol. 7955, pp. 194–198. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  3. Morsey, M., Lehmann, J., Auer, S., Stadler, C., Hellman, S.: DBPedia and the Live Extraction of Structured Data from Wikipedia. Electronic Library and Information Systems 46(2), 157–181 (2012)

    Article  Google Scholar 

  4. Kim, E.-K., Weild, M., Choi, K.-S.: Metadata synchronization between bilingual resources: case study in wikipedia. In: International Workshop on the Multilingual Semantic Web (2010)

    Google Scholar 

  5. Vrandečić, D., Krötzsch, M.: Wikidata: A Free Collaborative Knowledge Base. Communications of the ACM (2014, to appear)

    Google Scholar 

  6. Nastase, V., Strube, M.: Decoding Wikipedia Categories for Knowledge Acquisition. AAAI (2008)

    Google Scholar 

  7. Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P.N., Hellmann, S., Morsey, M., van Kleef, P., Auer, S., Bizer, C.: DBpedia - A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia. IOS Press (2012)

    Google Scholar 

  8. Nguyen, T., Moreira, V., Nguyen, H., Nguyen, H., Freire, J.: Multilingual schema matching for Wikipedia infoboxes. Proceedings of the VLDB Endowment 5(2), 133–144 (2011)

    Article  Google Scholar 

  9. Palmero Aprosio, A., Giuliano, C., Lavelli, A.: Towards an automatic creation of localized versions of DBpedia. In: Alani, H., Kagal, L., Fokoue, A., Groth, P., Biemann, C., Parreira, J.X., Aroyo, L., Noy, N., Welty, C., Janowicz, K. (eds.) ISWC 2013, Part I. LNCS, vol. 8218, pp. 494–509. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  10. Ponzetto, S.P., Navigli, R.: Large scale taxonomy mapping for restructuring and integrating wikipedia. In: IJCAT 2009 (2009)

    Google Scholar 

  11. Chernov, S., Iofciu, T., Nejdl, W., Zhou, X.: Extracting semantic relationships between wikipedia categories. In: ESWC, June 2006

    Google Scholar 

  12. Adar, E., Skinner, M., Weld, D.S.: Information arbitrage across multi-lingual wikipedia. In: WSDM 2009 Proceedings of the Second ACM International Conference on Web Search and Data Mining, pp. 94–103 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Thang Hoang Ta .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Ta, T.H., Anutariya, C. (2015). A Model for Enriching Multilingual Wikipedias Using Infobox and Wikidata Property Alignment. In: Supnithi, T., Yamaguchi, T., Pan, J., Wuwongse, V., Buranarach, M. (eds) Semantic Technology. JIST 2014. Lecture Notes in Computer Science(), vol 8943. Springer, Cham. https://doi.org/10.1007/978-3-319-15615-6_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-15615-6_25

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-15614-9

  • Online ISBN: 978-3-319-15615-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics