Abstract
Wikipedia supports a large converged data with millions of contributions in more than 287 languages currently. Its content changes rapidly and continuously every hour with thousands of edits which trigger many challenges for Wikipedia in controlling, associating and balancing article content among language editions. This paper provides some processes to enrich Wikipedia content, which will retrieve semantic relations based on alignment between infobox properties and Wikidata properties in various languages. Then, the outcomes mainly contribute these semantic relations back to Wikidata and Wikipedias, especially ones are based on the Latin alphabet. The case study will offer a specific case about aligning biological infoboxes and detecting missing interwiki links of biological species at Vietnamese Wikipedia and English Wikipedia.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Mendes, P.N., Jakob, M., Bizer, C.: In: DBPedia: A Multilingual Cross-Domain Knowledge Base. LREC, pp. 1813–1817. European Language Resources Association ELRA (2012)
Cabrio, E., Cojan, J., Gandon, F., Hallili, A.: Querying multilingual DBpedia with QAKiS. In: Cimiano, P., Fernández, M., Lopez, V., Schlobach, S., Völker, J. (eds.) ESWC 2013. LNCS, vol. 7955, pp. 194–198. Springer, Heidelberg (2013)
Morsey, M., Lehmann, J., Auer, S., Stadler, C., Hellman, S.: DBPedia and the Live Extraction of Structured Data from Wikipedia. Electronic Library and Information Systems 46(2), 157–181 (2012)
Kim, E.-K., Weild, M., Choi, K.-S.: Metadata synchronization between bilingual resources: case study in wikipedia. In: International Workshop on the Multilingual Semantic Web (2010)
Vrandečić, D., Krötzsch, M.: Wikidata: A Free Collaborative Knowledge Base. Communications of the ACM (2014, to appear)
Nastase, V., Strube, M.: Decoding Wikipedia Categories for Knowledge Acquisition. AAAI (2008)
Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P.N., Hellmann, S., Morsey, M., van Kleef, P., Auer, S., Bizer, C.: DBpedia - A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia. IOS Press (2012)
Nguyen, T., Moreira, V., Nguyen, H., Nguyen, H., Freire, J.: Multilingual schema matching for Wikipedia infoboxes. Proceedings of the VLDB Endowment 5(2), 133–144 (2011)
Palmero Aprosio, A., Giuliano, C., Lavelli, A.: Towards an automatic creation of localized versions of DBpedia. In: Alani, H., Kagal, L., Fokoue, A., Groth, P., Biemann, C., Parreira, J.X., Aroyo, L., Noy, N., Welty, C., Janowicz, K. (eds.) ISWC 2013, Part I. LNCS, vol. 8218, pp. 494–509. Springer, Heidelberg (2013)
Ponzetto, S.P., Navigli, R.: Large scale taxonomy mapping for restructuring and integrating wikipedia. In: IJCAT 2009 (2009)
Chernov, S., Iofciu, T., Nejdl, W., Zhou, X.: Extracting semantic relationships between wikipedia categories. In: ESWC, June 2006
Adar, E., Skinner, M., Weld, D.S.: Information arbitrage across multi-lingual wikipedia. In: WSDM 2009 Proceedings of the Second ACM International Conference on Web Search and Data Mining, pp. 94–103 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Ta, T.H., Anutariya, C. (2015). A Model for Enriching Multilingual Wikipedias Using Infobox and Wikidata Property Alignment. In: Supnithi, T., Yamaguchi, T., Pan, J., Wuwongse, V., Buranarach, M. (eds) Semantic Technology. JIST 2014. Lecture Notes in Computer Science(), vol 8943. Springer, Cham. https://doi.org/10.1007/978-3-319-15615-6_25
Download citation
DOI: https://doi.org/10.1007/978-3-319-15615-6_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-15614-9
Online ISBN: 978-3-319-15615-6
eBook Packages: Computer ScienceComputer Science (R0)