Skip to main content

Extracting Wikipedia Data to Enrich Spatial Information

  • Conference paper
  • First Online:
Innovations for Community Services (I4CS 2017)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 717))

Included in the following conference series:

  • 377 Accesses

Abstract

Freely available geo data allow a developer to create new types of remarkable services related to the user’s location. Even though current geo data sources have a high coverage and quality, they do not contain all information required by new services. This is because geo data sources usually focus on object geometries and object types. Important information is often missing. As an example: city entries mainly contain the city name and border, but not the name of mayor, amount of taxes, year of foundation, number of districts etc. These data are available in online encyclopediae such as Wikipedia, but there is no obvious approach to relate both sources. Our objective was thus to create an automatic import from Wikipedia articles that describe geo objects and extract all relevant data. To extract processible values we are able to identify property types such dates, money values, powers, heights, sizes etc. This makes it possible to use these data for further computation, e.g. to search for maxima, build averages and sums or to create comparative conditions in queries.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Auer, S., Lehmann, J., Hellmann, S.: LinkedGeoData: adding a spatial dimension to the web of data. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 731–746. Springer, Heidelberg (2009). doi:10.1007/978-3-642-04930-9_46

    Chapter  Google Scholar 

  2. Barrett, D.J.: MediaWiki (Wikipedia and Beyond), O’Reilly (2008)

    Google Scholar 

  3. Bennett, J.: OpenStreetMap. Packt Publishing, Birmingham (2010)

    Google Scholar 

  4. Gabrilovich, E., Markovitch, S.: Computing semantic relatedness using Wikipedia-based explicit semantic analysis. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007), Hyderabad, India, 6–12 January 2007, pp. 1606–1611 (2007)

    Google Scholar 

  5. Global Administrative Areas. http://gadm.org/

  6. Hoffart, J., Suchanek, F.M., Berberich, K., Weikum, G.: YAGO2: a spatially and temporally enhanced knowledge base from Wikipedia. Artif. Intell. 194, 28–61 (2013)

    Article  MathSciNet  MATH  Google Scholar 

  7. Milne, D., Witten, I.H.: An open-source toolkit for mining Wikipedia. Artif. Intell. 194(2013), 222–239 (2013). Elsevier

    Article  MathSciNet  Google Scholar 

  8. NASA: Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER). http://asterweb.jpl.nasa.gov/

  9. Ponzetto, S.P., Strube, M.: Deriving a large scale taxonomy from Wikipedia. In: AAAI 2007 Proceedings of the 22nd National Conference on Artificial Intelligence - Volume 2, 22–26 July 2007, Vancouver, British Columbia, pp. 1440–1445 (2007)

    Google Scholar 

  10. Prato, A., Ronchetti, M.: Using Wikipedia as a reference for extracting semantic information from a text. In: Third International Conference on Advances in Semantic Processing, SEMAPRO 2009, 11–16 October 2009, Sliema, Malta, pp. 56–61 (2009)

    Google Scholar 

  11. Roth, J.: Die HomeRun-Plattform für ortsbezogene Dienste außerhalb des Massenmarktes. In: Zipf, A., Lanig, S., Bauer, M. (eds.) 6. GI/ITG KuVS Workshop Location Based Services and Applications, Heidelberger Geographische Bausteine Heft 18, 2010 (2010). (in German)

    Google Scholar 

  12. Roth, J.: Übernahme von Geodatenbeständen aus Open Street Map und Bereitstellung einer effizienten Zugriffsmöglichkeit für ortsbezogene Dienste, Praxis der Informationsverarbeitung und Kommunikation (PIK), vol. 13, no. 4 (2010). (in German)

    Google Scholar 

  13. Roth, J.: Combining symbolic and spatial exploratory search – the homerun explorer. In: Innovative Internet Computing Systems (I2CS), Hagen, 19–21 June 2013, Fortschritt-Berichte VDI, Reihe, vol. 10, no. 826, pp. 94–108 (2013)

    Google Scholar 

  14. Roth, J.: From weak to strong geo object classification. In: Schau, V., Eichler, G., Roth, J. (eds.) Proceedings of the 10th Workshop Location-Based Application and Services (LBAS) 16–17 September 2013, University of Jena, Germany, Logos Verlag Berlin, pp. 3–12 (2014)

    Google Scholar 

  15. Roth, J.: Predicting route targets based on optimality considerations. In: International Conference on Innovations for Community Services (I4CS), Reims (France) 4–6 June 2014, pp. 61–68. IEEE Xplore (2015)

    Google Scholar 

  16. Roth, J.: Fast spatio-symbolic searching in huge geo databases. In: Proceedings of the 11th Workshop Location-Based application and Services (LBAS), 18–19 September 2014, Telekom Innovation Laboratories, Darmstadt, Germany, Logos Verlag (2015)

    Google Scholar 

  17. Roth, J.: Generating meaningful location descriptions. In: International Conference on Innovations for Community Services (I4CS), 8–10 July 2015, Nuremberg (Germany), pp. 30–37. IEEE Xplore (2015)

    Google Scholar 

  18. Völkel, M., Krötzsch, M., Vrandecic, D., Haller, H., Studer, R.: Semantic Wikipedia. In: Proceedings of the 15th International Conference on World Wide Web (WWW 2006), 23–26 May 2006, Edinburgh, Scotland, pp. 585–594 (2006)

    Google Scholar 

  19. Wikipedia 2017: MediaWiki action API. https://www.mediawiki.org/wiki/API:Main_page/en

  20. Wu, F., Weld, D.S.: Open information extraction using Wikipedia. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, 11–16 July 2010, pp. 118–127 (2010)

    Google Scholar 

  21. Zesch, T., Müller, C., Gurevych, I.: Extracting lexical semantic knowledge from Wikipedia and Wiktionary. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008), 28–30 May 2008, Marrakech, Morocco (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jörg Roth .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Roth, J. (2017). Extracting Wikipedia Data to Enrich Spatial Information. In: Eichler, G., Erfurth, C., Fahrnberger, G. (eds) Innovations for Community Services. I4CS 2017. Communications in Computer and Information Science, vol 717. Springer, Cham. https://doi.org/10.1007/978-3-319-60447-3_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-60447-3_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-60446-6

  • Online ISBN: 978-3-319-60447-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics