Skip to main content

The BioKET Biodiversity Data Warehouse: Data and Knowledge Integration and Extraction

  • Conference paper
Advances in Intelligent Data Analysis XIII (IDA 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8819))

Included in the following conference series:

Abstract

Biodiversity datasets are generally stored in different formats. This makes it difficult for biologists to combine and integrate them to retrieve useful information for the purpose of, for example, efficiently classify specimens. In this paper, we present BioKET, a data warehouse which is a consolidation of heterogeneous data sources stored in different formats. For the time being, the scopus of BioKET is botanical. We had, among others things, to list all the existing botanical ontologies and relate terms in BioKET with terms in these ontologies. We demonstrate the usefulness of such a resource by applying FIST, a combined biclustering and conceptual association rule extraction method on a dataset extracted from BioKET to analyze the risk status of plants endemic to Laos. Besides, BioKET may be interfaced with other resources, like GeoCAT, to provide a powerful analysis tool for biodiversity data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Benniamin, A., Irudayaraj, V., Manickam, V.S.: How to identify rare and endangered ferns and fern allies. Ethnobotanical Leaflets 12, 108–117 (2008)

    Google Scholar 

  2. Biodiversity informatics and co-operation in taxonomy for interactive shared knowledge base (BIOTIK), http://www.biotik.org (accessed September 2011)

  3. Botanical research and herbarium management system (BRAHMS), http://herbaria.plants.ox.ac.uk/bol/ (accessed January 2013)

  4. http://wiki.openstreetmap.org/wiki/Bounding_box (Accessed April 2014)

  5. De Craenel, L.R., Wanntorp, L.: Floral development and anatomy of salvadoraceae. Ecological Applications 104(5), 913–923 (2009)

    Google Scholar 

  6. Eldredge, N.: Life on Earth: An Encyclopedia of Biodiversity, Ecology, and Evolution, Life on Earth, vol. 1. ABC-CLIO (2002)

    Google Scholar 

  7. Fritsch, P.W., Bush, C.M.: A new species of gaultheria (ericaceae) from mount kinabalu, borneo, malaysia. Novon: A Journal for Botanical Nomenclature 21(3), 338–342 (2011), http://dx.doi.org/10.1371/journal.pone.0005725

    Article  Google Scholar 

  8. Geocat: Geospatial conservation assessment tool, http://geocat.kew.org/ (accessed April 2014)

  9. Global biodiversity outlook 3, http://www.cbd.int/gbo3 (accessed January 2013)

  10. Grillo, O., Venora, G. (eds.): Biological Diversity and Sustainable Resources Use. InTech (2011)

    Google Scholar 

  11. Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann Publishers Inc., San Francisco (2011)

    Google Scholar 

  12. Hochachka, W.M., Caruana, R., Fink, D., Munson, A., Riedewald, M., Sorokina, D., Kellings, S.: Data-mining discovery of pattern and process in ecological systems. The Journal of Wildlife Management 71(7), 2427–2437 (2007)

    Article  Google Scholar 

  13. Institute, W.R.: Ecosystems and human well-being: Biodiversity synthesis. Millennium Ecosystem Assessment (2005)

    Google Scholar 

  14. Marbán, O., Mariscal, G., Segovia, J.: A data mining & knowledge discovery process model. In: Data Mining and Knowledge Discovery in Real Life Applications, InTech, Vienna (2009)

    Google Scholar 

  15. Mariscal, G., Marbán, O., Fernández, C.: A survey of data mining and knowledge discovery process models and methodologies. The Knowledge Engineering Review 25(2), 137–166 (2010), http://journals.cambridge.org/article_S0269888910000032

    Article  Google Scholar 

  16. Midgley, G.: Biodiversity and ecosystem function. Science 335(6065), 174–175 (2012), http://www.sciencemag.org/content/335/6065/174.short

    Article  Google Scholar 

  17. Mondal, K.C., Pasquier, N., Mukhopadhyay, A., Maulik, U., Bandyopadhyay, S.: A new approach for association rule mining and bi-clustering using formal concept analysis. In: MLDM 2012, pp. 86–101 (2012)

    Google Scholar 

  18. Natural products information system (NAPIS), http://whitepointsystems.com (accessed February 2013)

  19. Obrst, L.: Ontologies for semantically interoperable systems. In: CIKM 2003, pp. 366–369 (2003), http://doi.acm.org/10.1145/956863.956932

  20. Peters, C., Peters, D., Cota-Sánchez, J.: Data mining and mapping of herbarium specimens using geographic information systems: A look at the biodiversity informatics project of the W. P. Fraser Herbarium, SASK (2009), http://www.herbarium.usask.ca/research/Data%20Mining,%20CBA%202009.pdf

  21. Rahangdale, S.S., Rahangdale, S.R.: Plant species composition on two rock outcrops from the northern western ghats, maharashtra, india. Journal of Threatened Taxa 6(4), 5593–5612 (2014)

    Article  Google Scholar 

  22. Shah, A.: Why Is Biodiversity Important? Who Cares? Global Issues (April 2011), http://www.globalissues.org/article/170/why-is-biodiversity-important-who-cares

  23. So, N.V.: The potential of local tree species to accelerate natural forest succession on marginal grasslands in southern vietnam, http://www.forru.org/extra/forru/PDF_Files/frfwcpdf/part2/p28

  24. Spehn, E.M., Korner, C. (eds.): Data Mining for Global Trends in Mountain Biodiversity. CRC Press (2009)

    Google Scholar 

  25. Talent, J.: Earth and Life: Global Biodiversity, Extinction Intervals and Biogeographic Perturbations Through Time. International Year of Planet Earth. Springer (2012)

    Google Scholar 

  26. The convention on biological diversity (CBD), http://www.cbd.int (accessed September 2013)

  27. The IUCN Red List of Threatened Species, http://www.iucnredlist.org/ (accessed January 2014)

  28. Whetzel, P., Noy, N., Shah, N., Alexander, P., Nyulas, C., Tudorache, T., Musen, M.: What are ontologies (accessed March 2013), http://www.bioontology.org/learning-about-ontologies

  29. Wickneswari, R.: Hopea odorata roxb, http://www.apforgen.org/apfCD/Information

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Inthasone, S., Pasquier, N., Tettamanzi, A.G.B., da Costa Pereira, C. (2014). The BioKET Biodiversity Data Warehouse: Data and Knowledge Integration and Extraction. In: Blockeel, H., van Leeuwen, M., Vinciotti, V. (eds) Advances in Intelligent Data Analysis XIII. IDA 2014. Lecture Notes in Computer Science, vol 8819. Springer, Cham. https://doi.org/10.1007/978-3-319-12571-8_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12571-8_12

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12570-1

  • Online ISBN: 978-3-319-12571-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics