Skip to main content

Natural Language Technology for Information Integration in Business Intelligence

  • Conference paper
Book cover Business Information Systems (BIS 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4439))

Included in the following conference series:

Abstract

Business intelligence requires the collecting and merging of information from many different sources, both structured and unstructured, in order to analyse for example financial risk, operational risk factors, follow trends and perform credit risk management. While traditional data mining tools make use of numerical data and cannot easily be applied to knowledge extracted from free text, traditional information extraction is either not adapted for the financial domain, or does not address the issue of information integration: the merging of information from different kinds of sources. We describe here the development of a system for content mining using domain ontologies, which enables the extraction of relevant information to be fed into models for analysis of financial and operational risk and other business intelligence applications such as company intelligence, by means of the XBRL standard. The results so far are of extremely high quality, due to the implementation of primarily high-precision rules.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ahmad, K., Gillam, L., Cheng, D.: Sentiments on a grid: Analysis of streaming news and views. In: 5th Language Resources and Evaluation Conference (2006)

    Google Scholar 

  2. Appelt, D.E., et al.: Description of the JV-FASTUS system as used for MUC-5. In: Proceedings of the Fourth Message Understanding Conference MUC-5, pp. 221–235. Morgan Kaufmann, San Francisco (1993)

    Chapter  Google Scholar 

  3. Baumgartner, R., et al.: Web data extraction for business intelligence: the lixto approach. In: Proc. of BTW (2005)

    Google Scholar 

  4. Chinchor, N.: Muc-4 evaluation metrics. In: Proceedings of the Fourth Message Understanding Conference, pp. 22–29 (1992)

    Google Scholar 

  5. Cunningham, H., et al.: GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. In: Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL’02) (2002)

    Google Scholar 

  6. Cunningham, H., Maynard, D., Tablan, V.: JAPE: a Java Annotation Patterns Engine (2nd edn.). Research Memorandum CS–00–10, Department of Computer Science, University of Sheffield (November 2000)

    Google Scholar 

  7. Declerck, T., Krieger, H.: Translating XBRL into Description Logic: an approach using Protege, Sesame and OWL. In: Proceedings of Business Information Systems (BIS), Klagenfurt, Germany (2006)

    Google Scholar 

  8. Ellingsworth, M., Sullivan, D.: Text mining improves business intelligence and predictive modeling in insurance. DM Review Magazine (2003)

    Google Scholar 

  9. Nie, J.-Y., Paradis, F., Tajarobi, A.: Discovery of business opportunities on the internet with information extraction. In: Workshop on Multi-Agent Information Retrieval and Recommender Systems (IJCAI), Edinburgh, Scotland, pp. 47–54 (2005)

    Google Scholar 

  10. Fornasari, F., et al.: Xbrl web-based business intelligence services. In: Cunningham, P., Cunningham, M. (eds.) Innovation and the Knowledge Economy: Issues, Applications, Case Studies. Proceedings of eChallenge 2005, IOS Press, Amsterdam (2005)

    Google Scholar 

  11. Gaizauskas, R., Wilks, Y.: Information Extraction: Beyond Document Retrieval. Journal of Documentation 54(1), 70–105 (1998)

    Article  Google Scholar 

  12. Jacobs, P.S., Rau, L.F.: Scisor: Extracting information from on-line news. Communications of the ACM 33(11), 88–97 (1990)

    Article  Google Scholar 

  13. Maynard, D., Bontcheva, K., Cunningham, H.: Towards a semantic extraction of Named Entities. In: Recent Advances in Natural Language Processing, Bulgaria (2003)

    Google Scholar 

  14. Maynard, D., Peters, W., Li, Y.: Metrics for evaluation of ontology-based information extraction. In: WWW 2006 Workshop on “Evaluation of Ontologies for the Web” (EON), Edinburgh, Scotland (2006)

    Google Scholar 

  15. Maynard, D., et al.: Rapid customisation of an Information Extraction system for surprise languages. Special issue of ACM Transactions on Asian Language Information Processing: Rapid Development of Language Capabilities: The Surprise Languages (2003)

    Google Scholar 

  16. Maynard, D., et al.: Named Entity Recognition from Diverse Text Types. In: Recent Advances in Natural Language Processing 2001 Conference, Tzigov Chark, Bulgaria, pp. 257–274 (2001), http://gate.ac.uk/sale/ranlp2001/maynard-etal.pdf

  17. Maynard, D., et al.: Ontology-based information extraction for market monitoring and technology watch. In: ESWC Workshop “End User Apects of the Semantic Web”, Heraklion, Crete (2005)

    Google Scholar 

  18. Montes, J.: Consumer entertainment software - industry trends. In: Stanford-Smith, B., Chozza, E. (eds.) E-Work and E-Commerce, IOS Press, Amsterdam (2001)

    Google Scholar 

  19. Popov, B., et al.: KIM – Semantic Annotation Platform. In: Natural Language Engineering (2004)

    Google Scholar 

  20. van Rijsbergen, C.J.: Information Retrieval. Butterworths, London (1979)

    Google Scholar 

  21. Wilks, Y., Catizone, R.: Can We Make Information Extraction More Adaptive? In: Pazienza, M.T. (ed.) SCIE 1999. LNCS (LNAI), vol. 1714, pp. 1–16. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Witold Abramowicz

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Maynard, D., Saggion, H., Yankova, M., Bontcheva, K., Peters, W. (2007). Natural Language Technology for Information Integration in Business Intelligence. In: Abramowicz, W. (eds) Business Information Systems. BIS 2007. Lecture Notes in Computer Science, vol 4439. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72035-5_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-72035-5_28

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-72034-8

  • Online ISBN: 978-3-540-72035-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics