Skip to main content

Big Scale Text Analytics and Smart Content Navigation

  • Conference paper
  • First Online:
Enabling Real-Time Business Intelligence (BIRTE 2014, BIRTE 2013)

Abstract

Identifying and exploring relevant content in growing document collections is a challenge for researchers, users, and system providers alike. Supporting this is crucial for companies offering knowledge in the form of documents as their core product. Our demo shows an intelligent way of doing guided research in big text collections, using the collection of the major scientific publisher Springer SBM as an example data set. We use the SAP HANA platform for flexible text analysis, ad-hoc calculations and data linkage, in order to enhance the experience of users navigating and exploring publications. We integrate unstructured data (textual documents) and structured data (document metadata and web server logs), and provide interactive filters in order to enable a responsive user experience while searching for relevant content. With HANA, we are able to implement this functionality over big data on a single machine by making use of HANA’s SQL data store and the built-in application server.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 34.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Published by Springer Science+Business Media [2].

References

  1. Plattner, H., Zeier, A.: In-Memory Data Management: An Inflection Point for Enterprise Applications. Springer, Berlin (2011)

    Google Scholar 

  2. SpringerLink Corpus (2013). http://link.springer.com

  3. Wikipedia Encyclopedia API (2013). https://www.mediawiki.org/wiki/API

Download references

Acknowledgements

The authors would like to thank the whole Strategic Projects Team SAP/Walldorf, especially Spyridon Antonopoulos, Jens Böning, Fredrick Chew, Enno Folkerts, Christian Heller, Nick Lanham, Andrew McCormick-Smith, Martin Sommer, Frederik Transier, and Patrick Zamzow. Furthermore, we thank Springer for their support.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Karsten Schmidt .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Schmidt, K., Bächle, S., Scholl, P., Nold, G. (2015). Big Scale Text Analytics and Smart Content Navigation. In: Castellanos, M., Dayal, U., Pedersen, T., Tatbul, N. (eds) Enabling Real-Time Business Intelligence. BIRTE BIRTE 2014 2013. Lecture Notes in Business Information Processing, vol 206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-46839-5_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-46839-5_12

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-46838-8

  • Online ISBN: 978-3-662-46839-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics