Skip to main content

Enhanced Entity Extraction Using Big Data Mechanics

  • Conference paper
  • First Online:
International Conference on Advanced Computing Networking and Informatics

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 870))

Abstract

With the advancements of new technologies, a large volume of digital data is getting generated every second from various internal and external sources like social networking, organizations and any business applications. Big data refers to enormous digital data that are high in volume, velocity, varieties. The traditional conventional approach fails to handle large data sets using their tools and techniques. Big data proved to be an effective mean for collecting, analyzing and processing data despite their size and data formats structured, semi-structured or unstructured. Large set of information and data are produced from different organizations and social activities. Text mining or text analytics plays a significant role in deriving relevant information from text in digital environment. Text mining includes technique like entity extraction which automatically extracts structured information from unstructured or semi-structured documents. This paper details how entity extraction is useful in processing human language texts by using natural language processing. Entity extraction based on method like part-of-speech tagging which helps in determining the noun, verb, adverb and adjectives associated with a sentence. Enhanced entity extraction method will be mainly useful for filtering entities based on their part-of-speeches by removing any ambiguities. Entity extraction focuses on relevant parts of a document and represents them in a structured manner.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. M.K. Shilpa, Big data and methodology-a review. Int. J. Adv. Res. Comput. Sci. Software Eng. 3(10), 991–995 (2013)

    Google Scholar 

  2. M. song et al., PKDE4J: Entity and relation extraction for public knowledge discovery. J. Biomed. Inf. 57, 320–332 (2015)

    Article  Google Scholar 

  3. N. Ranjan, A. Gupta, I. Dhumale, Text analytics and classification techniques for text document. IJDR 5, 5953–5955 (2015)

    Google Scholar 

  4. D.D.A. Bui, G. Del Fiol, S. Jonnalagadda, PDF text classification to leverage information extraction from publication reports. J Biomed. Inf. 61, 141–148 (2016)

    Article  Google Scholar 

  5. C. Bøhn, K. Nørvåg, Extracting named entities and synonyms from wikipedia. 2010 24th IEEE International Conference on Advanced Information Networking and Applications (AINA). IEEE (2010)

    Google Scholar 

  6. S.R. Kundeti, J. Vijayananda, S. Mujjiga, M. Kalyan, Clinical named entity recognition: challenges and opportunities. 2016 IEEE International Conference on Big Data (Big Data). IEEE (2016)

    Google Scholar 

  7. R. Sharnagat, Named Entity Recognition: A Literature Survey. Center For Indian Language Technology (2014)

    Google Scholar 

  8. N. Patil, N. Patil, B.V. Pawar, Survey of named entity recognition systems with respect to Indian and foreign languages. Int. J. Comput. Appl. 134(16) (2016)

    Article  Google Scholar 

  9. P. Sutheebanjard, W. Premchaiswadi, Thai personal named entity extraction without using word segmentation or POS tagging. Natural Language Processing, 2009, SNLP’09, Eighth International Symposium on. IEEE (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Adyasha Dash .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dash, A., Pandey, M., Rautaray, S. (2019). Enhanced Entity Extraction Using Big Data Mechanics. In: Kamal, R., Henshaw, M., Nair, P. (eds) International Conference on Advanced Computing Networking and Informatics. Advances in Intelligent Systems and Computing, vol 870. Springer, Singapore. https://doi.org/10.1007/978-981-13-2673-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-2673-8_8

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-2672-1

  • Online ISBN: 978-981-13-2673-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics