Abstract
With the advancements of new technologies, a large volume of digital data is getting generated every second from various internal and external sources like social networking, organizations and any business applications. Big data refers to enormous digital data that are high in volume, velocity, varieties. The traditional conventional approach fails to handle large data sets using their tools and techniques. Big data proved to be an effective mean for collecting, analyzing and processing data despite their size and data formats structured, semi-structured or unstructured. Large set of information and data are produced from different organizations and social activities. Text mining or text analytics plays a significant role in deriving relevant information from text in digital environment. Text mining includes technique like entity extraction which automatically extracts structured information from unstructured or semi-structured documents. This paper details how entity extraction is useful in processing human language texts by using natural language processing. Entity extraction based on method like part-of-speech tagging which helps in determining the noun, verb, adverb and adjectives associated with a sentence. Enhanced entity extraction method will be mainly useful for filtering entities based on their part-of-speeches by removing any ambiguities. Entity extraction focuses on relevant parts of a document and represents them in a structured manner.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
M.K. Shilpa, Big data and methodology-a review. Int. J. Adv. Res. Comput. Sci. Software Eng. 3(10), 991–995 (2013)
M. song et al., PKDE4J: Entity and relation extraction for public knowledge discovery. J. Biomed. Inf. 57, 320–332 (2015)
N. Ranjan, A. Gupta, I. Dhumale, Text analytics and classification techniques for text document. IJDR 5, 5953–5955 (2015)
D.D.A. Bui, G. Del Fiol, S. Jonnalagadda, PDF text classification to leverage information extraction from publication reports. J Biomed. Inf. 61, 141–148 (2016)
C. Bøhn, K. Nørvåg, Extracting named entities and synonyms from wikipedia. 2010 24th IEEE International Conference on Advanced Information Networking and Applications (AINA). IEEE (2010)
S.R. Kundeti, J. Vijayananda, S. Mujjiga, M. Kalyan, Clinical named entity recognition: challenges and opportunities. 2016 IEEE International Conference on Big Data (Big Data). IEEE (2016)
R. Sharnagat, Named Entity Recognition: A Literature Survey. Center For Indian Language Technology (2014)
N. Patil, N. Patil, B.V. Pawar, Survey of named entity recognition systems with respect to Indian and foreign languages. Int. J. Comput. Appl. 134(16) (2016)
P. Sutheebanjard, W. Premchaiswadi, Thai personal named entity extraction without using word segmentation or POS tagging. Natural Language Processing, 2009, SNLP’09, Eighth International Symposium on. IEEE (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Dash, A., Pandey, M., Rautaray, S. (2019). Enhanced Entity Extraction Using Big Data Mechanics. In: Kamal, R., Henshaw, M., Nair, P. (eds) International Conference on Advanced Computing Networking and Informatics. Advances in Intelligent Systems and Computing, vol 870. Springer, Singapore. https://doi.org/10.1007/978-981-13-2673-8_8
Download citation
DOI: https://doi.org/10.1007/978-981-13-2673-8_8
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-2672-1
Online ISBN: 978-981-13-2673-8
eBook Packages: EngineeringEngineering (R0)