ProMine: A Text Mining Solution for Concept Extraction and Filtering

Gillani, Saira; Kő, Andrea

doi:10.1007/978-3-319-28917-5_3

ProMine: A Text Mining Solution for Concept Extraction and Filtering

Saira Gillani⁵ &
Andrea Kő⁵

Chapter
First Online: 20 April 2016

1231 Accesses
5 Citations

Part of the book series: Knowledge Management and Organizational Learning ((IAKM,volume 2))

Abstract

Due to the on-going economic crisis, the management of organizational knowledge is becoming more and more important. This knowledge resides in organizational processes. The extraction of this hidden knowledge from the business processes and the usage of this knowledge for domain ontology development is a major challenge. This chapter presents ProMine, a text mining ontology extraction tool that extracts deep representations from the business processes. ProMine extracts new domain related concepts and proposes a new filtering mechanism based on a new hybrid similarity measure to filter most relevant concepts. The tool is evaluated through a case study of the insurance domain. The results showed that ProMine performance is good and it generates many new concepts against each business process.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
EUREKA_HU_12-1-2012-0039, supported by the Research and Technology Innovation Fund, New Széchenyi Plan, Hungary.

References

Auer, S. (2005). Powl–a web based platform for collaborative semantic web development. Paper presented at the Proceedings of the Workshop Scripting for the Semantic Web.
Google Scholar
Barforush, A. A., & Rahnama, A. (2012). Ontology learning: Revisted. Journal of Web Engineering, 11(4), 269–289.
Google Scholar
Bekkerman, R., El-Yaniv, R., Tishby, N., & Winter, Y. (2001). On feature distributional clustering for text categorization. Paper presented at the Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.
Google Scholar
Buitelaar, P., & Sacaleanu, B. (2001). Ranking and selecting synsets by domain relevance. Paper presented at the Proceedings of WordNet and Other Lexical Resources: Applications, Extensions and Customizations, NAACL 2001 Workshop.
Google Scholar
Cimiano, P., & Völker, J. (2005). Text2Onto. Natural language processing and information systems. Paper presented at the 10th International Conference on Applications of Natural Language to Information Systems, NLDB 2005, Alicante, Spain, June 15–17, 2005. Proceedings, of Lecture Notes in Computer Science (Edited by: Montoyo A, Muñoz R, Métais E).
Google Scholar
Dagan, I., Pereira, F., & Lee, L. (1994). Similarity-based estimation of word cooccurrence probabilities. Paper presented at the Proceedings of the 32nd annual meeting on Association for Computational Linguistics.
Google Scholar
Euzenat, J., & Shvaiko, P. (2007). Ontology matching (Vol. 333). Berlin: Springer.
Google Scholar
Farquhar, A., Fikes, R., & Rice, J. (1997). The ontolingua server: A tool for collaborative ontology construction. International Journal of Human-Computer Studies, 46(6), 707–727.
Article Google Scholar
Formica, A. (2008). Concept similarity in formal concept analysis: An information content approach. Knowledge-Based Systems, 21(1), 80–87.
Article Google Scholar
Gacitua, R., Sawyer, P., & Rayson, P. (2008). A flexible framework to experiment with ontology learning techniques. Knowledge-Based Systems, 21(3), 192–199.
Article Google Scholar
George, P., Vangelis, K., Anastasia, K., Georgios, P., & Constantine, S. D. (2009, June). Semi-automated ontology learning: The boemie approach. In Proceedings of the First ESWC Workshop on Inductive Reasoning and Machine Learning on the Semantic Web, Heraklion, Greece.
Google Scholar
Ghadfi, S., Béchet, N., & Berio, G. (2014). Building ontologies from textual resources: A pattern based improvement using deep linguistic information. Paper presented at the Proceedings of the 5th Workshop on Ontology and Semantic Web Patterns (WOP2014), Riva del Garda, Italy.
Google Scholar
Gillani, S. A., & Kő, A. (2014). Process-based knowledge extraction in a public authority: A text mining approach. In Electronic government and the information systems perspective (pp. 91–103). Cham: Springer International Publishing.
Google Scholar
Gruber, T. R. (1993). A translation approach to portable ontology specifications. Knowledge Acquisition, 5(2), 199–220.
Article Google Scholar
Guo, W., & Diab, M. (2012). A simple unsupervised latent semantics based approach for sentence similarity. Paper presented at the Proceedings of the First Joint Conference on Lexical and Computational Semantics-Volume 1: Proceedings of the Main Conference and the Shared Task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation.
Google Scholar
Islam, N., Siddiqui, M. S., & Shaikh, Z. (2010). TODE: A Dot Net based tool for ontology development and editing. Paper presented at the 2nd International Conference on Computer Engineering and Technology (ICCET).
Google Scholar
Jiang, X., & Tan, A. H. (2010). CRCTOL: A semantic‐based domain ontology learning system. Journal of the American Society for Information Science and Technology, 61(1), 150–168.
Article Google Scholar
Kang, Y.-B., Haghighi, P. D., & Burstein, F. (2014). CFinder: An intelligent key concept finder from text for ontology development. Expert Systems with Applications, 41(9), 4494–4504.
Article Google Scholar
Landauer, T. K., Foltz, P. W., & Laham, D. (1998). An introduction to latent semantic analysis. Discourse Processes, 25(2–3), 259–284.
Article Google Scholar
Lindén, K., & Piitulainen, J. O. (2004). Discovering synonyms and other related words. Paper presented at the Proceedings of COLING 2004 CompuTerm 2004: 3rd International Workshop on Computational Terminology.
Google Scholar
Lund, K., & Burgess, C. (1996, April). Hyperspace analogue to language (HAL): A general model semantic representation. Brain and Cognition, 30(3), 5–5. 525 B ST, STE 1900, San Diego, CA 92101-4495: Academic press Inc JNL-COMP Subscriptions.
Google Scholar
Luong, H., Wang, Q., & Gauch, S. (2012). Ontology learning using word net lexical expansion and text mining. INTECH Open Access Publisher.
Google Scholar
Maedche, A., & Staab, S. (2000). The text-to-onto ontology learning environment. Paper presented at the Software Demonstration at ICCS-2000-Eight International Conference on Conceptual Structures.
Google Scholar
Maedche, A., & Staab, S. (2004). Ontology learning. In Handbook on ontologies (pp. 173–190). Berlin Heidelberg: Springer.
Chapter Google Scholar
Meng, L., Huang, R., & Gu, J. (2013). A review of semantic similarity measures in wordnet. International Journal of Hybrid Information Technology, 6(1), 1–12.
Google Scholar
Miller, G. A. (1995). WordNet: A lexical database for English. Communications of the ACM, 38(11), 39–41.
Article Google Scholar
Nagar, A., & Al-Mubaid, H. (2008). A new path length measure based on go for gene similarity with evaluation using sgd pathways. Paper presented at the 21st IEEE International Symposium on Computer-Based Medical Systems, 2008. CBMS’08.
Google Scholar
Nie, X., & Zhou, J. (2008). A domain adaptive ontology learning framework. Paper presented at the IEEE International Conference on Networking, Sensing and Control, 2008. ICNSC 2008.
Google Scholar
Noy, N. F., & Musen, M. A. (2003). The PROMPT suite: Interactive tools for ontology merging and mapping. International Journal of Human-Computer Studies, 59(6), 983–1024.
Article Google Scholar
Noy, N. F., Sintek, M., Decker, S., Crubézy, M., Fergerson, R. W., & Musen, M. A. (2001). Creating semantic web contents with protege-2000. IEEE Intelligent Systems, 16(2), 60–71.
Article Google Scholar
Park, J., Cho, W., & Rho, S. (2010). Evaluating ontology extraction tools using a comprehensive evaluation framework. Data and Knowledge Engineering, 69(10), 1043–1061.
Article Google Scholar
Pedersen, T., Pakhomov, S. V., Patwardhan, S., & Chute, C. G. (2007). Measures of semantic similarity and relatedness in the biomedical domain. Journal of Biomedical Informatics, 40(3), 288–299.
Article Google Scholar
Pirró, G. (2009). A semantic similarity metric combining features and intrinsic information content. Data and Knowledge Engineering, 68(11), 1289–1308.
Article Google Scholar
Qin, P., Lu, Z., Yan, Y., & Wu, F. (2009). A new measure of word semantic similarity based on wordnet hierarchy and dag theory. Paper presented at the International Conference on Web Information Systems and Mining, 2009. WISM 2009.
Google Scholar
Rada, R., Mili, H., Bicknell, E., & Blettner, M. (1989). Development and application of a metric on semantic nets. IEEE Transactions on Systems, Man and Cybernetics, 19(1), 17–30.
Article Google Scholar
Raunich, S., & Rahm, E. (2011). ATOM: Automatic target-driven ontology merging. Paper presented at the IEEE 27th International Conference on Data Engineering (ICDE), 2011.
Google Scholar
Resnik, P. (1995, August 20–25). Using information content to evaluate semantic similarity in a taxonomy. In Proceedings of the 14th International Joint Conference on Artificial Intelligence (pp. 448–453). Montreal, QC, Canada.
Google Scholar
Saleena, B., & Srivatsa, S. (2015). Using concept similarity in cross ontology for adaptive e-Learning systems. Journal of King Saud University-Computer and Information Sciences, 27(1), 1–12.
Article Google Scholar
Salton, G., & Michael, J. (1983). Introduction to modern information retrieval (pp. 24–51). New York: McGraw-Hill.
Google Scholar
Sánchez, D., Batet, M., & Isern, D. (2011). Ontology-based information content computation. Knowledge-Based Systems, 24(2), 297–303.
Article Google Scholar
Santoso, H. A., Haw, S.-C., & Abdul-Mehdi, Z. T. (2011). Ontology extraction from relational database: Concept hierarchy as background knowledge. Knowledge-Based Systems, 24(3), 457–464.
Article Google Scholar
Schutz, A., & Buitelaar, P. (2005). Relext: A tool for relation extraction from text in ontology extension. In The semantic web–ISWC 2005 (pp. 593–606). Berlin Heidelberg: Springer.
Chapter Google Scholar
Slimani, T. (2013). Description and evaluation of semantic similarity measures approaches. International Journal of Computer Applications, 80(10), 0975–8887.
Article Google Scholar
Sure, Y., Angele, J., & Staab, S. (2002). OntoEdit: Guiding ontology development by methodology and inferencing. In Proceedings of the International Conference on Ontologies, Databases and Applications of SEmantics ODBASE 2002. Irvine, CA: University of California.
Google Scholar
Sussna, M. J. (1997). Text retrieval using inference in semantic metanetworks.
Google Scholar
Wang, G., Yu, Y., & Zhu, H. (2007). Pore: Positive-only relation extraction from wikipedia text. Berlin: Springer.
Google Scholar
Wu, X., & Bolivar, A. (2008). Keyword extraction for contextual advertisement. Paper presented at the Proceedings of the 17th International Conference on World Wide Web.
Google Scholar
Wu, Z., & Palmer, M. (1994). Verbs semantics and lexical selection. Paper presented at the Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics.
Google Scholar
Yang, Y., & Pedersen, J. O. (1997). A comparative study on feature selection in text categorization. Paper presented at the ICML.
Google Scholar
Zablith, F. (2008). Dynamic ontology evolution. International Semantic Web Conference (ISWC) Doctoral Consortium, Karlsruhe, Germany.
Google Scholar
Zouaq, A. (2011). An overview of shallow and deep natural language processing for ontology learning. Ontology Learning and Knowledge Discovery Using the Web: Challenges and Recent Advances, 2, 16–37.
Article Google Scholar
Zouaq, A., Gasevic, D., & Hatala, M. (2011). Towards open ontology learning and filtering. Information Systems, 36(7), 1064–1081.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Corvinus University of Budapest, Budapest, Hungary
Saira Gillani & Andrea Kő

Authors

Saira Gillani
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Kő
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Saira Gillani .

Editor information

Editors and Affiliations

Corvinno Ltd., Budapest, Hungary
András Gábor
Corvinus University of Budapest, Budapest, Hungary
Andrea Kő

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Gillani, S., Kő, A. (2016). ProMine: A Text Mining Solution for Concept Extraction and Filtering. In: Gábor, A., Kő, A. (eds) Corporate Knowledge Discovery and Organizational Learning. Knowledge Management and Organizational Learning, vol 2. Springer, Cham. https://doi.org/10.1007/978-3-319-28917-5_3

Download citation

DOI: https://doi.org/10.1007/978-3-319-28917-5_3
Published: 20 April 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28915-1
Online ISBN: 978-3-319-28917-5
eBook Packages: Business and ManagementBusiness and Management (R0)

Publish with us

Policies and ethics