Abstract
Due to the on-going economic crisis, the management of organizational knowledge is becoming more and more important. This knowledge resides in organizational processes. The extraction of this hidden knowledge from the business processes and the usage of this knowledge for domain ontology development is a major challenge. This chapter presents ProMine, a text mining ontology extraction tool that extracts deep representations from the business processes. ProMine extracts new domain related concepts and proposes a new filtering mechanism based on a new hybrid similarity measure to filter most relevant concepts. The tool is evaluated through a case study of the insurance domain. The results showed that ProMine performance is good and it generates many new concepts against each business process.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
EUREKA_HU_12-1-2012-0039, supported by the Research and Technology Innovation Fund, New Széchenyi Plan, Hungary.
References
Auer, S. (2005). Powl–a web based platform for collaborative semantic web development. Paper presented at the Proceedings of the Workshop Scripting for the Semantic Web.
Barforush, A. A., & Rahnama, A. (2012). Ontology learning: Revisted. Journal of Web Engineering, 11(4), 269–289.
Bekkerman, R., El-Yaniv, R., Tishby, N., & Winter, Y. (2001). On feature distributional clustering for text categorization. Paper presented at the Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.
Buitelaar, P., & Sacaleanu, B. (2001). Ranking and selecting synsets by domain relevance. Paper presented at the Proceedings of WordNet and Other Lexical Resources: Applications, Extensions and Customizations, NAACL 2001 Workshop.
Cimiano, P., & Völker, J. (2005). Text2Onto. Natural language processing and information systems. Paper presented at the 10th International Conference on Applications of Natural Language to Information Systems, NLDB 2005, Alicante, Spain, June 15–17, 2005. Proceedings, of Lecture Notes in Computer Science (Edited by: Montoyo A, Muñoz R, Métais E).
Dagan, I., Pereira, F., & Lee, L. (1994). Similarity-based estimation of word cooccurrence probabilities. Paper presented at the Proceedings of the 32nd annual meeting on Association for Computational Linguistics.
Euzenat, J., & Shvaiko, P. (2007). Ontology matching (Vol. 333). Berlin: Springer.
Farquhar, A., Fikes, R., & Rice, J. (1997). The ontolingua server: A tool for collaborative ontology construction. International Journal of Human-Computer Studies, 46(6), 707–727.
Formica, A. (2008). Concept similarity in formal concept analysis: An information content approach. Knowledge-Based Systems, 21(1), 80–87.
Gacitua, R., Sawyer, P., & Rayson, P. (2008). A flexible framework to experiment with ontology learning techniques. Knowledge-Based Systems, 21(3), 192–199.
George, P., Vangelis, K., Anastasia, K., Georgios, P., & Constantine, S. D. (2009, June). Semi-automated ontology learning: The boemie approach. In Proceedings of the First ESWC Workshop on Inductive Reasoning and Machine Learning on the Semantic Web, Heraklion, Greece.
Ghadfi, S., Béchet, N., & Berio, G. (2014). Building ontologies from textual resources: A pattern based improvement using deep linguistic information. Paper presented at the Proceedings of the 5th Workshop on Ontology and Semantic Web Patterns (WOP2014), Riva del Garda, Italy.
Gillani, S. A., & Kő, A. (2014). Process-based knowledge extraction in a public authority: A text mining approach. In Electronic government and the information systems perspective (pp. 91–103). Cham: Springer International Publishing.
Gruber, T. R. (1993). A translation approach to portable ontology specifications. Knowledge Acquisition, 5(2), 199–220.
Guo, W., & Diab, M. (2012). A simple unsupervised latent semantics based approach for sentence similarity. Paper presented at the Proceedings of the First Joint Conference on Lexical and Computational Semantics-Volume 1: Proceedings of the Main Conference and the Shared Task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation.
Islam, N., Siddiqui, M. S., & Shaikh, Z. (2010). TODE: A Dot Net based tool for ontology development and editing. Paper presented at the 2nd International Conference on Computer Engineering and Technology (ICCET).
Jiang, X., & Tan, A. H. (2010). CRCTOL: A semantic‐based domain ontology learning system. Journal of the American Society for Information Science and Technology, 61(1), 150–168.
Kang, Y.-B., Haghighi, P. D., & Burstein, F. (2014). CFinder: An intelligent key concept finder from text for ontology development. Expert Systems with Applications, 41(9), 4494–4504.
Landauer, T. K., Foltz, P. W., & Laham, D. (1998). An introduction to latent semantic analysis. Discourse Processes, 25(2–3), 259–284.
Lindén, K., & Piitulainen, J. O. (2004). Discovering synonyms and other related words. Paper presented at the Proceedings of COLING 2004 CompuTerm 2004: 3rd International Workshop on Computational Terminology.
Lund, K., & Burgess, C. (1996, April). Hyperspace analogue to language (HAL): A general model semantic representation. Brain and Cognition, 30(3), 5–5. 525 B ST, STE 1900, San Diego, CA 92101-4495: Academic press Inc JNL-COMP Subscriptions.
Luong, H., Wang, Q., & Gauch, S. (2012). Ontology learning using word net lexical expansion and text mining. INTECH Open Access Publisher.
Maedche, A., & Staab, S. (2000). The text-to-onto ontology learning environment. Paper presented at the Software Demonstration at ICCS-2000-Eight International Conference on Conceptual Structures.
Maedche, A., & Staab, S. (2004). Ontology learning. In Handbook on ontologies (pp. 173–190). Berlin Heidelberg: Springer.
Meng, L., Huang, R., & Gu, J. (2013). A review of semantic similarity measures in wordnet. International Journal of Hybrid Information Technology, 6(1), 1–12.
Miller, G. A. (1995). WordNet: A lexical database for English. Communications of the ACM, 38(11), 39–41.
Nagar, A., & Al-Mubaid, H. (2008). A new path length measure based on go for gene similarity with evaluation using sgd pathways. Paper presented at the 21st IEEE International Symposium on Computer-Based Medical Systems, 2008. CBMS’08.
Nie, X., & Zhou, J. (2008). A domain adaptive ontology learning framework. Paper presented at the IEEE International Conference on Networking, Sensing and Control, 2008. ICNSC 2008.
Noy, N. F., & Musen, M. A. (2003). The PROMPT suite: Interactive tools for ontology merging and mapping. International Journal of Human-Computer Studies, 59(6), 983–1024.
Noy, N. F., Sintek, M., Decker, S., Crubézy, M., Fergerson, R. W., & Musen, M. A. (2001). Creating semantic web contents with protege-2000. IEEE Intelligent Systems, 16(2), 60–71.
Park, J., Cho, W., & Rho, S. (2010). Evaluating ontology extraction tools using a comprehensive evaluation framework. Data and Knowledge Engineering, 69(10), 1043–1061.
Pedersen, T., Pakhomov, S. V., Patwardhan, S., & Chute, C. G. (2007). Measures of semantic similarity and relatedness in the biomedical domain. Journal of Biomedical Informatics, 40(3), 288–299.
Pirró, G. (2009). A semantic similarity metric combining features and intrinsic information content. Data and Knowledge Engineering, 68(11), 1289–1308.
Qin, P., Lu, Z., Yan, Y., & Wu, F. (2009). A new measure of word semantic similarity based on wordnet hierarchy and dag theory. Paper presented at the International Conference on Web Information Systems and Mining, 2009. WISM 2009.
Rada, R., Mili, H., Bicknell, E., & Blettner, M. (1989). Development and application of a metric on semantic nets. IEEE Transactions on Systems, Man and Cybernetics, 19(1), 17–30.
Raunich, S., & Rahm, E. (2011). ATOM: Automatic target-driven ontology merging. Paper presented at the IEEE 27th International Conference on Data Engineering (ICDE), 2011.
Resnik, P. (1995, August 20–25). Using information content to evaluate semantic similarity in a taxonomy. In Proceedings of the 14th International Joint Conference on Artificial Intelligence (pp. 448–453). Montreal, QC, Canada.
Saleena, B., & Srivatsa, S. (2015). Using concept similarity in cross ontology for adaptive e-Learning systems. Journal of King Saud University-Computer and Information Sciences, 27(1), 1–12.
Salton, G., & Michael, J. (1983). Introduction to modern information retrieval (pp. 24–51). New York: McGraw-Hill.
Sánchez, D., Batet, M., & Isern, D. (2011). Ontology-based information content computation. Knowledge-Based Systems, 24(2), 297–303.
Santoso, H. A., Haw, S.-C., & Abdul-Mehdi, Z. T. (2011). Ontology extraction from relational database: Concept hierarchy as background knowledge. Knowledge-Based Systems, 24(3), 457–464.
Schutz, A., & Buitelaar, P. (2005). Relext: A tool for relation extraction from text in ontology extension. In The semantic web–ISWC 2005 (pp. 593–606). Berlin Heidelberg: Springer.
Slimani, T. (2013). Description and evaluation of semantic similarity measures approaches. International Journal of Computer Applications, 80(10), 0975–8887.
Sure, Y., Angele, J., & Staab, S. (2002). OntoEdit: Guiding ontology development by methodology and inferencing. In Proceedings of the International Conference on Ontologies, Databases and Applications of SEmantics ODBASE 2002. Irvine, CA: University of California.
Sussna, M. J. (1997). Text retrieval using inference in semantic metanetworks.
Wang, G., Yu, Y., & Zhu, H. (2007). Pore: Positive-only relation extraction from wikipedia text. Berlin: Springer.
Wu, X., & Bolivar, A. (2008). Keyword extraction for contextual advertisement. Paper presented at the Proceedings of the 17th International Conference on World Wide Web.
Wu, Z., & Palmer, M. (1994). Verbs semantics and lexical selection. Paper presented at the Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics.
Yang, Y., & Pedersen, J. O. (1997). A comparative study on feature selection in text categorization. Paper presented at the ICML.
Zablith, F. (2008). Dynamic ontology evolution. International Semantic Web Conference (ISWC) Doctoral Consortium, Karlsruhe, Germany.
Zouaq, A. (2011). An overview of shallow and deep natural language processing for ontology learning. Ontology Learning and Knowledge Discovery Using the Web: Challenges and Recent Advances, 2, 16–37.
Zouaq, A., Gasevic, D., & Hatala, M. (2011). Towards open ontology learning and filtering. Information Systems, 36(7), 1064–1081.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Gillani, S., Kő, A. (2016). ProMine: A Text Mining Solution for Concept Extraction and Filtering. In: Gábor, A., Kő, A. (eds) Corporate Knowledge Discovery and Organizational Learning. Knowledge Management and Organizational Learning, vol 2. Springer, Cham. https://doi.org/10.1007/978-3-319-28917-5_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-28917-5_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28915-1
Online ISBN: 978-3-319-28917-5
eBook Packages: Business and ManagementBusiness and Management (R0)