Skip to main content

ProMine: A Text Mining Solution for Concept Extraction and Filtering

  • Chapter
  • First Online:

Part of the book series: Knowledge Management and Organizational Learning ((IAKM,volume 2))

Abstract

Due to the on-going economic crisis, the management of organizational knowledge is becoming more and more important. This knowledge resides in organizational processes. The extraction of this hidden knowledge from the business processes and the usage of this knowledge for domain ontology development is a major challenge. This chapter presents ProMine, a text mining ontology extraction tool that extracts deep representations from the business processes. ProMine extracts new domain related concepts and proposes a new filtering mechanism based on a new hybrid similarity measure to filter most relevant concepts. The tool is evaluated through a case study of the insurance domain. The results showed that ProMine performance is good and it generates many new concepts against each business process.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    EUREKA_HU_12-1-2012-0039, supported by the Research and Technology Innovation Fund, New Széchenyi Plan, Hungary.

References

  • Auer, S. (2005). Powl–a web based platform for collaborative semantic web development. Paper presented at the Proceedings of the Workshop Scripting for the Semantic Web.

    Google Scholar 

  • Barforush, A. A., & Rahnama, A. (2012). Ontology learning: Revisted. Journal of Web Engineering, 11(4), 269–289.

    Google Scholar 

  • Bekkerman, R., El-Yaniv, R., Tishby, N., & Winter, Y. (2001). On feature distributional clustering for text categorization. Paper presented at the Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.

    Google Scholar 

  • Buitelaar, P., & Sacaleanu, B. (2001). Ranking and selecting synsets by domain relevance. Paper presented at the Proceedings of WordNet and Other Lexical Resources: Applications, Extensions and Customizations, NAACL 2001 Workshop.

    Google Scholar 

  • Cimiano, P., & Völker, J. (2005). Text2Onto. Natural language processing and information systems. Paper presented at the 10th International Conference on Applications of Natural Language to Information Systems, NLDB 2005, Alicante, Spain, June 15–17, 2005. Proceedings, of Lecture Notes in Computer Science (Edited by: Montoyo A, Muñoz R, Métais E).

    Google Scholar 

  • Dagan, I., Pereira, F., & Lee, L. (1994). Similarity-based estimation of word cooccurrence probabilities. Paper presented at the Proceedings of the 32nd annual meeting on Association for Computational Linguistics.

    Google Scholar 

  • Euzenat, J., & Shvaiko, P. (2007). Ontology matching (Vol. 333). Berlin: Springer.

    Google Scholar 

  • Farquhar, A., Fikes, R., & Rice, J. (1997). The ontolingua server: A tool for collaborative ontology construction. International Journal of Human-Computer Studies, 46(6), 707–727.

    Article  Google Scholar 

  • Formica, A. (2008). Concept similarity in formal concept analysis: An information content approach. Knowledge-Based Systems, 21(1), 80–87.

    Article  Google Scholar 

  • Gacitua, R., Sawyer, P., & Rayson, P. (2008). A flexible framework to experiment with ontology learning techniques. Knowledge-Based Systems, 21(3), 192–199.

    Article  Google Scholar 

  • George, P., Vangelis, K., Anastasia, K., Georgios, P., & Constantine, S. D. (2009, June). Semi-automated ontology learning: The boemie approach. In Proceedings of the First ESWC Workshop on Inductive Reasoning and Machine Learning on the Semantic Web, Heraklion, Greece.

    Google Scholar 

  • Ghadfi, S., Béchet, N., & Berio, G. (2014). Building ontologies from textual resources: A pattern based improvement using deep linguistic information. Paper presented at the Proceedings of the 5th Workshop on Ontology and Semantic Web Patterns (WOP2014), Riva del Garda, Italy.

    Google Scholar 

  • Gillani, S. A., & Kő, A. (2014). Process-based knowledge extraction in a public authority: A text mining approach. In Electronic government and the information systems perspective (pp. 91–103). Cham: Springer International Publishing.

    Google Scholar 

  • Gruber, T. R. (1993). A translation approach to portable ontology specifications. Knowledge Acquisition, 5(2), 199–220.

    Article  Google Scholar 

  • Guo, W., & Diab, M. (2012). A simple unsupervised latent semantics based approach for sentence similarity. Paper presented at the Proceedings of the First Joint Conference on Lexical and Computational Semantics-Volume 1: Proceedings of the Main Conference and the Shared Task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation.

    Google Scholar 

  • Islam, N., Siddiqui, M. S., & Shaikh, Z. (2010). TODE: A Dot Net based tool for ontology development and editing. Paper presented at the 2nd International Conference on Computer Engineering and Technology (ICCET).

    Google Scholar 

  • Jiang, X., & Tan, A. H. (2010). CRCTOL: A semantic‐based domain ontology learning system. Journal of the American Society for Information Science and Technology, 61(1), 150–168.

    Article  Google Scholar 

  • Kang, Y.-B., Haghighi, P. D., & Burstein, F. (2014). CFinder: An intelligent key concept finder from text for ontology development. Expert Systems with Applications, 41(9), 4494–4504.

    Article  Google Scholar 

  • Landauer, T. K., Foltz, P. W., & Laham, D. (1998). An introduction to latent semantic analysis. Discourse Processes, 25(2–3), 259–284.

    Article  Google Scholar 

  • Lindén, K., & Piitulainen, J. O. (2004). Discovering synonyms and other related words. Paper presented at the Proceedings of COLING 2004 CompuTerm 2004: 3rd International Workshop on Computational Terminology.

    Google Scholar 

  • Lund, K., & Burgess, C. (1996, April). Hyperspace analogue to language (HAL): A general model semantic representation. Brain and Cognition, 30(3), 5–5. 525 B ST, STE 1900, San Diego, CA 92101-4495: Academic press Inc JNL-COMP Subscriptions.

    Google Scholar 

  • Luong, H., Wang, Q., & Gauch, S. (2012). Ontology learning using word net lexical expansion and text mining. INTECH Open Access Publisher.

    Google Scholar 

  • Maedche, A., & Staab, S. (2000). The text-to-onto ontology learning environment. Paper presented at the Software Demonstration at ICCS-2000-Eight International Conference on Conceptual Structures.

    Google Scholar 

  • Maedche, A., & Staab, S. (2004). Ontology learning. In Handbook on ontologies (pp. 173–190). Berlin Heidelberg: Springer.

    Chapter  Google Scholar 

  • Meng, L., Huang, R., & Gu, J. (2013). A review of semantic similarity measures in wordnet. International Journal of Hybrid Information Technology, 6(1), 1–12.

    Google Scholar 

  • Miller, G. A. (1995). WordNet: A lexical database for English. Communications of the ACM, 38(11), 39–41.

    Article  Google Scholar 

  • Nagar, A., & Al-Mubaid, H. (2008). A new path length measure based on go for gene similarity with evaluation using sgd pathways. Paper presented at the 21st IEEE International Symposium on Computer-Based Medical Systems, 2008. CBMS’08.

    Google Scholar 

  • Nie, X., & Zhou, J. (2008). A domain adaptive ontology learning framework. Paper presented at the IEEE International Conference on Networking, Sensing and Control, 2008. ICNSC 2008.

    Google Scholar 

  • Noy, N. F., & Musen, M. A. (2003). The PROMPT suite: Interactive tools for ontology merging and mapping. International Journal of Human-Computer Studies, 59(6), 983–1024.

    Article  Google Scholar 

  • Noy, N. F., Sintek, M., Decker, S., Crubézy, M., Fergerson, R. W., & Musen, M. A. (2001). Creating semantic web contents with protege-2000. IEEE Intelligent Systems, 16(2), 60–71.

    Article  Google Scholar 

  • Park, J., Cho, W., & Rho, S. (2010). Evaluating ontology extraction tools using a comprehensive evaluation framework. Data and Knowledge Engineering, 69(10), 1043–1061.

    Article  Google Scholar 

  • Pedersen, T., Pakhomov, S. V., Patwardhan, S., & Chute, C. G. (2007). Measures of semantic similarity and relatedness in the biomedical domain. Journal of Biomedical Informatics, 40(3), 288–299.

    Article  Google Scholar 

  • Pirró, G. (2009). A semantic similarity metric combining features and intrinsic information content. Data and Knowledge Engineering, 68(11), 1289–1308.

    Article  Google Scholar 

  • Qin, P., Lu, Z., Yan, Y., & Wu, F. (2009). A new measure of word semantic similarity based on wordnet hierarchy and dag theory. Paper presented at the International Conference on Web Information Systems and Mining, 2009. WISM 2009.

    Google Scholar 

  • Rada, R., Mili, H., Bicknell, E., & Blettner, M. (1989). Development and application of a metric on semantic nets. IEEE Transactions on Systems, Man and Cybernetics, 19(1), 17–30.

    Article  Google Scholar 

  • Raunich, S., & Rahm, E. (2011). ATOM: Automatic target-driven ontology merging. Paper presented at the IEEE 27th International Conference on Data Engineering (ICDE), 2011.

    Google Scholar 

  • Resnik, P. (1995, August 20–25). Using information content to evaluate semantic similarity in a taxonomy. In Proceedings of the 14th International Joint Conference on Artificial Intelligence (pp. 448–453). Montreal, QC, Canada.

    Google Scholar 

  • Saleena, B., & Srivatsa, S. (2015). Using concept similarity in cross ontology for adaptive e-Learning systems. Journal of King Saud University-Computer and Information Sciences, 27(1), 1–12.

    Article  Google Scholar 

  • Salton, G., & Michael, J. (1983). Introduction to modern information retrieval (pp. 24–51). New York: McGraw-Hill.

    Google Scholar 

  • Sánchez, D., Batet, M., & Isern, D. (2011). Ontology-based information content computation. Knowledge-Based Systems, 24(2), 297–303.

    Article  Google Scholar 

  • Santoso, H. A., Haw, S.-C., & Abdul-Mehdi, Z. T. (2011). Ontology extraction from relational database: Concept hierarchy as background knowledge. Knowledge-Based Systems, 24(3), 457–464.

    Article  Google Scholar 

  • Schutz, A., & Buitelaar, P. (2005). Relext: A tool for relation extraction from text in ontology extension. In The semantic web–ISWC 2005 (pp. 593–606). Berlin Heidelberg: Springer.

    Chapter  Google Scholar 

  • Slimani, T. (2013). Description and evaluation of semantic similarity measures approaches. International Journal of Computer Applications, 80(10), 0975–8887.

    Article  Google Scholar 

  • Sure, Y., Angele, J., & Staab, S. (2002). OntoEdit: Guiding ontology development by methodology and inferencing. In Proceedings of the International Conference on Ontologies, Databases and Applications of SEmantics ODBASE 2002. Irvine, CA: University of California.

    Google Scholar 

  • Sussna, M. J. (1997). Text retrieval using inference in semantic metanetworks.

    Google Scholar 

  • Wang, G., Yu, Y., & Zhu, H. (2007). Pore: Positive-only relation extraction from wikipedia text. Berlin: Springer.

    Google Scholar 

  • Wu, X., & Bolivar, A. (2008). Keyword extraction for contextual advertisement. Paper presented at the Proceedings of the 17th International Conference on World Wide Web.

    Google Scholar 

  • Wu, Z., & Palmer, M. (1994). Verbs semantics and lexical selection. Paper presented at the Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics.

    Google Scholar 

  • Yang, Y., & Pedersen, J. O. (1997). A comparative study on feature selection in text categorization. Paper presented at the ICML.

    Google Scholar 

  • Zablith, F. (2008). Dynamic ontology evolution. International Semantic Web Conference (ISWC) Doctoral Consortium, Karlsruhe, Germany.

    Google Scholar 

  • Zouaq, A. (2011). An overview of shallow and deep natural language processing for ontology learning. Ontology Learning and Knowledge Discovery Using the Web: Challenges and Recent Advances, 2, 16–37.

    Article  Google Scholar 

  • Zouaq, A., Gasevic, D., & Hatala, M. (2011). Towards open ontology learning and filtering. Information Systems, 36(7), 1064–1081.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Saira Gillani .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Gillani, S., Kő, A. (2016). ProMine: A Text Mining Solution for Concept Extraction and Filtering. In: Gábor, A., Kő, A. (eds) Corporate Knowledge Discovery and Organizational Learning. Knowledge Management and Organizational Learning, vol 2. Springer, Cham. https://doi.org/10.1007/978-3-319-28917-5_3

Download citation

Publish with us

Policies and ethics