Skip to main content

KEA-Based Document Tagging for Project Recommendation and Analysis

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 708))

Abstract

This paper proposes an innovative approach in managing project-related documents, project domain analysis, and recommendation of open areas from current project document pool. Using keyterm extraction technique, documents are tagged under appropriate categories and subcategories for better management of project documents. Hence, this tagged document serves as a reference for the students who are planning to take up new projects. The system generates various reports for statistical analysis of projects carried out in each research domain. These statistics benefit users to get an overview of the trends of project works done over the past few years. There are also reports illustrating the number of open areas over respective academic years. The open areas are identified and listed for the students. This novel approach would help the students who are seeking new project. Our system helps the students, faculty, and other academicians to get involved in ongoing projects and also to obtain ideas in their respective research domain. We have modified the stemming method in basic keyterm extraction algorithm (KEA) by adding Porter stemmer rather than Lovins stemming method, and our experimental results confirm that our modified keyterm extraction method outperforms the KEA method while tagging English documents.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Witten, I.H., Paynter, G.W., Frank, E., Gutwin, C., Nevill-Manning, C.G.: KEA: practical automatic keyphrase extraction. In Proceedings of the Fourth ACM Conference on Digital Libraries, pp. 254–255. ACM (1999)

    Google Scholar 

  2. Han, P., Shen, S., Wang, D., Liu, Y.: The influence of word normalization in English document clustering. In 2012 IEEE International Conference on Computer Science and Automation Engineering (CSAE), vol. 2, pp. 116–120. IEEE (2012)

    Google Scholar 

  3. Menon, R.R.K., Kini, N.V.: Harnessing the discriminatory strength of dictionaries. IEEE (2016)

    Google Scholar 

  4. Sathyadevan, S., Athira, U.: Improved document classification through enhanced naive Bayes algorithm. IEEE (2014)

    Google Scholar 

  5. Jivani, A.G.: A comparative study of stemming algorithms. Int. J. Comp. Tech. Appl. 2(6), 1930–1938 (2011)

    Google Scholar 

  6. Rose, J.D.: An efficient association rule based hierarchical algorithm for text clustering. Int. J. Adv. Eng. Tech/Vol. VII/Issue I/Jan.-March 751, 753 (2016)

    Google Scholar 

  7. Patil, L.H., Atique, M.: A novel approach for feature selection method TF-IDF in document clustering. In: 2013 IEEE 3rd International Advance Computing Conference (IACC). IEEE (2013)

    Google Scholar 

  8. Pillai, P.G., Narayanan, J.: Question categorization using SVM based on different term weighting methods. Int. J. Comput. Sci. Eng. (IJCSE) 4(05) (2012)

    Google Scholar 

  9. Mii, M., Lazi, M., Proti, J.: A software tool that helps teachers in handling, processing and understanding the results of massive exams. In: Proceedings of the Fifth Balkan Conference in Informatics. ACM (2012)

    Google Scholar 

  10. JFreeChart project (2012). http://www.jfree.org/

  11. Dynamics Reports. http://www.dynamicreports.org/

  12. Han, J., Kamber, M., Pei, J.: Data Mining Concepts and Techniques, 3rd edn (2011)

    Google Scholar 

Download references

Acknowledgements

We would like to express our sincere gratitude to the faculty of Department of Computer Science and Applications of Amrita Vishwa Vidyapeetham, Amritapuri, for providing help and guidance.

Our sincere thanks to Dr. M. R. Kaimal, Chairman, Computer Science Department, Amrita Vishwa Vidyapeetham, Amritapuri, for his prompt support.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to M. G. Thushara .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Thushara, M.G., Sreeremya, S.A., Smitha, S. (2018). KEA-Based Document Tagging for Project Recommendation and Analysis. In: Sa, P., Bakshi, S., Hatzilygeroudis, I., Sahoo, M. (eds) Recent Findings in Intelligent Computing Techniques . Advances in Intelligent Systems and Computing, vol 708. Springer, Singapore. https://doi.org/10.1007/978-981-10-8636-6_30

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-8636-6_30

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-8635-9

  • Online ISBN: 978-981-10-8636-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics