Similarity Matching of Computer Science Unit Outlines in Higher Education

Langan, Gaurav; Montgomery, James; Garg, Saurabh

doi:10.1007/978-3-319-50127-7_12

Gaurav Langan²¹,
James Montgomery²¹ &
Saurabh Garg²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9992))

Included in the following conference series:

Australasian Joint Conference on Artificial Intelligence

3092 Accesses

Abstract

With the globalisation of education, students may undertake higher education courses anywhere in the world. Yet there is variation between different universities’ offerings. Even though web search engines can help one to locate potentially similar courses or subjects offered by different universities, judging the degree of similarity between each of them is currently a manual process in which a student or staff member has to go through subject/unit descriptions within a course to understand the different topics taught. In this paper, we study the application of text mining to evaluate the similarity or overlap between different units and propose a system that can help students and staff to make these judgements. The unit or course descriptions are generally short, containing 100–200 words, and exhibit very wide diversity in the ways they are written. Experimental results using data from Australian and international universities demonstrate the accuracy of the proposed system in calculating the similarity between different computing units.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Notes

1.
The full list is available upon request.

References

Bird, S.: NLTK: the natural language toolkit. In: Proceedings of the COLING/ACL on interactive presentation sessions, pp. 69–72. Association for Computational Linguistics (2006)
Google Scholar
Buckland, M.K., Gey, F.C.: The relationship between recall and precision. JASIS 45(1), 12–19 (1994)
Article Google Scholar
Damashek, M., et al.: Gauging similarity with n-grams: language-independent categorization of text. Science 267(5199), 843–848 (1995)
Article Google Scholar
Fellbaum, C.: WordNet. Wiley Online Library (1998)
Google Scholar
Luan, J.: Data mining and knowledge management in higher education-potential applications (2002)
Google Scholar
Medelyan, O., Milne, D., Legg, C., Witten, I.H.: Mining meaning from Wikipedia. Int. J. Hum. Comput. Stud. 67(9), 716–754 (2009)
Article Google Scholar
Mihalcea, R., Corley, C., Strapparava, C.: Corpus-based and knowledge-based measures of text semantic similarity. In: AAAI, vol. 6, pp. 775–780 (2006)
Google Scholar
Milne, D., Witten, I.: An open-source toolkit for mining Wikipedia. Artif. Intell. 194, 222–239 (2013)
Article MathSciNet Google Scholar
Romero, C., Ventura, S.: Educational data mining: a review of the state of the art. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 40(6), 601–618 (2010)
Article Google Scholar
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)
Article MATH Google Scholar
Tang, C., Lau, R.W., Li, Q., Yin, H., Li, T., Kilis, D.: Personalized courseware construction based on web data mining. In: Proceedings of the First International Conference on Web Information Systems Engineering, vol. 2, pp. 204–211. IEEE (2000)
Google Scholar
Wikipedia: Main page (2015). https://en.wikipedia.org/
Wu, Z., Palmer, M.: Verbs semantics and lexical selection. In: Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics, pp. 133–138. Association for Computational Linguistics (1994)
Google Scholar
Zhang, L., Liu, X., Liu, X.: Personalized instructing recommendation system based on web mining. In: The 9th International Conference for Young Computer Scientists, 2008. ICYCS 2008, pp. 2517–2521. IEEE (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Engineering & ICT, Unversity of Tasmania, Hobart, Australia
Gaurav Langan, James Montgomery & Saurabh Garg

Authors

Gaurav Langan
View author publications
You can also search for this author in PubMed Google Scholar
James Montgomery
View author publications
You can also search for this author in PubMed Google Scholar
Saurabh Garg
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to James Montgomery .

Editor information

Editors and Affiliations

University of Tasmania, Hobart, Australia
Byeong Ho Kang
Auckland University of Technology, Auckland, New Zealand
Quan Bai

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Langan, G., Montgomery, J., Garg, S. (2016). Similarity Matching of Computer Science Unit Outlines in Higher Education. In: Kang, B.H., Bai, Q. (eds) AI 2016: Advances in Artificial Intelligence. AI 2016. Lecture Notes in Computer Science(), vol 9992. Springer, Cham. https://doi.org/10.1007/978-3-319-50127-7_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-50127-7_12
Published: 29 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50126-0
Online ISBN: 978-3-319-50127-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics