Mining Prerequisite Relationships Among Learning Objects

De Medio, Carlo; Gasparetti, Fabio; Limongelli, Carla; Sciarrone, Filippo; Temperini, Marco

doi:10.1007/978-3-319-40542-1_35

Carlo De Medio^11,12,
Fabio Gasparetti¹¹,
Carla Limongelli¹¹,
Filippo Sciarrone¹¹ &
…
Marco Temperini¹²

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 618))

Included in the following conference series:

International Conference on Human-Computer Interaction

2040 Accesses
1 Citations

Abstract

The process of carefully choosing and sequencing a set of Learning Objects (LOs) to build a course may reveal to be quite a challenging task. In this work we focus on an aspect of such challenge, related to the verification and respect of the relationships of pedagogical dependence that holds between two LOs added to a course (meaning that if a given LO has another one as “pre-requisite”, then any sequencing of the LOs in the course will need to have the latter LO taken by the learners before of the former). An innovative Machine learning-based approach for the identification of these kinds of relationships is proposed.

You have full access to this open access chapter, Download conference paper PDF

Discovering Prerequisite Relationships Among Learning Objects: A Coursera-Driven Approach

Constructing an Educational Knowledge Graph with Concepts Linked to Wikipedia

Article 30 September 2021

Exploring knowledge graphs for the identification of concept prerequisites

Article Open access 12 December 2019

Keywords

1 Introduction

In the case of online courses, a Learning Object (LO) can be seen as a digital object that is used for achieving a desired learning outcome or educational goal. With the ever-increasing use of learning management systems (LMS), repositories of LOs to be considered in specialized training are getting popular and heterogeneous w.r.t. covered disciplines. They encourage the instructors to adopt (and adapt) such LOs while building their education courses. Popular examples are Connexion^{Footnote 1}, Ariadne^{Footnote 2} and Merlot^{Footnote 3}. Autonomous crawling techniques can also help building these repositories by sifting through hypertext resources on the web [1, 2].

Several factors inhibit a more widespread use of such paradigm of course development. Often LOs follow poorly, or not at all, the expected standardized meta-tag or classification scheme, and badly needed references to other LOs are missing. This could negatively impair the vision of LOs as resources that, once created, can be quickly retrieved and used several times in different contexts, compensating the high cost of production.

Promising research activities are studying ontologies and Semantic Web technologies, allowing to address these issues, and capable to support the development of next generation LO repositories [3]; yet, creating education ontologies remains a time-consuming and error-prone task.

On the other side of the same coin, building an e-learning course by a sequence of LOs, i.e. by selecting didactic resources and designing their organization in the course, is a multi-level, multi-faceted and iterative process, in which different skills and knowledge are required. In this kind of task, recommender and filtering tools can be of substantial help [4–7].

In our approach the sequencing of LOs in the course can still be managed by the instructors, basing on their taste and preferences, yet they can be also helped by a set of suggestions, related to the pre-requisite relationships holding among the LOs selected for the course. Such relationships can be automatically computed and provide the instructor with significant help and guidance. We show a light-weight formalization of the LO, and how it can be represented by a set of WikiPedia articles; then we show how such set of topics can help deciding on the dependence relationship holding between two LOs. In this endeavor we exploit the classification in categories available for the WikiPedia articles, and obtain interesting results for our framework, in terms of precision and recall of the dependence relationships.

2 Related Works

Wikipedia offers a quantity of high quality content resources in terms of presentation [8]. The openness, easy availability, and freshness of data make Wikipedia of interest in a variety of research activities, such as natural language processing and translation tools. Links, categories and information in templates provide structured content, which can be retrieved from raw XML dumps or Application Program Interface calls.

While some attempts aim at incorporating selected Wikipedia content into the curriculum as a collaborative environment [9] or for categorizing learning resources [10], to our knowledge our approach is novel w.r.t. inferring dependency relationships between LOs.

An interesting case-based reasoning approach, following a self-directed learning paradigm in assisting users to build sequences of elements out of user-defined libraries, is proposed in [11].

An evaluation of the hypotheses that motivated this research has been previously discussed in the following works: [12–14].

3 Mining Prerequisites

The current proposal consists in a traditional Machine Learning (ML) approach [15] applied to a dataset of LOs by performing a comparative analysis of several features of the LOs. The dataset is composed by LOs coming from five web-based courses we managed, on a wide variety of subject matters.

The presented approach is implemented in a software system that supports the following process. Firstly the set of LOs is textually analyzed, and each LO is associated to a Wikipedia page (topic): the set of topics is considered representative of the set of LOs. Then, the fact that a LO is represented by a topic allows to quantify the values of a set of features of the LO, by computing them on the associated topic.

We define the features according with peculiar aspects of the representative topics such as content length, generality, or specialization. Namely, given two learning objects $LO_i$ and $LO_j$, we have: (1) the two average lengths of the text of the Wikipedia topics associated to the pair defined in terms of words obtained by a text tokenization process, (2) the number of links in the first section of the Wikipedia topics, (3) the average number of links in the topics associated to the LOs, (4) the number of distinct nouns in the LOs extracted by a part-of-speech tagger, (5) the intersection of the two sets of nouns extracted from the two LOs, (6) similar to the features #1 but limited to the first section of Wikipedia and (7) the intersection between the set of nouns used in links to other topics in the topics associated to $LO_i$, and the nouns extracted from $LO_j$.

So then, the topics are analyzed and the related LOs features computed. Finally the dependency relation between two LOs is inferred taking their features under consideration: this computation is obtained by feeding the features into a ML-based classifier.

4 Empirical Evaluation

In data mining, a decision tree is a predictive model that can be used to represent both classifiers and regression models. J48 is the implementation of C4.5 algorithm [16] developed by J. Ross Quinlan. C4.5 algorithm produces decision tree classification for a given dataset by recursive division of the data and the tree is grown using Depth-first strategy. Pruning methods have been introduced to reduce the complexity of tree structure without decreasing the accuracy of classification. Subtree raising is the followed pruning support procedure, that is, moving nodes upwards toward the root of tree and also replacing other nodes on the same way [17].

JRip is the propositional rule learner based on the Repeated Incremental Pruning to Produce Error Reduction (RIPPER) [18]. Starting with the less prevalent classes, the algorithm iteratively grows and prunes rules until there are no positive examples left. It tries every potential value of each attribute and selects the condition with highest information gain. The minimum description length is considered as stopping criterion when new conditions are sequentially added to a rule.

These two ML algorithms have been considered for the classification task, where the following measures can be defined:

tp: the number of identified dependencies that are also expected in the test set;
fp: the number of dependencies returned by the classifier but missing in the test set;
fn: the number of expected dependencies that the classifier misses to identify.

and, consequently, the performances can be evaluated with the standard measures of Precision (Pr) and Recall (Re).

$$\begin{aligned} Pr = \frac{tp}{tp+fp}\qquad \qquad \qquad \quad Re =\frac{tp}{tp+fn} \end{aligned}$$

that is, the precision and the recall.

Five course materials with various levels of difficulty, conveying different random topics, e.g., scientific, archaeological, cinematography and art; have been considered for the evaluation. A domain expert manually identified the expected dependencies among LOs.

The average precision (Pr) reaches 0.828 and 0.736, for J48 and JRip, respectively. The recall (Re) values range from 0.811 (J48) and 0.756 (JRip). Each approach is validated following a 10-fold cross-validation. The outcomes prove that the hypothesis of a classifier trained on features extracted from two LOs has the chance to correctly identifying prerequisites among them.

5 Conclusions

We have presented and evaluated a Machine learning-based approach for mining prerequisite relations between learning objects. It can be used in a more comprehensive approach for helping teachers in searching relevant content and assisting them during the course development.

In our future work, we plan to continue evaluating the precision of the proposed approach in different domains of interest. In some circumstances (e.g., Mathematics and Statistics courses), the semantic annotation does not successfully associate relevant topics to the learning objects. Alternative approaches must be considered in order to overcome this issue and categorize the features exatracted from the LOs [19]. Preferences of teachers manifested through the course development can also be studied and combined, for example by monitoring the browsing behaviour on learning objects represented by hypertext resources [12].

Notes

1.
Connexions is a Learning Object Repository, available at http://www.cnx.org (Accessed 27 January 2016).
2.
Ariadne Foundation, available at http://ariadne-eu.org (Accessed 27 January 2016).
3.
Merlot is a Learning Object Repository, available at http://www.merlot.org (Accessed 27 January 2016).

References

Gasparetti, F., Micarelli, A.: Adaptive web search based on a colony of cooperative distributed agents. In: Klusch, M., Omicini, A., Ossowski, S., Laamanen, H. (eds.) CIA 2003. LNCS (LNAI), vol. 2782, pp. 168–183. Springer, Heidelberg (2003)
Chapter Google Scholar
Micarelli, A., Gasparetti, F.: Adaptive focused crawling. In: Brusilovsky, P., Kobsa, A., Nejdl, W. (eds.) Adaptive Web 2007. LNCS, vol. 4321, pp. 231–262. Springer, Heidelberg (2007)
Chapter Google Scholar
Raju, P., Ahmed, V.: Enabling technologies for developing next-generation learning object repository for construction. Autom. Constr. 22, 247–257 (2012). Planning Future Cities-Selected papers from the 2010 eCAADe Conference
Article Google Scholar
Limongelli, C., Sciarrone, F., Starace, P., Temperini, M.: An ontology-driven olap system to help teachers in the analysis of web learning object repositories. Inf. Syst, Manag. 27(3), 198–206 (2010)
Article Google Scholar
Limongelli, C., Lombardi, M., Marani, A., Sciarrone, F., Temperini, M.: A recommendation module to help teachers build courses through the moodle learning management system. New Rev. Hypermedia Multimedia 22, 58–82 (2015)
Article Google Scholar
Limongelli, C., Sciarrone, F., Temperini, M.: A social network-based teacher model to support course construction. Comput. Hum. Behav. 51, 1077–1085 (2015)
Article Google Scholar
Revilla Muñoz, O., Alpiste Penalba, F., Fernández Sánchez, J.: The skills, competences, and attitude toward information and communications technology recommender system: an online support program for teachers with personalized recommendations. New Rev. Hypermedia Multimedia 22, 83–110 (2015)
Article Google Scholar
Mesgari, M., Okoli, C., Mehdi, M., Nielsen, F.Å., Lanamki, A.: The sum of all human knowledge : a systematic review ofscholarly research on the content of wikipedia. J. Assoc Inf. Sci. Technol. 66(2), 219–245 (2015)
Article Google Scholar
Forte, A., Bruckman, A.: From wikipedia to the classroom: exploring online publication and learning. In: Proceedings of the 7th International Conference on Learning Sciences, ICLS 2006, pp.182–188. International Society of the Learning Sciences (2006)
Google Scholar
Meyer, M., Rensing, C., Steinmetz, R.: Categorizing Learning Objects Based On Wikipedia as Substitute Corpus. CEUR Workshop Proceedings, September 2007
Google Scholar
Gasparetti, F., Micarelli, A., Sciarrone, F.: A web-based training system for business letter writing. Knowl. Based Syst. 22(4), 287–291 (2009)
Article Google Scholar
Gasparetti, F., Micarelli, A., Sansonetti, G.: Exploiting web browsing activities for user needs identification. In: 2014 International Conference on Computational Science and Computational Intelligence (CSCI), vol. 2, pp. 86–89, March 2014
Google Scholar
Gasparetti, F., Limongelli, C., Sciarrone, F.: A content-based approach for supporting teachers in discovering dependency relationships between instructional units in distance learning environments. In: Stephanidis, C. (ed.) HCII 2015 Posters. CCIS, vol. 529, pp. 241–246. Springer, Heidelberg (2015)
Chapter Google Scholar
Medio, C.D., Gasparetti, F., Limongelli, C., Sciarrone, F., Temperini, M.: Automatic extraction of prerequisites among learning objects using wikipedia-based content analysis. In: Proceedings of the 13th International Conference on Intelligent Tutoring Systems, ITS 2016. Springer (2016)
Google Scholar
Mitchell, T.M.: Machine Learning, 1st edn. McGraw-Hill Inc., New York (1997)
MATH Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann Series in Data Management Systems, 2nd edn. Morgan Kaufmann Publishers Inc., San Francisco (2005)
MATH Google Scholar
Zhao, Y., Zhang, Y.: Comparison of decision tree methods for finding active objects. Adv. Space Res. 41(12), 1955–1959 (2008)
Article Google Scholar
Leon, F., Aignatoaiei, B., Zaharia, M.: Performance analysis of algorithms for protein structure classification. In: 20th International Workshop on Database and Expert Systems Application, 2009, DEXA 2009, pp. 203–207, August 2009
Google Scholar
Gentili, G., Marinilli, M., Micarelli, A., Sciarrone, F.: Text categorization in an intelligent agent for filtering information on the web. IJPRAI 15(3), 527–549 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Engineering, Roma Tre University, Via della Vasca Navale 79, 00146, Rome, Italy
Carlo De Medio, Fabio Gasparetti, Carla Limongelli & Filippo Sciarrone
Department of Computer, Control and Management Engineering, Sapienza University, Via Ariosto, 25, 00184, Rome, Italy
Carlo De Medio & Marco Temperini

Authors

Carlo De Medio
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Gasparetti
View author publications
You can also search for this author in PubMed Google Scholar
Carla Limongelli
View author publications
You can also search for this author in PubMed Google Scholar
Filippo Sciarrone
View author publications
You can also search for this author in PubMed Google Scholar
Marco Temperini
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fabio Gasparetti .

Editor information

Editors and Affiliations

Found. for Res. & Tec. - Hellas (FORTH), University of Crete, Heraklion, Greece
Constantine Stephanidis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

De Medio, C., Gasparetti, F., Limongelli, C., Sciarrone, F., Temperini, M. (2016). Mining Prerequisite Relationships Among Learning Objects. In: Stephanidis, C. (eds) HCI International 2016 – Posters' Extended Abstracts. HCI 2016. Communications in Computer and Information Science, vol 618. Springer, Cham. https://doi.org/10.1007/978-3-319-40542-1_35

Download citation

DOI: https://doi.org/10.1007/978-3-319-40542-1_35
Published: 22 June 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40541-4
Online ISBN: 978-3-319-40542-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Mining Prerequisite Relationships Among Learning Objects

Abstract

Similar content being viewed by others

Discovering Prerequisite Relationships Among Learning Objects: A Coursera-Driven Approach

Constructing an Educational Knowledge Graph with Concepts Linked to Wikipedia

Exploring knowledge graphs for the identification of concept prerequisites

Keywords

1 Introduction

2 Related Works

3 Mining Prerequisites

4 Empirical Evaluation

5 Conclusions

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Mining Prerequisite Relationships Among Learning Objects

Abstract

Similar content being viewed by others

Discovering Prerequisite Relationships Among Learning Objects: A Coursera-Driven Approach

Constructing an Educational Knowledge Graph with Concepts Linked to Wikipedia

Exploring knowledge graphs for the identification of concept prerequisites

Keywords

1 Introduction

2 Related Works

3 Mining Prerequisites

4 Empirical Evaluation

5 Conclusions

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation