Predicting Student Performance from Combined Data Sources

Wolff, Annika; Zdrahal, Zdenek; Herrmannova, Drahomira; Knoth, Petr

doi:10.1007/978-3-319-02738-8_7

Predicting Student Performance from Combined Data Sources

Annika Wolff³,
Zdenek Zdrahal³,
Drahomira Herrmannova³ &
…
Petr Knoth³

Chapter
First Online: 01 January 2013

3990 Accesses
15 Citations

Part of the book series: Studies in Computational Intelligence ((SCI,volume 524))

Abstract

This chapter will explore the use of predictive modeling methods for identifying students who will benefit most from tutor interventions. This is a growing area of research and is especially useful in distance learning where tutors and students do not meet face to face. The methods discussed will include decision-tree classification, support vector machine (SVM), general unary hypotheses automaton (GUHA), Bayesian networks, and linear and logistic regression. These methods have been trialed through building and testing predictive models using data from several Open University (OU) modules. The Open University offers a good test-bed for this work, as it is one of the largest distance learning institutions in Europe. The chapter will discuss how the predictive capacity of the different sources of data changes as the course progresses. It will also highlight the importance of understanding how a student’s pattern of behavior changes during the course.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Abbreviations

ANOVA:: Analysis of variance
CMS:: Course management system
CS:: Course signals
GUHA:: General unary hypotheses automaton
MOOC:: Massive open online course
OU:: Open university
SVM:: Support vector machine
TMA:: Tutor marked assessment
VLE:: Virtual learning environment

References

Kabra, R.R., Bichkar, R.S.: Performance prediction of engineering students using decision trees. Int. J. Comput. Appl. 36(11), 8–12 (2011)
Google Scholar
Baradwaj, B., Pal, S.: Mining educational data to analyze student’s performance. Int. J. Adv. Comput. Sci. Appl. 2(6), 63–69 (2011)
Google Scholar
Pandey, M., Sharma, V.K.: A decision tree algorithm pertaining to the student performance analysis and prediction. Int. J. Comput. Appl. 61(13), 1–5 (2013)
Google Scholar
Baepler, P., Murdoch, C.J.: Academic analytics and data mining in higher education. Int. J. Sch. Teach. Learn. 4(2), 1–9 (2010)
Google Scholar
Arnold, K.E., Pistilli, M.D.: Course signals at purdue: using learning analytics to increase student success. In: 2nd International Conference on Learning Analytics and Knowledge, pp. 267–270. ACM, New York (2012)
Google Scholar
Pistilli, M.D., Arnold, K.E.: Purdue signals: mining real-time academic data to enhance student success. About Campus 15(3), 22–24 (2010)
Article Google Scholar
Peng, H., Long, F., Ding, C.: Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 27(8), 1226–1238 (2005)
Article Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explor. 11(1), 10–18 (2009)
Article Google Scholar
Hájek, P., Holeňa, M., Rauch, J.: The GUHA method and its meaning for data mining. J. Comput. Syst. Sci. 76(1), 34–48 (2010)
Google Scholar
Rauch, J.: GUHA method and the LISp-miner system. In: Observational Calculi and Association Rules. Studies of Computational Intelligence, vol. 469, pp. 233–260. Springer, Heidelberg (2013)
Google Scholar
Koller, D., Friedman, F.: Probabilistic Graphical Models. MIT Press, Cambridge (2009)
MATH Google Scholar
Bishop, C. M.: A new framework for machine learning. In: Zurada, J.M., Yen, G.G., Wang, J. (eds.) Computational Intelligence: Research Frontiers, IEEE World Congress on Computational Intelligence. LNCS, vol. 5050, pp. 1–24. Springer, Heidelberg (2008)
Google Scholar
Minka, T., Winn, J., Guiver, J., Knowles, D.: Infer.NET 2.5, Microsoft Research, Cambridge (2012)
Google Scholar

Download references

Acknowledgments

We would like to acknowledge the help and support of JISC and the contribution from Microsoft Research.

Author information

Authors and Affiliations

Knowledge Media Institute, The Open University, Milton Keynes, MK7, 6AA, UK
Annika Wolff, Zdenek Zdrahal, Drahomira Herrmannova & Petr Knoth

Authors

Annika Wolff
View author publications
You can also search for this author in PubMed Google Scholar
Zdenek Zdrahal
View author publications
You can also search for this author in PubMed Google Scholar
Drahomira Herrmannova
View author publications
You can also search for this author in PubMed Google Scholar
Petr Knoth
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Annika Wolff .

Editor information

Editors and Affiliations

Escuela Superior de Ingeniería Mecánica y Eléctrica, Zacatenco (ESIME-Z), World Outreach Light to the Nations Ministries (WOLNM), Instituto Politécnico Nacional (IPN), Gustavo A. Madero, Mexico City, Distrito Federal, Mexico
Alejandro Peña-Ayala

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Wolff, A., Zdrahal, Z., Herrmannova, D., Knoth, P. (2014). Predicting Student Performance from Combined Data Sources. In: Peña-Ayala, A. (eds) Educational Data Mining. Studies in Computational Intelligence, vol 524. Springer, Cham. https://doi.org/10.1007/978-3-319-02738-8_7

Download citation

DOI: https://doi.org/10.1007/978-3-319-02738-8_7
Published: 07 November 2013
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-02737-1
Online ISBN: 978-3-319-02738-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics