Abstract
The purpose of using web usage mining methods in the area of learning management systems is to reveal the knowledge hidden in the log files of their web and database servers. By applying data mining methods to these data, interesting patterns concerning the users’ behaviour can be identified. They help us to find the most effective structure of the e-learning courses, optimize the learning content, recommend the most suitable learning path based on student’s behaviour, or provide more personalized environment. We prepare six datasets of different quality obtained from logs of the learning management system and pre-processed in different ways. We use three datasets with identified users’ sessions based on 15, 30 and 60 minute session timeout threshold and three another datasets with the same thresholds including reconstructed paths among course activities. We try to assess the impact of different session timeout thresholds with or without paths completion on the quantity and quality of the sequence rule analysis that contribute to the representation of the learners’ behavioural patterns in learning management system. The results show that the session timeout threshold has significant impact on quality and quantity of extracted sequence rules. On the contrary, it is shown that the completion of paths has neither significant impact on quantity nor quality of extracted rules.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ba-Omar, H., Petrounias, I., Anwar, F.: A Framework for Using Web Usage Mining to Personalise E-learning. In: Seventh IEEE International Conference on Advanced Learning Technologies, ICALT 2007, pp. 937–938 (2007)
Crespo Garcia, R.M., Kloos, C.D.: Web Usage Mining in a Blended Learning Context: A Case Study. In: Eighth IEEE International Conference on Advanced Learning Technologies, ICALT 2008, pp. 982–984 (2008)
Chitraa, V., Davamani, A.S.: A Survey on Preprocessing Methods for Web Usage Data. International Journal of Computer Science and Information Security 7 (2010)
Marquardt, C.G., Becker, K., Ruiz, D.D.: A Pre-processing Tool for Web Usage Mining in the Distance Education Domain. In: Proceedings of International Database Engineering and Applications Symposium, IDEAS 2004, pp. 78–87 (2004)
Romero, C., Ventura, S., Garcia, E.: Data Mining in Course Management Systems: Moodle Case Study and Tutorial. Comput. Educ. 51, 368–384 (2008)
Falakmasir, M.H., Habibi, J.: Using Educational Data Mining Methods to Study the Impact of Virtual Classroom in E-Learning. In: Baker, R.S.J.d., Merceron, A., Pavlik, P.I.J. (eds.) 3rd International Conference on Educational Data Mining, Pittsburgh, pp. 241–248 (2010)
Bing, L.: Web Data Mining. Exploring Hyperlinks, Contents and Usage Data. Springer, Heidelberg (2006)
Munk, M., Kapusta, J., Svec, P.: Data Pre-processing Evaluation for Web Log Mining: Reconstruction of Activities of a Web Visitor. Procedia Computer Science 1, 2273–2280 (2010)
Romero, C., Espejo, P.G., Zafra, A., Romero, J.R., Ventura, S.: Web Usage Mining for Predicting Final Marks of Students that Use Moodle Courses. Computer Applications in Engineering Education 26 (2010)
Raju, G.T., Satyanarayana, P.S.: Knowledge Discovery from Web Usage Data: a Complete Preprocessing Methodology. IJCSNS International Journal of Computer Science and Network Security 8 (2008)
Spiliopoulou, M., Mobasher, B., Berendt, B., Nakagawa, M.: A Framework for the Evaluation of Session Reconstruction Heuristics in Web-Usage Analysis. INFORMS J. on Computing 15, 171–190 (2003)
Bayir, M.A., Toroslu, I.H., Cosar, A.: A New Approach for Reactive Web Usage Data Processing. In: Proceedings of 22nd International Conference on Data Engineering Workshops, pp. 44–44 (2006)
Zhang, H., Liang, W.: An Intelligent Algorithm of Data Pre-processing in Web Usage Mining. In: Proceedings of the World Congress on Intelligent Control and Automation (WCICA), pp. 3119–3123 (2004)
Cooley, R., Mobasher, B., Srivastava, J.: Data Preparation for Mining World Wide Web Browsing Patterns. Knowledge and Information Systems 1, 5–32 (1999)
Yan, L., Boqin, F., Qinjiao, M.: Research on Path Completion Technique in Web Usage Mining. In: International Symposium on Computer Science and Computational Technology, ISCSCT 2008, vol. 1, pp. 554–559 (2008)
Yan, L., Boqin, F.: The Construction of Transactions for Web Usage Mining. In: International Conference on Computational Intelligence and Natural Computing, CINC 2009, vol. 1, pp. 121–124 (2009)
Huynh, T.: Empirically Driven Investigation of Dependability and Security Issues in Internet-Centric Systems. Department of Electrical and Computer Engineering. University of Alberta, Edmonton (2010)
Huynh, T., Miller, J.: Empirical Observations on the Session Timeout Threshold. Inf. Process. Manage. 45, 513–528 (2009)
Catledge, L.D., Pitkow, J.E.: Characterizing Browsing Strategies in the World-Wide Web. Comput. Netw. ISDN Syst. 27, 1065–1073 (1995)
Huntington, P., Nicholas, D., Jamali, H.R.: Website Usage Metrics: A Re-assessment of Session Data. Inf. Process. Manage. 44, 358–372 (2008)
Meiss, M., Duncan, J., Goncalves, B., Ramasco, J.J., Menczer, F.: What’s in a Session: Tracking Individual Behavior on the Web. In: Proceedings of the 20th ACM Conference on Hypertext and Hypermedia. ACM, Torino (2009)
Huang, X., Peng, F., An, A., Schuurmans, D.: Dynamic Web Log Session Identification with Statistical Language Models. J. Am. Soc. Inf. Sci. Technol. 55, 1290–1303 (2004)
Goseva-Popstojanova, K., Mazimdar, S., Singh, A.D.: Empirical Study of Session-Based Workload and Reliability for Web Servers. In: Proceedings of the 15th International Symposium on Software Reliability Engineering. IEEE Computer Society, Los Alamitos (2004)
Tian, J., Rudraraju, S., Zhao, L.: Evaluating Web Software Reliability Based on Workload and Failure Data Extracted from Server Logs. IEEE Transactions on Software Engineering 30, 754–769 (2004)
Chen, Z., Fowler, R.H., Fu, A.W.-C.: Linear Time Algorithms for Finding Maximal Forward References. In: Proceedings of the International Conference on Information Technology: Computers and Communications. IEEE Computer Society, Los Alamitos (2003)
Borbinha, J., Baker, T., Mahoui, M., Jo Cunningham, S.: A comparative transaction log analysis of two computing collections. In: Borbinha, J.L., Baker, T. (eds.) ECDL 2000. LNCS, vol. 1923, pp. 418–423. Springer, Heidelberg (2000)
Kohavi, R., Mason, L., Parekh, R., Zheng, Z.: Lessons and Challenges from Mining Retail E-Commerce Data. Mach. Learn. 57, 83–113 (2004)
Munk, M., Kapusta, J., Švec, P., Turčáni, M.: Data Advance Preparation Factors Affecting Results of Sequence Rule Analysis in Web Log Mining. E+M Economics and Management 13, 143–160 (2010)
Agrawal, R., Imieliski, Swami, A.: Mining Association Rules Between Sets of Items in Large Databases. In: Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data. ACM, Washington, D.C (1993)
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules in Large Databases. In: Proceedings of the 20th International Conference on Very Large Data Bases. Morgan Kaufmann Publishers Inc., San Francisco (1994)
Han, J., Lakshmanan, L.V.S., Pei, J.: Scalable Frequent-pattern Mining Methods: an Overview. In: Tutorial notes of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, San Francisco (2001)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, New York (2000)
Electronic Statistics Textbook. StatSoft, Tulsa (2010)
Romero, C., Ventura, S.: Educational Data Mining: A Survey from 1995 to 2005. Expert Systems with Applications 33, 135–146 (2007)
Berry, M.J., Linoff, G.S.: Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management. Wiley Publishing, Inc., Chichester (2004)
Hays, W.L.: Statistics. CBS College Publishing, New York (1988)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Munk, M., Drlik, M. (2011). Influence of Different Session Timeouts Thresholds on Results of Sequence Rule Analysis in Educational Data Mining. In: Cherifi, H., Zain, J.M., El-Qawasmeh, E. (eds) Digital Information and Communication Technology and Its Applications. DICTAP 2011. Communications in Computer and Information Science, vol 166. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21984-9_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-21984-9_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21983-2
Online ISBN: 978-3-642-21984-9
eBook Packages: Computer ScienceComputer Science (R0)