Abstract
Use of machine learning techniques for educational proposes (or educational data mining) is an emerging field aimed at developing methods of exploring data from computational educational settings and discovering meaningful patterns. The stored data (virtual courses, e-learning log file, demographic and academic data of students, admissions/registration info, and so on) can be useful for machine learning algorithms. In this article, we cite the most current articles that use machine learning techniques for educational proposes and we present a case study for predicting students’ marks. Students’ key demographic characteristics and their marks in a small number of written assignments can constitute the training set for a regression method in order to predict the student’s performance. Finally, a prototype version of software support tool for tutors has been constructed.
Similar content being viewed by others
References
Amershi S, Conati C (2009) Combining unsupervised and supervised classification to build user models for exploratory learning environments. J Educ Data Min 1(1): 18–71
Anaya AR, Boticario JG (2011) Application of machine learning techniques to analyse student interactions and improve the collaboration process. Expert Syst Appl 38: 1171–1181
Anozie N, Junker BW (2006) Predicting end-of-year accountability assessment scores from monthly student records in an online tutoring system. In Educational data mining AAAI workshop, California, USA, pp 1–6
Atkeson CG, Moore AW, Schaal S (1997) Locally weighted learning. Artificial Intell Rev 11(1–5): 11–73
Baker RSJD, Yacef K (2009) The state of educational data mining in 2009: a review and future visions. J Educ Data Min 1(1): 3–17
Brusilovsky P, Millán E (2007) User models for adaptive hypermedia and adaptive educational systems. In The adaptive web. LNCS 4321, Springer, Berlin, pp 3–53
Buldu A, Üçgün K (2010) Data mining application on students’ data. Procedia Soc Behav Sci 2: 5251–5259
Campbell J (2007) Utilizing student data within the course management system to determine undergraduate student academic success: An exploratory study. Doctoral thesis, Purdue University, Indiana, USA
Chen S-M, Bai S-M (2010) Using data mining techniques to automatically construct concept maps for adaptive learning systems. Expert Syst Appl 37: 4496–4503
Chen CM, Hsieh YL, Hsu SH (2007a) Mining learner profile utilizing association rule for web-based learning diagnosis. Expert Syst Appl 33(1): 6–22
Chen C-M, Chen Y-Y, Liu C-Y (2007b) Learning Performance Assessment Approach Using Web-Based Learning Portfolios for E-learning Systems, IEEE Trans Syst Man Cybern C Appl Rev 37(6) November
Cocea M, Weibelzahl S (2006) Can log files analysis estimate learners’ level of motivation? In: Proceedings of the workshop week Lernen—Wissensentdeckung—Adaptivität, Hildesheim, pp 32–35
Cohen A, Nachmias R (2010) What can instructors and policy makers learn about Web-supported learning through Web-usage mining, Internet and Higher Education. doi:10.1016/j.iheduc.2010.07.008
Cummins D, Yacef K, Koprinska I (2006) A sequence based recommender system for learning resources. Aust J Intell Inform process Syst 9: 49–56
Delen D (2010) A comparative analysis of machine learning techniques for student retention management. Decis Support Syst 49: 498–506
El-Alfy EM, Abdel-Aal RE (2008) Construction and analysis of educational tests using abductive machine learning. Comput Educ 51: 1–16
Frias-Martinez E, Chen SY, Liu X (2006) Survey of data mining approaches to user modeling for adaptive hypermedia. IEEE Trans Syst Man Cybern C Appl Rev 36(2): 734–748
García E, Romero C, Ventura S, de Castro C (2010) A collaborative educational association rule mining tool, A collaborative educational association rule mining tool, Internet and Higher Education. doi:10.1016/j.iheduc.2010.07.006
Guo Q, Zhang M (2009) Implement web learning environment based on data mining. Knowl-Based Syst 22: 439–442
Hamalainen W, Vinni M (2006) Comparison of machine learning methods for intelligent tutoring systems. In: Proceedings of the eighth international conference in intelligent tutoring systems, Taiwan, pp 525–534
Hammouda K, Kamel M (2006) Data mining in e-learning. In: Pierre S (ed), E-learning networked environments and architectures: a knowledge processing perspective, Springer Book Series: Advanced information and knowledge processing, pp 1–28
Hershkovitz A, Nachmias R (2009) Learning about online learning processes and students’ motivation through web usage mining. Interdiscip J E-Learning and Learning Objects, 5,197–215, Special series of Chais Conference 2009 best papers
Hsia T-C, Shie A-J, Chen L-C (2008) Course planning of extension education to meet market demand by using data mining techniques—an example of Chinkuo technology university in Taiwan. Expert Syst Appl 34: 596–602
Hsu MH (2008) A personalized English learning recommender system for ESL students. Expert Syst Appl 34(1): 683–688
Jantan H , Hamdan AR, Othman ZA (2010) Classification and prediction of academic talent using data mining techniques, KES 2010, Part I, LNAI 6276, pp 491–500
Kock M, Paramythis A (2011) Activity sequence modeling and dynamic clustering for personalized e-learning, User Model User-Adap Inter. doi:10.1007/s11257-010-9087-z
Kotsiantis S, Pierrakeas C, Pintelas P (2004) Predicting Students’ Performance in distance learning using machine learning techniques. Appl Artif Intell (AAI) 18(5): 411–426
Lee MW, Chen SY, Chrysostomou K, Liu X (2009) Mining students’ behavior in web-based learning programs. Expert Syst Appl 36: 3459–3464
Lin WT, Wang SJ, Wub YC, Ye TC (2011) An empirical analysis on auto corporation training program planning by data mining techniques. Expert Syst Appl 38: 5841–5850
Lykourentzou I, Giannoukos I, Nikolopoulos V, Mpardis G, Loumos V (2009) Dropout prediction in e-learning courses through the combination of machine learning techniques. Comput and Educ 53: 950–965
Macfadyen LP, Dawson S (2010) Mining LMS data to develop an “early warning system” for educators: a proof of concept. Comput Educ 54: 588–599
Malerba D, Esposito F, Ceci M (2004) Top–down induction of model trees with regression and splitting nodes. IEEE Trans Pattern Anal Mach Intell 26(5): 612–625
Moseley LG, Mead DM (2008) Predicting who will drop out of nursing courses: a machine learning exercise. Nurse Educ Today 28: 469–475
Paliwala M, Kumar UA (2009) Neural networks and statistical techniques: a review of applications. Expert Syst Appl 36(1): 2–17
Perera D, Kay J, Koprinska I, Yacef K, Zaïne OR (2009) Clustering and sequential pattern mining of online collaborative learning data. IEEE Trans Knowl Data Eng 21(6): 759–772
Platt J (1999) Using sparseness and analytic QP to speed training of support vector machines. In: Kearns MS, Solla SA, Cohn DA (eds) Advances in neural information processing systems 11. MIT Press, MA
Robnik-Šikonja M, Kononenko I (2003) Theoretical and empirical analysis of reliefF and RReliefF. Mach Learn 53(1–2): 23–69
Romero C, Ventura S (2007) Educational data mining: a survey from 1995 to 2005. Expert Syst Appl 33(1): 135–146
Romero C, Ventura S (2010) Educational data mining: a review of the state-of-the-art. IEEE Trans Syst Man Cybernet C Appl Rev 40(6): 601–618
Romero C, Ventura S, García E (2008) Data mining in course management systems: moodle case study and tutorial. Comput Educ 51(1): 368–384
Romero C, Ventura S, Zafra A, de Bra P (2009) Applying Web usage mining for personalizing hyperlinks in Web-based adaptive educational systems. Comput Educ 53: 828–840
Sevindik T, Demirkeser N, Cömert Z (2010) Virtual education environments and web mining. Procedia Soc Behav Sci 2: 5120–5124
Shevade S, Keerthi S, Bhattacharyya C, Murthy K (2000) Improvements to the SMO algorithm for SVM regression. IEEE Trans Neural Netw 11(5): 1183–1188
Soller A (2007) Adaptive support for distributed collaboration. In: Brusilovsky P, Kobsa A, Nejdl W (eds) The AdaptiveWeb, vol 4321 of Lecture Notes in Computer Science. Springer, Berlin, pp 573–595
Tseng SS, Sue PC, Su JM, Weng JF, Tsai WN (2007) A new approach for constructing the concept map. Comput and Educ 49(3): 691–707
Vialardi Sacín C, Bravo Agapito J, Shafti L, Ortigosa A (2009) Recommendation in higher education using data mining techniques. 2nd international conference of educational data mining 2009, Spain, 1–3 July, 2009, 190–199
Wang Y-h, Tseng M-H, Liao H-C (2009) Data mining for adaptive learning sequence in english language instruction. Expert Syst Appl 36: 7681–7686
Wang Y, Witten IH (1997) Induction of model trees for predicting continuous classes, In: Proceedings of the poster papers of the european conference on ML, Prague . Prague: university of economics, Faculty of informatics and statistics, pp 128–137
Weisberg S (2005) Appl Linear Regres, 3rd Edition, ISBN: 978-0-471-66379-9
Weng C-H (2011) Mining fuzzy specific rare itemsets for education data, Knowl-Based Syst. doi:10.1016/j.knosys.2011.02.010
Witten IH, Frank E, Hall MA (2011) Data mining: practical machine learning tools and techniques (Third Edition), Morgan Kaufmann, January, ISBN 978-0-12-374856-0
Xenos M, Pierrakeas C, Pintelas P (2002) A survey on student dropout rates and dropout causes concerning the students in the course of informatics of the Hellenic Open University. Comput Educ 39: 361–377
Yudelson MV, Medvedeva O, Legowski E, Castine M, Jukic D, Rebecca C (2006) Mining student learning data to develop high level pedagogic strategy in a medical ITS. In: Proceedings of AAAI workshop on educational data mining, Boston, pp 1–8
Zorrilla ME (2009) Data Warehouse Technology for E-Learning. In: Zakrzewska D et al. (eds) Meth and Support Tech for Data Analys, SCI 225, pp 1–20
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kotsiantis, S.B. Use of machine learning techniques for educational proposes: a decision support system for forecasting students’ grades. Artif Intell Rev 37, 331–344 (2012). https://doi.org/10.1007/s10462-011-9234-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10462-011-9234-x