Comparative Study of Supervised Algorithms for Prediction of Students’ Performance

Full Text (PDF, 819KB), PP.1-21

Views: 0 Downloads: 0

Author(s)

Madhuri T. Sathe 1,* Amol C. Adamuthe 2

1. Department of Computer Engineering, Rajarambapu Institute of Technology, Rajaramnagar, MS, India

2. Department of Computer Science and Information Technology, Rajarambapu Institute of Technology, Rajaramnagar, MS, India

* Corresponding author.

DOI: https://doi.org/10.5815/ijmecs.2021.01.01

Received: 8 May 2020 / Revised: 2 Jun. 2020 / Accepted: 28 Jun. 2020 / Published: 8 Feb. 2021

Index Terms

Educational data mining, Machine learning, Random forest, C5.0

Abstract

Predicting academic performance of the student is crucial task as it depends on various factors. To perform such predictions the machine learning and data mining algorithms are useful. This paper presents investigation of application of C5.0, J48, CART, Naïve Bayes (NB), K-Nearest Neighbour (KNN), Random Forest and Support Vector Machine for prediction of students’ performance. Three datasets from school level, college level and e-learning platform with varying input parameters are considered for comparison between C5.0, NB, J48, Multilayer Perceptron (MLP), PART, Random Forest, BayesNet, and Artificial Neural Network (ANN). Paper presents comparative results of C5.0, J48, CART, NB, KNN, Random forest and SVM on changing tuning parameters. The performance of these techniques is tested on three different datasets. Results show that the performances of Random forest and C5.0 are better than J48, CART, NB, KNN, and SVM.

Cite This Paper

Madhuri T. Sathe, Amol C. Adamuthe, "Comparative Study of Supervised Algorithms for Prediction of Students’ Performance", International Journal of Modern Education and Computer Science(IJMECS), Vol.13, No.1, pp. 1-21, 2021.DOI: 10.5815/ijmecs.2021.01.01

Reference

[1] http://educationaldatamining.org/

[2] Mahboob, T., Irfan, S., & Karamat, A., “A machine learning approach for student assessment in E-learning using Quinlan's C4. 5, Naive Bayes and Random Forest algorithms,” IEEE 19th International Multi-Topic Conference (INMIC), pp. 1-8, December 2016.

[3] Marbouti, F., Diefes-Dux, H. A., & Madhavan, K., “Models for early prediction of at-risk students in a course using standards-based grading”, Computers & Education, vol 103,pp 1-15, 2016.

[4] Costa, E. B., Fonseca, B., Santana, M. A., de Araújo, F. F., & Rego, J., “Evaluating the effectiveness of educational data mining techniques for early prediction of students' academic failure in introductory programming courses”, Computers in Human Behavior, vol 73, pp 247-256, 2017.

[5] Zhang, X., Xue, R., Liu, B., Lu, W., & Zhang, Y., “Grade Prediction of Student Academic Performance with Multiple Classification Models”, IEEE 14th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD), pp. 1086-1090, 2018.

[6] Al-Shehri, H., Al-Qarni, A., Al-Saati, L., Batoaq, A., Badukhen, H., Alrashed, S., ... & Olatunji, S. O., “Student performance prediction using support vector machine and k-nearest neighbor”, IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE) pp. 1-4, April 2017.

[7] Amrieh, E. A., Hamtini, T., & Aljarah, I., “Preprocessing and analyzing educational data set using X-API for improving student's performance”, IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT), pp. 1-5, November 2015.

[8] Hussain, S., Dahan, N. A., Ba-Alwib, F. M., & Ribata, N., “Educational data mining and analysis of students’ academic performance using WEKA”, Indonesian Journal of Electrical Engineering and Computer Science, vol 9(2), pp. 447-459, 2018.

[9] Parack, S., Zahid, Z., & Merchant, F.,” Application of data mining in educational databases for predicting academic trends and patterns”, IEEE International Conference on Technology Enhanced Education (ICTEE), pp. 1-4, January 2012.

[10] Alfiani, A. P., & Wulandari, F. A. (2015). Mapping student's performance based on data mining approach (a case study). Agriculture and Agricultural Science Procedia, 3, 173-177. 2015

[11] Christian, T. M., & Ayub, M., “Exploration of classification using NBTree for predicting students' performance”, IEEE International Conference on Data and Software Engineering (ICODSE), pp. 1-6, November 2014.

[12] Natek, S., & Zwilling, M.,“Student data mining solution–knowledge management system related to higher education institutions” Expert systems with applications, vol. 41(14), pp. 6400-6407, 2014.

[13] Goga, M., Kuyoro, S., & Goga, N., “A recommender for improving the student academic performance”, Procedia-Social and Behavioral Sciences, vol. 180, pp. 1481-1488, 2015.

[14] Huang, S., & Fang, N., “Predicting student academic performance in an engineering dynamics course: A comparison of four types of predictive mathematical models” Computers & Education, vol. 61, pp. 133-145, 2013.

[15] Guruler, H., Istanbullu, A., & Karahasan, M., “A new student performance analysing system using knowledge discovery in higher educational databases”, Computers & Education, vol. 55(1), pp. 247-254, 2010.

[16] Sorour, S. E., & Mine, T., “Building an interpretable model of predicting student performance using comment data mining”, IEEE 5th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI), pp. 285-291, July 2016.

[17] Vandamme, J. P., Meskens, N., & Superby, J. F., “Predicting academic performance by data mining methods”, Education Economics, vol. 15(4), pp. 405-419, 2007.

[18] Gray, G., McGuinness, C., & Owende, P., “An application of classification models to predict learner progression in tertiary education”, IEEE International Advance Computing Conference (IACC), pp. 549-554, February 2014.

[19] Macfadyen, L. P., & Dawson, S., “Mining LMS data to develop an “early warning system” for educators: A proof of concept”, Computers & education, vol. 54(2), pp. 588-599, 2010.

[20] Hoffait, A. S., & Schyns, M., “Early detection of university students with potential difficulties”, Decision Support Systems, vol. 101, pp. 1-11, 2017.

[21] Palmer, S., “Modelling engineering student academic performance using academic analytics”, International journal of engineering education, vol. 29(1), pp. 132-138, 2013.

[22] Romero, C., Espejo, P. G., Zafra, A., Romero, J. R., & Ventura, S., “Web usage mining for predicting final marks of students that use Moodle courses”, Computer Applications in Engineering Education, vol. 21(1), pp. 135-146, 2013.

[23] Mishra, T., Kumar, D., & Gupta, S., “Mining students’ data for performance prediction”, Proceedings of international conference on advanced computing & communication technologies, pp. 255-263, February 2014.

[24] Miguéis, V. L., Freitas, A., Garcia, P. J., & Silva, A., “Early segmentation of students according to their academic performance: A predictive modelling approach”, Decision Support Systems, vol. 115, pp. 36-51, 2018.

[25] Laugerman, M., Rover, D. T., Shelley, M. C., & Mickelson, S. K., “Determining graduation rates in engineering for community college transfer students using data mining”, International Journal of Engineering Education, vol. 31(6A), pp. 1448, 2015.

[26] Romero, C., & Ventura, S., “Data mining in education”, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol. 3(1), pp. 12-27, 2013.

[27] Arsad, P. M., & Buniyamin, N., “A neural network students' performance prediction model (NNSPPM)”, IEEE International Conference on Smart Instrumentation, Measurement and Applications (ICSIMA), pp. 1-5, November 2013.

[28] Roy, S., & Garg, A., “Predicting academic performance of student using classification techniques”, 4th IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics (UPCON), pp. 568-572, October 2017.

[29] Mueen, A., Zafar, B., & Manzoor, U., “Modeling and predicting students' academic performance using data mining techniques”, International Journal of Modern Education and Computer Science, vol. 8(11), pp. 36-42, 2016.

[30] Khan, I. H., “A Unified Framework for Systematic Evaluation of ABET Student Outcomes and Program Educational Objectives”, 2019.

[31] Kumar, M., Singh, A. J., & Handa, D., “Literature survey on student’s performance prediction in education using data mining techniques”, International Journal of Education and Management Engineering, vol. 7(6), pp. 40-49, 2017.

[32] Inyang, U. G., Eyoh, I. J., Robinson, S. A., & Udo, E. N., “Visual Association Analytics Approach to Predictive Modelling of Students’ Academic Performance”, 2019.

[33] https://archive.ics.uci.edu/ml/datasets/Student+Academics+Performance#

[34] https://archive.ics.uci.edu/ml/datasets/student+performance

[35] https://www.kaggle.com/aljarah/xAPI-Edu-Data

[36] Badr, G., Algobail, A., Almutairi, H., & Almutery, M., “Predicting students’ performance in university courses: a case study and tool in KSU mathematics department”, Procedia Computer Science, vol. 82, pp. 80-89, 2016.

[37] https://www.statisticssolutions.com/correlation-pearson-kendall-spearman/

[38] Drazin, S., & Montag, M., “Decision tree analysis using weka”, Machine Learning-Project II, University of Miami, pp. 1-3, 2012.

[39] Azmi, M. S. B. M., & Paris, I. H. B. M., “Academic performance prediction based on voting technique”, IEEE 3rd International Conference on Communication Software and Networks, pp. 24-27, May 2011.

[40] Amra, I. A. A., & Maghari, A. Y., “Students performance prediction using KNN and Naïve Bayesian”, 8th International Conference on Information Technology (ICIT), pp. 909-913, May 2017.

[41] Sokolova, M., & Lapalme, G., “A systematic analysis of performance measures for classification tasks”, Information processing & management, vol. 45(4), pp. 427-437, 2009.