1. bookVolume 26 (2021): Edizione 2 (December 2021)
Dettagli della rivista
License
Formato
Rivista
eISSN
2255-8691
Prima pubblicazione
08 Nov 2012
Frequenza di pubblicazione
2 volte all'anno
Lingue
Inglese
Accesso libero

Academic Performance Modelling with Machine Learning Based on Cognitive and Non-Cognitive Features

Pubblicato online: 30 Dec 2021
Volume & Edizione: Volume 26 (2021) - Edizione 2 (December 2021)
Pagine: 122 - 131
Dettagli della rivista
License
Formato
Rivista
eISSN
2255-8691
Prima pubblicazione
08 Nov 2012
Frequenza di pubblicazione
2 volte all'anno
Lingue
Inglese
Abstract

The academic performance of students is essential for academic progression at all levels of education. However, the availability of several cognitive and non-cognitive factors that influence students’ academic performance makes it challenging for academic authorities to use conventional analytical tools to extract hidden knowledge in educational data. Therefore, Educational Data Mining (EDM) requires computational techniques to simplify planning and determining students who might be at risk of failing or dropping from school due to academic performance, thus helping resolve student retention. The paper studies several cognitive and non-cognitive factors such as academic, demographic, social and behavioural and their effect on student academic performance using machine learning algorithms. Heterogenous lazy and eager machine learning classifiers, including Decision Tree (DT), K-Nearest-Neighbour (KNN), Artificial Neural Network (ANN), Logistic Regression (LR), Random Forest (RF), AdaBoost and Support Vector Machine (SVM) were adopted and training was performed based on k-fold (k = 10) and leave-one-out cross-validation. We evaluated their predictive performance using well-known evaluation metrics like Area under Curve (AUC), F-1 score, Precision, Accuracy, Kappa, Matthew’s correlation coefficient (MCC) and Recall. The study outcome shows that Student Absence Days (SAD) are the most significant predictor of students’ academic performance. In terms of prediction accuracy and AUC, the RF (Acc = 0.771, AUC = 0.903), LR (Acc = 0.779, AUC = 0.90) and ANN (Acc = 0.760, AUC = 0.895) outperformed all other algorithms (KNN (Acc = 0.638, AUC = 0.826), SVM (Acc = 0.727, AUC = 0.80), DT (Acc = 0.733, AUC = 0.876) and AdaBoost (Acc = 0.748, AUC = 0.808)), making them more suitable for predicting students’ academic performance.

Keywords

[1] E. I. Ani, “Debating the roots of poor academic performance in the West African subregion: The perspective of a philosopher,” SAGE Open, vol. 7, no. 2, Art no. 2158244017707795, May 2017. https://doi.org/10.1177/215824401770779510.1177/2158244017707795 Search in Google Scholar

[2] B. G. Adams, N. Wiium, and A. Abubakar, “Developmental assets and academic performance of adolescents in Ghana, Kenya, and South Africa,” Child & Youth Care Forum, vol. 48, no. 2, pp. 207–222, Nov. 2019. https://doi.org/10.1007/s10566-018-9480-z10.1007/s10566-018-9480-z Search in Google Scholar

[3] S. Venkatesh, Y. K. Rao, H. Nagaraja, T. Woolley, F. O. Alele, and B. S. Malau-Aduli, “Factors influencing medical students’ experiences and satisfaction with blended integrated e-learning,” Medical Principles and Practice, vol. 29, no. 4, pp. 396–402, Jul. 2020. https://doi.org/10.1159/00050521010.1159/000505210744566831801145 Search in Google Scholar

[4] L. F. Casinillo, M. A. E. Palen, E. L. Casinillo, and P. G. Batidor, “Assessing senior high student’s learning experiences in mathematics,” Indonesian Journal of Educational Studies, vol. 23, no. 1, pp. 44–60, 2020. https://doi.org/10.26858/ijes.v23i1.1343710.26858/ijes.v23i1.13437 Search in Google Scholar

[5] C. Semeraro, D. Giofrè, G. Coppola, D. Lucangeli, and R. Cassibba, “The role of cognitive and non-cognitive factors in mathematics achievement: The importance of the quality of the student-teacher relationship in middle school,” PLoS ONE, vol. 15, no. 4, Art no. e0231381, 2020. https://doi.org/10.1371/journal.pone.023138110.1371/journal.pone.0231381717024732310988 Search in Google Scholar

[6] R. Ghorbani and R. Ghousi, “Comparing different resampling methods in predicting students’ performance using machine learning techniques,” IEEE Access, vol. 8, pp. 67899–67911, Apr. 2020. https://doi.org/10.1109/ACCESS.2020.298680910.1109/ACCESS.2020.2986809 Search in Google Scholar

[7] I. K. Nti, A. Y. Appiah, and O. Nyarko-Boateng, “Assessment and prediction of earthing resistance in domestic installation,” Engineering Reports, vol. 2, no. 1, Art no. e12090, Jan. 2020. https://doi.org/10.1002/eng2.1209010.1002/eng2.12090 Search in Google Scholar

[8] I. K. Nti, M. Teimeh, A. F. Adekoya, and O. Nyarko-Boateng, “Forecasting electricity consumption of residential users based on lifestyle data using artificial neural networks,” ICTACT Journal on Soft Computing, vol. 10, no. 3, pp. 2107–2116, 2020. Search in Google Scholar

[9] O. Nyarko-Boateng, A. F. Adekoya, and B. A. Weyori, “Predicting the actual location of faults in underground optical networks using linear regression,” Engineering Reports, vol. 3, no. 3, Art no. e212304, Mar. 2021. https://doi.org/10.1002/eng2.1230410.1002/eng2.12304 Search in Google Scholar

[10] O. Nyarko-Boateng, A. F. Adekoya, and B. A. Weyori, “Tracing the exact location of failures in underground optical networks using LSTM deep learning model,” Indian Journal of Science and Technology, vol. 14, no. 4, pp. 297–309, 2021. https://doi.org/10.17485/IJST/v14i4.200810.17485/IJST/v14i4.2008 Search in Google Scholar

[11] I. K. Nti, A. F. Adekoya, and B. A. Weyori, “Efficient stock-market prediction using ensemble support vector machine,” Open Computer Science, vol. 10, no. 1, pp. 153–163, Jul. 2020. https://doi.org/10.1515/comp-2020-019910.1515/comp-2020-0199 Search in Google Scholar

[12] I. K. Nti, A. F. Adekoya, and B. A. Weyori, “Random forest based feature selection of macroeconomic variables for stock market prediction,” American Journal of Applied Sciences, vol. 16, no. 7, pp. 200–212, Jul. 2019. https://doi.org/10.3844/ajassp.2019.200.21210.3844/ajassp.2019.200.212 Search in Google Scholar

[13] F. Ecer, S. Ardabili, S. S. Band, and A. Mosavi, “Training multilayer perceptron with genetic algorithms and particle swarm optimization for modeling stock price index prediction,” Entropy, vol. 22, no. 11, Art no. 1239, 2020. https://doi.org/10.3390/e2211123910.3390/e22111239771211133287007 Search in Google Scholar

[14] Isha, S. Dixit, M. K. Ahirwar, D. Sakethnath, and M. Rakha, “Stock prediction by analyzing the past market trend,” in 2021 9th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), Noida, India, Sep. 2021, pp. 1–4. https://doi.org/10.1109/ICRITO51393.2021.959626310.1109/ICRITO51393.2021.9596263 Search in Google Scholar

[15] I. K. Nti and J. A. Quarcoo, “Self-motivation and academic performance in computer programming language using a hybridised machine learning technique,” International Journal of Artificial Intelligence and Expert Systems, vol. 8, no. 2, pp. 12–30, 2019. Search in Google Scholar

[16] D. Aggarwal, S. Mittal, and V. Bali, “Prediction model for classifying students based on performance using machine learning techniques,” International Journal of Recent Technology and Engineering, vol. 8, no. 2S7, pp. 496–503, Jul. 2019. https://doi.org/10.35940/ijrte.B1093.0782S71910.35940/ijrte.B1093.0782S719 Search in Google Scholar

[17] S. Hussain, N. A. Dahan, F. M. Ba-Alwi, and N. Ribata, “Educational data mining and analysis of students’ academic performance using WEKA,” Indonesian Journal of Electrical Engineering and Computer Science, vol. 9, no. 2, pp. 447–459, 2018. https://doi.org/10.11591/ijeecs.v9.i2.pp447-45910.11591/ijeecs.v9.i2.pp447-459 Search in Google Scholar

[18] H. Almarabeh, “Analysis of students’ performance by using different data mining classifiers,” International Journal of Modern Education and Computer Science, vol. 9, no. 8, pp. 9–15, Aug. 2017. https://doi.org/10.5815/ijmecs.2017.08.0210.5815/ijmecs.2017.08.02 Search in Google Scholar

[19] K. T. Chui, R. W. Liu, M. Zhao, and P. O. De Pablos, “Predicting students’ performance with school and family tutoring using generative adversarial network-based deep support vector machine,” IEEE Access, vol. 8, pp. 86745–86752, May 2020. https://doi.org/10.1109/ACCESS.2020.299286910.1109/ACCESS.2020.2992869 Search in Google Scholar

[20] A. K. Hamoud, A. S. Hashim, and W. A. Awadh, “Predicting student performance in higher education institutions using decision tree analysis,” International Journal of Interactive Multimedia and Artificial Intelligence, vol. 5, no. 2, pp. 26–31, 2018. https://doi.org/10.9781/ijimai.2018.02.00410.9781/ijimai.2018.02.004 Search in Google Scholar

[21] M. Akour, H. Al Sghaier, and O. Al Qasem, “The effectiveness of using deep learning algorithms in predicting students achievements,” Indonesian Journal of Electrical Engineering and Computer Science, vol. 19, no. 1, pp. 388–394, 2020. https://doi.org/10.11591/ijeecs.v19.i1.pp388-39410.11591/ijeecs.v19.i1.pp388-394 Search in Google Scholar

[22] Z. Ahmad and E. Shahzadi, “Prediction of students’ academic performance using artificial neural network,” Bulletin of Education and Research, vol. 40, no. 3, pp. 157–164, 2018. Search in Google Scholar

[23] I. Burman and S. Som, “Predicting students academic performance using support vector machine,” in 2019 Amity International Conference on Artificial Intelligence (AICAI), Dubai, United Arab Emirates, Feb. 2019, pp. 756–759. https://doi.org/10.1109/AICAI.2019.870126010.1109/AICAI.2019.8701260 Search in Google Scholar

[24] V. Cernat and L. Moldovan, “Emotional problems and academic performance of students in manufacturing,” in 11th International Conference on Interdisciplinarity in Engineering, Tirgu Mures, Romania, Oct. 2017. Procedia Manufacturing, vol. 22, pp. 833–839, 2018. https://doi.org/10.1016/j.promfg.2018.03.11810.1016/j.promfg.2018.03.118 Search in Google Scholar

[25] D. Oreski, S. Oreski, and B. Klicek, “Effects of dataset characteristics on the performance of feature selection techniques,” Applied Soft Computing, vol. 52, pp. 109–119, 2017. https://doi.org/10.1016/j.asoc.2016.12.02310.1016/j.asoc.2016.12.023 Search in Google Scholar

[26] I. H. Sarker, “Machine learning: Algorithms, real-world applications and research directions,” SN Computer Science, vol. 2, no. 3, Art no. 160, Mar. 2021. https://doi.org/10.1007/s42979-021-00592-x10.1007/s42979-021-00592-x798309133778771 Search in Google Scholar

[27] D. C. T. Hernández, “An experimental study of K* algorithm,” International Journal of Information Engineering and Electronic Business, vol. 7, no. 2, pp. 14–19, Mar. 2015. https://doi.org/10.5815/ijieeb.2015.02.0310.5815/ijieeb.2015.02.03 Search in Google Scholar

[28] A. M. Adam, “Sample size determination in survey research,” Journal of Scientific Research and Reports, vol. 26, no. 5, pp. 90–97, June 2020. https://doi.org/10.9734/JSRR/2020/v26i53026310.9734/jsrr/2020/v26i530263 Search in Google Scholar

[29] J. G. Carbonell, R. S. Michalski, and T. M. Mitchell, “An overview of machine learning,” in Machine Learning: An Artificial Intelligence Approach, R. S. Michalski, J. G. Carbonell, and T. M. Mitchell, Eds. Morgan Kaufmann, vol. 1, 1983, pp. 3–23. https://doi.org/10.1016/B978-0-08-051054-5.50005-410.1016/B978-0-08-051054-5.50005-4 Search in Google Scholar

[30] G. Bonaccorso, Machine Learning Algorithms. Packt Publishing Ltd., 2017. Search in Google Scholar

[31] A. Dey, “Machine learning algorithms: A review,” International Journal of Computer Science and Information Technologies, vol. 7, no. 3, pp. 1174–1179, 2016. Search in Google Scholar

[32] S. Abirami and P. Chitra, “Energy-efficient edge based real-time healthcare support system,” in The Digital Twin Paradigm for Smarter Systems and Environments: The Industry Use Cases. Advances in Computers, P. Raj and E. Preetha, Eds. Elsevier,vol. 117, no. 1, pp. 339–368, Oct. 2020. https://doi.org/10.1016/bs.adcom.2019.09.00710.1016/bs.adcom.2019.09.007 Search in Google Scholar

[33] H.-B. Ly, T.-A. Nguyen, and B. T. Pham, “Estimation of soil cohesion using machine learning method: A random forest approach,” Advances in Civil Engineering, vol. 2021, Art no. 8873993, 2021. https://doi.org/10.1155/2021/887399310.1155/2021/8873993 Search in Google Scholar

[34] I. K. Nti, A. F. Adekoya, and B. A. Weyori, “A comprehensive evaluation of ensemble learning for stock-market prediction,” Journal of Big Data, vol. 7, no. 1, Art no. 20, Mar. 2020. https://doi.org/10.1186/s40537-020-00299-510.1186/s40537-020-00299-5 Search in Google Scholar

[35] V. Ramesh, P. Parkavi, and K. Ramar, “Predicting student performance: A statistical and data mining approach,” International Journal of Computer Applications, vol. 63, no. 8, pp. 35–39, Feb. 2013. https://doi.org/10.5120/10489-524210.5120/10489-5242 Search in Google Scholar

[36] P. Kaur, M. Singh, and G. S. Josan, “Classification and prediction based data mining algorithms to predict slow learners in education sector,” in 3rd International Conference on Recent Trends in Computing, Delhi, India, Mar. 2015. Procedia Computer Science, vol. 57, pp. 500–508, 2015. https://doi.org/10.1016/j.procs.2015.07.37210.1016/j.procs.2015.07.372 Search in Google Scholar

[37] I. K. Nti, A. F. Adekoya, and B. A. Weyori, “A systematic review of fundamental and technical analysis of stock market predictions,” Artificial Intelligence Review, vol. 53, no. 4, pp. 3007–3057, Aug. 2019. https://doi.org/10.1007/s10462-019-09754-z10.1007/s10462-019-09754-z Search in Google Scholar

[38] A. A. Taha and S. J. Malebary, “An intelligent approach to credit card fraud detection using an optimized light gradient boosting machine,” IEEE Access, vol. 8, pp. 25579–25587, Feb. 2020. https://doi.org/10.1109/ACCESS.2020.297135410.1109/ACCESS.2020.2971354 Search in Google Scholar

Articoli consigliati da Trend MD

Pianifica la tua conferenza remota con Sciendo