An Ensemble Learning Approach for Student Performance Analysis of a Higher Educational Institute using a SHAP-Based Feature Selection and Optuna Optimization

Open AccessArticle

An Ensemble Learning Approach for Student Performance Analysis of a Higher Educational Institute using a SHAP-Based Feature Selection and Optuna Optimization

Volume 11, Issue 2, Page No 1–11, 2026

1 Electrical and Computer Engineering Department, Rafik Hariri University, Meshref, Lebanon
2 Department of Financial Studies, Rafik Hariri University, Meshref, Lebanon
3 Department of Human Resources, Saint Joseph University of Beirut, Beirut, Lebanon
4 Department of Management and Marketing Studies, Rafik Hariri University, Meshref, Lebanon
*whom correspondence should be addressed. E-mail: nassreddinega@rhu.edu.lb

Adv. Sci. Technol. Eng. Syst. J. 11(2), 1–11 (2026); crossref symbol DOI: 10.25046/aj110201

Keywords: Student performance, Higher education, Ensemble learning, AI-powered tools, Explanaible AI, SHAP feature selection, Optuna optimization

Received: 21 January 2026, Revised: 7 March 2026, Accepted: 10 March 2026, Published Online: 19 March 2026
(This article belongs to the SP20 (Special Issue on Multidisciplinary Frontiers in Engineering, Computing and Applied Sciences 2026) & Section Artificial Intelligence in Computer Science (CAI))
282 Downloads
Export Citations

Forecasting and assessing student performance are crucial for allowing educators to pinpoint deficiencies and promote grade improvement. A thorough comprehension of feature contributions is crucial for improving model interpretability and facilitating informed decision-making in academic institutions. Explainable artificial intelligence encompasses methodologies and strategies designed to deliver transparent and accessible rationales for the decisions rendered by artificial intelligence and machine learning algorithms. In this research paper, an interpretable gradient boosting approach for predicting student performance is introduced, including both feature selection using SHapley Additive exPlanations (SHAP)-based features and cost-sensitive decision thresholds. The proposed methodology includes hybrid resampling with SMOTE-Tomek, Optuna hyperparameter optimization with stratified cross-validation, and SHAP-guided feature selection strategy. The proposed approach is tested using a dataset of a higher educational institute in the Middle East, including student information, learning management, and a video interaction system, to make an analysis and evaluate the performance of the students. The results show both improvement of the macro F1-score and the fail-class recall by achieving an accuracy of 94%, a weighted/macro F1-score of 0.9399, and a fail-class recall of 0.9619. The suggested method facilitates trade-offs among prediction accuracy, interpretability, and fairness, bridging the divide between high-performing machine learning models and practical educational applications, hence aiding in the formulation of data-driven policies and the customization of learning experiences.

  1. C. Koukaras, S. G. Stavrinides, E. Hatzikraniotis, M. Mitsiaki, P. Koukaras, C. Tjortjis, “Navigating the future of education: A review on telecommunications and AI technologies, ethical implications, and equity challenges,” Telecom, 7(2), MDPI, 2026, https://doi.org/10.3390/telecom7010002.
  2. P. Naayini, “AI and the future of education: Advancing personalized learning and intelligent tutoring systems,” Frontiers in Educational Innovation and Research, 1(1), 29–39, 2025, https://doi.org/10.62762/FEIR.2025.332098.
  3. S. Arora, A. Rajesh, R. Misra, G. Singh, “Bridging technology and trust: the role of AI-driven robo-advisors in middle-class financial management,” Management Decision, 1–24, 2025, https://doi.org/10.1108/MD-01-2025-0093.
  4. T. Heafner, D. Maxwell, “CIVIC: Five pillars for using artificial intelligence in social studies education,” Society for Information Technology & Teacher Education International Conference, 2581–2589, AACE, 2025.
  5. R. Daher, “Integrating AI literacy into teacher education: a critical perspective paper,” Discover Artificial Intelligence, 5(1), 217, 2025, https://doi.org/10.1007/s44163-025-00475-7.
  6. G. Nassreddine, L. Saleh, M. Al Majzoub, A. El Arid, “SHAP explainability: An ensemble learning approach for student performance prediction,” 2025 IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology (IAICT), 432–438, IEEE, 2025, https://doi.org/10.1109/IAICT65714.2025.11100707.
  7. Y. Luo, G. Zhou, Y. Cui, “Understanding generative artificial intelligence adoption in higher education faculty: evidence from Chinese Universities and technical and vocational colleges,” Education and Information Technologies, 1–41, 2026, https://doi.org/10.1007/s10639-025-13885-y.
  8. S. Sengupta, S. Chakrabarti, “Towards a smarter education system: an investigation into ML and DL for information retrieval,” Multimedia Tools and Applications, 1–34, 2025, https://doi.org/10.1007/s11042-025-20656-x.
  9. W. Cao, N. Mai, “Predictive analytics for student success: AI-driven early warning systems and intervention strategies for educational risk management,” Educational Research and Human Development, 2(2), 36–48, 2025.
  10. I. Gligorea, M. Cioca, R. Oancea, A.-T. Gorski, H. Gorski, P. Tudorache, “Adaptive learning using artificial intelligence in e-learning: A literature review,” Education Sciences, 13(12), 1216, 2023, https://doi.org/10.3390/educsci13121216.
  11. Hariyanto, F. X. D. Kristianingsih, R. Maharani, “Artificial intelligence in adaptive education: a systematic review of techniques for personalized learning,” Discover Education, 4(1), 458, 2025, https://doi.org/10.1007/s44217-025-00908-6.
  12. S. Rawat, M. Rodrigues, P. Sheregar, K. A. Wagaskar, A. K. Tripathy, “Computer vision based hybrid classroom attention monitoring,” 2024 IEEE International Conference on Information Technology, Electronics and Intelligent Communication Systems (ICITEICS), 1–6, IEEE, 2024, https://doi.org/10.1109/ICITEICS61368.2024.10624965.
  13. G. Zhang, “Transformer-based AI framework for optimising English teaching evaluation strategies: a data-driven and explainable approach,” International Journal of Information and Communication Technology, 26(9), 107–127, 2025, https://doi.org/10.1504/IJICT.2025.145828.
  14. G. Gokmen, T. C. Akinci, M. Tektas, N. Onat, G. Kocyigit, N. Tektas, “Evaluation of student performance in laboratory applications using fuzzy logic,” Procedia-Social and Behavioral Sciences, 2(2), 902–909, 2010, https://doi.org/10.1016/j.sbspro.2010.03.124.
  15. Y. A. Alsariera, Y. Baashar, G. Alkawsi, A. Mustafa, A. A. Alkahtani, N. Ali, “Assessment and evaluation of different machine learning algorithms for predicting student performance,” Computational Intelligence and Neuroscience, 2022(1), 4151487, 2022, https://doi.org/10.1155/2022/4151487.
  16. G. Feng, M. Fan, Y. Chen, “Analysis and prediction of students’ academic performance based on educational data mining,” IEEE Access, 10, 19558–19571, 2022, https://doi.org/10.1109/ACCESS.2022.3151652.
  17. M. Abdasalam, A. Alzubi, K. Iyiola, “Student grade prediction for effective learning approaches using the optimized ensemble deep neural network,” Education and Information Technologies, 30(8), 10159–10183, 2025, https://doi.org/10.1007/s10639-024-13224-7.
  18. R. Dwivedi, D. Dave, H. Naik, S. Singhal, R. Omer, P. Patel, B. Qian, Z. Wen, T. Shah, G. Morgan, R. Ranjan, “Explainable AI (XAI): Core ideas, techniques, and solutions,” ACM Computing Surveys, 55(9), 2023, https://doi.org/10.1145/3561048.
  19. E. Ahmed, “Student performance prediction using machine learning algorithms,” Applied Computational Intelligence and Soft Computing, 2024(1), 4067721, 2024, https://doi.org/10.1155/2024/4067721.
  20. N. Abuzinadah, M. Umer, A. Ishaq, A. Al Hejaili, S. Alsubai, A. Eshmawi, A. Mohamed, I. Ashraf, “Role of convolutional features and machine learning for predicting student academic performance from MOODLE data,” PLOS ONE, 18(11), e0293061, 2023, https://doi.org/10.1371/journal.pone.0293061.
  21. S. Sarwat, N. Ullah, S. Sadiq, R. Saleem, M. Umer, A. Eshmawi, A. Mohamed, I. Ashraf, “Predicting students’ academic performance with conditional generative adversarial network and deep SVM,” Sensors, 22(13), 4834, 2022, https://doi.org/10.3390/s22134834.
  22. F. Qiu, G. Zhang, X. Sheng, L. Jiang, L. Zhu, Q. Xiang, B. Jiang, P.-K. Chen, “Predicting students’ performance in e-learning using learning process and behaviour data,” Scientific Reports, 12(1), 453, 2022, https://doi.org/10.1038/s41598-021-03867-8.
  23. F. Ouyang, M. Wu, L. Zheng, L. Zhang, P. Jiao, “Integration of artificial intelligence performance prediction and learning analytics to improve student learning in online engineering course,” International Journal of Educational Technology in Higher Education, 20(1), 4, 2023.
  24. J. Niyogisubizo, L. Liao, E. Nziyumva, E. Murwanashyaka, P. C. Nshimyumukiza, “Predicting student’s dropout in university classes using two-layer ensemble machine learning approach: A novel stacked generalization,” Computers and Education: Artificial Intelligence, 3, 100066, 2022, https://doi.org/10.1016/j.caeai.2022.100066.
  25. P. Guleria, M. Sood, “Explainable AI and machine learning: performance evaluation and explainability of classifiers on educational data mining inspired career counseling,” Education and Information Technologies, 28(1), 1081–1116, 2023, https://doi.org/10.1007/s10639-022-11221-2.
  26. M. F. Shahzad, S. Xu, W. M. Lim, X. Yang, Q. R. Khan, “Artificial intelligence and social media on academic performance and mental well-being: Student perceptions of positive impact in the age of smart learning,” Heliyon, 10(8), 2024, https://doi.org/10.1016/j.heliyon.2024.e29523.
  27. S. M. Lundberg, S.-I. Lee, “A unified approach to interpreting model predictions,” Advances in Neural Information Processing Systems, 30, Curran Associates, Inc., 2017.
  28. S. R. Rebeka, R. Thomas, “Fostering engagement and trust in e-learning communities through social media platforms,” in Innovative Approaches to Social Media in Education, chapter 11, 241–255, IGI Global, 2025, https://doi.org/10.4018/979-8-3693-3868-1.ch011.
  29. P. Muthulakshmi, M. Parveen, “Z-score normalized feature selection and iterative African buffalo optimization for effective heart disease prediction,” International Journal of Intelligent Engineering & Systems, 16(1), 2023.
  30. T. Yin, H. Chen, Z. Yuan, J. Wan, K. Liu, S.-J. Horng, T. Li, “A robust multi-label feature selection approach based on graph structure considering fuzzy dependency and feature interaction,” IEEE Transactions on Fuzzy Systems, 31(12), 4516–4528, 2023, https://doi.org/10.1109/TFUZZ.2023.3287193.
  31. I. L. Cherif, A. Kortebi, “On using extreme gradient boosting (XGBoost) machine learning algorithm for home network traffic classification,” 2019 Wireless Days (WD), 1–6, 2019, https://doi.org/10.1109/WD.2019.8734193.
  32. H. Wang, Q. Liang, J. T. Hancock, T. M. Khoshgoftaar, “Feature selection strategies: a comparative analysis of SHAP-value and importance-based methods,” Journal of Big Data, 11(1), 44, 2024, https://doi.org/10.1186/s40537-024-00905-w.
  33. G. Nassreddine, A. El Arid, M. Nassereddine, O. Al Khatib, “Fault detection and classification for photovoltaic panel system using machine learning techniques,” Applied AI Letters, 6(2), e115, 2025, https://doi.org/10.1002/ail2.115.
  34. G. Nassreddine, A. El Arid, M. Nasseredine, “Solar PV power prediction system based on machine learning approach,” 2023 IEEE International Conference on Energy Technologies for Future Grids (ETFG), 1–7, 2023, https://doi.org/10.1109/ETFG55873.2023.10407291.
  35. R. Hasan, S. Palaniappan, S. Mahmood, A. Abbas, K. U. Sarker, “Dataset of students’ performance using student information system, Moodle and the mobile application ‘eDify’,” Data, 6(11), 2021, https://doi.org/10.3390/data6110110.

Citations by Dimensions

Citations by PlumX

Google Scholar

Crossref Citations

No. of Downloads Per Month
No. of Downloads Per Country

Journal Menu

Journal Browser


Special Issues

Special Issue on Digital Frontiers of Entrepreneurship: Integrating AI, Gender Equity, and Sustainable Futures
Guest Editors: Dr. Muhammad Nawaz Tunio, Dr. Aamir Rashid, Dr. Imamuddin Khoso
Deadline: May 30, 2026

Special Issue on Indigenous Knowledge Systems of the Tribal Communities of the Asia Pacific
Guest Editors: Dr. Anurag Hazarika
Deadline: October 31, 2026