An Ensemble Learning Approach for Student Performance Analysis of a Higher Educational Institute using a SHAP-Based Feature Selection and Optuna Optimization
Volume 11, Issue 2, Page No 1–11, 2026
Adv. Sci. Technol. Eng. Syst. J. 11(2), 1–11 (2026);
DOI: 10.25046/aj110201
Keywords: Student performance, Higher education, Ensemble learning, AI-powered tools, Explanaible AI, SHAP feature selection, Optuna optimization
Forecasting and assessing student performance are crucial for allowing educators to pinpoint deficiencies and promote grade improvement. A thorough comprehension of feature contributions is crucial for improving model interpretability and facilitating informed decision-making in academic institutions. Explainable artificial intelligence encompasses methodologies and strategies designed to deliver transparent and accessible rationales for the decisions rendered by artificial intelligence and machine learning algorithms. In this research paper, an interpretable gradient boosting approach for predicting student performance is introduced, including both feature selection using SHapley Additive exPlanations (SHAP)-based features and cost-sensitive decision thresholds. The proposed methodology includes hybrid resampling with SMOTE-Tomek, Optuna hyperparameter optimization with stratified cross-validation, and SHAP-guided feature selection strategy. The proposed approach is tested using a dataset of a higher educational institute in the Middle East, including student information, learning management, and a video interaction system, to make an analysis and evaluate the performance of the students. The results show both improvement of the macro F1-score and the fail-class recall by achieving an accuracy of 94%, a weighted/macro F1-score of 0.9399, and a fail-class recall of 0.9619. The suggested method facilitates trade-offs among prediction accuracy, interpretability, and fairness, bridging the divide between high-performing machine learning models and practical educational applications, hence aiding in the formulation of data-driven policies and the customization of learning experiences.
- C. Koukaras, S. G. Stavrinides, E. Hatzikraniotis, M. Mitsiaki, P. Koukaras, C. Tjortjis, “Navigating the future of education: A review on telecommunications and AI technologies, ethical implications, and equity challenges,” Telecom, 7(2), MDPI, 2026, https://doi.org/10.3390/telecom7010002.
- P. Naayini, “AI and the future of education: Advancing personalized learning and intelligent tutoring systems,” Frontiers in Educational Innovation and Research, 1(1), 29–39, 2025, https://doi.org/10.62762/FEIR.2025.332098.
- S. Arora, A. Rajesh, R. Misra, G. Singh, “Bridging technology and trust: the role of AI-driven robo-advisors in middle-class financial management,” Management Decision, 1–24, 2025, https://doi.org/10.1108/MD-01-2025-0093.
- T. Heafner, D. Maxwell, “CIVIC: Five pillars for using artificial intelligence in social studies education,” Society for Information Technology & Teacher Education International Conference, 2581–2589, AACE, 2025.
- R. Daher, “Integrating AI literacy into teacher education: a critical perspective paper,” Discover Artificial Intelligence, 5(1), 217, 2025, https://doi.org/10.1007/s44163-025-00475-7.
- G. Nassreddine, L. Saleh, M. Al Majzoub, A. El Arid, “SHAP explainability: An ensemble learning approach for student performance prediction,” 2025 IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology (IAICT), 432–438, IEEE, 2025, https://doi.org/10.1109/IAICT65714.2025.11100707.
- Y. Luo, G. Zhou, Y. Cui, “Understanding generative artificial intelligence adoption in higher education faculty: evidence from Chinese Universities and technical and vocational colleges,” Education and Information Technologies, 1–41, 2026, https://doi.org/10.1007/s10639-025-13885-y.
- S. Sengupta, S. Chakrabarti, “Towards a smarter education system: an investigation into ML and DL for information retrieval,” Multimedia Tools and Applications, 1–34, 2025, https://doi.org/10.1007/s11042-025-20656-x.
- W. Cao, N. Mai, “Predictive analytics for student success: AI-driven early warning systems and intervention strategies for educational risk management,” Educational Research and Human Development, 2(2), 36–48, 2025.
- I. Gligorea, M. Cioca, R. Oancea, A.-T. Gorski, H. Gorski, P. Tudorache, “Adaptive learning using artificial intelligence in e-learning: A literature review,” Education Sciences, 13(12), 1216, 2023, https://doi.org/10.3390/educsci13121216.
- Hariyanto, F. X. D. Kristianingsih, R. Maharani, “Artificial intelligence in adaptive education: a systematic review of techniques for personalized learning,” Discover Education, 4(1), 458, 2025, https://doi.org/10.1007/s44217-025-00908-6.
- S. Rawat, M. Rodrigues, P. Sheregar, K. A. Wagaskar, A. K. Tripathy, “Computer vision based hybrid classroom attention monitoring,” 2024 IEEE International Conference on Information Technology, Electronics and Intelligent Communication Systems (ICITEICS), 1–6, IEEE, 2024, https://doi.org/10.1109/ICITEICS61368.2024.10624965.
- G. Zhang, “Transformer-based AI framework for optimising English teaching evaluation strategies: a data-driven and explainable approach,” International Journal of Information and Communication Technology, 26(9), 107–127, 2025, https://doi.org/10.1504/IJICT.2025.145828.
- G. Gokmen, T. C. Akinci, M. Tektas, N. Onat, G. Kocyigit, N. Tektas, “Evaluation of student performance in laboratory applications using fuzzy logic,” Procedia-Social and Behavioral Sciences, 2(2), 902–909, 2010, https://doi.org/10.1016/j.sbspro.2010.03.124.
- Y. A. Alsariera, Y. Baashar, G. Alkawsi, A. Mustafa, A. A. Alkahtani, N. Ali, “Assessment and evaluation of different machine learning algorithms for predicting student performance,” Computational Intelligence and Neuroscience, 2022(1), 4151487, 2022, https://doi.org/10.1155/2022/4151487.
- G. Feng, M. Fan, Y. Chen, “Analysis and prediction of students’ academic performance based on educational data mining,” IEEE Access, 10, 19558–19571, 2022, https://doi.org/10.1109/ACCESS.2022.3151652.
- M. Abdasalam, A. Alzubi, K. Iyiola, “Student grade prediction for effective learning approaches using the optimized ensemble deep neural network,” Education and Information Technologies, 30(8), 10159–10183, 2025, https://doi.org/10.1007/s10639-024-13224-7.
- R. Dwivedi, D. Dave, H. Naik, S. Singhal, R. Omer, P. Patel, B. Qian, Z. Wen, T. Shah, G. Morgan, R. Ranjan, “Explainable AI (XAI): Core ideas, techniques, and solutions,” ACM Computing Surveys, 55(9), 2023, https://doi.org/10.1145/3561048.
- E. Ahmed, “Student performance prediction using machine learning algorithms,” Applied Computational Intelligence and Soft Computing, 2024(1), 4067721, 2024, https://doi.org/10.1155/2024/4067721.
- N. Abuzinadah, M. Umer, A. Ishaq, A. Al Hejaili, S. Alsubai, A. Eshmawi, A. Mohamed, I. Ashraf, “Role of convolutional features and machine learning for predicting student academic performance from MOODLE data,” PLOS ONE, 18(11), e0293061, 2023, https://doi.org/10.1371/journal.pone.0293061.
- S. Sarwat, N. Ullah, S. Sadiq, R. Saleem, M. Umer, A. Eshmawi, A. Mohamed, I. Ashraf, “Predicting students’ academic performance with conditional generative adversarial network and deep SVM,” Sensors, 22(13), 4834, 2022, https://doi.org/10.3390/s22134834.
- F. Qiu, G. Zhang, X. Sheng, L. Jiang, L. Zhu, Q. Xiang, B. Jiang, P.-K. Chen, “Predicting students’ performance in e-learning using learning process and behaviour data,” Scientific Reports, 12(1), 453, 2022, https://doi.org/10.1038/s41598-021-03867-8.
- F. Ouyang, M. Wu, L. Zheng, L. Zhang, P. Jiao, “Integration of artificial intelligence performance prediction and learning analytics to improve student learning in online engineering course,” International Journal of Educational Technology in Higher Education, 20(1), 4, 2023.
- J. Niyogisubizo, L. Liao, E. Nziyumva, E. Murwanashyaka, P. C. Nshimyumukiza, “Predicting student’s dropout in university classes using two-layer ensemble machine learning approach: A novel stacked generalization,” Computers and Education: Artificial Intelligence, 3, 100066, 2022, https://doi.org/10.1016/j.caeai.2022.100066.
- P. Guleria, M. Sood, “Explainable AI and machine learning: performance evaluation and explainability of classifiers on educational data mining inspired career counseling,” Education and Information Technologies, 28(1), 1081–1116, 2023, https://doi.org/10.1007/s10639-022-11221-2.
- M. F. Shahzad, S. Xu, W. M. Lim, X. Yang, Q. R. Khan, “Artificial intelligence and social media on academic performance and mental well-being: Student perceptions of positive impact in the age of smart learning,” Heliyon, 10(8), 2024, https://doi.org/10.1016/j.heliyon.2024.e29523.
- S. M. Lundberg, S.-I. Lee, “A unified approach to interpreting model predictions,” Advances in Neural Information Processing Systems, 30, Curran Associates, Inc., 2017.
- S. R. Rebeka, R. Thomas, “Fostering engagement and trust in e-learning communities through social media platforms,” in Innovative Approaches to Social Media in Education, chapter 11, 241–255, IGI Global, 2025, https://doi.org/10.4018/979-8-3693-3868-1.ch011.
- P. Muthulakshmi, M. Parveen, “Z-score normalized feature selection and iterative African buffalo optimization for effective heart disease prediction,” International Journal of Intelligent Engineering & Systems, 16(1), 2023.
- T. Yin, H. Chen, Z. Yuan, J. Wan, K. Liu, S.-J. Horng, T. Li, “A robust multi-label feature selection approach based on graph structure considering fuzzy dependency and feature interaction,” IEEE Transactions on Fuzzy Systems, 31(12), 4516–4528, 2023, https://doi.org/10.1109/TFUZZ.2023.3287193.
- I. L. Cherif, A. Kortebi, “On using extreme gradient boosting (XGBoost) machine learning algorithm for home network traffic classification,” 2019 Wireless Days (WD), 1–6, 2019, https://doi.org/10.1109/WD.2019.8734193.
- H. Wang, Q. Liang, J. T. Hancock, T. M. Khoshgoftaar, “Feature selection strategies: a comparative analysis of SHAP-value and importance-based methods,” Journal of Big Data, 11(1), 44, 2024, https://doi.org/10.1186/s40537-024-00905-w.
- G. Nassreddine, A. El Arid, M. Nassereddine, O. Al Khatib, “Fault detection and classification for photovoltaic panel system using machine learning techniques,” Applied AI Letters, 6(2), e115, 2025, https://doi.org/10.1002/ail2.115.
- G. Nassreddine, A. El Arid, M. Nasseredine, “Solar PV power prediction system based on machine learning approach,” 2023 IEEE International Conference on Energy Technologies for Future Grids (ETFG), 1–7, 2023, https://doi.org/10.1109/ETFG55873.2023.10407291.
- R. Hasan, S. Palaniappan, S. Mahmood, A. Abbas, K. U. Sarker, “Dataset of students’ performance using student information system, Moodle and the mobile application ‘eDify’,” Data, 6(11), 2021, https://doi.org/10.3390/data6110110.
- Glender Brás, Samara Leal, Breno Sousa, Gabriel Paes, Cleberson Junior, João Souza, Rafael Assis, Tamires Marques, Thiago Teles Calazans Silva, "Machine Learning Methods for University Student Performance Prediction in Basic Skills based on Psychometric Profile", Advances in Science, Technology and Engineering Systems Journal, vol. 10, no. 4, pp. 1–13, 2025. doi: 10.25046/aj100401
- Andi Kristanto, Utari Dewi, Dina Fitria Murad, Yumiati, Santi Dewiki, Tiara Sevi Nurmania, "Utilization of Generative Artificial Intelligence to Improve Students’ Visual Literacy Skills", Advances in Science, Technology and Engineering Systems Journal, vol. 10, no. 3, pp. 1–8, 2025. doi: 10.25046/aj100301
- Anh-Thu Mai, Duc-Huy Nguyen, Thanh-Tin Dang, "Transfer and Ensemble Learning in Real-time Accurate Age and Age-group Estimation", Advances in Science, Technology and Engineering Systems Journal, vol. 7, no. 6, pp. 262–268, 2022. doi: 10.25046/aj070630
- Maria J. Poblaciones, "Opinion and Effectiveness of Kahoot! use in Online Distance Learning in Crop Production at Higher Education Level: A Case of Study", Advances in Science, Technology and Engineering Systems Journal, vol. 7, no. 1, pp. 8–13, 2022. doi: 10.25046/aj070102
- Seok-Jun Bu, Hae-Jung Kim, "Ensemble Learning of Deep URL Features based on Convolutional Neural Network for Phishing Attack Detection", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 5, pp. 291–296, 2021. doi: 10.25046/aj060532
- Mariutsi Alexandra Osorio-Sanabria, Astrid Jaime, Tamara Alcantara-Concepcion, Piedad Barreto, "Open Access Research Trends in Higher Education: A Literature Review", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 2, pp. 499–511, 2021. doi: 10.25046/aj060257
- Dionisius Saviordo Thenuardi, Benfano Soewito, "Indoor Positioning System using WKNN and LSTM Combined via Ensemble Learning", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 1, pp. 242–249, 2021. doi: 10.25046/aj060127
- Meyliana, Yakob Utama Chandra, Cadelina Cassandra, Surjandy, Erick Fernando, Henry Antonius Eka Widjaja, Harjanto Prabowo, "Education Value Chain Model for Examination, Grading, and Evaluation Process in Higher Education based on Blockchain Technology", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 6, pp. 1698–1703, 2020. doi: 10.25046/aj0506202
- Tedi Priatna, Dian Sa’adillah Maylawati, Hamdan Sugilar, Muhammad Ali Ramdhani, "Social Engineering to Establish Digital Culture in Higher Education", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 6, pp. 1474–1479, 2020. doi: 10.25046/aj0506177
- Fernando Richter Vidal, Feliz Gouveia, Christophe Soares, "Blockchain Application in Higher Education Diploma Management and Results Analysis", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 6, pp. 871–882, 2020. doi: 10.25046/aj0506104
- Ghassan Frache, Hector Nistazakis, George Tombras, "Constructing Learning-by-Doing Pedagogical Model for Delivering 21st Century Engineering Education", Advances in Science, Technology and Engineering Systems Journal, vol. 3, no. 1, pp. 115–124, 2018. doi: 10.25046/aj030114
- Lucila Romero, Milagros Gutierrez, Laura Caliusco, "Semantic modeling of portfolio assessment in e-learning environment", Advances in Science, Technology and Engineering Systems Journal, vol. 2, no. 1, pp. 149–156, 2017. doi: 10.25046/aj020117