An Ensemble Learning Approach for Student Performance Analysis of a Higher Educational Institute using a SHAP-Based Feature Selection and Optuna Optimization

Open AccessArticle

An Ensemble Learning Approach for Student Performance Analysis of a Higher Educational Institute using a SHAP-Based Feature Selection and Optuna Optimization

Volume 11, Issue 2, Page No 1–11, 2026

1 Electrical and Computer Engineering Department, Rafik Hariri University, Meshref, Lebanon
2 Department of Financial Studies, Rafik Hariri University, Meshref, Lebanon
3 Department of Human Resources, Saint Joseph University of Beirut, Beirut, Lebanon
4 Department of Management and Marketing Studies, Rafik Hariri University, Meshref, Lebanon
*whom correspondence should be addressed. E-mail: nassreddinega@rhu.edu.lb

Adv. Sci. Technol. Eng. Syst. J. 11(2), 1–11 (2026); crossref symbol DOI: 10.25046/aj110201

Keywords: Student performance, Higher education, Ensemble learning, AI-powered tools, Explanaible AI, SHAP feature selection, Optuna optimization

Received: 21 January 2026, Revised: 7 March 2026, Accepted: 10 March 2026, Published Online: 19 March 2026
(This article belongs to the SP20 (Special Issue on Multidisciplinary Frontiers in Engineering, Computing and Applied Sciences 2026) & Section Artificial Intelligence in Computer Science (CAI))
2 Downloads
Export Citations

Forecasting and assessing student performance are crucial for allowing educators to pinpoint deficiencies and promote grade improvement. A thorough comprehension of feature contributions is crucial for improving model interpretability and facilitating informed decision-making in academic institutions. Explainable artificial intelligence encompasses methodologies and strategies designed to deliver transparent and accessible rationales for the decisions rendered by artificial intelligence and machine learning algorithms. In this research paper, an interpretable gradient boosting approach for predicting student performance is introduced, including both feature selection using SHapley Additive exPlanations (SHAP)-based features and cost-sensitive decision thresholds. The proposed methodology includes hybrid resampling with SMOTE-Tomek, Optuna hyperparameter optimization with stratified cross-validation, and SHAP-guided feature selection strategy. The proposed approach is tested using a dataset of a higher educational institute in the Middle East, including student information, learning management, and a video interaction system, to make an analysis and evaluate the performance of the students. The results show both improvement of the macro F1-score and the fail-class recall by achieving an accuracy of 94%, a weighted/macro F1-score of 0.9399, and a fail-class recall of 0.9619. The suggested method facilitates trade-offs among prediction accuracy, interpretability, and fairness, bridging the divide between high-performing machine learning models and practical educational applications, hence aiding in the formulation of data-driven policies and the customization of learning experiences.

  1. N. Kalra, S. M. Paddock, “Driving to safety: How many miles of driving would it take to demonstrate autonomous vehicle reliability?” Transportation Research Part A, 94, 182–193, 2016, doi: 10.1016/j.tra.2016.09.010.
  2. A. Dosovitskiy, G. Ros, F. Codevilla, A. Lopez, V. Koltun, “CARLA: An open urban driving simulator”, CoRL, 2017, doi: 10.48550/arXiv.1711.03938.
  3. H. Caesar, V. Bankiti, A. H. Lang, S. Vora, V. E. Liong, Q. Xu, A. Krishnan, Y. Pan, G. Baldan, O. Beijbom, “nuScenes: A multimodal dataset for autonomous driving”, in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 11621–11631, 2020, doi: 10.1109/CVPR42600.2020.01164.
  4. F. Codevilla, E. Santana, A. Lopez, A. Gaidon, “Exploring the limitations of behavior cloning for autonomous driving”, in International Conference on Computer Vision (ICCV), 9328–9337, 2019, doi: 10.1109/ICCV.2019.00942.
  5. Y. Zhang, A. Carballo, H. Yang, K. Takeda, “Autonomous driving in adverse weather conditions: A survey”, arXiv preprint arXiv:2112.08936, 2021, doi: 10.1016/j.isprsjprs.2022.12.021.
  6. S. Zang, M. Ding, D. Smith, P. Tyler, T. Rakotoarivelo, M. A. Kaafar, “The impact of adverse weather conditions on autonomous vehicles: How rain, snow, fog, and hail affect the performance of a self-driving car”, IEEE Vehicular Technology Magazine, 14, 103–111, 2019, doi: 10.1109/MVT.2019.2892497.
  7. D. Neumeister, D. Pape, “Automated vehicles and adverse weather: Final report”, U.S. Department of Transportation, Federal Highway Administration, June 2019. Available: www.its.dot.gov/index.htm
  8. R. Li, T. Qin, C. Widdershoven, “ISS-Scenario: Scenario-based testing in CARLA”, in Theoretical Aspects of Software Engineering (TASE), 279–286, 2024, doi: 10.1007/978-3-031-64626-3_16.
  9. M. Čávojský, E. Šlapak, M. Dopiriak, G. Bugar, J. Gazda, “3CSim: CARLA corner case simulation for control assessment in autonomous driving”, arXiv preprint arXiv:2409.10524, 2024, doi: 10.48550/arXiv.2409.10524.
  10. B. Osiński, P. Milos, A. Jakubowski, P. Zięcina, M. Martyniak, C. Galias, A. Breuer, S. Homoceanu, H. Michalewski, “CARLA real traffic scenarios – novel training ground and benchmark for autonomous driving”, arXiv preprint arXiv:2012.11329, 2020, doi: 10.48550/arXiv.2012.11329.
  11. D. J. Fremont, E. Kim, Y. V. Pant, S. A. Seshia, A. Acharya, X. Bruso, P. Wells, S. Lemke, Q. Lu, S. Mehta, “Formal scenario-based testing of autonomous vehicles: From simulation to the real world”, in International Conference on Intelligent Transportation (ITSC), 1–8, 2020, doi: 10.1109/ITSC45102.2020.9294368.
  12. N. Hanselmann, K. Renz, K. Chitta, A. Bhattacharyya, A. Geiger, “KING: Generating safety-critical driving scenarios for robust imitation via kinematics gradients”, in Proceedings of the European Conference on Computer Vision, 333–350, 2022, doi: 10.1007/978-3-031-19839-7_20.
  13. H.-S. Cho, Y.-J. Park, M. Park, J. Son, “Study on designing scenarios to evaluate adverse condition positioning for highly reliable autonomous driving”, The Transactions of the Korean Society of Automotive Engineers, 31, 1021–1037, 2023, doi: 10.7467/KSAE.2023.31.12.1021.

Citations by Dimensions

Citations by PlumX

Google Scholar

Crossref Citations

No. of Downloads Per Month
No. of Downloads Per Country

Journal Menu

Journal Browser


Special Issues

Special Issue on Digital Frontiers of Entrepreneurship: Integrating AI, Gender Equity, and Sustainable Futures
Guest Editors: Dr. Muhammad Nawaz Tunio, Dr. Aamir Rashid, Dr. Imamuddin Khoso
Deadline: 30 May 2026

Special Issue on Sustainable Technologies for a Resilient Future
Guest Editors: Dr. Debasis Mitra, Dr. Sourav Chattaraj, Dr. Addisu Assefa
Deadline: 30 April 2026