Toward a Sustainable Innovation in the Faculty Evaluation Process: A Systematic Review of AI, Data Science, and NLP Applications in Higher Education
DOI:
https://doi.org/10.55965/setp.6.11.a1Abstract
Context. Teaching evaluation in public multicampus universities still relies on averages and closed-ended surveys, which are limited in capturing the complexity of academic performance. In response, this study presents a systematic review (2019–2024) on the use of Artificial Intelligence (AI) and Natural Language Processing (NLP) in university teaching evaluation processes, conceptualized as process innovations from Oslo Manual and aligned with Sustainable Development Goals (SDGs) 4 and 9.
Problem. Traditional systems in high-enrollment institutions exhibit low analytical precision, limited use of qualitative data, and delayed feedback. Accordingly, the following research question is posed: which techniques and algorithms does the literature report for the integrated analysis of quantitative and qualitative data in university teaching evaluation?
Purpose. To critically analyze the use of data science and AI—particularly NLP—to enhance teaching feedback, overcoming the limitations of traditional approaches and promoting timely, in-depth, and scalable evaluation processes.
Methodology. The PRISMA protocol was followed through searches using Boolean operators in Scopus, Web of Science, IEEE Xplore, and ERIC. After applying inclusion criteria and a two-phase peer-review process, 17 studies published between 2019 and 2024 were analyzed.
Findings. Techniques such as sentiment analysis, topic modeling (LDA), and large language models (LLMs)—notably DistilBERT, with accuracy levels close to 93%—consistently outperform traditional methods in managing large volumes of information.
Originality. The study’s originality lies in integrating dispersed literature on AI and NLP in higher education within a coherent process-innovation framework, combining technical, empirical, and ethical rigor.
Conclusions and Limitations. Advanced AI and NLP models show high potential to transform evaluation in Latin American university networks by enabling personalized feedback. However, challenges remain regarding model interpretability, bias mitigation, the scarcity of labeled data in Spanish, and institutional resistance to change, opening relevant avenues for future research and applied developments
Downloads
References
Arriaga-Cárdenas, O. G., Lara-Magaña, P. del C., & Pasciuta-Marco, P. D. (2022). Los centros públicos de investigación, como eje central de la innovación y la educación en México. Revista científica et Scientia et PRAXIS, 2(04), 66–81. https://doi.org/10.55965/setp.2.coed.a3 DOI: https://doi.org/10.55965/setp.2.coed.a3
Das, S., Roy, S., Bose, R., Acharjya, P. P., & Mondal, H. (2022). Analysis of student sentiment dynamics to evaluate teachers’ performance in online courses using machine learning. In 2022 International Conference on Applied Artificial Intelligence and Computing (ICAAIC) (pp. 668–673). IEEE. https://doi.org/10.1109/ICAAIC53929.2022.9792958 DOI: https://doi.org/10.1109/ICAAIC53929.2022.9792958
Esparza, G. G., de-Luna, A., Zezzatti, A. O., Hernandez, A., Ponce, J., Álvarez, M., Cossio, E., & de Jesus Nava, J. (2018). A sentiment analysis model to analyze students reviews of teacher performance using support vector machines. In 14th International Conference Distributed Computing and Artificial Intelligence (Vol. 14, pp. 157–164). Springer Nature. https://doi.org/10.1007/978-3-319-62410-5_19 DOI: https://doi.org/10.1007/978-3-319-62410-5_19
Fargues, M., Kadry, S., Lawal, I. A., Yassine, S., & Rauf, H. T. (2023). Automated analysis of open-ended students’ feedback using sentiment, emotion, and cognition classifications. Applied Sciences, 13(4), 2061. https://doi.org/10.3390/app13042061 DOI: https://doi.org/10.3390/app13042061
Hu, X., Yang, Y., Wu, X., & Li, Y.(2020). Text analysis of teaching evaluation based on machine learning. In Proceedings of the 10th International Workshop on Computer Science and Engineering (WCSE 2020). WCSE. https://doi.org/10.18178/wcse.2020.06.004 DOI: https://doi.org/10.18178/wcse.2020.06.004
Martínez Téllez, M. Á., & Celaya Lozano, A. (2023). Evaluación institucional de contenidos y proyectos de investigación en un CPI: Caso de estudio (2013-2022). Scientia et PRAXIS, 3(05), 51–77. https://doi.org/10.55965/setp.3.coed1.a3 DOI: https://doi.org/10.55965/setp.3.coed1.a3
Nikolić, N., Grljević, O., & Kovačević, A. (2020). Aspect-based sentiment analysis of reviews in the domain of higher education. The Electronic Library, 38(1), 44–64. https://doi.org/10.1108/EL-06-2019-0140 DOI: https://doi.org/10.1108/EL-06-2019-0140
Okoye, K., Arrona-Palacios, A., Camacho-Zuñiga, C., Achem, J. A. G., Escamilla, J., & Hosseini, S. (2022). Towards teaching analytics: A contextual model for analysis of students’ evaluation of teaching through text mining and machine learning classification. Education and Information Technologies, 27(3), 3891–3933. https://doi.org/10.1007/s10639-021-10751-5 DOI: https://doi.org/10.1007/s10639-021-10751-5
Okoye, K., Daruich, S. D. N., de La O, J. F. E., Castano, R., Escamilla, J., & Hosseini, S. (2023). A text mining and statistical approach for assessment of pedagogical impact of students’ evaluation of teaching and learning outcome in education. IEEE Access, 11, 9577–9596. https://doi.org/10.1109/ACCESS.2023.3239779 DOI: https://doi.org/10.1109/ACCESS.2023.3239779
Peng, H., Zhang, Z., & Liu, H. (2022). A sentiment analysis method for teaching evaluation texts using attention mechanism combined with CNN-BLSTM model. Scientific Programming, 2022, 1–9. https://doi.org/10.1155/2022/8496151 DOI: https://doi.org/10.1155/2022/8496151
Peña-Torres, J. A. (2024). Towards an improved of teaching practice using sentiment analysis in student evaluation. Ingeniería y Competitividad, 26(2). https://doi.org/10.25100/iyc.v26i2.13759 DOI: https://doi.org/10.25100/iyc.v26i2.13759
Ponmalar, S. P., V. S., Srithar, V., Morankar, G., Jebaseelan, S. D. S., & Harikrishnan, R. (2024). AI-driven CASGN framework for evaluating descriptive answers in online education. In 2024 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES) (pp. 1–8). IEEE. https://doi.org/10.1109/ICSES63760.2024.10910723 DOI: https://doi.org/10.1109/ICSES63760.2024.10910723
Pramod, D., Bharathi, S. V., & Raman, R. (2022). Faculty effectiveness prediction using machine learning and text analytics. In 2022 IEEE Technology and Engineering Management Conference (TEMSCON EUROPE) (pp. 40-47). IEEE. https://doi.org/10.1109/TEMSCONEUROPE54743.2022.9801997 DOI: https://doi.org/10.1109/TEMSCONEUROPE54743.2022.9801997
Pu, X., Yan, G., Yu, C., Mi, X., & Yu, C. (2021). Sentiment analysis of online course evaluation based on a new ensemble deep learning mode: Evidence from Chinese. Applied Sciences, 11(23), 11313. https://doi.org/10.3390/app112311313 DOI: https://doi.org/10.3390/app112311313
Resuello, J. L., Feliscuzo, L. S., & Pantaleon, C. B. (2024). Text mining student comments for teaching performance evaluation using VADER and Latent Dirichlet Allocation algorithm. In 2024 International Conference Automatics and Informatics (ICAI) (pp. 260–265). IEEE. https://doi.org/10.1109/ICAI63388.2024.10851580 DOI: https://doi.org/10.1109/ICAI63388.2024.10851580
Santamaria-Velasco, C. A., Montiel-Méndez, O. J. M.-M., & Montañez-Moya, G. S. (2024). Liderazgo transformacional y emprendimiento en estudiantes: Una vía hacia el desarrollo educativo sostenible. Scientia et PRAXIS, 4(08), 90–119. https://doi.org/10.55965/setp.4.08.uady.a4 DOI: https://doi.org/10.55965/setp.4.08.uady.a4
Shaik, T., Tao, X., Dann, C., Quadrelli, C., Li, Y., & O’Neill, S. (2022). Educational decision support system adopting sentiment analysis on student feedback. In 2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT) (pp. 377–383). IEEE. https://doi.org/10.1109/WI-IAT55865.2022.00062 DOI: https://doi.org/10.1109/WI-IAT55865.2022.00062
Shuqin, H., & Raga, R. C. (2024). A deep learning model for student sentiment analysis on course reviews. IEEE Access, 12(2169–3536), 136747–136758. https://doi.org/10.1109/ACCESS.2024.3463793 DOI: https://doi.org/10.1109/ACCESS.2024.3463793
Vera-Noriega, J. Á., Borbón-Morales, C. G., Mejía-Trejo, J., & Durazo-Salas, F. F. (2023). Relación y comparación entre las variables de clima organizacional, satisfacción y calidad de vida en una empresa de ventas por teléfono en el noroeste de México. Scientia et PRAXIS, 3(05), 83–109. https://doi.org/10.55965/setp.3.05.a4 DOI: https://doi.org/10.55965/setp.3.05.a4
Wu, Y., Ming, Z., Allen, J. K., & Mistree, F. (2023). Evaluation of students’ learning through reflection on doing based on sentiment analysis. Journal of Mechanical Design, 145(3). https://doi.org/10.1115/1.4056495 DOI: https://doi.org/10.1115/1.4056495
Yuan, H., & Hernandez, A. A. (2023). Sentiment analysis of student evaluation of teaching based on Bi-LSTM algorithm. In 2023 IEEE 15th International Conference on Advanced Infocomm Technology (ICAIT) (pp. 417–422). IEEE. https://doi.org/10.1109/ICAIT59485.2023.10367267 DOI: https://doi.org/10.1109/ICAIT59485.2023.10367267
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Cristian Ulises Barenca-Sotelo, Ma. Del Rocío Maciel-Arellano, Víctor Manuel Larios-Rosillo

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

