TY - GEN
T1 - The Research of Predicting Student's Academic Performance Based on Educational Data
AU - Zhang, Yubo
AU - Liu, Yanfang
N1 - Publisher Copyright:
© 2021 ACM.
PY - 2021/12/4
Y1 - 2021/12/4
N2 - In recent years, with the continuous improvement of teaching informatization, online teaching or online and offline hybrid teaching has become the new normal of teaching in some schools. However, the biggest problem in online teaching is that it is difficult to predict students' academic performance. Therefore, it is necessary to design an effective method to predict students' academic performance more accurately. In this paper, a student academic level prediction method based on stacking model fusion is proposed. Logistic regression, random forest, XGBoost, and Naive Bayes are selected as base learners according to optimal fusion criteria and model properties. Furthermore, the structure and distribution of the features of the dataset are optimized by data preprocessing, feature coding, and feature selection, and the upper limit of the model expression is effectively raised. On this basis, according to the features of dataset and model performance, we select the appropriate model for model fusion, and further improve the prediction effect. Experiments are conducted on OULAD and xAPI datasets, and the results show that the prediction accuracy of the proposed method is better than that of traditional prediction methods. Finally, we analyze the factors that affect academic performance and give some specific suggestions.
AB - In recent years, with the continuous improvement of teaching informatization, online teaching or online and offline hybrid teaching has become the new normal of teaching in some schools. However, the biggest problem in online teaching is that it is difficult to predict students' academic performance. Therefore, it is necessary to design an effective method to predict students' academic performance more accurately. In this paper, a student academic level prediction method based on stacking model fusion is proposed. Logistic regression, random forest, XGBoost, and Naive Bayes are selected as base learners according to optimal fusion criteria and model properties. Furthermore, the structure and distribution of the features of the dataset are optimized by data preprocessing, feature coding, and feature selection, and the upper limit of the model expression is effectively raised. On this basis, according to the features of dataset and model performance, we select the appropriate model for model fusion, and further improve the prediction effect. Experiments are conducted on OULAD and xAPI datasets, and the results show that the prediction accuracy of the proposed method is better than that of traditional prediction methods. Finally, we analyze the factors that affect academic performance and give some specific suggestions.
KW - Data Mining
KW - Model Fusion
KW - Performance Prediction
UR - https://www.scopus.com/pages/publications/85126393776
U2 - 10.1145/3507548.3507578
DO - 10.1145/3507548.3507578
M3 - 会议稿件
AN - SCOPUS:85126393776
T3 - ACM International Conference Proceeding Series
SP - 193
EP - 201
BT - Proceedings of 2021 5th International Conference on Computer Science and Artificial Intelligence, CSAI 2021
PB - Association for Computing Machinery
T2 - 5th International Conference on Computer Science and Artificial Intelligence, CSAI 2021
Y2 - 4 December 2021 through 6 December 2021
ER -