TY - JOUR
T1 - CTCPPre
T2 - A prediction method for accepted pull requests in GitHub
AU - Jiang, Jing
AU - Zheng, Jia teng
AU - Yang, Yun
AU - Zhang, Li
N1 - Publisher Copyright:
© 2020, Central South University Press and Springer-Verlag GmbH Germany, part of Springer Nature.
PY - 2020/2/1
Y1 - 2020/2/1
N2 - As the popularity of open source projects, the volume of incoming pull requests is too large, which puts heavy burden on integrators who are responsible for accepting or rejecting pull requests. An accepted pull request prediction approach can help integrators by allowing them either to enforce an immediate rejection of code changes or allocate more resources to overcome the deficiency. In this paper, an approach CTCPPre is proposed to predict the accepted pull requests in GitHub. CTCPPre mainly considers code features of modified changes, text features of pull requests’ description, contributor features of developers’ previous behaviors, and project features of development environment. The effectiveness of CTCPPre on 28 projects containing 221096 pull requests is evaluated. Experimental results show that CTCPPre has good performances by achieving accuracy of 0.82, AUC of 0.76 and F1-score of 0.88 on average. It is compared with the state of art accepted pull request prediction approach RFPredict. On average across 28 projects, CTCPPre outperforms RFPredict by 6.64%, 16.06% and 4.79% in terms of accuracy, AUC and F1-score, respectively.
AB - As the popularity of open source projects, the volume of incoming pull requests is too large, which puts heavy burden on integrators who are responsible for accepting or rejecting pull requests. An accepted pull request prediction approach can help integrators by allowing them either to enforce an immediate rejection of code changes or allocate more resources to overcome the deficiency. In this paper, an approach CTCPPre is proposed to predict the accepted pull requests in GitHub. CTCPPre mainly considers code features of modified changes, text features of pull requests’ description, contributor features of developers’ previous behaviors, and project features of development environment. The effectiveness of CTCPPre on 28 projects containing 221096 pull requests is evaluated. Experimental results show that CTCPPre has good performances by achieving accuracy of 0.82, AUC of 0.76 and F1-score of 0.88 on average. It is compared with the state of art accepted pull request prediction approach RFPredict. On average across 28 projects, CTCPPre outperforms RFPredict by 6.64%, 16.06% and 4.79% in terms of accuracy, AUC and F1-score, respectively.
KW - GitHub
KW - accepted pull request
KW - code review
KW - prediction
KW - pull-based software development
UR - https://www.scopus.com/pages/publications/85083370210
U2 - 10.1007/s11771-020-4308-z
DO - 10.1007/s11771-020-4308-z
M3 - 文章
AN - SCOPUS:85083370210
SN - 2095-2899
VL - 27
SP - 449
EP - 468
JO - Journal of Central South University
JF - Journal of Central South University
IS - 2
ER -