TY - GEN
T1 - Imperfect Code Generation
T2 - 46th International Conference on Software Engineering: Companion, ICSE-Companion 2024
AU - Lian, Xiaoli
AU - Wang, Shuaisong
AU - Ma, Jieping
AU - Tan, Xin
AU - Liu, Fang
AU - Shi, Lin
AU - Zhang, Li
AU - Gao, Cuiyun
N1 - Publisher Copyright:
© 2024 IEEE Computer Society. All rights reserved.
PY - 2024/5/23
Y1 - 2024/5/23
N2 - The task of code generation has received significant attention in recent years, especially when the pre-trained large language models (LLMs) for code have consistently achieved state-of-the-art performance. However, there is currently a lack of a comprehensive weakness taxonomy in the field, uncovering weaknesses in automatic code generation by LLMs. This may lead the community to invest excessive efforts into well-known hotspots while neglecting many crucial yet unrecognized issues that deserve more attention. To bridge this gap, we conduct a systematic study on analyzing the weaknesses based on three state-of-the-art LLMs across three widely-used code generation datasets. Our study identifies eight types of weaknesses and assesses their prevalence across each LLM and dataset, aiming to inform and shape the trajectory of future research in the domain.
AB - The task of code generation has received significant attention in recent years, especially when the pre-trained large language models (LLMs) for code have consistently achieved state-of-the-art performance. However, there is currently a lack of a comprehensive weakness taxonomy in the field, uncovering weaknesses in automatic code generation by LLMs. This may lead the community to invest excessive efforts into well-known hotspots while neglecting many crucial yet unrecognized issues that deserve more attention. To bridge this gap, we conduct a systematic study on analyzing the weaknesses based on three state-of-the-art LLMs across three widely-used code generation datasets. Our study identifies eight types of weaknesses and assesses their prevalence across each LLM and dataset, aiming to inform and shape the trajectory of future research in the domain.
UR - https://www.scopus.com/pages/publications/85194820776
U2 - 10.1145/3639478.3643081
DO - 10.1145/3639478.3643081
M3 - 会议稿件
AN - SCOPUS:85194820776
T3 - Proceedings - International Conference on Software Engineering
SP - 422
EP - 423
BT - Proceedings - 2024 ACM/IEEE 46th International Conference on Software Engineering
PB - IEEE Computer Society
Y2 - 14 April 2024 through 20 April 2024
ER -