Deep learning for code generation: a survey

  • Huangzhao Zhang
  • , Kechi Zhang
  • , Zhuo Li
  • , Jia Li
  • , Jia Li
  • , Yongmin Li
  • , Yunfei Zhao
  • , Yuqi Zhu
  • , Fang Liu
  • , Ge Li*
  • , Zhi Jin*
  • *Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

Abstract

In the past decade, thanks to the powerfulness of deep-learning techniques, we have witnessed a whole new era of automated code generation. To sort out developments, we have conducted a comprehensive review of solutions to deep learning-based code generation. In this survey, we generally formalize the pipeline and procedure of code generation and categorize existing solutions according to taxonomy from perspectives of architecture, model-agnostic enhancing strategy, metrics, and tasks. In addition, we outline the challenges faced by current dominant large models and list several plausible directions for future research. We hope that this survey may provide handy guidance to understanding, utilizing, and developing deep learning-based code-generation techniques for researchers and practitioners.

Original languageEnglish
Article number191101
JournalScience China Information Sciences
Volume67
Issue number9
DOIs
StatePublished - Sep 2024

Keywords

  • artificial intelligence
  • automated software engineering
  • code generation
  • deep learning
  • large model

Fingerprint

Dive into the research topics of 'Deep learning for code generation: a survey'. Together they form a unique fingerprint.

Cite this