跳到主要导航 跳到搜索 跳到主要内容

MCEVAL: MASSIVELY MULTILINGUAL CODE EVALUATION

  • Linzheng Chai
  • , Shukai Liu
  • , Jian Yang*
  • , Yuwei Yin
  • , Ke Jin
  • , Jiaheng Liu
  • , Tao Sun
  • , Ge Zhang
  • , Changyu Ren
  • , Hongcheng Guo
  • , Zekun Wang
  • , Boyang Wang
  • , Xianjie Wu
  • , Bing Wang
  • , Tongliang Li
  • , Liqun Yang
  • , Sufeng Duan
  • , Zhoujun Li
  • *此作品的通讯作者
  • Beihang University
  • University of British Columbia
  • University of Waterloo
  • Beijing Information Science & Technology University
  • Shanghai Jiao Tong University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Code large language models (LLMs) have shown remarkable advances in code understanding, completion, and generation tasks. Programming benchmarks, comprised of a selection of code challenges and corresponding test cases, serve as a standard to evaluate the capability of different LLMs in such tasks. However, most existing benchmarks primarily focus on Python and are still restricted to a limited number of languages, where other languages are translated from the Python samples degrading the data diversity. To further facilitate the research of code LLMs, we propose a massively multilingual code benchmark covering 40 programming languages (MCEVAL) with 16K test samples, which substantially pushes the limits of code LLMs in multilingual scenarios. The benchmark contains challenging code completion, understanding, and generation evaluation tasks with finely curated massively multilingual instruction corpora MCEVAL-INSTRUCT. In addition, we introduce an effective multilingual coder MCODER trained on MCEVAL-INSTRUCT to support multilingual programming language generation. Extensive experimental results on MCEVAL show that there is still a difficult journey between open-source models and closed-source LLMs in numerous languages. The instruction corpora and evaluation benchmark are available at https://github.com/MCEVAL/McEval.

源语言英语
主期刊名13th International Conference on Learning Representations, ICLR 2025
出版商International Conference on Learning Representations, ICLR
87030-87056
页数27
ISBN(电子版)9798331320850
出版状态已出版 - 2025
活动13th International Conference on Learning Representations, ICLR 2025 - Singapore, 新加坡
期限: 24 4月 202528 4月 2025

出版系列

姓名13th International Conference on Learning Representations, ICLR 2025

会议

会议13th International Conference on Learning Representations, ICLR 2025
国家/地区新加坡
Singapore
时期24/04/2528/04/25

指纹

探究 'MCEVAL: MASSIVELY MULTILINGUAL CODE EVALUATION' 的科研主题。它们共同构成独一无二的指纹。

引用此