跳到主要导航 跳到搜索 跳到主要内容

MARBLE: Music Audio Representation Benchmark for Universal Evaluation

  • Ruibin Yuan
  • , Yinghao Ma
  • , Yizhi Li
  • , Ge Zhang
  • , Xingran Chen
  • , Hanzhi Yin
  • , Le Zhuo
  • , Yiqi Liu
  • , Jiawen Huang
  • , Zeyue Tian
  • , Binyue Deng
  • , Ningzhi Wang
  • , Chenghua Lin*
  • , Emmanouil Benetos
  • , Anton Ragni
  • , Norbert Gyenge
  • , Roger Dannenberg
  • , Wenhu Chen
  • , Gus Xia
  • , Wei Xue
  • Si Liu, Shi Wang, Ruibo Liu, Yike Guo, Jie Fu*
*此作品的通讯作者
  • Multimodal Art Projection Research Community
  • Beijing Academy of Artificial Intelligence
  • Carnegie Mellon University
  • Queen Mary University of London
  • University of Manchester
  • University of Michigan, Ann Arbor
  • Beihang University
  • University of Sheffield
  • Hong Kong University of Science and Technology
  • Peking University
  • University of Waterloo
  • New York University
  • Mohamed Bin Zayed University of Artificial Intelligence
  • Chinese Academy of Sciences
  • Dartmouth College

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In the era of extensive intersection between art and Artificial Intelligence (AI), such as image generation and fiction co-creation, AI for music remains relatively nascent, particularly in music understanding. This is evident in the limited work on deep music representations, the scarcity of large-scale datasets, and the absence of a universal and community-driven benchmark. To address this issue, we introduce the Music Audio Representation Benchmark for universaL Evaluation, termed MARBLE. It aims to provide a benchmark for various Music Information Retrieval (MIR) tasks by defining a comprehensive taxonomy with four hierarchy levels, including acoustic, performance, score, and high-level description. We establish a unified protocol based on 18 tasks on 12 public-available datasets, providing a fair and standard assessment of representations of all open-sourced pre-trained models developed on music recordings as baselines. MARBLE offers an easy-to-use, extendable, and reproducible suite for the community, with clear statements on dataset copyright. Results suggest that recently proposed large-scale pre-trained musical language models perform the best in most tasks, with room for further improvement. The leaderboard and toolkit repository are published34 to promote future music AI research.

源语言英语
主期刊名Advances in Neural Information Processing Systems 36 - 37th Conference on Neural Information Processing Systems, NeurIPS 2023
编辑A. Oh, T. Neumann, A. Globerson, K. Saenko, M. Hardt, S. Levine
出版商Neural information processing systems foundation
ISBN(电子版)9781713899921
出版状态已出版 - 2023
活动37th Conference on Neural Information Processing Systems, NeurIPS 2023 - New Orleans, 美国
期限: 10 12月 202316 12月 2023

出版系列

姓名Advances in Neural Information Processing Systems
36
ISSN(印刷版)1049-5258

会议

会议37th Conference on Neural Information Processing Systems, NeurIPS 2023
国家/地区美国
New Orleans
时期10/12/2316/12/23

指纹

探究 'MARBLE: Music Audio Representation Benchmark for Universal Evaluation' 的科研主题。它们共同构成独一无二的指纹。

引用此