MARBLE: Music Audio Representation Benchmark for Universal Evaluation

  • Ruibin Yuan
  • , Yinghao Ma
  • , Yizhi Li
  • , Ge Zhang
  • , Xingran Chen
  • , Hanzhi Yin
  • , Le Zhuo
  • , Yiqi Liu
  • , Jiawen Huang
  • , Zeyue Tian
  • , Binyue Deng
  • , Ningzhi Wang
  • , Chenghua Lin*
  • , Emmanouil Benetos
  • , Anton Ragni
  • , Norbert Gyenge
  • , Roger Dannenberg
  • , Wenhu Chen
  • , Gus Xia
  • , Wei Xue
  • Si Liu, Shi Wang, Ruibo Liu, Yike Guo, Jie Fu*
*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In the era of extensive intersection between art and Artificial Intelligence (AI), such as image generation and fiction co-creation, AI for music remains relatively nascent, particularly in music understanding. This is evident in the limited work on deep music representations, the scarcity of large-scale datasets, and the absence of a universal and community-driven benchmark. To address this issue, we introduce the Music Audio Representation Benchmark for universaL Evaluation, termed MARBLE. It aims to provide a benchmark for various Music Information Retrieval (MIR) tasks by defining a comprehensive taxonomy with four hierarchy levels, including acoustic, performance, score, and high-level description. We establish a unified protocol based on 18 tasks on 12 public-available datasets, providing a fair and standard assessment of representations of all open-sourced pre-trained models developed on music recordings as baselines. MARBLE offers an easy-to-use, extendable, and reproducible suite for the community, with clear statements on dataset copyright. Results suggest that recently proposed large-scale pre-trained musical language models perform the best in most tasks, with room for further improvement. The leaderboard and toolkit repository are published34 to promote future music AI research.

Original languageEnglish
Title of host publicationAdvances in Neural Information Processing Systems 36 - 37th Conference on Neural Information Processing Systems, NeurIPS 2023
EditorsA. Oh, T. Neumann, A. Globerson, K. Saenko, M. Hardt, S. Levine
PublisherNeural information processing systems foundation
ISBN (Electronic)9781713899921
StatePublished - 2023
Event37th Conference on Neural Information Processing Systems, NeurIPS 2023 - New Orleans, United States
Duration: 10 Dec 202316 Dec 2023

Publication series

NameAdvances in Neural Information Processing Systems
Volume36
ISSN (Print)1049-5258

Conference

Conference37th Conference on Neural Information Processing Systems, NeurIPS 2023
Country/TerritoryUnited States
CityNew Orleans
Period10/12/2316/12/23

Fingerprint

Dive into the research topics of 'MARBLE: Music Audio Representation Benchmark for Universal Evaluation'. Together they form a unique fingerprint.

Cite this