跳到主要导航 跳到搜索 跳到主要内容

A bag-of-tones model with MFCC features for musical genre classification

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Musical genres are categorical labels created by humans to characterize pieces of music. These labels may be highly subjective but typically are related to the instrumentation, rhythmic structure, and harmonic content of the music. In this paper, we propose a model for music genre classification. The new model is referred to as the bag-of-tones (BOT) model which follows the conceptually similar idea of the bag-of-words (BOW) model in natural language processing and the bag-of-feature (BOF) model in image processing. The basic low-level music features such as Mel-frequency cepstral coefficients (MFCC) are clustered into a set of codewords referred to as "tones". By using such a model, each piece of music can be represented by a new feature vector of distribution on tones. Classical machine learning models such as support vector machines (SVM) can be applied for genre classification. The model is tested using two datasets. We found that the polynomial kernel function has the best performance in the SVM classification. By comparing to the previous work, we found the new proposed model outperform classical models on a given benchmark dataset. In general, this model can be used to structure the large collections of music available on the Web. It can play an important role in automatic digital music categorization and retrieval.

源语言英语
主期刊名Advanced Data Mining and Applications - 9th International Conference, ADMA 2013, Proceedings
564-575
页数12
版本PART 1
DOI
出版状态已出版 - 2013
活动9th International Conference on Advanced Data Mining and Applications, ADMA 2013 - Hangzhou, 中国
期限: 14 12月 201316 12月 2013

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
编号PART 1
8346 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议9th International Conference on Advanced Data Mining and Applications, ADMA 2013
国家/地区中国
Hangzhou
时期14/12/1316/12/13

指纹

探究 'A bag-of-tones model with MFCC features for musical genre classification' 的科研主题。它们共同构成独一无二的指纹。

引用此