Transfer Learning for Music Genre Classification

Guangxiao Song; Zhijie Wang; Fang Han; Shenyi Ding

doi:10.1007/978-3-319-68121-4_19

Conference Papers Year : 2017

Transfer Learning for Music Genre Classification

(1) , (1) , (1) , (1)

Guangxiao Song

Function : Author

Donghua University [Shanghai]

Zhijie Wang

Function : Author
PersonId : 1033406

Donghua University [Shanghai]

Fang Han

Function : Author
PersonId : 1033407

Donghua University [Shanghai]

Shenyi Ding

Function : Author

Donghua University [Shanghai]

Abstract

Modern music information retrieval system provides high-level features (genre, instrument, mood and so on) for searching and recommending conveniently. Among these music tags, genre is the most widely used in practice. Machine learning technique has the ability of cataloguing different genres from raw music. A disadvantage of it is that the final performance heavily depends on the used features. As a powerful learning algorithm, deep neural network can extract useful features automatically and effectively instead of time-consuming feature engineering. But deeper architecture means larger data are needed to train the neural network. In many cases, we may not have enough data to train a deep network. Transfer learning solves the problem by pre-training the network in a similar task which has enough data, then fine-tuning the parameters of the pre-trained network using the target dataset. Magnatagatune dataset is used for pre-training the proposed five-layer Recurrent Neural Network (RNN) with Gated Recurrent Unit (GRU). And in order to reduce the input of the network, scattering transform is used in this paper. Then GTZAN dataset is used as the target dataset of genre classification. Experimental results show the transfer learning way can achieve a higher average classification accuracy (95.8%) than the same deep RNN which initials the parameters randomly (93.5%). In addition, the deep RNN using transfer learning converges to the final accuracy faster than using random initialization.

Keywords

Music genre classification Transfer learning Deep learning

Domains

Computer Science [cs]

Fichier principal

978-3-319-68121-4_19_Chapter.pdf (378.83 Ko)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-01820925

Submitted on : Friday, June 22, 2018-10:44:02 AM

Last modification on : Friday, June 22, 2018-10:51:27 AM

Long-term archiving on : Tuesday, September 25, 2018-2:54:40 AM

Dates and versions

hal-01820925 , version 1 (22-06-2018)

Licence

Attribution

Identifiers

HAL Id : hal-01820925 , version 1
DOI : 10.1007/978-3-319-68121-4_19

Cite

Guangxiao Song, Zhijie Wang, Fang Han, Shenyi Ding. Transfer Learning for Music Genre Classification. 2nd International Conference on Intelligence Science (ICIS), Oct 2017, Shanghai, China. pp.183-190, ⟨10.1007/978-3-319-68121-4_19⟩. ⟨hal-01820925⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP IFIP-AICT IFIP-TC IFIP-TC12 IFIP-AICT-510 IFIP-ICIS

567 View

417 Download

Transfer Learning for Music Genre Classification

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share