Abstract
In Music Information Retrieval, classification of music genres is a core task and has been gaining increasing interest by adopting automated classification methods. Among these approaches, ensemble learning techniques have emerged as a promising solution by demonstrating their ability to enhance classification performance across diverse domains. However, traditional ensemble learning techniques may not deliver the desired accuracy improvements when applied to music datasets characterized by highly correlated low-level features associated with music genres. This study presents an innovative ensemble learning technique to address this challenge. The effectiveness of this approach is evaluated alongside established ensemble learning techniques by utilizing three publicly available music datasets, with two containing high-level sentiment-related features and one comprising low-level features extracted from music signals. The empirical experiments indicate that the proposed ensemble learning technique constantly outperforms conventional techniques in terms of classification accuracy. Notably, the proposed technique demonstrates remarkable performance enhancements when processing low-level features, whereas the traditional techniques failed to do so. This research highlights the substantial potential of advanced ensemble learning techniques in music genre classification and provides valuable insights into the strengths and limitations of various ensemble learning techniques when confronted with complex heterogeneous music datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bellman, R.E.: Dynamic Programming. Princeton University Press, New Jersey (2010)
Bonnin, G., Jannach, D.: Automated generation of music playlists: survey and experiments. ACM Comput. Surv. (CSUR) 47(2), 1–35 (2014)
Breiman, L.: Bagging predictors. Mach. Learn. 24, 123–140 (1996)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Chathuranga, D., Jayaratne, L.: Musical genre classification using ensemble of classifiers. In: 2012 Fourth International Conference on Computational Intelligence, Modelling and Simulation, pp. 237–242. IEEE (2012)
Chen, T., et al.: Xgboost: extreme gradient boosting. R Package Version 0.4-2, vol. 1, no. 4, pp. 1–4 (2015)
Cunningham, S.J., Jones, M., Jones, S.: Organizing digital music for use: an examination of personal music collections. In: International Society for Music Information Retrieval Conference. ISMIR (2004)
de Almeida, P.R.L., da Silva Júnior, E.J., Celinski, T.M., de Souza Britto, A., de Oliveira, L.E.S., Koerich, A.L.: Music genre classification using dynamic selection of ensemble of classifiers. In: 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 2700–2705. IEEE (2012)
Downie, J.S.: Music information retrieval. Ann. Rev. Inf. Sci. Technol. 37(1), 295–340 (2003)
Fu, Z., Lu, G., Ting, K.M., Zhang, D.: A survey of audio-based music classification and annotation. IEEE Trans. Multimedia 13(2), 303–319 (2010)
GuolinKe, Q.M., et al.: Lightgbm: a highly efficient gradient boosting decision tree. Adv. Neural Inf. Process. Syst. 30, 52 (2017)
Hamidani, Z.: Ultimate spotify tracks db. Kaggle (2021)
Hauke, J., Kossowski, T.: Comparison of values of Pearson’s and spearman’s correlation coefficients on the same sets of data. Quaestiones Geographicae 30(2), 87–93 (2011)
King, R.D., Orhobor, O.I., Taylor, C.C.: Cross-validation is safe to use. Nat. Mach. Intell. 3(4), 276 (2021)
Leartpantulak, K., Kitjaidure, Y.: Music genre classification of audio signals using particle swarm optimization and stacking ensemble. In: 2019 7th International Electrical Engineering Congress (iEECON), pp. 1–4. IEEE (2019)
Moura, L., Fontelles, E., Sampaio, V., França, M.: Music topics and metadata. Mendeley Data, V1 (2020)
Murtagh, F.: Multilayer perceptrons for classification and regression. Neurocomputing 2(5–6), 183–197 (1991)
Nanni, L., Costa, Y.M.G., Aguiar, R.L., Silla Jr, C.N., Brahnam, S.: Ensemble of deep learning, visual and acoustic features for music genre classification. J. New Music Res. 47(4), 383–397 (2018)
Olteanu, A.: Gtzan dataset - music genre classification. Kaggle (2021)
Peterson, L.E.: K-nearest neighbor. Scholarpedia 4(2), 1883 (2009)
Refaeilzadeh, P., Tang, L., Liu, H.: Cross-validation. Encycl. Database Syst. 5, 532–538 (2009)
Sanden, C., Zhang, J.Z.: Enhancing multi-label music genre classification through ensemble techniques. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 705–714 (2011)
Schapire, R.E.: The strength of weak learnability. Mach. Learn. 5, 197–227 (1990)
Schedl, M., Gómez, E., Urbano, J., et al.: Music information retrieval: recent developments and applications. Found. Trends® Inf. Retrieval 8(2–3), 127–261 (2014)
Sharma, S., Fulzele, P., Sreedevi, I.: Novel hybrid model for music genre classification based on support vector machine. In: 2018 IEEE Symposium on Computer Applications & Industrial Electronics, pp. 395–400. IEEE (2018)
Silla, C.N., Koerich, A.L., Kaestner, C.A.A.: A machine learning approach to automatic music genre classification. J. Braz. Comput. Soc. 14, 7–18 (2008)
Wolpert, D.H.: Stacked generalization. Neural Netw. 5(2), 241–259 (1992)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2025 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Shariat, R., Zhang, J.Z. (2025). Enhancing Music Genre Classification Using Augmented Features Ensemble Learning Technique. In: Hadfi, R., Anthony, P., Sharma, A., Ito, T., Bai, Q. (eds) PRICAI 2024: Trends in Artificial Intelligence. PRICAI 2024. Lecture Notes in Computer Science(), vol 15281. Springer, Singapore. https://doi.org/10.1007/978-981-96-0116-5_12
Download citation
DOI: https://doi.org/10.1007/978-981-96-0116-5_12
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-96-0115-8
Online ISBN: 978-981-96-0116-5
eBook Packages: Computer ScienceComputer Science (R0)