Abstract
With the rapid increase of digital music on online music platforms, it has become difficult for users to find unknown but interesting songs. Although many collaborative filtering or content based recommendation methods have been proposed, they have various relatively serious some problems, including cold start, diversity of recommendations. etc. Therefore, we propose a reinforcement personal music recommendation system (RPMRS) to address these problems. RPMRS comprises two main components. First, deep representation of audio and lyrics extracted by WaveNet and Word2Vec models, respectively, and apply a proposed content based recommendation method from these. Second, we employ reinforcement learning is to learn user preferences from their song playing log. Experimental results confirm, that hybrid features are superior to audio or lyrics based features for content recommendation, largely because independent audio features significantly outperform lyrics features; and reinforcement learning improves personalized recommendations. Overall, the proposed RPMRS provides dynamic and personalized music recommendations for the user.
Similar content being viewed by others
Notes
Note: cite in Magenta | Make Music and Art Using Machine Learning. (Date Published: 2017, April 05). Retrieved from https://magenta.tensorflow.org/
Pre-trained WaveNet model, downloadable from NSynth – Neural Audio Synthesis: http://download.magenta.tensorflow.org/models/nsynth/wavenet-ckpt.tar
Note: cite in Magenta | The NSynth Dataset. (Date Published: 2017, April 05). Retrieved from https://magenta.tensorflow.org/datasets/nsynth#files
References
Hurley N, Zhang M (2011) Novelty and diversity in top-n recommendation – analysis and evaluation. ACM Trans Internet Technol (TOIT) 10(4):14
Cheng Z, Shen J, Nie L, Chua TS, Kankanhalli M (2017) Exploring user-specific information in music retrieval. In: Proceedings of the 40th international ACM SIGIR conference on Research and Development in Information Retrieval. ACM, pp 655–664
Wang X, Wang Y (2014) Improving content-based and hybrid music recommendation using deep learning. In: Proceedings of the 22nd ACM international conference on Multimedia. ACM, pp 627–636
Logan B (2004) Music recommendation from song sets. In: ISMIR, pp 425–428
Ricci F, Rokach L, Shapira B (2011) Introduction to recommender systems handbook. In: Recommender systems handbook. Springer, Boston, pp 1–35
Liang D, Zhan M, Ellis DP (2015) Content-aware collaborative music recommendation using pre-trained neural networks. In: ISMIR, pp 295–301
Gao H, Tang J, Hu X, Liu H (2015) Content-aware point of interest recommendation on location-based social networks. In: Twenty-ninth AAAI conference on Artificial Intelligence
Lu Z, Dou Z, Lian J, Xie X, Yang Q (2015) Content-based collaborative filtering for news topic recommendation. In: Twenty-ninth AAAI conference on Artificial Intelligence
Soares M, Viana P (2015) Tuning metadata for better movie content-based recommendation systems. Multimed Tools Appl 74(17):7015–7036
Yeh CH, Tseng WY, Chen CY, Lin YD, Tsai YR, Bi HI, Lin YC, Lin HY (2014) Popular music representation: chorus detection & emotion recognition. Multimed Tools Appl 73(3):2103–2128. https://doi.org/10.1007/s11042-013-1687-2
Tzanetakis G (2007) Marsyas submissions to MIREX 2007. In: Proceedings of the international conference on Music Information Retrieval
Peeters G (2008) A generic training and classification system for MIREX08 classification tasks: audio music mood, audio genre, audio artist and audio tag. In: Proceedings of the international conference on Music Information Retrieval
Mokhsin MB, Rosli NB, Wan Adnan WA, Abdul Manaf N (2014). Automatic music emotion classification using artificial neural network based on vocal and instrumental sound timbres. In: New trends in software methodologies, tools and techniques – Proceedings of the 13th SoMeT 2014. Frontiers in artificial intelligence and applications, vol 265, IOS Press, pp 3–14. https://doi.org/10.3233/978-1-61499-434-3-3
Kim Y, Schmidt E, Migneco R, Morton B, Richardson P, Scott J et al (2010) Music emotion recognition: a state of the art review. In: Proceedings of the 11th international Society for Music Information Retrieval Conference (ISMIR). ISMIR, Utrecht, pp 255–266
Li T, Ogihara M (2003) Detecting emotion in music
Van Den Oord A, Dieleman S, Zen H, Simonyan K, Vinyals O, Graves A, Kavukcuoglu K (2016) WaveNet: a generative model for raw audio. SSW 125
Engel J, Resnick C, Roberts A, Dieleman S, Norouzi M, Eck D, Simonyan K (2017) Neural audio synthesis of musical notes with wavenet autoencoders. In: Proceedings of the 34th international conference on Machine Learning-Volume 70, pp 1068–1077. JMLR.org
Huang JW, Chiang CW, Chang JW (2018) Email security level classification of imbalanced data using artificial neural network: the real case in a world-leading enterprise. Eng Appl Artif Intell 75:11–21
Chang JW, Lee MC, Su CY, Wang TI (2017) Effects of using self-explanation on a web-based Chinese sentence-learning system. Comput Assist Lang Learn 30(1–2):44–63
Chang JW, Lee MC, Wang TI (2016) Integrating a semantic-based retrieval agent into case-based reasoning systems: a case study of an online bookstore. Comput Ind 78:29–42
Huang PS, Chiu PS, Chang JW, Huang YM, Lee MC (2019) A study of using syntactic cues in short-text similarity measure. J Internet Technol 20(3):839–850
Lee MC, Chang JW, Hsieh TC (2014) A grammar-based semantic similarity algorithm for natural language sentences. Sci World J:2014
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119
Choi K, Downie JS (2018) Exploratory investigation of word embedding in song lyric topic classification: promising preliminary results. In: JCDL, pp 327–328
Delbouys R, Hennequin R, Piccoli F, Royo-Letelier J, Moussallam M (2018) Music mood detection based on audio and lyrics with deep neural net. arXiv preprint arXiv:1809.07276
Watkins CJCH (1989) Learning from delayed rewards
King J, Imbrasaitė V (2015) Generating music playlists with hierarchical clustering and Q-learning. In: European conference on Information Retrieval. Springer, Cham, pp 315–326
Zhao X, Zhang L, Ding Z, Xia L, Tang J, Yin D (2018) Recommendations with negative feedback via pairwise deep reinforcement learning. In: Proceedings of the 24th ACM SIGKDD international conference on Knowledge Discovery & Data Mining. ACM, pp 1040–1048
Taghipour N, Kardan A (2008) A hybrid web recommender system based on q-learning. In: Proceedings of the 2008 ACM symposium on Applied Computing. ACM, pp 1164–1168
Srivihok A, Sukonmanee P (2005) E-commerce intelligent agent: personalization travel support agent using Q Learning. In: Proceedings of the 7th international conference on Electronic Commerce. ACM, pp 287–292
Zheng G, Zhang F, Zheng Z, Xiang Y, Yuan NJ, Xie X, Li Z (2018). DRN: a deep reinforcement learning framework for news recommendation. In Proceedings of the 2018 World Wide Web conference on World Wide Web. International World Wide Web Conferences Steering Committee, pp 167–176
Iyyer M, Manjunatha V, Boyd-Graber J, Daumé III H (2015) Deep unordered composition rivals syntactic methods for text classification. In: Proceedings of the 53rd annual meeting of the Association for Computational Linguistics and the 7th international joint conference on Natural Language Processing (Volume 1: Long Papers), vol 1, pp 1681–1691
Acknowledgements
This work was supported by the Ministry of Science and Technology, Taiwan, R.O.C. [grant number MOST 108-2218-E-025-002-MY3], [grant number MOST 108-2218-E-001-001].
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Chang, JW., Chiou, CY., Liao, JY. et al. Music recommender using deep embedding-based features and behavior-based reinforcement learning. Multimed Tools Appl 80, 34037–34064 (2021). https://doi.org/10.1007/s11042-019-08356-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-019-08356-9