|
|
Predicting Online Music Playbacks and Influencing Factors |
Liu Yuanchen,Wang Hao(),Gao Yaqi |
School of Information Management, Nanjing University, Nanjing 210023, China Jiangsu Key Laboratory of Data Engineering and Knowledge Service, Nanjing 210023, China |
|
|
Abstract [Objective] This paper predicts the amount of music playbacks and explores the influencing factors, aiming to help online music platforms evaluate the quality of music lists. [Methods] First, we used a web-crawler to retrieve the numerical and text features of music playlists from the Netease cloud. Then, we pre-trained the texts with Word2Vec and BERT. Third, we established RF, XGBoost and DNN models to predict the amount of playbacks. [Results] We found the prediction accuracy of DNN was higher than those of RF and XGBoost. The numbers of initial playbacks, comments, favorites and forwarding of music list had the most significant impacts on the amount of the music list playbacks. However, the text features reduce the prediction accuracy. [Limitations] The Netease cloud music updated everyday, therefore, we only examined the playback data collected 12 hours following the updates. [Conclusions] This study could help online music websites preliminarily judge the popularity of their music lists.
|
Received: 18 October 2020
Published: 15 September 2021
|
|
Fund:National Social Science Fund of China(17ZDA291);Youth of Excellence in Social Sciences of Jiangsu Prince, Tang Scholar of Nanjing University |
Corresponding Authors:
Wang Hao ORCID:0000-0002-0131-0823
E-mail: ywhaowang@nju.edu.cn
|
[1] |
CNNIC. 第45次中国互联网络发展状况统计报告[R]. 中国互联网络信息中心, 2020.
|
[1] |
(CNNIC. The 45th China Statistical Report on Internet Development[R]. China Internet Network Information Center, 2020.)
|
[2] |
崔新平. 移动互联网时代中国音乐文化传播思考[J]. 四川戏剧, 2020(4):147-149.
|
[2] |
( Cui Xinping. Thoughts on the Dissemination of Chinese Musical Culture in the Era of Mobile Internet[J]. Sichuan Theatre, 2020(4):147-149.)
|
[3] |
刘晓明, 聂新磊. 网易云音乐定制化营销发展策略研究[J]. 价值工程, 2019, 38(28):88-89.
|
[3] |
( Liu Xiaoming, Nie Xinlei. Research on Netease Cloud Music Customization Marketing Development Strategy[J]. Value Engineering, 2019, 38(28):88-89.)
|
[4] |
陈晓宇, 付少雄, 邓胜利. 社会化问答用户信息搜寻的影响因素研究——一种混合方法的视角[J]. 图书情报工作, 2018, 62(20):102-111.
|
[4] |
( Chen Xiaoyu, Fu Shaoxiong, Deng Shengli. Analyzing the Influencing Factors of Internet Users’ Information-seeking Behavior: A Mixed-method Perspective[J]. Library and Information Service, 2018, 62(20):102-111.)
|
[5] |
崔连广, 闫旭, 张玉利. 心理因素联动对创业者决策逻辑的影响——一个基于QCA方法的研究[J]. 科学学与科学技术管理, 2020, 41(9):123-135.
|
[5] |
( Cui Lianguang, Yan Xu, Zhang Yuli. The Impact of Psychological Factors on Entrepreneurs’ Decision Logics: A Fuzzy-Set Qualitative Comparative Analysis[J]. Science of Science and Management of S.&.T., 2020, 41(9):123-135.)
|
[6] |
张宁, 袁勤俭. 用户视角下的学术社交网络信息质量影响因素研究——基于扎根理论方法[J]. 图书情报知识, 2018, 62(5):105-113.
|
[6] |
( Zhang Ning, Yuan Qinjian. The Influence Factors of Information Quality in Academic Social Networks from User' Perspective Based on Grounded Theory[J]. Document, Informaiton & Knowledge, 2018, 62(5):105-113.)
|
[7] |
姜文学, 王妍. “一带一路”电子产品贸易格局演变特征及影响因素研究——基于复杂网络分析方法[J]. 国际商务研究, 2020, 41(5):26-40.
|
[7] |
( Jiang Wenxue, Wang Yan. Research on Structural Change Characteristics and Influencing Factors of Electronic Products Trade Network along the Belt and Road: Based on Complex Network Analysis Method[J]. International Business Research, 2020, 41(5):26-40.)
|
[8] |
边璐, 王晓贺, 张江朋, 等. 稀土产品价格决定: 影响因素与预测方法综述[J]. 稀土, 2020, 41(4):146-158.
|
[8] |
( Bian Lu, Wang Xiaohe, Zhang Jiangpeng, et al. Review of Rare Earth Price: Influencing Factors and Forecasting Methods[J]. Chinese Rare Earths, 2020, 41(4):146-158.)
|
[9] |
李舟军, 范宇, 吴贤杰. 面向自然语言处理的预训练技术研究综述[J]. 计算机科学, 2020, 47(3):162-173.
|
[9] |
( Li Zhoujun, Fan Yu, Wu Xianjie. Survey of Natural Language Processing Pre-training Techniques[J]. Computer Science, 2020, 47(3):162-173.)
|
[10] |
黄丽明, 陈维政, 闫宏飞, 等. 基于循环神经网络和深度学习的股票预测方法[J]. 广西师范大学学报(自然科学版), 2019, 37(1):13-22.
|
[10] |
( Huang Liming, Chen Weizheng, Yan Hongfei, et al. A Stock Prediction Method Based on Recurrent Neural Network and Deep Learning[J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(1):13-22.)
|
[11] |
张晗, 贾甜远, 骆方, 等. 面向网络文本的BERT心理特质预测研究[J/OL]. 计算机科学与探索.[2020-11-21]. DOI: 10.3778/j.issn.1673-9418.2007009.
doi: 10.3778/j.issn.1673-9418.2007009
|
[11] |
( Zhang Han, Jia Tianyuan, Luo Fang, et al. A Study on Predicting Psychological Traits of Online Text by BERT[J/OL]. Journal of Frontiers of Computer Science and Technology.[2020-11-21]. DOI: 10.3778/j.issn.1673-9418.2007009.)
doi: 10.3778/j.issn.1673-9418.2007009
|
[12] |
方匡南, 吴见彬, 朱建平, 等. 随机森林方法研究综述[J]. 统计与信息论坛, 2011, 26(3):32-38.
|
[12] |
( Fang Kuangnan, Wu Jianbin, Zhu Jianping, et al. A Review of Technologies on Random Forests[J]. Journal of Statistics and Information, 2011, 26(3):32-38.)
|
[13] |
Malekipirbazari M, Aksakalli V. Risk Assessment in Social Lending via Random Forests[J]. Expert Systems with Applications, 2015, 42(10):4621-4631.
doi: 10.1016/j.eswa.2015.02.001
|
[14] |
Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System[C]// Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2016: 785-794.
|
[15] |
Pan B Y. Application of XGBoost Algorithm in Hourly PM2.5 Concentration Prediction[J]. IOP Conference Series: Earth and Environmental Science, 2018, 113:012127.
doi: 10.1088/1755-1315/113/1/012127
|
[16] |
Hinton G E, Osindero S, Teh Y W. A Fast Learning Algorithm for Deep Belief Nets[J]. Neural Computation, 2006, 18(7):1527-1554.
doi: 10.1162/neco.2006.18.7.1527
|
[17] |
Wu Y K, Tan H C, Qin L Q, et al. A Hybrid Deep Learning Based Traffic Flow Prediction Method and Its Understanding[J]. Transportation Research Part C-Emerging Technologies, 2018, 90:166-180.
doi: 10.1016/j.trc.2018.03.001
|
[18] |
Putin E, Mamoshina P, Aliper A, et al. Deep Biomarkers of Human Aging: Application of Deep Neural Networks to Biomarker Development[J]. Aging (Albany NY), 2016, 8(5):1021-1033.
|
[19] |
Mudambi S M, Schuff D. What Makes a Helpful Online Review? A Study of Customer Reviews on amazon.com[J]. MIS Quarterly, 2010, 34(1):185-200.
doi: 10.2307/20721420
|
[20] |
Chevalier J A, Mayzlin D. The Effect of Word of Mouth on Sales: Online Book Reviews[J]. Journal of Marketing Research, 2006, 43(3):345-354.
doi: 10.1509/jmkr.43.3.345
|
[21] |
李进华, 张婷婷. 社会化问答知识分享用户感知有用性影响因素研究——以知乎为例[J]. 现代情报, 2018, 38(4):20-28.
|
[21] |
( Li Jinhua, Zhang Tingting. Research on Influencing Factors of User Perceived Usefulness of Knowledge Sharing in Social Q&A——A Case Study of Zhihu[J]. Modern Information, 2018, 38(4):20-28.)
|
[22] |
单英骥, 邵鹏. 信息过载视角下用户创建资源列表扩散效果的影响因素研究——以网易云音乐为例[J]. 现代情报, 2019, 39(7):93-101.
|
[22] |
( Shan Yingji, Shao Peng. Research on the Influencing Factors of the User Generated Resource List Diffusion Effect in the Perspective of Information Overload——Take Netease Cloud Music as an Example[J]. Modern Information, 2019, 39(7):93-101.)
|
[23] |
Susarla A, Oh J H, Tan Y. Social Networks and the Diffusion of User-Generated Content: Evidence from YouTube[J]. Information Systems Research, 2012, 23(1):23-41.
doi: 10.1287/isre.1100.0339
|
[24] |
Mikolov T, Corrado G S, Chen K, et al. Efficient Estimation of Word Representations in Vector Space[C]// Proceedings of the International Conference on Learning Representations. 2013.
|
[25] |
Zheng X Q, Chen H Y, Xu T Y. Deep Learning for Chinese Word Segmentation and POS Tagging[C]// Proceedings of 2013 Conference on Empirical Methods in Natural Language Processing. 2013: 647-657.
|
[26] |
Xing C, Wang D, Zhang X W, et al. Document Classification with Distributions of Word Vectors[C]// Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference. 2014. DOI: 10.1109/APSIPA.2014.7041633.
doi: 10.1109/APSIPA.2014.7041633
|
[27] |
Kim H K, Kim H, Cho S. Bag-of-Concepts: Comprehending Document Representation Through Clustering Words in Distributed Representation[J]. Neurocomputing, 2017, 266:336-352.
doi: 10.1016/j.neucom.2017.05.046
|
[28] |
唐明, 朱磊, 邹显春. 基于Word2Vec的一种文档向量表示[J]. 计算机科学, 2016, 43(6):214-217, 269.
|
[28] |
( Tang Ming, Zhu Lei, Zou Xianchun. Document Vector Representation Based on Word2Vec[J]. Computer Science, 2016, 43(6):214-217, 269.)
|
[29] |
Devlin J, Chang M W, Lee K, et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding[OL]. arXiv Preprint, arXiv: 1810.04805.
|
[30] |
刘欢, 张智雄, 王宇飞. BERT模型的主要优化改进方法研究综述[J]. 数据分析与知识发现, 2021, 5(1):3-15.
|
[30] |
( Liu Huan, Zhang Zhixiong, Wang Yufei. A Review on Main Optimization Methods of BERT[J]. Data Analysis and Knowledge Discovery, 2021, 5(1):3-15.)
|
[31] |
Breiman L. Random Forests[J]. Machine Learning, 2001, 45:5-32.
doi: 10.1023/A:1010933404324
|
[32] |
刘艳丽. 随机森林综述[D]. 天津: 南开大学, 2008.
|
[32] |
( Liu Yanli. A Review of Random Forests[D]. Tianjin: Nankai University, 2008.)
|
[33] |
Chen T Q, He T. Higgs Boson Discovery with Boosted Trees[C]// Proceedings of the 2014 International Conference on High-Energy Physics and Machine Learning. 2014: 69-80.
|
[34] |
张永梅, 陈惠妮, 张奕. 基于XGBoost的雾霾预测方法[J]. 计算机工程与设计, 2019, 40(12):3631-3638.
|
[34] |
( Zhang Yongmei, Chen Huini, Zhang Yi. Haze Prediction Method Based on XGBoost[J]. Computer Engineering and Design, 2019, 40(12):3631-3638.)
|
[35] |
杨柳青, 查蓓, 陈伟. 基于深度神经网络的砂岩储层孔隙度预测方法[J]. 中国科技论文, 2020, 15(1):73-80.
|
[35] |
( Yang Liuqing, Zha Bei, Chen Wei. Prediction Method of Reservoir Porosity Based on Deep Neural Network[J]. China Sciencepaper, 2020, 15(1):73-80.)
|
[36] |
Sussman S W, Siegal W S. Informational Influence in Organizations: An Integrated Approach to Knowledge Adoption[J]. Information Systems Research, 2003, 14(1):47-65.
doi: 10.1287/isre.14.1.47.14767
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|