[Objective] This study is to address the issues of data sparsity and user selection bias in explicit rating data in recommender systems, by proposing a rating data filling model based on uninteresting item injection. [Methods] A general rating data filling model is constructed based on Conditional Generative Adversarial Networks framework. Denoising Auto-Encoder is used as the generator to capture the nonlinear potential factors behind the interaction and improve the robustness of model. To address the selection bias problem, uninteresting items are identified based on the user’s time point visibility, and are injected into the model by modifying the mask operation to generate data consistent with the user’s real rating distribution. [Results] Our experiments on MovieLens and Amazon datasets show that after data filling, the recommendation accuracy of ItemCF, BiasSVD, and AutoRec improves by more than three times on average. [Limitations] The data generation method relies on rating data and may not be effective in the case of extremely sparse rating data, such as in cold start scenarios. [Conclusions] The proposed model effectively alleviates data sparsity and eliminates selection bias, significantly improving the performance of recommended tasks of existing collaborative filtering methods.
石磊, 李树青, 蒋明锋, 张志旺, 王愈. 融合选择数据偏差消除和条件生成对抗网络的显式评分填充策略*[J]. 数据分析与知识发现, 2023, 7(6): 1-14.
Shi Lei, Li Shuqing, Jiang Mingfeng, Zhang Zhiwang, Wang Yu. Explicit Rating Filling Strategy Based on Selection Data Bias Elimination and Conditional Generative Adversarial Networks. Data Analysis and Knowledge Discovery, 2023, 7(6): 1-14.
Lu J, Wu D S, Mao M S, et al. Recommender System Application Developments: A Survey[J]. Decision Support Systems, 2015, 74: 12-32.
doi: 10.1016/j.dss.2015.03.008
[2]
Pan W S, Cui S, Wen H Y, et al. Correcting the User Feedback-Loop Bias for Recommendation Systems[OL]. arXiv Preprint, arXiv:2109.06037.
[3]
Marlin B M, Zemel R S. Collaborative Prediction and Ranking with Non-random Missing Data[C]// Proceedings of the 3rd ACM Conference on Recommender Systems. 2009: 5-12.
[4]
Schnabel T, Swaminathan A, Singh A, et al. Recommendations as Treatments: Debiasing Learning and Evaluation[C]// Proceedings of the 33rd International Conference on Machine Learning. 2016: 1670-1679.
[5]
Steck H. Evaluation of Recommendations: Rating-Prediction and Ranking[C]// Proceedings of the 7th ACM Conference on Recommender Systems. 2013: 213-220.
[6]
Steck H. Training and Testing of Recommender Systems on Data Missing not at Random[C]// Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2010: 713-722.
[7]
Goodfellow I J, Pouget-Abadie J, Mirza M, et al. Generative Adversarial Nets[C]// Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2. 2014: 2672-2680.
[8]
Radford A, Metz L, Chintala S. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks[OL]. arXiv Preprint, arXiv:1511.06434.
[9]
Yu L T, Zhang W N, Wang J, et al. SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient[C]// Proceedings of the 31st AAAI Conference on Artificial Intelligence. 2017: 2852-2858.
[10]
Donahue C, McAuley J, Puckette M. Synthesizing Audio with Generative Adversarial Networks[OL]. arXiv Preprint, arXiv:1802.04208.
[11]
Antoniou A, Storkey A, Edwards H. Data Augmentation Generative Adversarial Networks[OL]. arXiv Preprint, arXiv:1711.04340.
[12]
Wang J, Yu L T, Zhang W N, et al. IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models[C]// Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2017: 515-524.
[13]
Wang H W, Wang J, Wang J L, et al. GraphGAN: Graph Representation Learning with Generative Adversarial Nets[C]// Proceedings of the 32nd AAAI Conference on Artificial Intelligence. 2018: 2508-2515.
[14]
Chae D K, Kim S W. Adversarial Training of Deep Autoencoders Towards Recommendation Tasks[C]// Proceedings of 2018 International Conference on Network Infrastructure and Digital Content. 2018: 91-95.
[15]
Chae D K, Kang J S, Kim S W, et al. CFGAN: A Generic Collaborative Filtering Framework Based on Generative Adversarial Networks[C]// Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 2018: 137-146.
[16]
Maurera F B P, Dacrema M F, Cremonesi P. An Evaluation Study of Generative Adversarial Networks for Collaborative Filtering[OL]. arXiv Preprint, arXiv:2201.01815.
[17]
Wang Q Y, Yin H Z, Wang H, et al. Enhancing Collaborative Filtering with Generative Augmentation[C]// Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2019: 548-556.
[18]
Chae D K, Kim J, Chau D H, et al. AR-CF: Augmenting Virtual Users and Items in Collaborative Filtering for Addressing Cold-Start Problems[C]// Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2020: 1251-1260.
[19]
Chen J W, Dong H D, Wang X, et al. Bias and Debias in Recommender System: A Survey and Future Directions[OL]. arXiv Preprint, arXiv:2010.03240.
[20]
Marlin B M, Zemel R S, Roweis S, et al. Collaborative Filtering and the Missing at Random Assumption[C]// Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence. 2007: 267-275.
[21]
Cremonesi P, Koren Y, Turrin R. Performance of Recommender Algorithms on Top-n Recommendation Tasks[C]// Proceedings of the 4th ACM Conference on Recommender Systems. 2010: 39-46.
(Li Yang, Xiao Zeqing, Nie Songsong, et al. Review of Research on Generative Adversarial Network and Its Application in New Energy Data Quality[J]. Southern Power System Technology, 2020(2): 25-33.)
[23]
Mirza M, Osindero S. Conditional Generative Adversarial Nets[OL]. arXiv Preprint, arXiv:1411.1784.
[24]
Wu Y, DuBois C, Zheng A X, et al. Collaborative Denoising Auto-Encoders for Top-n Recommender Systems[C]// Proceedings of the 9th ACM International Conference on Web Search and Data Mining. 2016: 153-162.
[25]
Chen H L, Wang S, Jiang N, et al. Trust‐Aware Generative Adversarial Network with Recurrent Neural Network for Recommender Systems[J]. International Journal of Intelligent Systems, 2021, 36(2): 778-795.
doi: 10.1002/int.v36.2
[26]
Liang D W, Charlin L, McInerney J, et al. Modeling User Exposure in Recommendation[C]// Proceedings of the 25th International Conference on World Wide Web. 2016: 951-961.
[27]
He X N, Gao M, Kan M Y, et al. Predicting the Popularity of Web 2.0 Items Based on User Comments[C]// Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval. 2014: 233-242.
(Shi Lei, Li Shuqing. Mining Uninteresting Items with Visibility of User Time Points and Collaborative Filtering Recommendation Method[J]. Data Analysis and Knowledge Discovery, 2022, 6(5): 64-76.)
[29]
Sarwar B, Karypis G, Konstan J, et al. Item-Based Collaborative Filtering Recommendation Algorithms[C]// Proceedings of the 10th International Conference on World Wide Web. 2001: 285-295.
[30]
Koren Y, Bell R, Volinsky C. Matrix Factorization Techniques for Recommender Systems[J]. Computer, 2009, 42(8): 30-37.
[31]
Sedhain S, Menon A K, Sanner S, et al. AutoRec: Autoencoders Meet Collaborative Filtering[C]// Proceedings of the 24th International Conference on World Wide Web. 2015: 111-112.
[32]
Hwang W S, Parc J, Kim S W, et al. “Told You I Didn’t Like It”: Exploiting Uninteresting Items for Effective Collaborative Filtering[C]// Proceedings of 2016 IEEE 32nd International Conference on Data Engineering. 2016: 349-360.
[33]
He X N, Zhang H W, Kan M Y, et al. Fast Matrix Factorization for Online Recommendation with Implicit Feedback[C]// Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2016: 549-558.