Explicit Rating Filling Strategy Based on Selection Data Bias Elimination and Conditional Generative Adversarial Networks

doi:10.11925/infotech.2096-3467.2022.0605

Data Analysis and Knowledge Discovery

2023, Vol. 7

Issue (6): 1-14 DOI: 10.11925/infotech.2096-3467.2022.0605

Current Issue | Archive | Adv Search

Explicit Rating Filling Strategy Based on Selection Data Bias Elimination and Conditional Generative Adversarial Networks

Shi Lei,Li Shuqing(

),Jiang Mingfeng,Zhang Zhiwang,Wang Yu

College of Information Engineering, Nanjing University of Finance & Economics, Nanjing 210023, China

Download: PDF (1271 KB) HTML ( 27 )
Export: BibTeX | EndNote (RIS)

Abstract

[Objective] This study is to address the issues of data sparsity and user selection bias in explicit rating data in recommender systems, by proposing a rating data filling model based on uninteresting item injection. [Methods] A general rating data filling model is constructed based on Conditional Generative Adversarial Networks framework. Denoising Auto-Encoder is used as the generator to capture the nonlinear potential factors behind the interaction and improve the robustness of model. To address the selection bias problem, uninteresting items are identified based on the user’s time point visibility, and are injected into the model by modifying the mask operation to generate data consistent with the user’s real rating distribution. [Results] Our experiments on MovieLens and Amazon datasets show that after data filling, the recommendation accuracy of ItemCF, BiasSVD, and AutoRec improves by more than three times on average. [Limitations] The data generation method relies on rating data and may not be effective in the case of extremely sparse rating data, such as in cold start scenarios. [Conclusions] The proposed model effectively alleviates data sparsity and eliminates selection bias, significantly improving the performance of recommended tasks of existing collaborative filtering methods.

Key words： Data Sparsity Selection Bias Generative Adversarial Networks Uninteresting Items Data Filling

Received: 13 June 2022 Published: 21 March 2023

ZTFLH:

TP393

Fund:Natural Science Major Foundation of the Jiangsu Higher Education Institutions of China(19KJA510011);National Natural Science Foundation of China(61877061)

Corresponding Authors: Li Shuqing，ORCID：0000-0001-9814-5766，E-mail：leeshuqing@163.com。

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	Lei Shi
	Shuqing Li
	Mingfeng Jiang
	Zhiwang Zhang
	Yu Wang

Cite this article:

Shi Lei, Li Shuqing, Jiang Mingfeng, Zhang Zhiwang, Wang Yu. Explicit Rating Filling Strategy Based on Selection Data Bias Elimination and Conditional Generative Adversarial Networks. Data Analysis and Knowledge Discovery, 2023, 7(6): 1-14.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2022.0605 OR https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2023/V7/I6/1

GAN Structure

DAE Structure

DAEGAN Structure

UIIGAN Structure

Statistics of Dataset

Data Enhancement Experiment Result of DAEGAN

Generated Rating Distribution of DAEGAN

θ on Error Rate
">

Effect of Parameters $θ$ on Error Rate

? δ in UIIGAN
">

Data Filling Effect of Parameter $? δ$ in UIIGAN

Recommendation Accuracy of MovieLens 100k

Recommendation Accuracy of MovieLens latest

Recommendation Accuracy of Amazon CD

Experiment Result on MovieLens 100k

Experiment Result on MovieLens latest

Experiment Result on Amazon CD

[1]	Lu J, Wu D S, Mao M S, et al. Recommender System Application Developments: A Survey[J]. Decision Support Systems, 2015, 74: 12-32. doi: 10.1016/j.dss.2015.03.008
[2]	Pan W S, Cui S, Wen H Y, et al. Correcting the User Feedback-Loop Bias for Recommendation Systems[OL]. arXiv Preprint, arXiv:2109.06037.
[3]	Marlin B M, Zemel R S. Collaborative Prediction and Ranking with Non-random Missing Data[C]// Proceedings of the 3rd ACM Conference on Recommender Systems. 2009: 5-12.
[4]	Schnabel T, Swaminathan A, Singh A, et al. Recommendations as Treatments: Debiasing Learning and Evaluation[C]// Proceedings of the 33rd International Conference on Machine Learning. 2016: 1670-1679.
[5]	Steck H. Evaluation of Recommendations: Rating-Prediction and Ranking[C]// Proceedings of the 7th ACM Conference on Recommender Systems. 2013: 213-220.
[6]	Steck H. Training and Testing of Recommender Systems on Data Missing not at Random[C]// Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2010: 713-722.
[7]	Goodfellow I J, Pouget-Abadie J, Mirza M, et al. Generative Adversarial Nets[C]// Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2. 2014: 2672-2680.
[8]	Radford A, Metz L, Chintala S. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks[OL]. arXiv Preprint, arXiv:1511.06434.
[9]	Yu L T, Zhang W N, Wang J, et al. SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient[C]// Proceedings of the 31st AAAI Conference on Artificial Intelligence. 2017: 2852-2858.
[10]	Donahue C, McAuley J, Puckette M. Synthesizing Audio with Generative Adversarial Networks[OL]. arXiv Preprint, arXiv:1802.04208.
[11]	Antoniou A, Storkey A, Edwards H. Data Augmentation Generative Adversarial Networks[OL]. arXiv Preprint, arXiv:1711.04340.
[12]	Wang J, Yu L T, Zhang W N, et al. IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models[C]// Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2017: 515-524.
[13]	Wang H W, Wang J, Wang J L, et al. GraphGAN: Graph Representation Learning with Generative Adversarial Nets[C]// Proceedings of the 32nd AAAI Conference on Artificial Intelligence. 2018: 2508-2515.
[14]	Chae D K, Kim S W. Adversarial Training of Deep Autoencoders Towards Recommendation Tasks[C]// Proceedings of 2018 International Conference on Network Infrastructure and Digital Content. 2018: 91-95.
[15]	Chae D K, Kang J S, Kim S W, et al. CFGAN: A Generic Collaborative Filtering Framework Based on Generative Adversarial Networks[C]// Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 2018: 137-146.
[16]	Maurera F B P, Dacrema M F, Cremonesi P. An Evaluation Study of Generative Adversarial Networks for Collaborative Filtering[OL]. arXiv Preprint, arXiv:2201.01815.
[17]	Wang Q Y, Yin H Z, Wang H, et al. Enhancing Collaborative Filtering with Generative Augmentation[C]// Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2019: 548-556.
[18]	Chae D K, Kim J, Chau D H, et al. AR-CF: Augmenting Virtual Users and Items in Collaborative Filtering for Addressing Cold-Start Problems[C]// Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2020: 1251-1260.
[19]	Chen J W, Dong H D, Wang X, et al. Bias and Debias in Recommender System: A Survey and Future Directions[OL]. arXiv Preprint, arXiv:2010.03240.
[20]	Marlin B M, Zemel R S, Roweis S, et al. Collaborative Filtering and the Missing at Random Assumption[C]// Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence. 2007: 267-275.
[21]	Cremonesi P, Koren Y, Turrin R. Performance of Recommender Algorithms on Top-n Recommendation Tasks[C]// Proceedings of the 4th ACM Conference on Recommender Systems. 2010: 39-46.
[22]	李洋, 肖泽青, 聂松松, 等. 生成对抗网络及其在新能源数据质量中的应用研究综述[J]. 南方电网技术, 2020(2): 25-33.
[22]	(Li Yang, Xiao Zeqing, Nie Songsong, et al. Review of Research on Generative Adversarial Network and Its Application in New Energy Data Quality[J]. Southern Power System Technology, 2020(2): 25-33.)
[23]	Mirza M, Osindero S. Conditional Generative Adversarial Nets[OL]. arXiv Preprint, arXiv:1411.1784.
[24]	Wu Y, DuBois C, Zheng A X, et al. Collaborative Denoising Auto-Encoders for Top-n Recommender Systems[C]// Proceedings of the 9th ACM International Conference on Web Search and Data Mining. 2016: 153-162.
[25]	Chen H L, Wang S, Jiang N, et al. Trust‐Aware Generative Adversarial Network with Recurrent Neural Network for Recommender Systems[J]. International Journal of Intelligent Systems, 2021, 36(2): 778-795. doi: 10.1002/int.v36.2
[26]	Liang D W, Charlin L, McInerney J, et al. Modeling User Exposure in Recommendation[C]// Proceedings of the 25th International Conference on World Wide Web. 2016: 951-961.
[27]	He X N, Gao M, Kan M Y, et al. Predicting the Popularity of Web 2.0 Items Based on User Comments[C]// Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval. 2014: 233-242.
[28]	石磊, 李树青. 基于用户时点可见性的无趣项挖掘及协同过滤推荐方法[J]. 数据分析与知识发现, 2022, 6(5):64-76.
[28]	(Shi Lei, Li Shuqing. Mining Uninteresting Items with Visibility of User Time Points and Collaborative Filtering Recommendation Method[J]. Data Analysis and Knowledge Discovery, 2022, 6(5): 64-76.)
[29]	Sarwar B, Karypis G, Konstan J, et al. Item-Based Collaborative Filtering Recommendation Algorithms[C]// Proceedings of the 10th International Conference on World Wide Web. 2001: 285-295.
[30]	Koren Y, Bell R, Volinsky C. Matrix Factorization Techniques for Recommender Systems[J]. Computer, 2009, 42(8): 30-37.
[31]	Sedhain S, Menon A K, Sanner S, et al. AutoRec: Autoencoders Meet Collaborative Filtering[C]// Proceedings of the 24th International Conference on World Wide Web. 2015: 111-112.
[32]	Hwang W S, Parc J, Kim S W, et al. “Told You I Didn’t Like It”: Exploiting Uninteresting Items for Effective Collaborative Filtering[C]// Proceedings of 2016 IEEE 32nd International Conference on Data Engineering. 2016: 349-360.
[33]	He X N, Zhang H W, Kan M Y, et al. Fast Matrix Factorization for Online Recommendation with Implicit Feedback[C]// Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2016: 549-558.

[1]	Shi Lei, Li Shuqing. Mining Uninteresting Items with Visibility of User Time Points and Collaborative Filtering Recommendation Method[J]. 数据分析与知识发现, 2022, 6(5): 64-76.
[2]	Wang Yong,Wang Yongdong,Guo Huifang,Zhou Yumin. Measuring Item Similarity Based on Increment of Diversity[J]. 数据分析与知识发现, 2018, 2(5): 70-76.
[3]	Li Daoguo,Li Lianjie,Shen Enping. New Collaborative Filtering Recommendation Algorithm Based on User Rating Time[J]. 现代图书情报技术, 2016, 32(9): 65-69.

Viewed

Full text

Abstract

Cited

Shared

Discussed