Please wait a minute...
Advanced Search
数据分析与知识发现  2018, Vol. 2 Issue (5): 23-31     https://doi.org/10.11925/infotech.2096-3467.2017.1218
  研究论文 本期目录 | 过刊浏览 | 高级检索 |
众包社区中基于敏感性分析的用户偏好挖掘模型及实验*
张亭亭1(), 赵宇翔2, 朱庆华3
1 南京大学工程管理学院 南京 210093
2 南京理工大学经济管理学院 南京 210094
3南京大学信息管理学院 南京 210093
Mining User Preferences in Crowdsourcing Community with Sensitivity Analysis
Zhang Tingting1(), Zhao Yuxiang2, Zhu Qinghua3
1School of Engineering and Management, Nanjing University, Nanjing 210093, China
2School of Economics and Management, Nanjing University of Science and Technology, Nanjing 210094, China
3School of Information Management, Nanjing University, Nanjing 210093, China
全文: PDF (577 KB)   HTML ( 3
输出: BibTeX | EndNote (RIS)      
摘要 

【目的】 对众包社区中用户及任务特征进行分析, 识别出众包用户的潜在兴趣偏好。【方法】在现有研究的基础上, 本文运用敏感性分析方法研究了众包用户对各任务属性特征的敏感性程度, 并结合二部图原理构建相应的众包用户潜在偏好挖掘模型, 挖掘出众包用户行为规律中所包含的隐性偏好信息, 并通过实验分析说明了该模型的有效性。【结果】本文提出的模型可以有效识别众包用户对于Books、Software、Music等属性特征的敏感性程度, 并挖掘出用户对于Pyrex Oblong Roaster、Oxford、Cashback等任务的潜在偏好, 预测其选择倾向。较传统协同过滤算法相比, 具有更小的MAE值。【局限】 本文偏好挖掘模型仅从竞赛型众包环境中的用户角度出发, 尚未考虑到协作型众包中不同用户的兴趣特征间的互补。【结论】本文模型不仅能够全面准确理解众包用户兴趣偏好, 还能挖掘众包用户潜在的偏好信息, 使得众包任务的分配更具有针对性, 从而增加众包任务分配的准确性。

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
张亭亭
赵宇翔
朱庆华
关键词 众包任务-用户匹配属性特征敏感性分析潜在偏好挖掘    
Abstract

[Objective] This paper analyzes the attributes and task characteristics of crowdsourcing community users, aiming to identify their potential interests or preferences. [Methods] First, we studied the user’s sensitivity to task attributes based on sensitivity analysis method, and constructed a model for mining user’s potential preferences with bipartite graph. Then, we used this model to discover the implicit preferences from user’s behaviors. Finally, we confirmed the validity of the proposed model through experimental analysis. [Results] Our model could effectively identify the degrees of users’ sensitivity to Books, Software, and Music etc. It could also discover users’ potential interests or preferences to Pyrex Oblong Roaster, Oxford, and Cashback etc. to predict their choices. Compared with traditional collaborative filtering algorithms, the proposed model has a smaller MAE value. [Limitations] Our preferences mining model is based on users in the competitive environment, and it does not consider the complementarity among the interests of collaborative users. [Conclusions] The proposed model could accurately understand the users’ interests in crowdsourcing community, and then reveal their potential preferences. It helps us effectively distribute crowdsourcing tasks.

Key wordsCrowdsourcing    Task-User Fit    Attribute Feature    Sensitivity Analysis    Potential Preferences Mining
收稿日期: 2017-12-05      出版日期: 2018-06-20
ZTFLH:  N99  
基金资助:*本文系国家社会科学基金重大项目“面向大数据的数字图书馆移动视觉搜索机制及应用研究”(项目编号: 15ZDB126)、国家自然科学基金面上项目“基于科研众包模式的公众科学项目运作与管理机制研究”(项目编号: 71774083)和国家自然科学基金青年项目“基于行动者网络框架的众包模式设计与管理研究”(项目编号: 71403119)的研究成果之一
引用本文:   
张亭亭, 赵宇翔, 朱庆华. 众包社区中基于敏感性分析的用户偏好挖掘模型及实验*[J]. 数据分析与知识发现, 2018, 2(5): 23-31.
Zhang Tingting,Zhao Yuxiang,Zhu Qinghua. Mining User Preferences in Crowdsourcing Community with Sensitivity Analysis. Data Analysis and Knowledge Discovery, 2018, 2(5): 23-31.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.2096-3467.2017.1218      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2018/V2/I5/23
  用户和任务属性特征图
User ID Products Categories of the Products Rating
5247778 ‘Wader Basics Tow Truck’ ‘Family’ 5
5247778 ‘Arthur in a pickle - Marc Brown' 'Books' 4
5647565 'Straightheads (DVD)' 'DVDs' 4
3338 'Oxford (England)' 'Travel' 4
6496990 'Top Gun (Blu-ray)' 'DVDs' 5
  实验数据结构
用户 用户编号
众包用户1 5247778
众包用户2 5647565
众包用户3 3338
众包用户4 5769137
众包用户5 6496990
众包用户6 5241292
  社区众包用户表
众包任务 描述
任务1 Pyrex Oblong Roaster
任务2 Wader Basics Tow Truck
任务3 Straw Craft: More GoldenDollies-M.Lambeth
任务4 Arthur in a pickle-Marc Brown
任务5 Straight heads (DVD)
任务6 The Daughter of Time-Josephine Tey
任务7 Rebecca Wheatley-The New Me Workout(DVD)
任务8 Oxford (England)
任务9 Cashback (DVD)
任务10 Movie Collector: DVD Database Software
任务11 Free All Angels [ECD]-Ash
任务12 Take A Break''s Fate & Fortune
  社区众包任务表
属性 描述 属性 描述
属性1 House & Garden 属性6 Software
属性2 Family 属性7 Food & Drink
属性3 Books 属性8 Beauty
属性4 DVDs 属性9 Music
属性5 Travel 属性10 Entertainment
  属性特征表
  社区性众包用户对于任务属性的敏感性值
  社区用户对众包任务的预测偏好值
  初始社区众包用户对众包任务的评分值
  两种方法预测效果的比较
[1] Howe J.The Rise of Crowdsourcing[J]. Wired Magazine, 2006, 14(6): 1-4.
[2] Zhao Y, Zhu Q.Evaluation on Crowdsourcing Research: Current Status and Future Direction[J]. Information Systems Frontiers, 2014, 16(3): 417-434.
doi: 10.1007/s10796-012-9350-4
[3] Nakatsu R T, Grossman E B, Iacovou C L.A Taxonomy of Crowdsourcing Based on Task Complexity[J]. Journal of Information Science, 2014, 40(6): 823-834.
doi: 10.1177/0165551514550140
[4] Zhao Y, Zhu Q.Effects of Extrinsic and Intrinsic Motivation on Participation in Crowdsourcing Contest[J]. Online Information Review, 2014, 38(7): 896-917.
doi: 10.1108/OIR-08-2014-0188
[5] 李肇明. 基于个人兴趣的用户偏好建模[D]. 昆明: 云南大学, 2013.
[5] (Li Zhaoming.User Preference Modeling Based on Personal Interest[D]. Kunming: Yunnan University, 2013.)
[6] Zhao Y, Zhu Q.Conceptualizing Task Affordance in Online Crowdsourcing Context[J]. Online Information Review, 2016, 40(7): 938-958.
doi: 10.1108/OIR-06-2015-0192
[7] Ho C J, Jabbari S, Vaughan J W.Adaptive Task Assignment for Crowdsourced Classification[C]//Proceedings of International Conference on Machine Learning. 2013: 534-542.
[8] Feldman M, Bernstein A.Cognition-based Task Routing: Towards Highly-Effective Task-Assignments in Crowdsourcing Settings[C]//Proceedings of the 35th International Conference on Information Systems(ICIS), Auckland, New Zealand.2014.
[9] Yuen M C, King I, Leung K S.Task Matching in Crowdsourcing[C]//Proceedings of the IEEE International Conferences on Internet of Things, and Cyber, Physical and Social Computing. 2012: 409-412.
[10] Geiger D, Schader M.Personalized Task Recommendation in Crowdsourcing Information Systems — Current State of the Art[J]. Decision Support Systems, 2014, 65(C): 3-16.
doi: 10.1016/j.dss.2014.05.007
[11] Herlocker J L, Konstan J A, Terveen L G, et al.Evaluating Collaborative Filtering Recommender Systems[J]. ACM Transactions on Information Systems, 2004, 22(1): 5-53.
doi: 10.1145/963770
[12] 胡昌平, 邵其赶, 孙高岭. 个性化信息服务中的用户偏好与行为分析[J]. 情报理论与实践, 2008, 31(1): 4-6.
[12] (Hu Changping, Shao Qigan, Sun Gaoling.User Preference and Behavior Analysis in Individual Information Service[J]. Information Studies: Theory & Application, 2008, 31(1): 4-6.)
[13] 刘远超, 王晓龙, 刘秉权, 等. 基于聚类分析策略的用户偏好挖掘[J]. 计算机应用研究, 2005, 22(12): 21-23.
doi: 10.3969/j.issn.1001-3695.2005.12.006
[13] (Liu Yuanchao, Wang Xiaolong, Liu Bingquan, et al.A Cluster-based Approach on Mining Text Preference[J]. Application Research of Computers, 2005, 22(12): 21-23.)
doi: 10.3969/j.issn.1001-3695.2005.12.006
[14] 孔繁超. 个性化信息服务中用户偏好的动态挖掘[J]. 情报理论与实践, 2009, 32(6): 111-113.
[14] (Kong Fanchao.Dynamic Mining of User Preferences in Individual Information Service[J]. Information Studies: Theory & Application, 2009, 32(6): 111-113.)
[15] 林霜梅, 汪更生, 陈弈秋. 个性化推荐系统中的用户建模及特征选择[J]. 计算机工程, 2007, 33(17): 196-198.
[15] (Lin Shuangmei, Wang Gengsheng, Chen Yiqiu.User Modeling and Feature Selection in Personalized Recommending System[J]. Computer Engineering, 2007, 33(17): 196-198.)
[16] 朱小宁, 双锴, 程祥. 基于用户兴趣和能力实现任务分发的众包平台[OL]. 中国科技论文在线, 2014. .
[16] (Zhu Xiaoning, Shuang Kai, Cheng Xiang. A Crowdsourcing Platform for Task Distribution Based on User Interest and Expertise[OL]. China Sciencepaper Online, 2014.
[17] Minder P, Seuken S, Bernstein A.Crowdmanager- combinatorial Allocation and Pricing of Crowdsourcing Tasks with Time Constraints[C]// Proceedings of the Workshop on Social Computing and User Generated Content in Conjunction with ACM Conference on Electronic Commerce, Valencia, Spain.2012: 1-18.
[18] Karger D R, Oh S, Shah D.Budget-optimal Crowdsourcing Using Low-rank Matrix Approximations[C]// Proceedings of the 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton). 2011.
[19] Peter J P, Olson J C.Consumer Behavior & Marketing Strategy[M]. McGraw-Hill, 2010.
[20] 贾大文, 曾承, 彭智勇, 等. 一种基于用户偏好自动分类的社会媒体共享和推荐方法[J]. 计算机学报, 2012, 35(11): 2381-2391.
doi: 10.3724/SP.J.1016.2012.02381
[20] (Jia Dawen, Zeng Cheng, Peng Zhiyong, et al.A User Preference Based Automatic Potential Group Generation Method for Social Media Sharing and Recommendation[J]. Chinese Journal of Computers, 2012, 35(11): 2381-2391.)
doi: 10.3724/SP.J.1016.2012.02381
[21] 李聪, 梁昌勇. 基于属性值偏好矩阵的协同过滤推荐算法[J]. 情报学报, 2008, 27(6): 884-890.
doi: 10.3969/j.issn.1000-0135.2008.06.013
[21] (Li Cong, Liang Changyong.A Collaborative Filtering Recommendation Algorithm Based on Attributes-value Preference Matrix[J]. Journal of the China Society for Scientific and Technical Information, 2008, 27(6): 884-890.)
doi: 10.3969/j.issn.1000-0135.2008.06.013
[22] Horvath T.A Model of User Preference Learning for Content-Based Recommender Systems[J]. Computing & Informatics, 2012, 28(4): 453-481.
doi: 10.1016/j.cviu.2008.08.001
[23] Frey H C, Patil S R.Identification and Review of Sensitivity Analysis Methods[J]. Risk Analysis, 2002, 22(3): 553-578.
doi: 10.1111/risk.2002.22.issue-3
[24] Boudreau K J, Lakhani K R.Using the Crowd as an Innovation Partner[J]. Harvard Business Review, 2013, 91(4): 60-69.
[25] Afuah A, Tucci C L.Crowdsourcing as a Solution to Distant Search[J]. Academy of Management Review, 2012, 37(3): 355-375.
doi: 10.5465/amr.2010.0146
[26] Kleinbaum D, Kupper L.Applied Regression Analysis and Other Multivariate Methods[J]. Technometrics, 2008, 31(1): 117-118.
doi: 10.1080/00401706.1989.10488486
[27] 何晓群, 刘文卿. 应用回归分析[M]. 北京: 中国人民大学出版社, 2007: 58-79.
[27] (He Xiaoqun, Liu Wenqing.Applied Regression Analysis[M]. Beijing: China Renmin University Press, 2007: 58-79.)
[28] 张新猛, 蒋盛益. 基于加权二部图的个性化推荐算法[J]. 计算机应用, 2012, 32(3): 654-657, 678.
doi: 10.3724/SP.J.1087.2012.00654
[28] (Zhang Xinmeng, Jiang Shengyi.Personalized Recommendation Algorithm Based on Weighted Bipartite Network[J]. Journal of Computer Applications, 2012, 32(3): 654-657, 678.)
doi: 10.3724/SP.J.1087.2012.00654
[29] Zhou T, Ren J, Medo M, et al.How to Project a Bipartite Network?[J]. Physics, 2007, 76(4): 70-80.
[1] 赵宇翔,练靖雯. 数字人文视域下文化遗产众包研究综述*[J]. 数据分析与知识发现, 2021, 5(1): 36-55.
[2] 高广尚. 关于实体解析基本方法的研究和述评*[J]. 数据分析与知识发现, 2019, 3(5): 27-40.
[3] 李慧,柴亚青. 基于卷积神经网络的细粒度情感分析方法*[J]. 数据分析与知识发现, 2019, 3(1): 95-103.
[4] 周成, 魏红芹. 基于随机森林属性约简的众包竞赛参与者识别体系研究*[J]. 数据分析与知识发现, 2018, 2(7): 46-54.
[5] 王倩倩, 袁勤俭. 卖家描述与买家评论相符度模型研究[J]. 现代图书情报技术, 2014, 30(5): 58-65.
[6] 李兵, 徐玮佳, 张婧璇. 利用微博数据的产品评价方法研究——以“安卓系统”评价为例[J]. 现代图书情报技术, 2014, 30(4): 92-98.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn