[Objective] The online "water army" causes the distortion of network information. The paper proposes two methods to detect water army.[Context] Use the methods to detect the "water army" existed on movie website,e-commerce website and so on.[Methods] The paper proposes static and dynamic methods to detect water army, and designs an intensity index to show the fluctuations of the number of reviews relative to the overall in one day.[Results]The paper uses mining technology to collect rating data of Douban movie site, then analyses the ratings to identify the"water army", which verifies the effectiveness of two detection methods.[Conclusions] The combination of the static and dynamic detection methods can detect the existence of "water army" phenomenon effectively. But it also has some limitations, for example, the insufficient rating data affects the detection.
王烁, 徐健, 刘颖. 网络“水军”探测方法研究[J]. 现代图书情报技术, 2014, 30(7): 92-100.
Wang Shuo, Xu Jian, Liu Ying. Research on Online "Water Army" Detection Methods. New Technology of Library and Information Service, 2014, 30(7): 92-100.
[1] 百度百科. “水军”现象[EB/OL].[2013-10-25]. http://baike. baidu.com/view/3098178.htm.(Baidupedia.The Phenomenon of Water Army[EB/OL].[2013-10-25]. http://baike.baidu.com/view/3098178.htm.)
[2] 新华网.三星“水军”营销遭曝光: 诋毁对手被罚千万台币[EB/OL].[2013-10-28]. http://news.xinhuanet.com/finance/2013-10/28/c_125607898.htm. (XinhuaNet. Water Army
Marketing of Samsung is Exposed: Discredit Opponents Lead to be Fined Ten Million NT Dollars[EB/OL].[2013-10-28]. http://news.xinhuanet.com/finance/2013-10/28/c_125607898.htm.)
[3] 王淑敏, 李军豪. 蜜罐技术在社交网络反垃圾信息中应用[J]. 煤炭技术, 2011, 30(7): 154-155. (Wang Shumin, Li Junhao. Honeypot Technology Applicationin Social Networks of SPAM[J]. Coal Technology, 2011, 30(7): 154-155.)
[4] 隆承志, 周杰. 基于特征共享的垃圾邮件过滤方法[C]. 见: 2010年基于互联网的商业管理学术会议论文集. 2010. (Long Chengzhi, Zhou Jie. Characteristics Sharing Based Spam Filtering Method[C]. In: Proceedings of the Conference on Web Based Business Management.2010.)
[5] Chen C C, Tseng Y. Quality Evaluation of Product Reviews Using an Information Quality Framework[J]. Decision Support Systems, 2011, 50(4): 755-768.
[6] Liu Y, Jin J, Ji P, et al. Identifying Helpful Online Reviews: A Product Designer’s Perspective[J]. Computer-Aided Design, 2013, 45(2): 180-194.
[7] 苏雪佳. B2C在线评论有用性影响因素研究——以亚马逊网站为例[D]. 武汉: 中南民族大学, 2012. (Su Xuejia. A Study of the Impact Factors of the Helpfulness of B2C Online Reviews —— A Case Study of Amazon[D]. Wuhan: South- Central University for Nationalities, 2012.)
[8] 江海洋. 基于评论挖掘和用户偏好学习的评分预测协同过滤[J]. 计算机应用研究, 2010, 27(12): 4430-4432. (Jiang Haiyang. Collaborative Filtering Based on Opinion Mining and User Preference Learning[J]. Application Research of Computers, 2010, 27(12): 4430-4432.)
[9] 盛骤, 谢式千, 潘承毅. 概率论与数理统计[M]. 第三版. 北京: 高等教育出版社, 2001. (Sheng Zhou, Xie Shiqian, Pan Chengyi. Probability Theory and Mathematical Statistics[M]. The 3rd Edition. Beijing: Higher Education Press, 2001.)
[10] 查先进. 信息分析[M].武汉: 武汉大学出版社, 2011. (Zha Xianjin. Information Analysis[M]. Wuhan: Wuhan University Press, 2011.)