面向企业微博的客户细分框架<sup>*</sup>

doi:10.11925/infotech.1003-3513.2016.02.06

现代图书情报技术

2016, Vol. 32

Issue (2): 43-51 https://doi.org/10.11925/infotech.1003-3513.2016.02.06

研究论文

本期目录 | 过刊浏览 | 高级检索

面向企业微博的客户细分框架^*

陈东沂^1,³,周子程¹(

),蒋盛益¹,王连喜²,吴佳林¹

¹广东外语外贸大学信息学院广州 510006
²广东外语外贸大学图书馆广州 510420
³顺丰科技有限公司深圳 518000

A Framework for Customer Segmentation on Enterprises’ Microblog

Chen Dongyi^1,³,Zhou Zicheng¹(

),Jiang Shengyi¹,Wang Lianxi²,Wu Jialin¹

¹School of Informatics, Guangdong University of Foreign Studies, Guangzhou 510006, China
²Guangdong University of Foreign Studies Library, Guangzhou 510420, China
³S.F.EXPRESS Co. Ltd., Shenzhen 518000, China

摘要
参考文献
相关文章
Metrics

全文: PDF (1455 KB) HTML ( 48 )
输出: BibTeX | EndNote (RIS)

摘要

【目的】为有效解决微博客户特性的表示问题, 以更好地实施企业微博客户细分。【方法】借助微博平台上客户的个人和社会关系特性, 利用客户及其好友的自定义标签表示客户的特性, 采用基于非负矩阵分解的文本聚类方法, 提出一种面向企业微博的客户细分框架。【结果】实验结果表明, 基于非负矩阵分解的方法取得约86.130%的asw指标平均值, 远远超出基于K-means和层次聚类的方法。【局限】只通过融合微博客户个人及其关注好友的标签表示微博客户特性的方法不能够全面刻画客户特征。【结论】能够为企业微博客户细分中的客户特性的表示、细分、评价及结果可视化等问题提供参考和借鉴。

	服务

	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	陈东沂
	周子程
	蒋盛益
	王连喜
	吴佳林

关键词 ：客户细分, 微博营销, 文本聚类, 非负矩阵分解

Abstract：

[Objective] This study tried to describe the customers’ characteristics effectively. [Methods] The proposed framework aimed to explore the personal and social relationship among the customers and their friends on the microblog platform. We described the customers’ characteristics using self-defined tags and then created segmentation with the help of text clustering and non-negative matrix factorization technologies. [Results] The method based on non-negative matrix factorization achieved an approximately 86.130% on average asw index, which outperformed traditional methods based on K-means and hierarchical clustering. [Limitations] The customers’ characteristic cannot be described only by himself and his friends with self-defined tags on Microblogging. [Conclusions] The proposed framework could improve the effectiveness of characteristics description, evaluation and visualization of microblog customer segmentation.

Key words： Customer segmentation Microblogging marketing Text clustering Non-negative matrix factorization

收稿日期: 2015-07-27 出版日期: 2016-03-08

基金资助:*本文系国家自然科学基金项目“面向微博公共事件的反向社会情绪识别及演化分析研究”(项目编号:61572145)、广东省科技计划项目 “广东省企业竞争情报信息提取及态势推理机制研究——以汽车行业为例”(项目编号:2015A030401093)和广东大学生科技创新培育专项资金项目“微博用户生成内容挖掘及其在微博营销领域的应用研究”(项目编号:308-GK151019)的研究成果之一

引用本文:

陈东沂,周子程,蒋盛益,王连喜,吴佳林. 面向企业微博的客户细分框架^*[J]. 现代图书情报技术, 2016, 32(2): 43-51.
Chen Dongyi,Zhou Zicheng,Jiang Shengyi,Wang Lianxi,Wu Jialin. A Framework for Customer Segmentation on Enterprises’ Microblog. New Technology of Library and Information Service, 2016, 32(2): 43-51.

链接本文:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2016.02.06 或 https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2016/V32/I2/43

[1]	Pang G S, Jiang S Y, Chen D Y.A Simple Integration of Social Relationship and Text Data for Identifying Potential Customers in Microblogging [A]. //Advanced Data Mining and Applications[M]. Springer Berlin Heidelberg, 2013: 397-409.
[2]	Hennig-Thurau T, Malthouse E C, Friege C, et al.The Impact of New Media on Customer Relationships[J]. Journal of Service Research, 2010, 13(3): 311-330.
[3]	Stelzner M A. Social Media Marketing Industry Report [EB/OL]. [2016-06-15]. .
[4]	Rajagopal S.Customer Data Clustering Using Data Mining Technique[J]. International Journal of Database Management Systems, 2011, 3(4): 1-11.
[5]	Lefait G, Kechadi T.Customer Segmentation Architecture Based on Clustering Techniques [C]. In: Proceedings of the 4th International Conference on Digital Society. IEEE, 2010: 243-248.
[6]	Wu J, Lin Z.Research on Customer Segmentation Model by Clustering [C]. In: Proceedings of the 7th International Conference on Electronic Commerce. ACM, 2005: 316-318.
[7]	Pennacchiotti M, Popescu A M.Democrats, Republicans and Starbucks Afficionados: User Classification in Twitter [C]. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2011: 430-438.
[8]	Tinati R, Carr L, Hall W, et al.Identifying Communicator Roles in Twitter[C]. In: Proceedings of the 21st International Conference Companion on World Wide Web. ACM, 2012: 1161-1168.
[9]	Fink C, Kopecky J, Morawskib M.Inferring Gender from the Content of Tweets: A Region Specific Example [C]. In: Proceedings of the 6th International AAAI Conference on Weblogs and Social Media, Dublin, Ireland. AAAI, 2012: 459-462.
[10]	Steinbach M, Karypis G, Kumar V.A Comparison of Document Clustering Techniques [C]. In: Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2000: 1-20.
[11]	Jain A K, Murty M N, Flynn P J.Data Clustering: A Review[J]. ACM Computing Surveys, 1999, 31(3): 264-323.
[12]	Willett P.Recent Trends in Hierarchic Document Clustering: A Critical Review[J]. Information Processing and Management, 1988, 24(5): 577-597.
[13]	Rao D, Yarowsky D, Shreevats A, et al.Classifying Latent User Attributes in Twitter [C]. In: Proceedings of the 2nd International Workshop on Search and Mining User-generated Contents. ACM, 2010: 37-44.
[14]	Lee D D, Seung H S.Learning the Parts of Objects by Non-negative Matrix Factorization[J]. Nature, 1999, 401(6755): 788-791.
[15]	Shahnaz F, Berry M W, Pauca V P, et al.Document Clustering Using Nonnegative Matrix Factorization[J]. Information Processing & Management, 2006, 42(2): 373-386.
[16]	Wang X, Tang J, Liu H.Document Clustering via Matrix Representation [C]. In: Proceedings of the 11th International Conference on Data Mining. IEEE, 2011: 804-813.
[17]	Gautam B P, Shrestha D.Document Clustering Through Non-Negative Matrix Factorization: A Case Study of Hadoop for Computational Time Reduction of Large Scale[J]. 稚内北星学園大学紀要, 2010, 10(3): 15-25.
[18]	黄钢石, 陆建江, 张亚非. 基于NMF的文本聚类方法[J]. 计算机工程, 2004, 30(11):113-114.
[18]	(Huang Ggangshi, Lu Jianjiang, Zhang Yafei.Text Clustering Method Based on Non-negative Matrix Factorization[J]. Computer Engineering, 2004, 30(11): 113-114.)
[19]	张磊, 冯晓森, 项学智. 基于非负矩阵分解的中文文本主题分类[J]. 计算机工程, 2009, 35(13):26-27.
[19]	(Zhang Lei, Feng Xiaosen, Xiang Xuezhi.Topic Classification of Chinese Document Based on NMF[J]. Computer Engineering, 2009, 35(13): 26-27.)
[20]	Calinski T, Harabasz J.A Dendrite Method for Cluster Analysis[J]. Communications in Statistics, 1974, 3(1): 1-27.
[21]	Rousseeuw P J.Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis[J]. Journal of Computational and Applied Mathematics, 1987, 20(1): 53-65.
[22]	Brunet J P, Tamayo P, Golub T, et al.Metagenes and Molecular Pattern Discovery Using Matrix Factorization[J]. Proceedings of the National Academy of Sciences (PNAS), 2004, 101(12): 4164-4169.

[1]	闫春,刘璐. 基于改进SOM神经网络模型与RFM模型的非寿险客户细分研究*[J]. 数据分析与知识发现, 2020, 4(4): 83-90.
[2]	赵华茗,余丽,周强. 基于均值漂移算法的文本聚类数目优化研究 ^*[J]. 数据分析与知识发现, 2019, 3(9): 27-35.
[3]	陆泉,朱安琪,张霁月,陈静. *中文网络健康社区中的用户信息需求挖掘研究^——以求医网肿瘤板块数据为例**[J]. 数据分析与知识发现, 2019, 3(4): 22-32.
[4]	张涛, 马海群. 一种基于LDA主题模型的政策文本聚类方法研究^*[J]. 数据分析与知识发现, 2018, 2(9): 59-65.
[5]	施晓华, 卢宏涛. 基于矩阵分解学习的科学合作网络社区发现研究^*[J]. 数据分析与知识发现, 2017, 1(9): 49-56.
[6]	官琴, 邓三鸿, 王昊. 中文文本聚类常用停用词表对比研究^*[J]. 数据分析与知识发现, 2017, 1(3): 72-80.
[7]	胡晓雪. 考虑类结构变动的自适应进化聚类及其在客户细分中的应用[J]. 数据分析与知识发现, 2017, 1(12): 21-31.
[8]	龚凯乐,成颖,孙建军. 基于参与者共现分析的博文聚类研究^*[J]. 现代图书情报技术, 2016, 32(10): 50-58.
[9]	赵华茗. 分布式环境下的文本聚类研究与实现[J]. 现代图书情报技术, 2015, 31(1): 82-88.
[10]	顾晓雪, 章成志. 结合内容和标签的Web文本聚类研究[J]. 现代图书情报技术, 2014, 30(11): 45-52.
[11]	许鑫, 洪韵佳. 专题知识库中文本聚类结果的可视化研究——以中华烹饪文化知识库为例[J]. 现代图书情报技术, 2014, 30(10): 25-32.
[12]	邓三鸿,万接喜,王昊,刘喜文. 基于特征翻译和潜在语义标引的跨语言文本聚类实验分析^*[J]. 现代图书情报技术, 2014, 30(1): 28-35.
[13]	赵辉, 刘怀亮. 面向用户生成内容的短文本聚类算法研究[J]. 现代图书情报技术, 2013, 29(9): 88-92.
[14]	何文静, 何琳. 基于社会标签的文本聚类研究[J]. 现代图书情报技术, 2013, 29(7/8): 49-54.
[15]	洪韵佳, 许鑫. 基于领域本体的知识库多层次文本聚类研究——以中华烹饪文化知识库为例[J]. 现代图书情报技术, 2013, (12): 19-26.

Viewed

Full text

Abstract

Cited

Shared

Discussed