1School of Informatics, Guangdong University of Foreign Studies, Guangzhou 510006, China 2Guangdong University of Foreign Studies Library, Guangzhou 510420, China 3S.F.EXPRESS Co. Ltd., Shenzhen 518000, China
[Objective] This study tried to describe the customers’ characteristics effectively. [Methods] The proposed framework aimed to explore the personal and social relationship among the customers and their friends on the microblog platform. We described the customers’ characteristics using self-defined tags and then created segmentation with the help of text clustering and non-negative matrix factorization technologies. [Results] The method based on non-negative matrix factorization achieved an approximately 86.130% on average asw index, which outperformed traditional methods based on K-means and hierarchical clustering. [Limitations] The customers’ characteristic cannot be described only by himself and his friends with self-defined tags on Microblogging. [Conclusions] The proposed framework could improve the effectiveness of characteristics description, evaluation and visualization of microblog customer segmentation.
陈东沂,周子程,蒋盛益,王连喜,吴佳林. 面向企业微博的客户细分框架*[J]. 现代图书情报技术, 2016, 32(2): 43-51.
Chen Dongyi,Zhou Zicheng,Jiang Shengyi,Wang Lianxi,Wu Jialin. A Framework for Customer Segmentation on Enterprises’ Microblog. New Technology of Library and Information Service, 2016, 32(2): 43-51.
Pang G S, Jiang S Y, Chen D Y.A Simple Integration of Social Relationship and Text Data for Identifying Potential Customers in Microblogging [A]. //Advanced Data Mining and Applications[M]. Springer Berlin Heidelberg, 2013: 397-409.
[2]
Hennig-Thurau T, Malthouse E C, Friege C, et al.The Impact of New Media on Customer Relationships[J]. Journal of Service Research, 2010, 13(3): 311-330.
[3]
Stelzner M A. Social Media Marketing Industry Report [EB/OL]. [2016-06-15]. .
[4]
Rajagopal S.Customer Data Clustering Using Data Mining Technique[J]. International Journal of Database Management Systems, 2011, 3(4): 1-11.
[5]
Lefait G, Kechadi T.Customer Segmentation Architecture Based on Clustering Techniques [C]. In: Proceedings of the 4th International Conference on Digital Society. IEEE, 2010: 243-248.
[6]
Wu J, Lin Z.Research on Customer Segmentation Model by Clustering [C]. In: Proceedings of the 7th International Conference on Electronic Commerce. ACM, 2005: 316-318.
[7]
Pennacchiotti M, Popescu A M.Democrats, Republicans and Starbucks Afficionados: User Classification in Twitter [C]. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2011: 430-438.
[8]
Tinati R, Carr L, Hall W, et al.Identifying Communicator Roles in Twitter[C]. In: Proceedings of the 21st International Conference Companion on World Wide Web. ACM, 2012: 1161-1168.
[9]
Fink C, Kopecky J, Morawskib M.Inferring Gender from the Content of Tweets: A Region Specific Example [C]. In: Proceedings of the 6th International AAAI Conference on Weblogs and Social Media, Dublin, Ireland. AAAI, 2012: 459-462.
[10]
Steinbach M, Karypis G, Kumar V.A Comparison of Document Clustering Techniques [C]. In: Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2000: 1-20.
[11]
Jain A K, Murty M N, Flynn P J.Data Clustering: A Review[J]. ACM Computing Surveys, 1999, 31(3): 264-323.
[12]
Willett P.Recent Trends in Hierarchic Document Clustering: A Critical Review[J]. Information Processing and Management, 1988, 24(5): 577-597.
[13]
Rao D, Yarowsky D, Shreevats A, et al.Classifying Latent User Attributes in Twitter [C]. In: Proceedings of the 2nd International Workshop on Search and Mining User-generated Contents. ACM, 2010: 37-44.
[14]
Lee D D, Seung H S.Learning the Parts of Objects by Non-negative Matrix Factorization[J]. Nature, 1999, 401(6755): 788-791.
[15]
Shahnaz F, Berry M W, Pauca V P, et al.Document Clustering Using Nonnegative Matrix Factorization[J]. Information Processing & Management, 2006, 42(2): 373-386.
[16]
Wang X, Tang J, Liu H.Document Clustering via Matrix Representation [C]. In: Proceedings of the 11th International Conference on Data Mining. IEEE, 2011: 804-813.
[17]
Gautam B P, Shrestha D.Document Clustering Through Non-Negative Matrix Factorization: A Case Study of Hadoop for Computational Time Reduction of Large Scale[J]. 稚内北星学園大学紀要, 2010, 10(3): 15-25.
(Zhang Lei, Feng Xiaosen, Xiang Xuezhi.Topic Classification of Chinese Document Based on NMF[J]. Computer Engineering, 2009, 35(13): 26-27.)
[20]
Calinski T, Harabasz J.A Dendrite Method for Cluster Analysis[J]. Communications in Statistics, 1974, 3(1): 1-27.
[21]
Rousseeuw P J.Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis[J]. Journal of Computational and Applied Mathematics, 1987, 20(1): 53-65.
[22]
Brunet J P, Tamayo P, Golub T, et al.Metagenes and Molecular Pattern Discovery Using Matrix Factorization[J]. Proceedings of the National Academy of Sciences (PNAS), 2004, 101(12): 4164-4169.