[Objective] This research designed a new model of profiling big data users’ portraits, aiming to address the fusion issue facing qualitative and quantitative methods. [Methods] We combined qualitative and quantitative methods to design the new model, which has a user value map based on sociological and psychological theories. Then, we used the Look-alike algorithm to build a map data label system, and used the K-Means clustering algorithm to processs the data. Finally, we interpret the clustered data. [Results] We examined our model with 200 million data points, and successfully divided young users into 20 groups. The total amount of data reached 17 million with 606 labels, which are better than the survey data. [Limitations] More research is needed to extract more original data, improve the subjective control of the user value map, as well as conduct heterogeneous data profiling. [Conclusions] The proposed model is of significance for related studies.
吴文瀚. 基于定性定量融合方法的大数据用户画像模型设计与实证[J]. 数据分析与知识发现, 2022, 6(4): 108-119.
Wu Wenhan. Profiling Big Data Users with Qualitative and Quantitative Fusion Methods. Data Analysis and Knowledge Discovery, 2022, 6(4): 108-119.
Fullerton R A. The Birth of Consumer Behavior: Motivation Research in the 1940s and 1950s[J]. Journal of Historical Research in Marketing, 2013, 5(2):212-222.
doi: 10.1108/17557501311316833
[2]
Bartels R. The History of Marketing Thought[M]. The 2nd Edition. New York: Gorsuch Scarisbrick Pub, 1976: 1-33.
[3]
Deaton A, Muellbauer J. Economics and Consumer Behavior [M]. Cambridge: Cambridge University Press, 1980: 25-53.
[4]
刘江. 消费者行为研究[M]. 北京: 北京广告函授学院出版社, 1985: 1-11.
[4]
( Liu Jiang. Consumer Behavior Research[M]. Beijing: Beijing Advertising Correspondence Academy Press, 1985: 1-11.)
( Lu Taihong. 50 Years of Consumer Behavior: Evolution and Overthrow[J]. Foreign Economics & Management, 2017, 39(6):23-38.)
[7]
van Maanen J. Ethnography as Work: Some Rules of Engagement[J]. Journal of Management Studies, 2011, 48(1):218-234.
doi: 10.1111/j.1467-6486.2010.00980.x
( Wang Tian, Mei Hongchang, Zhang Wei. Analysis of Factors Influencing Consumption and Their Model Description Method[J]. Consumer Economics, 2005, 21(5):9-14.)
( Ni Hongyao. Research on the Influencing Factors of B2C E-commerce Consumers’ Repeat Purchase: Empirical Research Based on Structured Equation Model[J]. Consumer Economics, 2013, 29(3):60-64.)
[10]
Wang X, Bendle N T, Mai F, et al. The Journal of Consumer Research at 40: A Historical Analysis[J]. Journal of Consumer Research, 2015, 42(1):5-18.
doi: 10.1093/jcr/ucv009
Institute S. SIGMA, Organization for International Market Research and Consulting [EB/OL]. (2015-01-01). [2020-04-20]. http://www.sigma-online.com/en/About_SIGMA/.
[13]
Price L L, Rowntree B S. Poverty: A Study of Town Life[J]. The Economic Journal, 1902, 12(45):56.
doi: 10.2307/2957025
[14]
Wells W D, Gubar G. Life Cycle Concept in Marketing Research[J]. Journal of Marketing Research, 1966, 3(4):355-363.
doi: 10.1177/002224376600300403
[15]
Riesman D, Glazer N, Denney R, et al. The Lonely Crowd [M]. New Haven: Yale University Press, 1950: 15-30.
[16]
Rokeach M. The Role of Values in Public Opinion Research[J]. Public Opinion Quarterly, 1968, 32(4):547-559.
doi: 10.1086/267645
[17]
Cooper A, Reimann M. About Face 2.0: The Essentials of Interaction Design[M]. New Jersey: John Wiley & Sons, Inc., 2007: 223-225.
[18]
Teixeira C, Sousa P J, Arnaldo M J. User Profiles in Organizational Environments[J]. Campus-Wide Information Systems, 2008, 25(3):128-144.
doi: 10.1108/10650740810886312
( Hua Bolin, Zhao Hui. Discussion about Application on User Profile Method in the Demand Detection of Science and Technology Intelligence[J]. Information Studies: Theory & Application, 2020, 43(9):93-99.)
( Zhao Yahui, Liu Fanglin, Luo Lin. A Review of User Profile in the Context of Big Data: Knowledge System and Research Prospect[J]. Research on Library Science, 2019(24):13-24.)
( Zhou Guanghua, Xin Ying, Zhang Yajie, et al. Study on Big Data’s Applications in Medical and Health Field[J]. Chinese Journal of Health Informatics and Management, 2013, 10(4):296-300.)
[23]
赵博. 大数据在金融领域的应用研究[J]. 信息通信技术, 2018, 12(3):22-26.
[23]
( Zhao Bo. Research on the Application of Big Data in Finance Industry[J]. Information and Communications Technologies, 2018, 12(3):22-26.)
[24]
郑淑蓉. 零售业大数据:形成、应用及启示[J]. 理论探索, 2014(2):90-94.
[24]
( Zheng Shurong. Big Data in the Retail Industry: Formation, Application and Enlightenment[J]. Theoretical Exploration, 2014(2):90-94.)
( Xin Yu, Zheng Xin. Data Driven and Customer Life Cycle Theory——Analysis of the Automobile Industry as an Example[J]. Henan Social Sciences, 2014, 22(3):71-77.)
( Xu Tao, Huang Li, Li Minlei, et al. Research on Portrait Method of Residential Users Based on Multi-Dimensional Fine-Grained Behavior Data[J]. Power Demand Side Management, 2019, 21(3):47-52.)
[29]
Godoy D. Learning User Interests for User Profiling in Personal Information Agents[J]. AI Communications, 2006, 19(4):391-394.
[30]
Kim E G, Chun S H. Analyzing Online Car Reviews Using Text Mining[J]. Sustainability, 2019, 11(6):1611.
doi: 10.3390/su11061611
( Zhang Qi. Research on Charging Demand Analysis and Driving Range of Battery Electric Vehicle Based on User Characteristics[D]. Nanjing: Southeast University, 2019.)
( Wang Zhenfei. Research on the Group Profiles of Bloggers in Science Net Based on RFM Model: Case Study of Library Science,Information Science,and Archival Science[J]. Information Research, 2020, 7(11):26-33.)
( Liu Yan, Li Luqi, Hou Li. Knowledge Service System-Orientated User Portrait and Its Application[J]. Chinese Journal of Medical Library and Information Science, 2020, 29(11):16-23.)
( Zhang Aiqing. 20th Century’s Motivation Research[J]. Journal of Central China Normal University (Humanities and Social Sciences), 1999, 38(3):26-31.)
[35]
Callebaut J. The Naked Consumer Today: Or an Overview of Why Consumers Really Buy Things, and What This Means for Marketing[M]. Chicago: Garant Publishers, 2002: 5-30.
( Nitin Nishandar. TNS: Demystifying the Source of Gravity of the Eight Major Brands[J]. China Advertising, 2014(7):86-88.)
[37]
Mangalampalli A, Ratnaparkhi A, Hatch A O, et al. A Feature-Pair-Based Associative Classification Approach to Look-Alike Modeling for Conversion-Oriented User-Targeting in Tail Campaigns[C]// Proceedings of the 20th International Conference Companion on World Wide Web. 2011: 85-86.
[38]
Ma Q, Wen M S, Xia Z, et al. A Sub-Linear, Massive-Scale Look-alike Audience Extension System[C]//Proceedings of the 5th International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications at KDD 2016. 2016:51-67.
[39]
Liu Y D, Ge K K, Zhang X, et al. Real-Time Attention Based Look-Alike Model for Recommender System[C]//Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2019: 2765-2773.
( Ren Zhengdong, Zhang Junteng, Ren Dongxiao. Portrait Analysis of College Students Based on Target Group Index[J]. Journal of Heilongjiang Vocational Institute of Ecological Engineering, 2021, 34(2):113-116.)
( Ma Xin, Duan Ganglong, Wang Jianren, et al. Research on Airline Customer Clustering Based on Improved Silhouette Coefficient Method[J]. Operations Research and Management Science, 2021, 30(1):140-146.)
(Implementation of Global Strategy Great Wall Motors will Once Again Set Off at the Frankfurt Motor Show [EB/OL]. (2019-09-04). [2021-04-20]. https://www.gwm.com.cn/news_detail-16513.html.)