Please wait a minute...
Data Analysis and Knowledge Discovery  2023, Vol. 7 Issue (10): 25-36    DOI: 10.11925/infotech.2096-3467.2022.0795
Current Issue | Archive | Adv Search |
A Decentralized Classification Algorithm for Online Consumers Based on Improved LPA
Liu Zhu1,Qian Xiaodong2()
1School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
2School of Economics & Management, Lanzhou Jiaotong University, Lanzhou 730070, China
Download: PDF (1276 KB)   HTML ( 14
Export: BibTeX | EndNote (RIS)      
Abstract  Abstract: [Objective] This paper proposes a classification algorithm based on the improved LPA model,aiming to improve the label propagation distance and node similarity judgment in decentralized e-commerce network consumer classification. [Methods] Firstly, we introduced the cosine similarity formula to measure the similarity of nodes and constructed a similarity adjacency matrix. These steps improved the measurement of node distance according to their shared relationship in the LPA algorithm. We also introduced the principle of a back lookup table to conform to the characteristics of locality and reduce the time complexity. Secondly, we selected the initial center point with the degree centrality index and used the clustering coefficient index to update the label rules. We proposed the label propagation distance optimization formula to make the LPA algorithm meet the locality requirements. [Results] The category structure modularity Q of the improved LPA algorithm was 0.054 and 0.145 higher than the traditional LPA algorithm in the network with two neighbor similarity thresholds. The modular Q value increased up to 0.092 on data of different scales. [Limitations] The paper needs to set two parameters and use the principle of the back lookup table. The relationship between time complexity and network size is square. [Conclusions] The improved LPA can more effectively limit label propagation, which creates higher intra-categories node similarity and lower inter-categories node similarity. It is suitable for analyzing decentralized e-commerce consumer networks.
Key wordsDecentralization      E-Commerce      Consumer Network      Locality      Tag Propagation      Classification     
Received: 31 July 2022      Published: 22 March 2023
ZTFLH:  TP393  
  G350  
Fund:National Natural Science Foundation of China(71461017)
Corresponding Authors: Qian Xiaodong,ORCID:0000-0001-6425-7559,E-mail:qianxd@mail.lzjtu.cn。   

Cite this article:

Liu Zhu, Qian Xiaodong. A Decentralized Classification Algorithm for Online Consumers Based on Improved LPA. Data Analysis and Knowledge Discovery, 2023, 7(10): 25-36.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2022.0795     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2023/V7/I10/25

ρ and λ Relation Diagram
">
ρ and λ Relation Diagram
“Barbell Network” Topology
Schematic Diagram of Dividing “Barbell Network” Category by This Algorithm
Schematic Diagram of LPA Dividing “Barbell Network” Category
ψ = 0.95)
">
Network Topology( ψ = 0.95)
ψ = 0.90)
">
Network Topology ( ψ = 0.90)
算法 Q
ψ = 0.95 ψ = 0.90
LPA 0.629 0.478
本文算法 0.683 0.623
Test Results of Real Network Q Value
Network Modularity of Different Scales
Q Contribution Value Analysis( ψ = 0.95
">
Q Contribution Value Analysis( ψ = 0.95
Q Contribution Value Analysis( ψ = 0.90
">
Q Contribution Value Analysis( ψ = 0.90
Convergence Analysis of Experimental Results
类别序号 类型
1 Bread
2 Instant Noodle
3 Dehydrated Beef
4 Spicy Strip
5 Cracker
6 Instant Noodle、Dehydrated Beef、Spicy Strip
7 Bread、Cracker
8 Bread、Instant Noodle
Types of Consumer Groups
[1] Dibb S. Market Segmentation: Strategies for Success[J]. Marketing Intelligence & Planning, 1998, 16(7): 394-406.
[2] Heilman C M, Bowman D. Segmenting Consumers Using Multiple-Category Purchase Data[J]. International Journal of Research in Marketing, 2002, 19(3): 225-252.
doi: 10.1016/S0167-8116(02)00077-0
[3] 曾小青, 徐秦, 张丹, 等. 基于消费数据挖掘的多指标客户细分新方法[J]. 计算机应用研究, 2013, 30(10): 2944-2947.
[3] (Zeng Xiaoqing, Xu Qin, Zhang Dan, et al. New Multi-Indicator Customer Segmentation Method Based on Consuming Data Mining[J]. Application Research of Computers, 2013, 30(10): 2944-2947.)
[4] 胡凤英, 周正龙. 考虑社交关系的网购拼单研究[J]. 中国管理科学, 2021, 29(11): 191-202.
[4] (Hu Fengying, Zhou Zhenglong. Online Interpersonal Group-Buying Based on Social Relationship[J]. Chinese Journal of Management Science, 2021, 29(11): 191-202.)
[5] Jarukasemratana S, Murata T, Liu X. Community Detection Algorithm Based on Centrality and Node Distance in Scale-Free Networks[C]// Proceedings of the 24th ACM Conference on Hypertext and Social Media. 2013: 258-262.
[6] Shang C X, Feng S Z, Zhao Z Y, et al. Efficiently Detecting Overlapping Communities Through Seeding and Semi-Supervised Learning[J]. International Journal of Machine Learning and Cybernetics, 2017, 8(2): 455-468.
doi: 10.1007/s13042-015-0338-5
[7] Huang J B, Sun H L, Liu Y G, et al. Towards Online Multiresolution Community Detection in Large-scale Networks[J]. PLoS One, 2011, 6(8): e23829.
[8] 吴建, 王梓权, 易亿, 等. 基于图遍历的局部社区发现算法[J]. 计算机应用研究, 2019, 36(9): 2636-2638, 2670.
[8] (Wu Jian, Wang Ziquan, Yi Yi, et al. Local Community Detection Algorithm Based on Graph Traversal[J]. Application Research of Computers, 2019, 36(9): 2636-2638, 2670.)
[9] Raghavan U N, Albert R, Kumara S. Near Linear Time Algorithm to Detect Community Structures in Large-scale Networks[J]. Physical Review E, Statistical, Nonlinear and Soft Matter Physics, 2007, 76(3 Pt 2): 036106.
[10] Leung I X Y, Hui P, Liò P, et al. Towards Real-Time Community Detection in Large Networks[J]. Physical Review E, Statistical, Nonlinear, and Soft Matter Physics, 2009, 79(6 Pt 2): 066107.
[11] 张蕾, 钱峰, 赵姝, 等. 基于边权的稳定标签传播社区发现算法[J]. 小型微型计算机系统, 2019, 40(2): 314-319.
[11] (Zhang Lei, Qian Feng, Zhao Shu, et al. Stable Label Propagation Algorithm Based on Edge Weights for Community Detection[J]. Journal of Chinese Computer Systems, 2019, 40(2): 314-319.)
[12] 刘扬, 郑文萍, 张川, 等. 一种基于局部随机游走的标签传播算法[J]. 计算机科学, 2022, 49(10): 103-110.
doi: 10.11896/jsjkx.220400145
[12] (Liu Yang, Zheng Wenping, Zhang Chuan, et al. Local Random Walk Based Label Propagation Algorithm[J]. Computer Science, 2022, 49(10): 103-110.)
doi: 10.11896/jsjkx.220400145
[13] Gregory S. Finding Overlapping Communities in Networks by Label Propagation[J]. New Journal of Physics, 2010, 12(10): 103018.
doi: 10.1088/1367-2630/12/10/103018
[14] Attal J P, Malek M, Zolghadri M. Overlapping Community Detection Using Core Label Propagation Algorithm and Belonging Functions[J]. Applied Intelligence, 2021, 51(11): 8067-8087.
doi: 10.1007/s10489-021-02250-4
[15] 梁世娇, 柴争义. 基于节点亲密度的标签传播重叠社区发现算法[J]. 云南大学学报(自然科学版), 2020, 42(1): 58-65.
[15] Liang Shijiao, Chai Zhengyi. Label Propagation Overlapping Community Detection Algorithm Based on Node Intimacy[J]. Journal of Yunnan University(Natural Sciences Edition), 2020, 42(01):58-65.)
[16] Barber M J, Clark J W. Detecting Network Communities by Propagating Labels under Constraints[J]. Physical Review E, Statistical, Nonlinear, and Soft Matter Physics, 2009, 80(2 Pt 2): 026129.
[17] Šubelj L, Bajec M. Robust Network Community Detection Using Balanced Propagation[J]. The European Physical Journal B, 2011, 81(3): 353-362.
doi: 10.1140/epjb/e2011-10979-2
[18] Zhou K, Martin A, Pan Q, et al. Evidential Label Propagation Algorithm for Graphs[C]// Proceedings of the 19th International Conference on Information Fusion. IEEE, 2016: 1316-1323.
[19] Palla G, Derényi I, Farkas I, et al. Uncovering the Overlapping Community Structure of Complex Networks in Nature and Society[J]. Nature, 2005, 435(7043): 814-818.
doi: 10.1038/nature03607
[20] Clauset A. Finding Local Community Structure in Networks[J]. Physical Review E, Statistical, Nonlinear, and Soft Matter Physics, 2005, 72(2 Pt 2): 026132.
[21] Aitzhan N Z, Svetinovic D. Security and Privacy in Decentralized Energy Trading Through Multi-Signatures, Blockchain and Anonymous Messaging Streams[J]. IEEE Transactions on Dependable and Secure Computing, 2018, 15(5): 840-852.
doi: 10.1109/TDSC.8858
[22] 范吉立, 何蒲, 李晓华, 等. 基于区块链的去中心化物品共享交易服务系统[J]. 计算机应用, 2019, 39(5): 1330-1335.
doi: 10.11772/j.issn.1001-9081.2018112512
[22] (Fan Jili, He Pu, Li Xiaohua, et al. Blockchain Based Decentralized Item Sharing and Transaction Service System[J]. Journal of Computer Applications, 2019, 39(5): 1330-1335.)
doi: 10.11772/j.issn.1001-9081.2018112512
[23] Ge C P, Liu Z, Fang L M. A Blockchain Based Decentralized Data Security Mechanism for the Internet of Things[J]. Journal of Parallel and Distributed Computing, 2020, 141: 1-9.
doi: 10.1016/j.jpdc.2020.03.005
[24] 徐志明, 李栋, 刘挺, 等. 微博用户的相似性度量及其应用[J]. 计算机学报, 2014, 37(1): 207-218.
[24] (Xu Zhiming, Li Dong, Liu Ting, et al. Measuring Similarity between Microblog Users and its Application[J]. Chinese Journal of Computers, 2014, 37(1): 207-218.)
[25] 戴维·诺克, 杨松. 社会网络分析[M]. 李兰, 译. 2版. 上海: 格致出版社, 上海人民出版社, 2012: 103-104.
[25] (David Knock, Yang Song. Social Network Analysis[M]. Translated by Li Lan. The 2nd Edition. Shanghai: Truth & Wisdom Press, Shanghai People’s Publishing House, 2012: 103-104.)
[26] Newman M E J. Modularity and Community Structure in Networks[J]. Proceedings of the National Academy of Sciences of the United States of America, 2006, 103(23): 8577-8582.
doi: 10.1073/pnas.0601602103 pmid: 16723398
[1] Xu Chen, Zhang Wei. Detecting Crowdfunding Frauds Based on Textual and Imbalanced Data[J]. 数据分析与知识发现, 2023, 7(9): 125-135.
[2] Cheng Quan, Dong Jia. Hierarchical Multi-label Classification of Children's Literature for Graded Reading[J]. 数据分析与知识发现, 2023, 7(7): 156-169.
[3] Xu Guixian, Zhang Zixin, Yu Shaona, Dong Yushuang, Tian Yuan. Tibetan News Text Classification Based on Graph Convolutional Networks[J]. 数据分析与知识发现, 2023, 7(6): 73-85.
[4] Ye Guanghui, Li Songye, Song Xiaoying. Text Classification Method for Urban Portrait Based on Multi-Label Annotation Learning[J]. 数据分析与知识发现, 2023, 7(5): 60-70.
[5] Lv Qi, Shangguan Yanhong, Zhang Lin, Huang Ying. Interdisciplinary Measurement Based on Automatic Classification of Text Content[J]. 数据分析与知识发现, 2023, 7(4): 56-67.
[6] Deng Na, He Xinyang, Chen Weijie, Chen Xu. MPMFC: A Traditional Chinese Medicine Patent Classification Model Integrating Network Neighborhood Structural Features and Patent Semantic Features[J]. 数据分析与知识发现, 2023, 7(4): 145-158.
[7] Wang Weijun, Ning Zhiyuan, Du Yi, Zhou Yuanchun. Identifying Interdisciplinary Sci-Tech Literature Based on Multi-Label Classification[J]. 数据分析与知识发现, 2023, 7(1): 102-112.
[8] Wang jinzheng, Yang Ying, Yu Bengong. Classifying Customer Complaints Based on Multi-head Co-attention Mechanism[J]. 数据分析与知识发现, 2023, 7(1): 128-137.
[9] Cao Zhe, Guo Huilan, Wu Jiang, Hu Zhongyi. The Ideal and Reality of Metaverse: User Perception of VR Products Based on Review Mining[J]. 数据分析与知识发现, 2023, 7(1): 49-62.
[10] Zhang Zhipeng, Mao Yusheng, Zhang Liyi. Classifying Reasons of Hotel Reviews with Domain ERNIE and BiLSTM Model[J]. 数据分析与知识发现, 2022, 6(9): 65-76.
[11] Ye Han,Sun Haichun,Li Xin,Jiao Kainan. Classification Model for Long Texts with Attention Mechanism and Sentence Vector Compression[J]. 数据分析与知识发现, 2022, 6(6): 84-94.
[12] Tu Zhenchao, Ma Jing. Item Categorization Algorithm Based on Improved Text Representation[J]. 数据分析与知识发现, 2022, 6(5): 34-43.
[13] Feng Yong, Xu Wentao, Wang Rongbing, Xu Hongyan, Zhang Yonggang. User Community Partition Based on Multi-layer Information Fusion in E-commerce Heterogeneous Network[J]. 数据分析与知识发现, 2022, 6(5): 89-98.
[14] Chen Guo, Ye Chao. News Classification with Semi-Supervised and Active Learning[J]. 数据分析与知识发现, 2022, 6(4): 28-38.
[15] Xiao Yuejun, Li Honglian, Zhang Le, Lv Xueqiang, You Xindong. Classifying Chinese Patent Texts with Feature Fusion[J]. 数据分析与知识发现, 2022, 6(4): 49-59.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn