[Objective] This paper tries to improve the DBSCAN algorithm and verify its feasibility and effectiveness in social tagging. [Methods] First, we analyzed the frequency of social tags for resources and their total appearances. Then, we examined the relationship between tags and resources to improve the DBSCAN clustering algorithm. Finally, we applied the new algorithm to cluster tags, and users. [Results] We ran our experiment with data from Douban Movies. The modified DBSCAN algorithm improved the inter-object and inter-cluster correlations of social taggings. [Limitations] The sample datasets need more in-depth mining. [Conclusions] The improved DBSCAN algorithm could effectively cluster social tags.
Hotho A, Jäschke R, Schmitz C, et al.Information Retrieval in Folksonomies: Search and Ranking[C]// Proceedings of the 3rd European Conference on the Semantic Web: Research and Applications. 2006: 411-426.
[2]
熊回香. 面向Web3.0的大众分类研究[D]. 武汉: 华中师范大学, 2011.
[2]
(Xiong Huixiang.Research on Folksonomy Oriented to Web3.0[D]. Wuhan: Central China Normal University, 2011.)
[3]
Hayman S.Folksonomies and Tagging: New Developments in Social Bookmarking[C]// Proceedings of the 2007 Ark Group Conference: Developing and Improving Classification Schemes. 2007.
(Su Xinning, Yang Jianlin, Jiang Niannan, et al.Data Warehouse and Data Mining[M]. Beijing: Tsinghua University Press, 2006.)
[5]
Martin P, Eklund P.Embedding Knowledge in Web Documents: CGs Versus XML-based Metadata Languages[C]// Proceedings of the 7th International Conference on Conceptual Structures: Standards and Practices. 1999: 230-246.
[6]
Razmerita L, Lytras M D.Ontology-Based User Modelling Personalization: Analyzing the Requirements of a Semantic Learning Portal[C]// Proceedings of the 1st World Summit on Knowledge Society. Springer, 2008: 354-363.
(Fang Xiaoke, Ji Chunguang.Research on the Personalized Recommendation Based on Tag Topic and Concept Space[J]. Information Studies: Theory & Application, 2015, 38(5): 105-111.)
doi: 10.16353/j.cnki.1000-7490.2015.05.021
[8]
Sood S, Owsley S, Hammond K J, et al.TagAssist: Automatic Tag Suggestion for Blog Posts[C]//Proceedings of ICWSM’ 2007, Boulder, Colorado, USA. 2007.
[9]
Zhang Z K, Liu C.A Hypergraph Model of Social Tagging Networks[J]. Journal of Statistical Mechanics: Theory and Experiment, 2010(10): P10005.
doi: 10.1088/1742-5468/2010/10/P10005
(Zhong Qingyan, Su Yidan, Liang Shengyong.Tag Recommendation Research Base on Hierarchical Clustering and Semantic[J]. Microcomputer Information, 2010, 26(12-3): 199-203.)
doi: 10.3969/j.issn.2095-6835.2010.36.080
(Liao Zhifang, Wang Chaoqun, Li Xiaoqing, et al.Tag Recommendation and New User Tag Recommendation Algorithms Based on Tensor Decomposition[J]. Journal of Chinese Computer Systems, 2013, 34(11): 2472-2476.)
doi: 10.3969/j.issn.1000-1220.2013.11.011
(Zhang Bin, Zhang Yin, Gao Kening, et al.Combining Relation and Content Analysis for Social Tagging Recommendation[J]. Journal of Software, 2012, 23(3): 476-488.)
doi: 10.3724/SP.J.1001.2012.04001
(Yi Ming, Cao Yujie, Shen Jinzhi, et al.An Approach to Web User Interest Modeling Based on Density-based Clustering Algorithm in the Social Tag System[J]. Journal of the China Society for Scientific and Technical Information, 2011, 30(1): 37-43.)
doi: 10.3772/j.issn.1000-0135.2011.01.005
[14]
Begelman G, Keller P, Smadja F.Automated Tag Clustering: Improving Search and Exploration in the Tag Space[C]// Proceedings of the Collaborative Web Tagging Workshop at WWW2006. 2006: 15-33.
(Cao Gaohui, Jiao Yuying, Cheng Quan.Research on Tag Cluster Based on Hierarchical Agglomerative Clustering Algorithm[J]. New Technology of Library and Information Service, 2008(4): 23-28.)
doi: 10.3969/j.issn.1003-3513.2008.04.005
[16]
Gemmell J, Shepitsen A, Mobasher B, et al.Personalizing Navigation in Folksonomies Using Hierarchical Tag Clustering[C]// Proceedings of the 10th International Conference on Data Warehousing and Knowledge Discovery. Springer, 2008: 196-205.
(Wang Cuiying.Study on Tag Clustering Analysis[J]. New Technology of Library and Information Service, 2008(5): 67-71.)
doi: 10.3969/j.issn.1003-3513.2008.05.012
(Li Shuangqing, Mu Shengdi.Improved DBSCAN Algorithm and Its Application[J]. Computer Engineering and Applications, 2014, 50(8): 72-76.)
doi: 10.3778/j.issn.1002-8331.1212-0093
[20]
Li P, Wang B, Jin W, et al.User-Related Tag Expansion for Web Document Clustering[C]// Proceedings of the 33rd European Conference on Information Retrieval. Springer, 2011: 19-31.
[21]
Zezula P, Amato G, Dohnal V, et al.Similarity Search: The Metric Space Approach[M]. Springer Science & Business Media, 2006.