Research on Automatic Classification of Chinese Books Based on Social Tagging
He Lin1, Wan Jian2, He Juan1, Guo Shiyun1
1. College of Information Science and Technology, Nanjing Agricultural University, Nanjing 210095, China;
2. College of Public Administration, Nanjing Agricultural University, Nanjing 210095, China
[Objective] The paper aims to improve the ability of automatic text classification of social tagging by controlling the relation and quality of social tagging. [Methods] A classification model called “core controlled, shell uncontrolled” is constructed based on the control of a concept space called Social tagging-Keyword in order to realize the regulation control of social tagging based on subject headings. [Results] The validity tests show that this new method has a better performance on the text classification based on social tagging in consideration of efficiency and the cost. [Limitations] The data used for concept space is not as much as possible due to the restriction of the Website. Also, the concept space is lack of deep semantic relations which would be richer in the future. [Conclusions] This study proposes a feasible solution for improving the quality of social tags and the capacity of automatic text classification.
何琳, 万健, 何娟, 郭诗云. 基于社会标签的中文图书自动分类研究[J]. 现代图书情报技术, 2014, 30(9): 1-7.
He Lin, Wan Jian, He Juan, Guo Shiyun. Research on Automatic Classification of Chinese Books Based on Social Tagging. New Technology of Library and Information Service, 2014, 30(9): 1-7.
[1] 曹高辉, 焦玉英, 成全. 基于凝聚式层次聚类算法的标签聚类研究[J]. 现代图书情报技术, 2008(4): 23-28. (Cao Gaohui, Jiao Yuying, Cheng Quan. Research on Tag Cluster Based on Hierarchical Agglomerative Clustering Algorithm [J]. New Technology of Library and Information Service, 2008(4): 23-28.)
[2] Begelman G, Keller P, Smadja F. Automated Tag Clustering: Improving Search and Exploration in the Tag Space[OL]. [2012-12-16]. http://www.ra.ethz.ch/cdstore/www2006/www. rawsugar.com/www2006/20.pdf.
[3] Heymann P, Garcia-Molinay H. Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems[OL]. [2012-12-16]. http://ilpubs.stanford.edu:8090/ 775/1/2006-10.pdf.
[4] Christiaens S. Metadata Mechanisms: From Ontology to Folksonomy and Back [A]//On the Move to Meaningful Internet Systems 2006: OTM 2006 Workshop [C]. Berlin: Springer, 2006(4277): 199-207.
[5] 易明, 王学东, 邓卫华. 基于社会网络分析的社会化标签网络分析与个性化信息服务研究[J]. 中国图书馆学报, 2010, 36(2): 107-114. (Yi Ming, Wang Xuedong, Deng Weihua. Social Tagging Network Analysis and Personalized Information Service Based on Social Network Analysis[J]. Journal of Library Science in China, 2010, 36(2): 107-114.)
[6] 李亚婷, 马费成. 基于标签共现的社会网络分析研究[J]. 情报杂志, 2012, 31(7): 103-109. (Li Yating, Ma Feicheng. Social Network Analysis Based on Tags Co-occurrence[J]. Journal of Intelligence, 2012, 31(7): 103-109.)
[7] 任家乐, 雷若寒, 姜晓. OPAC与“美味书签”相结合的学术资源导航系统构建探索[J]. 图书馆杂志, 2010, 29(6): 21-24, 20. (Ren Jiale, Lei Ruohan, Jiang Xiao. Integrating OPAC with Delicious: A New Guidance System for Academic Resources [J]. Library Journal, 2010, 29(6): 21-24, 20.)
[8] Quintarelli E, Resmini A, Rosati L. FaceTag: Integrating Bottom-up and Top-down Classification in a Social Tagging System [OL]. [2014-02-25]. https://asis.org/Bulletin/Jun-07/ QuintarelliEtc.pdf.
[9] Munk T B, Mork K. Folksonomy: The Power Law the Significance of the Least Effort [J]. Knowledge Organization, 2007, 34(1): 16-33.
[10] Berendt B, Hanser C. Tags are Not Metadata, but “Just More Content”- to Some People [EB/OL]. [2013-12-03]. http:// www.icwsm.org/papers/paper12.html.
[11] Sun A, Suryanto M A, Liu Y. Blog Classification Using Tags: An Empirical Study [C]. In: Proceedings of the 10th International Conference on Asian Digital Libraries. Berlin, Germany: Springer, 2007: 307-316.
[12] Razikin K, Goh D H L, Chua A Y K, et al. Can Social Tags Help You Find What You Want? [C]. In: Proceedings of the 12th European Conference on Digital Libraries (ECDL 2008). Berlin: Springer, 2008: 50-61.
[13] 丛鲁丽. 基于大众分类法的中文博客分类方法[J]. 情报杂志, 2009, 28(9): 50-52. (Cong Luli. Chinese Weblog Pages Classification Based on Folksonomy [J]. Journal of Intelligence, 2009, 28(9): 50-52.)
[14] 李劲, 张华, 吴浩雄, 等. 基于社会标注质量的文本分类模型框架[J]. 计算机应用, 2012, 32(5): 1335-1339. (Li Jin, Zhang Hua, Wu Haoxiong, et al. Text Classification Model Framework Based on Social Annotation Quality [J]. Journal of Computer Applications, 2012, 32(5): 1335-1339.)
[15] 马张华, 侯汉清. 文献分类法主题法导论[M]. 北京: 北京图书馆出版社, 1999: 153-155. (Ma Zhanghua, Hou Hanqing. Introduction to Literature Classification Act Themes [M]. Beijing: Beijing Library Press, 1999: 153-155.)
[16] Sahon G. Mathematics and Information Retrieval [J]. Journal of Documentation, 1979, 35(1): 1-29.