Please wait a minute...
New Technology of Library and Information Service  2009, Vol. Issue (9): 40-44    DOI: 10.11925/infotech.1003-3513.2009.09.07
article Current Issue | Archive | Adv Search |
An English Tag Clustering Method Based on the Porter Stemming Algorithm
Dou Yongxiang  Su Shanjia  Zhao Pengwei
(School of Economics and Management,Xidian University,  Xi’an 710071, China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

The tags added by users are free rein and uncontrolled in folksonomy systems,so the porter stemming algorithm is introduced firstly in this paper to extract the roots of English tags. Then the method of clustering English tags is brought forward, which chooses the precision following the user’s choice. Finally, making use of the tag cloud, simulation experiment is conducted and proves that this algorithm can make the English tags clustered according to the user’s requirement and describe the resource better.

Key wordsFolksonomy      Tag      Cluster     
Received: 22 June 2009      Published: 25 September 2009
: 

G250.7

 
Corresponding Authors: 苏山佳     E-mail: sushanjia119@yeah.net
About author:: Dou Yongxiang,Su Shanjia,Zhao Pengwei

Cite this article:

Dou Yongxiang,Su Shanjia,Zhao Pengwei. An English Tag Clustering Method Based on the Porter Stemming Algorithm. New Technology of Library and Information Service, 2009, (9): 40-44.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2009.09.07     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2009/V/I9/40

[1] 栾芳芳,陈雪强. Web2.0时代的Folksonomy研究[J]. 图书情报论坛,2008(2):54-56.
[2 ]The Porter Stemming Algorithm[EB/OL].[2009-02-10].http://tartarus.org/~martin/PorterStemmer/def.txt.
[3 ]Mathes A. Folksonomies-cooperative Classification and Communication Through Shared Metadata[EB/OL]. [2007-11-10].http://www.adammathes.com-/academic/computer-mediated-communication/folksonomies.html.
[4] 黄建年,侯汉清. Tag分类基本问题探究[J].情报理论与实践,2008,31(3):461-465.
[5] Abel F, Henze N, Krause D. Exploiting Additional Context for Graph-based Tag Recommendations in Folksonomy Systems[C]. In: Proceedings of IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, Singapore. 2008:148-154.
[6] Abbasi R, Staab S, Cimiano P. Organizing Resource on Tagging Systems Using T-ORG[C].In: Proceedings of the International Workshop on Bridging the Gap Between Semantic Web and Web2.0, Innsbruck, Austria. 2007:97-110.
[7] De Meo P, Quattrone G, Ursino D. Exploitation of Semantic Relationships and Hierarchical Data Structures to Support a User in His Annotation and Browsing Activities in Folksonomy[J]. Information System,2009,34(6):511-535.
[8] 曹高辉,焦玉英,成全. 基于凝聚式层次聚类算法的标签聚类研究[J]. 现代图书情报技术,2008(4):23-27.
[9] 王翠英. 标签的聚类分析研究[J]. 现代图书情报技术,2008(5):67-71.
[10] Tags Used by the Clients[DB/OL].[2009-01-03]. http://delicious.com.

[1] Wang Yifan,Li Bo,Shi Hua,Miao Wei,Jiang Bin. Annotation Method for Extracting Entity Relationship from Ancient Chinese Works[J]. 数据分析与知识发现, 2021, 5(9): 63-74.
[2] Wang Ruolin, Niu Zhendong, Lin Qika, Zhu Yifan, Qiu Ping, Lu Hao, Liu Donglei. Disambiguating Author Names with Embedding Heterogeneous Information and Attentive RNN Clustering Parameters[J]. 数据分析与知识发现, 2021, 5(8): 13-24.
[3] Wang Xiwei,Jia Ruonan,Wei Yanan,Zhang Liu. Clustering User Groups of Public Opinion Events from Multi-dimensional Social Network[J]. 数据分析与知识发现, 2021, 5(6): 25-35.
[4] Lu Linong,Zhu Zhongming,Zhang Wangqiang,Wang Xiaochun. Cross-database Knowledge Integration and Fingerprint of Institutional Repositories with Lingo3G Clustering Algorithm[J]. 数据分析与知识发现, 2021, 5(5): 127-132.
[5] Zhang Qi,Jiang Chuan,Ji Youshu,Feng Minxuan,Li Bin,Xu Chao,Liu Liu. Unified Model for Word Segmentation and POS Tagging of Multi-Domain Pre-Qin Literature[J]. 数据分析与知识发现, 2021, 5(3): 2-11.
[6] Zhang Mengyao, Zhu Guangli, Zhang Shunxiang, Zhang Biao. Grouping Microblog Users of Trending Topics Based on Sentiment Analysis[J]. 数据分析与知识发现, 2021, 5(2): 43-49.
[7] Ding Hao, Ai Wenhua, Hu Guangwei, Li Shuqing, Suo Wei. A Personalized Recommendation Model with Time Series Fluctuation of User Interest[J]. 数据分析与知识发现, 2021, 5(11): 45-58.
[8] Wang Yuan, Shi Kaize, Niu Zhendong. Position-Aware Stepwise Tagging Method for Triples Extraction of Entity-Relationship[J]. 数据分析与知识发现, 2021, 5(10): 71-80.
[9] Yang Chen, Chen Xiaohong, Wang Chuhan, Liu Tingting. Recommendation Strategy Based on Users’ Preferences for Fine-Grained Attributes[J]. 数据分析与知识发现, 2021, 5(10): 94-102.
[10] Wang Yan, Wang Huyan, Yu Bengong. Chinese Text Classification with Feature Fusion[J]. 数据分析与知识发现, 2021, 5(10): 1-14.
[11] Zhao Yuxiang,Lian Jingwen. Review of Cultural Heritage Crowdsourcing in the Domain of Digital Humanities[J]. 数据分析与知识发现, 2021, 5(1): 36-55.
[12] Yu Fengchang,Cheng Qikai,Lu Wei. Locating Academic Literature Figures and Tables with Geometric Object Clustering[J]. 数据分析与知识发现, 2021, 5(1): 140-149.
[13] Wu Jinming,Hou Yuefang,Cui Lei. Automatic Expression of Co-occurrence Clustering Based on Indexing Rules of Medical Subject Headings[J]. 数据分析与知识发现, 2020, 4(9): 133-144.
[14] Wen Pingmei,Ye Zhiwei,Ding Wenjian,Liu Ying,Xu Jian. Developments of Named Entity Disambiguation[J]. 数据分析与知识发现, 2020, 4(9): 15-25.
[15] Xi Yunjiang, Du Diedie, Liao Xiao, Zhang Xuehong. Analyzing & Clustering Enterprise Microblog Users with Supernetwork[J]. 数据分析与知识发现, 2020, 4(8): 107-118.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn