Please wait a minute...
Advanced Search
现代图书情报技术  2010, Vol. 26 Issue (3): 47-51     https://doi.org/10.11925/infotech.1003-3513.2010.03.08
  知识组织与知识管理 本期目录 | 过刊浏览 | 高级检索 |
基于PLSA的大众标注资源主题挖掘*
王嵩   代逸生  李保珍
(江苏科技大学经济管理学院    镇江 212003)
Explore Network Resource Topics from Social Annotations System Based on PLSA
Wang Song   Dai Yisheng   Li Baozhen
(Economic & Management School, Jiangsu University of Science and Technology, Zhenjiang 212003,China)
全文: PDF (498 KB)   HTML  
输出: BibTeX | EndNote (RIS)      
摘要 

针对大众标注中用户生成标签的随意性以及无规则性所导致的标签混乱问题,引入潜在语义索引分析PLSA算法,得到特定资源主题下的标签集,为网络信息组织及用户获取提供有效的途径。通过抽取Delicious网站中的用户标注信息,证实PLSA方法对于特定资源的主题特征具有比较好的效果。

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
王嵩
代逸生
李保珍
关键词 大众标注概率潜在语义分析语义标签资源主题    
Abstract

Due to the random of mass tagging user-generated tags and non-regularity of confusion arising from the label, this paper introduces the Probabilistic Latent Semantic Analysis (PLSA) algorithm for latent semantic indexing analysis,gets the label set of specific resources under the theme and provides an effective approach for the network information organization and the user’s access. By taking the user annotation information through Delicious site,the paper substantiates that the PLSA approach can achieve a good result for the subjects of particular resources.

Key wordsSocial tagging    Probabilistic latent semantic analysis    Semantic tags    Resource topics
收稿日期: 2010-02-01      出版日期: 2010-03-25
: 

G232

 
基金资助:

*本文系教育部人文社会科学研究项目“Web2.0环境下基于大众标注的网络民意跟踪与鉴别模式”(项目编号: 09YJC870010) 和江苏省教育厅高校哲学社科基金项目“基于大众标注的网络舆情监测”(项目编号:09SJB860002)的研究成果之一。

通讯作者: 王嵩     E-mail: wshappy888@126.com
作者简介: 王嵩,代逸生,李保珍
引用本文:   
王嵩,代逸生,李保珍. 基于PLSA的大众标注资源主题挖掘*[J]. 现代图书情报技术, 2010, 26(3): 47-51.
Wang Song,Dai Yisheng,Li Baozhen. Explore Network Resource Topics from Social Annotations System Based on PLSA. New Technology of Library and Information Service, 2010, 26(3): 47-51.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2010.03.08      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2010/V26/I3/47

[1] 刘延章.近五年来网络信息分类组织研究的现状及其展望[J].情报学报,2004,23(2): 173-178.
[2] Smith G.Folksonomy:Social Classification[EB/OL].[2006-11-02].http://atomiq.org/archives/2004/08/folksonomy social classification.html.
[3] Mathes A.Folksonomies-Cooperative Classification and Communication Through Shared Metadata[EB/OL].[2006-11-02].http://www.adammathes.com/academic/computer-mediated-communication/folksonomies.html.
[4] Folksonomies?How about Metadata Ecologies[EB/OL].[2006-11-02]. http://www.louisrosenfeld.com/home/bloug_archive/000330.html.
[5] Folksonomy Explanations[EB/OL].(2005-01-18).[2006-11-02].http://www.vanderwal.net/random/entrysel.php?blog=1622.
[6] Quintarelli E.Folksonomies:Power to the People[C/OL].In:Proceedings of ISKO Italy-UniMIB Meeting,Milan.2005.(2005-01-24).[2006-11-02].http://www.iskoi.org/doc/folksonomies.htm.
[7] Lin X, Beaudoin J E, Bui Y, et al. Exploring Characteristics of Social Classification[EB/OL].[2008-04-24]. http://www.slais.ubc.ca/users/sigcr/sigcr-06lin.pdf.
[8] Kipp M E. Exploring the Context of User, Creator and Intermediate Tagging[EB/OL].[2008-04-24]. http://www.iasummit.org/2006/files/109_Presentation_Desc.pdf.
[9]Brooks C H,Montanez N. Improved Annotation of the Blogosphere via Autotagging and Hierarchical Clustering[C]. In: Proceedings of the 15th International Conference on World Wide Web.2006:625-632.
[10] Sun A, Suryanto M A, Liu Y. Blog Classification Using Tags: An Empirical Study[C]. In: Proceedings of the 10th International Conference on Asian Digital Librarie.2007:307-316.
[11]Razikin K, Goh D H, Cheong E K C, et al. The Efficacy of Tags in Social Tagging Systems[C]. In: Proceedings of the 10th International Conference on Asian Digital Libraries.2007:506-507.
[12] Deerwester S,Dumas S T, Furnas G W, et al. Indexing by Latent Semantic Analysis[C]. In: Proceedings of American Society for Information Scienc.1990:391-407.
[13] Hofmann T. Probabilistic Latent Semantic Indexing[C].In:Proceedings of the 22nd International Conference on Research and Development in Information Retrieval. 1999: 50-57.

 

[1] 黄红霞, 章成志. 中文微博用户标签的调查分析——以新浪微博为例[J]. 现代图书情报技术, 2012, (10): 49-54.
[2] 何继媛, 窦永香, 刘东苏. 大众标注系统中基于本体的语义检索研究综述[J]. 现代图书情报技术, 2011, 27(3): 51-56.
[3] 窦永香,苏山佳,杨美姣,陈丽霞. 基于大众标注的P2P语义检索系统模型研究*[J]. 现代图书情报技术, 2010, 26(2): 50-55.
[4] 蒋翠清, 张玉, 丁勇. 基于PLSA的大众标注潜在语义发现[J]. 现代图书情报技术, 2010, 26(10): 28-32.
[5] 窦永香,苏山佳,赵捧未. 基于Porter算法的英文标签聚类方法研究*[J]. 现代图书情报技术, 2009, (9): 40-44.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn