The basic theory and its features about Latent Semantic Indexing(LSI) are analyzed.For the three factors of LSI, the word selection,dimension simplification, words weighting have been engaged and improved. Scientific and technical literatures from computing are used as testing documents, also the improved weight algorithm and the retrieval results about two LSI systems are analyzed. The experimental results show that the feature choice and retrieval results are superior improved and hard performance with the new weight algorithm.
李媛媛,马永强. 基于潜在语义索引的特征选择与权重改进若干关键问题的研究与实现[J]. 现代图书情报技术, 2007, 2(10): 80-84.
Li Yuanyuan,Ma Yongqiang. Research and Implementation of Several Key Problems in Feature Choice and Weight Improvement Based on Latent Semantic Indexing. New Technology of Library and Information Service, 2007, 2(10): 80-84.
[1] Gao J, Zhang J. Clustered SVD Strategies in Latent Semantic Indexing[J]. Information Processing & Management, 2005, 41(3): 1051-1063.
[2] Zha H Y, Marques O, Simon H. A Subspace - based Model for Information Retrieval with Applications in Latent Semantic Indexing[R]. U. K. :CSE Tech Report CSE - 98 - 002 ,1998.
[3] Papadimitriou C H, Raghavan P, Tamaki H, et al. Latent Semantic Indexing :A Probabilistic Analysis[C]. In : Proceedings of PODS'98[C].Washington:Seattle,1998:159-168.
[4] 陈越, 郭力. 隐含语义检索及其应用[J]. 现代图书情报技术.2001 (6) : 27-29.
[5] 盖杰,王怡,武港山. 基于潜在语义分析的信息检索[J]. 计算机工程.2004, 30(6):58-60.
[6] 韩客松,王永成.一种用于主题提取的非线性加权方法[J].情报学报.2000,19(6):650-653.
[7] 郑家恒,卢娇丽.关键词抽取方法的研究[J].计算机工程.2005,31(18):194-196.