|
|
Indentifying the Topic of Web Information in Web Information Gathering |
Shao Xiaoliang Liu Hong |
(The Network Center of Second Military Medical University, Shanghai 200433, China) |
|
|
Abstract This paper introduces primarily a core work of Web topic information gathering system that we designed——identifying the topic of Web information, the algorithm begins from structuring professional topic dictionary,
analyses and considers well with the characteristics of Web page text, It increases consumedly the efficiency and accuracy of the system,this algorithm will be applicable to the other topic fields.
|
Received: 16 April 2004
Published: 25 October 2004
|
|
Corresponding Authors:
Shao Xiaoliang
E-mail: xlshao@smmu.edu.cn
|
About author:: Shao Xiaoliang,Liu Hong |
1 Andrew McCallum and Kamal Nigam: A comparison of event models for naive bayes text categorization, AAAI-98 Workshop on “Learning for Text Categorization”,1998
2 庞剑锋,卜东波,白硕.基于向量空间模型的文本自动分类系统的研究与实现.计算机应用研究,2001(9)
3 李勇,桑艳艳.网络文本数据分类技术与实现算法.情报学报,2002(1)
4 尹锋.汉语自动分词研究的现状与新思维.现代图书情报技术,1998(4)
5 梅伯平.网络信息组织的分类主题一体化研究.情报科学,2003(4)
6 冯书晓,徐新,杨春梅.国内中文分词技术研究新进展.情报杂志,2002(11)
7 牛忠兰,陈跃新,徐正同,潘鲁军.网络文本自动分类系统的研究与设计.微处理机,2001(2)
8 刁倩,王永成,张惠惠,何骥.文本自动分类中的词权重与分类算法.中文信息学报,2000(3) |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|