|
|
Research on Recognition of Sudden Events on Web Based on Combination of Rules and Statistical Method |
Xia Yan, He Lin, Pan Yunlai, Ouyang Chenchen |
College of Information and Technology, Nanjing Agricultural University, Nanjing 210095,China |
|
|
Abstract The paper focuses on a large number of news corpus, pretreats the titles and abstracts of training documents, then builds up the feature vector library. At last, it uses matching method of decision table rules and vector space method to identificate the articles in two ways, and makes better service of the sudden events recognition on Web.
|
Received: 16 August 2010
Published: 04 January 2011
|
|
[1] 张广渊,李晶皎,王爱侠.基于知识的满文识别后处理 [J]. 计算机辅助工程 ,2006,15(3):69-71.
[2] 徐文海, 温有奎.一种基于TFIDF方法的中文关键词抽取算法 [J]. 情报理论与实践 ,2008,31(2):298-302.
[3] 张庆国,章成志,薛德军,等.适用于隐含主题抽取的K最近邻关键词自动抽取 [J]. 情报学报 ,2009,28(2):163-168.
[4] 张虹.基于自动文本分类的关键词抽取算法 [J]. 计算机工程 ,2009,35(12):145-147.
[5] Hkkinena J, Suontaustab J, Riisc S, et al. Assessing Text-to-phoneme Mapping Strategies in Speaker Independent Isolated Word Recognition [J]. Speech Communication, 2003, 41(2-3):455-467 .
[6] 程岚岚,何丕廉,孙越恒.基于朴素贝叶斯模型的中文关键词提取算法研究 [J]. 计算机应用 ,2005,25(12):2780-2782.
[7] 张爱华,荆继武,向继.中文文本分类中的文本表示因素比较 [J]. 中国科学院研究生院学报 ,2009,26(3):400-407.
[8] 李渝勤,孙丽华.基于规则的自动分类在文本分类中的应用 [J]. 中文信息学报 ,2004,18(4):9-14.
[9] 章成志,白振田.文本自动标引与自动分类研究 [M].南京:东南大学出版社,2009:151.
[10] 徐波,孙茂松,靳光瑾.中文信息处理若干重要问题 [M].北京:科学出版社,2003:14-26.
[11] 钟义信.全信息自然语言理解方法论 [J]. 北京邮电大学学报 ,2004,27(4):1-12.
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|