|
|
Construction of a Super Classed and Denoted Corpus |
Liu Hua |
(College of Chinese Language and Culture of Jinan University,Guangzhou 510610,China) |
|
|
Abstract Aimming at the problem of training and test corpus in text classing, we have built a super classed and denoted corpus, which has abundant field information, scientific class system, extensible storage format and structured semantic denotations. It adapts to the construction of training and test corpus for text classing、topic identify and IR.
|
Received: 24 October 2005
Published: 25 January 2006
|
|
Corresponding Authors:
Liu Hua
E-mail: liuhua0461@sina.com
|
About author:: Liu Hua |
1谢振亮. 基于WEB挖掘技术的网页自动分类和聚类的研究. 天津:天津大学硕士学位论文,2004
2冯是聪等. “天网”目录导航服务研究. 计算机研究与发展. 2004(4):653-659
3朱凯等. 因特网语料自动下载分析软件的设计. 北京:第一届学生计算语言学研讨会论文集,2002
4黄昌宁、李涓子. “语料库语言学”. 北京:商务印书馆,2002 |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|