|
|
Mining Chinese New Word in BBS |
Lv Xueqiang Huang He Li Yuqin Shi Shuicai |
(Chinese Information Processing Research Center, Beijing Information Science andTechnology University, Beijing 100101, China) |
|
|
Abstract A simple method using statistics and rule is presented for mining Chinese new words in BBS texts automatically, in which we use such key technologies as Chinese segmentation, frequency statistics, speech pattern filter and a series of operations on word fragments. A system developed in this method can mine random context-insensitive new words in any length and in any field, of any kind.
|
Received: 13 October 2006
Published: 25 January 2007
|
|
Corresponding Authors:
Lv Xueqiang
E-mail: lv.xueqian@trs.com.cn
|
About author:: Lv Xueqiang,Huang He,Li Yuqin,Shi Shuicai |
1尚英.现代汉语新词语研究现状及趋势:[学位论文].山东:烟台师范学院,1997
2亢世勇.新词语大词典.上海:上海辞书出版社,2003
3高永伟.近20年英语国家对新词的研究.外语与外语教学,1998(11):9-11
4郑家恒,杜永萍,宋礼鹏,农业病虫害词汇获取方法初探.孙茂松,陈群秀.语言计算与基于内容的文本处理.北京:清华大学出版社,2003.61-66
5郑家恒,李文花.基于构词法的网络新词自动识别初探.山西大学学报(自然科学版),2002,25(2):115-119
6沈丽琴,施勤,柴海新.自动新词提取方法和系统[专利].中国,00126471.0,2002-03-20
7邹刚,刘洋,刘群,孟遥,于浩,亢世勇.面向Internet的中文新词语检测.中文信息学报,2004,18(6):1-9
8Chen A T.Chinese Word Segmentation Using Minimal Linguistic Knowledge: [dissertation]. University of California at Berkeley,2004 |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|