Please wait a minute...
New Technology of Library and Information Service  2007, Vol. 2 Issue (1): 37-39    DOI: 10.11925/infotech.1003-3513.2007.01.09
Current Issue | Archive | Adv Search |
Mining Chinese New Word in BBS
Lv Xueqiang   Huang He   Li Yuqin   Shi Shuicai
(Chinese Information Processing Research Center, Beijing Information Science andTechnology University, Beijing  100101, China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

A simple method using statistics and rule is presented for mining Chinese new words in BBS texts automatically, in which we use such key technologies as Chinese segmentation, frequency statistics, speech pattern filter and a series of operations on word fragments. A system developed in this method can mine random context-insensitive new words in any length and in any field, of any kind.

Key wordsAuto-mining      New word      Statistics      Rule     
Received: 13 October 2006      Published: 25 January 2007
: 

TP391

 
Corresponding Authors: Lv Xueqiang     E-mail: lv.xueqian@trs.com.cn
About author:: Lv Xueqiang,Huang He,Li Yuqin,Shi Shuicai

Cite this article:

Lv Xueqiang,Huang He,Li Yuqin,Shi Shuicai . Mining Chinese New Word in BBS. New Technology of Library and Information Service, 2007, 2(1): 37-39.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2007.01.09     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2007/V2/I1/37

1尚英.现代汉语新词语研究现状及趋势:[学位论文].山东:烟台师范学院,1997
2亢世勇.新词语大词典.上海:上海辞书出版社,2003
3高永伟.近20年英语国家对新词的研究.外语与外语教学,1998(11):9-11
4郑家恒,杜永萍,宋礼鹏,农业病虫害词汇获取方法初探.孙茂松,陈群秀.语言计算与基于内容的文本处理.北京:清华大学出版社,2003.61-66
5郑家恒,李文花.基于构词法的网络新词自动识别初探.山西大学学报(自然科学版),2002,25(2):115-119
6沈丽琴,施勤,柴海新.自动新词提取方法和系统[专利].中国,00126471.0,2002-03-20
7邹刚,刘洋,刘群,孟遥,于浩,亢世勇.面向Internet的中文新词语检测.中文信息学报,2004,18(6):1-9
8Chen A T.Chinese Word Segmentation Using Minimal Linguistic Knowledge: [dissertation]. University of California at Berkeley,2004

[1] Li Tiejun,Yan Duanwu,Yang Xiongfei. Recommending Microblogs Based on Emotion-Weighted Association Rules[J]. 数据分析与知识发现, 2020, 4(4): 27-33.
[2] Wei Wei,Guo Chonghui,Xing Xiaoyu. Annotating Knowledge Points & Recommending Questions Based on Semantic Association Rules[J]. 数据分析与知识发现, 2020, 4(2/3): 182-191.
[3] Mingxuan Huang,Shoudong Lu,Hui Xu. Cross-Language Information Retrieval Based on Weighted Association Patterns and Rule Consequent Expansion[J]. 数据分析与知识发现, 2019, 3(9): 77-87.
[4] Xianlai Chen,Chaopeng Han,Ying An,Li Liu,Zhongmin Li,Rong Yang. Extracting New Words with Mutual Information and Logistic Regression[J]. 数据分析与知识发现, 2019, 3(8): 105-113.
[5] Shaohua Qiang,Yunlu Luo,Yupeng Li,Peng Wu. Ontology Reasoning for Financial Affairs with RBR and CBR[J]. 数据分析与知识发现, 2019, 3(8): 94-104.
[6] Yong Zhang,Shuqing Li,Yongshang Cheng. Mining Algorithm for Weighted Association Rules Based on Frequency Effective Length[J]. 数据分析与知识发现, 2019, 3(7): 85-93.
[7] Ru Li,Rui Li,Jie Jiang,Huayi Wu. Spatio-Temporal Characteristics of WMTS Access Sessions[J]. 数据分析与知识发现, 2019, 3(6): 1-11.
[8] Zhanglu Tan,Zhaogang Wang,Han Hu. Study on a Method of Feature Classification Selection Based on χ2 Statistics[J]. 数据分析与知识发现, 2019, 3(2): 72-78.
[9] Qiang Lu,Zhenfang Zhu,Fuyong Xu,Qiangqiang Guo. Chinese Sentiment Classification Method with Bi-LSTM and Grammar Rules[J]. 数据分析与知识发现, 2019, 3(11): 99-107.
[10] He Yue,Feng Yue,Zhao Shupeng,Ma Yufeng. Recommending Contents Based on Zhihu Q&A Community: Case Study of Logistics Topics[J]. 数据分析与知识发现, 2018, 2(9): 42-49.
[11] He Yue,Wang Aixin,Feng Yue,Wang Li. Optimizing Layouts of Outpatient Pharmacy Based on Association Rules[J]. 数据分析与知识发现, 2018, 2(1): 99-108.
[12] Wei Xing,Hu Dehua,Yi Minhan,Zhu Qizhen,Zhu Wenjie. Extracting Disease-Gene-Drug Correlations Based on Data Cube[J]. 数据分析与知识发现, 2017, 1(10): 94-104.
[13] Huang Mingxuan. Cross Language Information Retrieval Model Based on Matrix-weighted Association Patterns Mining[J]. 数据分析与知识发现, 2017, 1(1): 26-36.
[14] Li Xiaoying,Xia Guanghui,Li Danya. Finding Semantic Relations Among Subject Indexed Papers[J]. 现代图书情报技术, 2016, 32(7-8): 87-93.
[15] Ma Tianyi,Zhang Pengzhu,Feng Haoyin. Knowledge Requirement Model for Online Outsourcing Tasks[J]. 现代图书情报技术, 2016, 32(3): 74-81.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn