Please wait a minute...
New Technology of Library and Information Service  2006, Vol. 1 Issue (7): 33-36    DOI: 10.11925/infotech.1003-3513.2006.07.08
Current Issue | Archive | Adv Search |
A Chinese Automatic Word Segmentation Method for Chinese Information Retrieval
Sun Wei
(School of Information Management, Heilongjiang University, Harbin 150080,China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

This paper discusses what kinds of characteristics Chinese word segmentation should have during information retrieval,analyzes solving difficulties to combine Chinese information retrieval with Chinese word segmentation,and on the basis of these characteristics and difficulties, puts forward a Chinese automatic word segmentation method for Chinese information retrieval.

Key wordsChinese information retrieval      Chinese automatic word segmentation      Dictionary     
Received: 06 April 2006      Published: 25 July 2006
: 

TP391

 
Corresponding Authors: Sun Wei     E-mail: sunwei_wei@126.com
About author:: Sun Wei

Cite this article:

Sun Wei . A Chinese Automatic Word Segmentation Method for Chinese Information Retrieval. New Technology of Library and Information Service, 2006, 1(7): 33-36.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2006.07.08     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2006/V1/I7/33

1孙宾.现代汉语文本的词语切分技术.http://www.ce86.com/lunwen/computer/ai/3814.html (Accessed Feb.10,2006)
2闫引堂,周小强.交集型歧义字段切分方法研究.情报学报,2000,19(6):637-643
3孙茂松,左正平.高频最大交集型歧义切分字段在汉语自动分词中的作用.中文信息学报,1999,13(1):27-34
4费洪晓,康松林,朱小娟等.基于词频统计的中文分词的研究.计算机工程与应用,2005(7):67-68,100
5孙建军等. 信息检索技术.北京:科学出版社,2004.238-240
6翟凤文.统计与字典相结合的中文分词[学位论文].吉林:吉林大学,2005
7郑逢斌,付征叶等.HENU汉语自动分词系统中歧义字段消除算法.河南大学学报(自然科学版),2004(4):49-52

[1] Zhang Mengyao, Zhu Guangli, Zhang Shunxiang, Zhang Biao. Grouping Microblog Users of Trending Topics Based on Sentiment Analysis[J]. 数据分析与知识发现, 2021, 5(2): 43-49.
[2] Peng Chen,Lv Xueqiang,Sun Ning,Zang Le,Jiang Zhaocai,Song Li. Building Phrase Dictionary for Defective Products with Convolutional Neural Network[J]. 数据分析与知识发现, 2020, 4(11): 112-120.
[3] Feng Guoming,Zhang Xiaodong,Liu Suhui. DBLC Model for Word Segmentation Based on Autonomous Learning[J]. 数据分析与知识发现, 2018, 2(5): 40-47.
[4] Hu Jiaheng,Cen Yonghua,Wu Chengyao. Constructing Sentiment Dictionary with Deep Learning: Case Study of Financial Data[J]. 数据分析与知识发现, 2018, 2(10): 95-102.
[5] Li Weiqing,Wang Weijun. Building Product Feature Dictionary with Large-scale Review Data[J]. 数据分析与知识发现, 2018, 2(1): 41-50.
[6] Li Yazi,Zheng Jianli,Zhou Yiyang,Li Guolei. Building a National System for the Reimbursable Prescription Drugs[J]. 现代图书情报技术, 2016, 32(6): 96-101.
[7] Wang Xiaoyun,Yuan Yuan,Shi Lingling. Predicting Opening Weekend Box Office Prediction Based on Microblog[J]. 现代图书情报技术, 2016, 32(4): 31-39.
[8] Guo Shunli,Zhang Xiangxian. Building Sentiment Analysis Dictionary for Chinese Book Reviews[J]. 现代图书情报技术, 2016, 32(2): 67-74.
[9] Nie Hui, Rong Zhe. Review Helpfulness Prediction Research Based on Review Sentiment Feature Sets[J]. 现代图书情报技术, 2015, 31(7-8): 113-121.
[10] Gu Wei, Li Chaofan, Wang Hongjun, Xiao Shibin, Shi Shuicai. Acquisition of Synonym from Patent Query Logs[J]. 现代图书情报技术, 2015, 31(2): 24-30.
[11] Zhang Jie, Zhang Haichao, Zhai Dongsheng. Research of the Word Segmentation for Chinese Patent Claims[J]. 现代图书情报技术, 2014, 30(9): 91-98.
[12] Song Peiyan, Li Jingjing, Zhao Xing. Recommended Method for Cross-language Term Synonymous Relationship and Its Empirical Research[J]. 现代图书情报技术, 2013, (5): 40-45.
[13] Tang Xiaobo, Xiao Lu. Research of Co-word Analysis Method of Combining Keywords Extension and Domain Ontology[J]. 现代图书情报技术, 2013, 29(11): 60-67.
[14] Wu Shixian,Zhang Bilan. An Extraction Model of Experience and Evaluation Article[J]. 现代图书情报技术, 2009, 25(4): 88-92.
[15] Gao Wenli,Li Dehua. Chinese Dictionary Query Mechanism Based on Tri-array Trie[J]. 现代图书情报技术, 2007, 2(7): 76-78.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn