%A Duan Yufeng, Ju Fei %T Research on Chinese New Word Recognition in Specialized Field Based on N-Gram %0 Journal Article %D 2012 %J Data Analysis and Knowledge Discovery %R 10.11925/infotech.1003-3513.2012.02.07 %P 41-47 %V 28 %N 2 %U {https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/abstract/article_3520.shtml} %8 2012-02-25 %X The paper researches automatic new word recognition in specialized field which is represented by phytology. A set of 200 documents on plant description randomly drawn from “Flora of China” is taken as sample set. At first, draw new words candidates are drawn by N-Gram method based on words split by ICTCLAS. Then all the new words candidates are sorted respectively by term frequency (TF), document frequency (D) and average term frequency (TF/D) and the candidates are selected among certain boundary as true new words. The experiments show that new words recognition according to TF is the best and F measurement is 0.65. This method can automatically produce user dictionary of specialized field and is highly portable.