Please wait a minute...
New Technology of Library and Information Service  2013, Vol. 29 Issue (3): 33-37    DOI: 10.11925/infotech.1003-3513.2013.03.06
Current Issue | Archive | Adv Search |
Research of Mining the Word Category Knowledge for Chinese Syntactic Function Distribution Knowledge Base
Wang Dongbo1, Zhu Danhao2
1. College of Information and Technology Science, Nanjing Agricultural University, Nanjing 210095, China;
2. International Institute for Software Technology, United Nations University, Macao 3058, China
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  According to the Chinese word syntactic function distribution, the paper constructs syntactic function distribution knowledge in multi-way tree storage structure base based on Tsinghua treebank. The Chinese word category knowledge is mined by using the K-medoids clustering algorithm of Sparse Feature Clustering based on syntactic function distribution knowledge base.
Key wordsTreebank      Word syntactic function      Knowledge base      SFC     
Received: 20 November 2012      Published: 14 May 2013
:  TP391  

Cite this article:

Wang Dongbo, Zhu Danhao. Research of Mining the Word Category Knowledge for Chinese Syntactic Function Distribution Knowledge Base. New Technology of Library and Information Service, 2013, 29(3): 33-37.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2013.03.06     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2013/V29/I3/33

[1] 陈小荷.从自动句法分析角度看汉语词类问题[J]. 语言教学与研究 ,1999(3):63-72.(Chen Xiaohe. Chinese Words’Classes from the Perspective of Automatic Syntactic Analysis[J].Language Teaching and Research, 1999(3):63-72.)
[2] 徐艳华.现代汉语实词语法功能考察及词类体系重构[D].南京:南京师范大学,2006.(Xu Yanhua.Survey on Modern Chinese Notional Word Grammar Function and Reconstructing the POS System[D].Nanjing: Nanjing Normal University,2006.)
[3] 陈锋,陈小荷.基于树库的现代汉语短语分布考察[J]. 语言科学 ,2008, 7(1):12-17.(Chen Feng,Chen Xiaohe.A Study on Grammartical Functions of Phrases in Mandarin Chinese Based on Chinese TreeBank[J].Linguistic Sciences,2008, 7(1):12-17.)
[4] 卢俊之,陈小荷, 王东波, 等.基于语法功能匹配的汉语句法分析算法[J]. 计算机工程与应用 ,2008,44(16):151-153, 159.(Lu Junzhi,Chen Xiaohe, Wang Dongbo, et al.Chinese Parsing Algorithm Based on Grammar Function Match[J].Computer Engineering and Applications,2008,44(16):151-153,159.)
[5] 崔尚卿, 马秀莉, 唐世渭,等.基于不均匀密度的自动聚类算法[J]. 计算机工程 ,2008, 34(23):86-88.(Cui Shangqing, Ma Xiuli, Tang Shiwei, et al.Auto-clustering Algorithm Based on Non-uniform Density[J].Computer Engineering,2008, 34(23):86-88.)
[6] 王伟.文本自动聚类技术研究[J]. 情报杂志 ,2009, 28(2):94-96.(Wang Wei.Research on Text Automatic Clustering[J].Journal of Intelligence,2009,28(2):94-96.)
[7] 王舵, 郄君, 张娟, 等.一种快速词自动聚类算法[J]. 计算机应用与软件 ,2010, 27(8):277-278.(Wang Duo, Qie Jun, Zhang Juan, et al.A New Algorithm of Words Automatic Clustering[J].Computer Applications and Software,2010, 27(8):277-278.)
[8] 潘章明.半监督的自动聚类[J]. 计算机应用 ,2010, 30(10):2614-2617.(Pan Zhangming.Semi-supervised Automatic Clustering[J].Journal of Computer Applications, 2010, 30(10):2614-2617.)
[9] 于洪, 储双双.一种基于决策粗糙集的自动聚类方法[J]. 计算机科学 ,2011, 38(1):221-224.(Yu Hong, Chu Shuangshuang.Novel Autonomous Clustering Method Based on Decision-theoretic Rough Set[J].Computer Science,2011, 38(1):221-224.)
[10] Boley D, Gini M, Gross R, et al. Partitioning-based Clustering for Web Document Categorization[J]. Decision Support Systems, 1999, 27(3):329-341.
[11] Mao J, Jain A K. A Self-organizing Network for Hyperellipsoidal Clustering [J]. IEEE Transactions on Neural Networks, 1996, 7(1):16-29.
[12] Cai W, Chen S, Zhang D. Fast and Robust Fuzzy C-means Clustering Algorithms Incorporating Local Information for Image Segmentation[J]. Pattern Recognition, 2007, 40(3):825-838.
[13] Chen H H, Lin C J. A Multilingual News Summarizer[C]. In: Proceedings of the 18th International Conference on Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2000:159-165.
[14] Leftin L J.Newsblaster Russian-English Clustering Performance Analysis[R].Columbia Computer Science Technical Reports, 2003.
[15] Evans D K,Klavans J L,McKeown K R.Columbia Newsblaster: Multilingual News Summarization on the Web Demonstration[C].In: Proceedings of HLT-NAACL 2004. Stroudsburg: Association for Computational Linguistics, 2004:1-4.
[16] Mathieu B, Besancon R, Fluhr C. Multilingual Document Clusters Discovery[C]. In: Proceedings of RIAO 2004. 2004:116-125.
[17] 周强, 张伟, 俞士汶.汉语树库的构建[J]. 中文信息学报 ,1997(4):42-51. (Zhou Qiang,Zhang Wei,Yu Shiwen.Building a Chinese Treebank[J].Journal of Chinese Information Processing,1997(4): 42-51.)
[18] Dhillon I S, Mallela S, Kumar R.A Divisive Information Theoretic Feature Clustering Algorithm for Text Classification[J].The Journal of Machine Learning Research,2003,3(1):1265-1287.
[19] Marcus M P,Marcinkiewicz M A,Santorini B.Building a Large Annotated Corpus of English: The Penn Treebank[J].Computational Linguistics,1993,19(2):313-330.
[1] Li Wenna,Zhang Zhixiong. Research on Knowledge Base Error Detection Method Based on Confidence Learning[J]. 数据分析与知识发现, 2021, 5(9): 1-9.
[2] He Xueyao, Ma Tingcan, Yue Mingliang, Ou Guiyan. Analyzing Highly Cited Papers Sponsored by National Natural Science Foundation of China[J]. 数据分析与知识发现, 2021, 5(2): 61-69.
[3] Wen Pingmei,Ye Zhiwei,Ding Wenjian,Liu Ying,Xu Jian. Developments of Named Entity Disambiguation[J]. 数据分析与知识发现, 2020, 4(9): 15-25.
[4] Ruihua Qi,Junyi Zhou,Xu Guo,Caihong Liu. Extracting Book Review Topics with Knowledge Base[J]. 数据分析与知识发现, 2019, 3(6): 83-91.
[5] Chen Guo,Xiao Lu. Linking Knowledge Elements from Online Community[J]. 数据分析与知识发现, 2017, 1(11): 75-83.
[6] Zhou Pengcheng,Wu Chuan,Lu Wei. Entity Linking Method for Short Texts with Multi-Knowledge Bases: Case Study of Wikipedia and Freebase[J]. 现代图书情报技术, 2016, 32(6): 1-11.
[7] Dongsheng Zhai, He Liu, Jie Zhang, Liwei Cai. Managing Patent Semantic Knowledge with Graph Database[J]. 数据分析与知识发现, 2016, 32(12): 66-75.
[8] Jiang Xun, Xu Xukan, Su Xinning. Knowledge Service-oriented Model of Knowledge Base Frame Structure Research Based on Double-base Cooperating[J]. 现代图书情报技术, 2014, 30(2): 55-62.
[9] Xu Xin, Hong Yunjia. Study on Text Visualization of Clustering Result for Domain Knowledge Base —— Take Knowledge Base of Chinese Cuisine Culture as the Object[J]. 现代图书情报技术, 2014, 30(10): 25-32.
[10] Xu Xin, Guo Jinlong. Construction of Subject Knowledge Base——Taking the Domain of Chinese Cuisine Culture as an Example[J]. 现代图书情报技术, 2013, (12): 2-9.
[11] Guo Jinlong, Hong Yunjia, Xu Xin. Construction and Application of Ontology in the Domain of Chinese Cuisine Culture[J]. 现代图书情报技术, 2013, (12): 10-18.
[12] Hong Yunjia, Xu Xin. Study on Multi-level Text Clustering for Knowledge Base Based on Domain Ontology——Taking Knowledge Base of Chinese Cuisine Culture as an Example[J]. 现代图书情报技术, 2013, (12): 19-26.
[13] Zhang Pengyi, Qu Yan, Huang Chen. Design and Application of the S&T Innovation Group and Environment Ontology[J]. 现代图书情报技术, 2013, (12): 42-47.
[14] Li Jianwei, Song Wen, Tang Yijie, Liu Yi, Wang Xinglan. Research on Data Building for Knowledge Base Based on Scientific Research Ontology[J]. 现代图书情报技术, 2013, 29(11): 15-21.
[15] Hong Yunjia, Xu Xin. Knowledge Base of Collaborative Virtual Reference Systems:State of the Art and Future Trends[J]. 现代图书情报技术, 2012, (9): 2-9.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn