|
|
Study on the Differentiating Method of Technical and Effect Words in Patent |
Chen Ying1, Zhang Xiaolin2 |
1. Institute of Medical Information, Chinese Academy of Medical Sciences, Beijing 100020, China;
2. National Science Library, Chinese Academy of Sciences, Beijing 100190, China |
|
|
Abstract In analyzing unstructured information of patents, there is a problem in identifying and defining the technology innovations and the effect of patent currently.This paper puts forward a method to differentiate technical and effect words in patent,based on the features of patents’ structure-grammar-clue word.The method can synthetically consider three feature factors: the structure, the grammar and clues word, then improve the recognition result of the technical and effect words in patents.
|
Received: 09 October 2011
Published: 02 February 2012
|
|
[1] Huang S H,Ke H R, Yang W P. Structure Clustering for Chinese Patent Documents[J]. Expert Systems with Applications, 2008,34(4):2290-2297.[2] Fujii A, Ishikawa T. Document Structure Analysis for the NTCIR-5 Patent Retrieval Task[C]. In:Proceedings of the 5th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering and Cross-lingual Information Access,Tokyo,Japan.2005:292-296.[3] Shinmori A, Okumura M, Marukawa Y, et al. Can Claim Analysis Contribute toward Patent Map Generation?[C].In: Proceedings of NTCIR-4,Tokyo.2004.[4] 张惠, 邱清盈, 冯培恩, 等. 产品专利设计知识获取方法研究[J]. 哈尔滨工程大学学报, 2009,30(7): 785-791.[5] Indukuri K, Ambekar A, Sureka A. Similarity Analysis of Patent Claims Using Natural Language Processing Techniques[C].In: Proceedings of the International Conference on Computational Intelligence and Multimedia Applications,Sivakasi, Tamil Nadu. IEEE Computer Society, 2007:169-175.[6] 孙鑫.自然语言处理中语法分析研究[J]. 现代图书情报技术, 2004(Z1):44-46.[7] Shinmori A, Okumura M, Marukawa Y, et al. Patent Claim Processing for Readability: Structure Analysis and Term Explanation [EB/OL].[2011-09-02]. http://acl.ldc.upenn.edu/W/W03/W03-2007.pdf.[8] Shinmori A, Okumura M, Marukawa Y, et al. Rhetorical Structure Analysis of Japanese Patent Claims Using Cue Phrases[C].In: Proceedings of the 3rd NTCIR Workshop, Tokyo,Japan.2002.[9] 杨清亮. 发明是这样诞生的——TRIZ理论全接触[M]. 北京: 机械工业出版社,2008: 169.[10] 刘翰卿. 基于SAO结构之中文专利文件自动摘要技术研究[D]. 台湾: 国立交通大学, 2005.[11] 于立彪, 赵静. 专利权利要求中功能性特征的解释原则探析[J]. 中国专利与商标, 2007(3): 63-70.[12] Atwell E. Brown语料库标记集[EB/OL].[2011-09-01].http://www.comp.leeds.ac.uk/amalgam/tagsets/brown.html.[13] 林士能. 专利文件语意之撷取与比对[D]. 台湾: 国立清华大学, 2003.[14] Cascini G,Lucchesi D,Rissone P. Automatic Patents Functional Analysis Through Semantic Processing[C]. In: Proceedings of the 12th ADM International Conference,Italy.2001.[15] Cascini G, Russo D. Computer-aided Analysis of Patents and Search for TRIZ Contradictions[J]. International Journal of Product Development, 2007,4(1-2): 52-67.[16] Onix Stop Word List[EB/OL].[2011-09-02]. http://www.lextek.com/manuals/onix/stopwords1.html.[17] LingPipe [EB/OL].[2011-09-01]. http://alias-i.com/lingpipe/index.html.[18] Tseng Y H, Lin C J, Lin Y I. Text Mining Techniques for Patent Analysis[J]. Information Processing & Management, 2007,43(5): 1216-1247.[19] 李育嫦. 词表在网络信息检索中的应用分析[J]. 情报理论与实践, 2006,29(2): 161-163,193.[20] Database UPF-TaI. Stopwords[EB/OL].[2011-09-02]. http://www.uspto.gov/patft/help/stopword.htm. |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|