|
|
Study on Detection Method of Similarity Patents |
Zhou Qunfang |
Baoshan Iron and Steel Co., Ltd., Shanghai 201900, China |
|
|
Abstract In order to get similar patents from patent databases, based on merging similar concepts on Ontology-based identification replacement, this paper constructs a vector space model by estimating the TFIDF values of the concepts, and eventually gets a collection of similar clause on similarity matching sentences by orderly longest common subsequence identification method to match the results of previous similar patent document extraction. This method can help enterprises intelligence detect similar patents effectively.
|
Received: 02 November 2012
Published: 06 February 2013
|
|
[1] Fujii A, Ishikawa T. Document Structure Analysis for the NTCIR-5 Patent Retrieval[C]. In: Proceedings of the 5th NTCIR Workshop on Evaluation of Information Access Technologies, Information Retrieval, Question Answering and Cross-Lingual Information Access,Tokyo, Japan.2005. [2] Park H, Yoon J, Kim K. Identifying Patent Infringement Using SAO Based Semantic Technological Similarities[J]. Scientometrics, 2012, 90(2):515-529. [3] 汪雪锋,刘玉琴,刘佳. 中文专利侵权检索模型研究[J]. 计算机工程与应用,2009,45(9):212-215.(Wang Xuefeng, Liu Yuqin, Liu Jia. Research on Chinese Patent Infringement Retrieval Model[J]. Computer Engineering and Applications,2009,45(9):212-215.) [4] 马文姗,赵海宁,翟东升.中文专利侵权检索模型研究[J]. 情报杂志,2012 (4):175-179.(Ma Wenshan, Zhao Haining, Zhai Dongsheng. Research on Chinese Patent Infringement Retrieval Model[J]. Journal of Intelligence,2012 (4):175-179.) [5] Brin S, Davis J, Garcia-Molina H. Copy Detection Mechanisms for Digital Documents[C].In: Proceedings of the 1995 ACM SIGMOD International Conference on Management of Data (SIGMOD'95). New York: ACM, 1995: 398-409. [6] Shivakumar N, Garcia-Molina H. SCAM: A Copy Detection Mechanism for Digital Documents[C]. In: Proceedings of the 2nd International Conference on Theory and Practice of Digital Libraries, Austin, Texas,USA.1995. [7] Si A, Leong H V, Lau R W H. CHECK: A Document Plagiarism Detection System[C]. In: Proceedings of the 1997 ACM Symposium on Applied Computing (SAC'97). New York: ACM, 1997:70-77. [8] 杨思春. 一种改进的句子相似度计算模型[J]. 电子科技大学学报, 2006, 35(6): 956-959.(Yang Sichun. An Improved Model for Sentence Similarity Computing[J]. Journal of University of Electronic Science and Technology of China, 2006, 35(6): 956-959.) [9] 秦新国. 基于句子相似度的文档复制检测算法研究[J]. 现代图书情报技术, 2007(11): 63-66.(Qin Xinguo. Research on the Copy Detection Based on the Similarity of Sentences[J]. New Technology of Library and Information Service, 2007(11): 63-66.) [10] 王森, 王宇. 基于文本结构树的论文复制检测算法[J]. 现代图书情报技术, 2009(10): 50-55.(Wang Sen, Wang Yu. Algorithm of the Text Copy Detection Based on Text Structure Tree[J]. New Technology of Library and Information Service, 2009(10): 50-55.) [11] 孙伟, 邢长征. 关于中文文档复制检测算法的改进[J]. 计算机工程与科学, 2010, 32(8): 101-103.(Sun Wei, Xing Changzheng. An Improved Copy Detection Algorithm for the Chinese Documents[J]. Computer Engineering and Science, 2010, 32(8): 101-103.) [12] 张培颖. 多特征融合的语句相似度计算模型[J]. 计算机工程与应用, 2010, 46(26): 136-137.(Zhang Peiying. Model for Sentence Similarity Computing Based on Multi-features Combination[J]. Computer Engineering and Applications, 2010, 46(26): 136-137.) [13] 谷俊,朱紫阳. 基于聚类算法的本体层次关系获取研究[J]. 现代图书情报技术, 2011(12):46-51.(Gu Jun, Zhu Ziyang. Study on Ontology Hierarchy Relation Induction on Clustering Algorithm[J]. New Technology of Library and Information Service, 2011(12):46-51.) [14] GUID[EB/OL].[2012-05-27]. http://baike.baidu.com/view/185358.htm. [15] Wikipedia. Cosine Similarity[EB/OL].[2012-05-27]. http://en.wikipedia.org/wiki/Cosine_similarity. [16] Hirschberg D S. Algorithms for the Longest Common Subsequence Problem[J]. Journal of the ACM, 1977, 24(4): 664-675. [17] 冷强奎, 秦玉平, 王春立. 基于句子相似度的论文抄袭检测模型研究[J]. 计算机工程与应用, 2011, 47(24): 199-201.(Leng Qiangkui,Qin Yuping,Wang Chunli. Study on Model for Plagiarism-detection of Scientific Papers Based on Sentence Similarity[J].Computer Engineering and Applications, 2011, 47(24): 199-201.) |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|