Please wait a minute...
Data Analysis and Knowledge Discovery  2019, Vol. 3 Issue (5): 117-124    DOI: 10.11925/infotech.2096-3467.2018.0674
Current Issue | Archive | Adv Search |
Evaluating and Classifying Patent Values Based on Self-Organizing Maps and Support Vector Machine
Cheng Zhou(),Hongqin Wei
Glorious Sun School of Business and Management, Donghua University, Shanghai 200051, China
Download: PDF(581 KB)   HTML ( 2
Export: BibTeX | EndNote (RIS)      

[Objective] This paper proposes a new method for evaluating and classifying patent values. [Methods] With the help of value indicators, we designed a patent value analysis and classification system based on self-organizing maps (SOM) and support vector machine (SVM) techniques. We used the SOM to determine value categories, and then applied the random forest (RF) algorithm to rank value indictors based on their significance. Finally, we improved classification performance with the wrapped feature reduction method. [Results] The value tags determined by SOM effectively represented the patent values. Meanwhile, the value indictors were reduced from 14 to 10, and the classification accuracy was increased from 76.28% to 86.89%. [Limitations] Further refinement of patent values in each category is needed, which might reduce the patent value indicators. [Conclusions] The proposed SOM-RF-SVM method could support research and development activities as well as reduce the dependence on human factors.

Key wordsEvaluation of Patent Values      Data Clustering      Classification of Patent Values      Feature Selection      Self-Organizing Maps      Support Vector Machine     
Received: 25 June 2018      Published: 03 July 2019

Cite this article:

Cheng Zhou,Hongqin Wei. Evaluating and Classifying Patent Values Based on Self-Organizing Maps and Support Vector Machine. Data Analysis and Knowledge Discovery, 2019, 3(5): 117-124.

URL:     OR

[1] 阮敏. 企业所有权性质、环境规制与发明专利的研发效率[J]. 软科学, 2016, 30(2): 55-59.
[1] (Ruan Min. Corporate Ownership Nature, Environmental Regulation and R&D Efficiencies of the Invention Patent[J]. Soft Science, 2016, 30(2): 55-59.)
[2] 张耀天, 杜慰纯, 贾明顺, 等. 基于自适应层次分析法的企业专利质量评价研究[J]. 图书情报工作, 2016, 60(7): 110-115.
[2] (Zhang Yaotian, Du Weichun, Jia Mingshun, et al.Research on the Evaluation of Enterprise Patents Quality Based on the Adaptive Analytic Hierarchy Process[J]. Library and Information Service, 2016, 60(7): 110-115.)
[3] Dereli T, Durmu?o?lu A.Classifying Technology Patents to Identify Trends: Applying a Fuzzy-Based Clustering Approach in the Turkish Textile Industry[J]. Technology in Society, 2009, 31(3): 263-272.
[4] Narin F.Patent Bibliometrics[J]. Scientometrics, 1994, 30(1): 147-155.
[5] Ashtor J H.Redefining “Valuable Patents”: Analysis of the Enforcement Value of U.S. Patents[J]. Social Science Electronic Publishing, 2015. .
[6] 邱一卉, 张驰雨, 陈水宣. 基于分类回归树算法的专利价值评估指标体系研究[J]. 厦门大学学报:自然科学版, 2017, 56(2): 244-251.
[6] (Qiu Yihui, Zhang Chiyu, Chen Shuixuan.Research of Patent-Value Assessment Indictor System Based on Classification and Regression Tree Algorithm[J]. Journal of Xiamen University: Natural Science, 2017, 56(2): 244-251.)
[7] Archontopoulos E.Prior Art Search Tools on the Internet and Legal Status of the Results: A European Patent Office Perspective[J]. World Patent Information, 2004, 26(2): 113-121.
[8] Dang J, Motohashi K.Patent Statistics: A Good Indicator for Innovation in China? Patent Subsidy Program Impacts on Patent Quality[J]. China Economic Review, 2015, 35: 137-155.
[9] Chen Y S, Chang K C.The Relationship Between a Firm’s Patent Quality and Its Market Value—The Case of US Pharmaceutical Industry[J]. Technological Forecasting & Social Change, 2010, 77(1): 20-33.
[10] Frietsch R, Neuh?usler P, Jung T, et al.Patent Indicators for Macroeconomic Growth—The Value of Patents Estimated by Export Volume[J]. Technovation, 2014, 34(9): 546-558.
[11] Harhoff D, Scherer F M, Vopel K. Citations, Family Size, Opposition and the Value of Patent Rights[J]. Research Policy, 2003, 32(8): 1343-1363.
[12] 郑素丽, 宋明顺. 专利价值由何决定?——基于文献综述的整合性框架[J]. 科学学研究, 2012, 30(9): 1316-1323.
[12] (Zheng Suli, Song Mingshun.Review on the Determinants of Patent Value: An Integrate Framework[J]. Studies in Science of Science, 2012, 30(9): 1316-1323.)
[13] Marco A C.The Dynamics of Patent Citations[J]. Economics Letters, 2006, 94(2): 290-296.
[14] 苏健美. 基于收益法的专利权价值评估研究[D]. 昆明:云南大学, 2014.
[14] (Su Jianmei.Research on Patent Value Evaluation Based on Income Law[D]. Kunming: Yunnan University, 2014.)
[15] 杨冠灿, 刘彤, 李纲, 等. 基于综合引用网络的专利价值评价研究[J]. 情报学报, 2013, 32(12):1265-1277.
[15] (Yang Guancan, Liu Tong, Li Gang, et al.Research on Patent Value Evaluation Based on Comprehensive Citation Network[J]. Journal of the China Society for Scientific and Technical Information, 2013, 32(12): 1265-1277.)
[16] 赵蕴华, 张静, 李岩, 等. 基于机器学习的专利价值评估方法研究[J]. 情报科学, 2013, 31(12): 15-18.
[16] (Zhao Yunhua, Zhang Jing, Li Yan, et al.Study on Evaluation for Patent Value Based on Machine Learning[J]. Information Science, 2013, 31(12): 15-18.)
[17] 吕璐成, 刘娅, 杨冠灿. 基于决策树方法的专利被引影响因素研究[J]. 情报理论与实践, 2015, 38(2): 28-32.
[17] (Lv Lucheng, Liu Ya, Yang Guancan.Research on the Influencing Factors of Patent Citation Based on Decision Tree Method[J]. Information Studies: Theory & Application, 2015, 38(2): 28-32.)
[18] Chiu C Y, Huang P T.Application of the Honeybee Mating Optimization Algorithm to Patent Document Classification in Combination with the Support Vector Machine[J]. International Journal of Automation and Smart Technology, 2013, 3(3): 179-191.
[19] Wu C H, Ken Y, Huang T.Patent Classification System Using a New Hybrid Genetic Algorithm Support Vector Machine[J]. Applied Soft Computing, 2010, 10(4): 1164-1177.
[20] Breiman L.Random Forests[J]. Machine Learning, 2001, 45(1): 5-32.
[21] 姚登举, 杨静, 詹晓娟. 基于随机森林的特征选择算法[J]. 吉林大学学报: 工学版, 2014, 44(1): 137-141.
[21] (Yao Dengju, Yang Jing, Zhan Xiaojuan.Feature Selection Algorithm Based on Random Forest[J]. Journal of Jilin University: Engineering and Technology Edition, 2014, 44(1): 137-141.)
[22] Strobl C, Boulesteix A L, Kneib T, et al.Conditional Variable Importance for Random Forests[J]. BMC Bioinformatics, 2008, 9: 307.
[23] Cortes C, Vapnik V.Support Vector Network[J]. Machine Learning, 1995, 20(3): 273-297.
[24] 裴云龙, 蔡虹, 王晓南. 中外科学文献对中国高新产业技术创新质量的影响——基于专利的科学引文的分析[J]. 情报学报, 2013, 32(12): 1333-1344.
[24] (Pei Yunlong, Cai Hong, Wang Xiaonan.How Does Domestic and Foreign Scientific Literature Affect Technology Innovation Quality of High-tech Industries in China: An Analysis Based on Patent Citations Scientific Publications[J]. Journalof the China Society for Scientific and Technical Information, 2013, 32(12): 1333-1344.)
[25] 周成, 魏红芹. 基于随机森林属性约简的众包竞赛参与者识别体系研究[J]. 数据分析与知识发现, 2018, 2(7): 46-54.
[25] (Zhou Cheng, Wei Hongqin.Identifying Crowd Participants with Modified Random Forests Algorithm[J]. Data Analysis and Knowledge Discovery, 2018, 2(7): 46-54)
[26] 徐庆富, 康旭东, 杨中楷, 等. 基于专利权转让的我国省际技术转移特征研究[J]. 情报杂志, 2017, 36(7): 66-72.
[26] (Xu Qingfu, Kang Xudong, Yang Zhongkai, et al.Research on the Characteristics of Inter-Provincial Technology Transfer in China Based on Patent Right Transfer[J]. Journal of Intelligence, 2017, 36(7): 66-72.)
[27] Wu J L, Chang P C, Tsao C C, et al.A Patent Quality Analysis and Classification System Using Self-Organizing Maps with Support Vector Machine[J]. Applied Soft Computing, 2016, 41: 305-316.
[28] 孙玉涛, 栾倩. 专利质量测度“三阶段—两维度”模型及实证研究——以C9联盟高校为例[J]. 科学学与科学技术管理, 2016, 37(6): 23-32.
[28] (Sun Yutao, Luan Qian.A ‘Three Stages-Two Dimensions’ Model of Patent Quality Measuring and Its Empirical Study: A Case Study of C9 League[J]. Science of Science and Management of S.& T., 2016, 37(6): 23-32.)
[29] 乔永忠, 谭婉琳. 专利权利要求数与维持时间关系实证研究——以中日授权专利数据为例[J]. 科学学与科学技术管理, 2017, 38(2): 77-86.
[29] (Qiao Yongzhong, Tan Wanlin.Empirical Studies of the Relationship of the Claims Number and the Maintenance Time of Patents: Based on the Data of Patents Granted by China and Japan[J]. Science of Science and Management of S.& T., 2017, 38(2): 77-86.)
[1] Qingtian Zeng,Mingdi Dai,Chao Li,Hua Duan,Zhongying Zhao. Discovering Important Locations with User Representation and Trace Data[J]. 数据分析与知识发现, 2019, 3(6): 75-82.
[2] Jiaming Liang,Jie Zhao,Zhou Jianlong,Zhenning Dong. Detecting Collusive Fraudulent Online Transaction with Implicit User Behaviors[J]. 数据分析与知识发现, 2019, 3(5): 125-138.
[3] Tingxin Wen,Yangzi Li,Jingshuang Sun. News Hotspots Discovery Method Based on Multi Factor Feature Selection and AFOA/K-means[J]. 数据分析与知识发现, 2019, 3(4): 97-106.
[4] Zhanglu Tan,Zhaogang Wang,Han Hu. Study on a Method of Feature Classification Selection Based on χ2 Statistics[J]. 数据分析与知识发现, 2019, 3(2): 72-78.
[5] Tingxin Wen,Yangzi Li,Jingshuang Sun. Extracting Text Features with Improved Fruit Fly Optimization Algorithm[J]. 数据分析与知识发现, 2018, 2(5): 59-69.
[6] Xiaoxi Huang,Hanyu Li,Rongbo Wang,Xiaohua Wang,Zhiqun Chen. Recognizing Metaphor with Convolution Neural Network and SVM[J]. 数据分析与知识发现, 2018, 2(10): 77-83.
[7] Zhipeng Li,Weizhong Li. Feature Selection Based on Modified QPSO Algorithm[J]. 数据分析与知识发现, 2017, 1(7): 82-89.
[8] Jin Zeng,Wei Lu,Heng Ding,Haihua Chen. Modeling User’s Interests Based on Image Semantics[J]. 数据分析与知识发现, 2017, 1(4): 76-83.
[9] Shihai Tian,Deli Lyu. An Early Warning Algorithm for Public Opinion of Safety Emergency[J]. 数据分析与知识发现, 2017, 1(2): 11-18.
[10] Yue Zhang,Dongbo Wang,Danhao Zhu. Segmenting Chinese Words from Food Safety Emergencies[J]. 数据分析与知识发现, 2017, 1(2): 64-72.
[11] Shuang Yang,Fen Chen. Analyzing Sentiments of Micro-blog Posts Based on Support Vector Machine[J]. 数据分析与知识发现, 2017, 1(2): 73-79.
[12] Xiangdong Li,Tao Ruan,Kang Liu. Automatic Classification of Documents from Wikipedia[J]. 数据分析与知识发现, 2017, 1(10): 43-52.
[13] Yonghe Lu,Jinghuang Chen. Optimizing Feature Selection Method for Text Classification with Shuffled Frog Leaping Algorithm[J]. 数据分析与知识发现, 2017, 1(1): 91-101.
[14] Liu Hongguang,Ma Shuanggang,Liu Guifeng. Classifying Chinese News Texts with Denoising Auto Encoder[J]. 现代图书情报技术, 2016, 32(6): 12-19.
[15] Meng Yuan,Wang Hongwei. Evaluating Online Reviews Based on Text Content Features[J]. 现代图书情报技术, 2016, 32(4): 40-47.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938