Review of Latent Knowledge Discovery Methods Based on Association Between Scientific Papers and Technology Patents
Wang Shiwei1,2,Chen Chun1,2()
1Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou 730000, China 2Department of Information Resources Management, School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190, China
【目的】 通过文献调研梳理总结基于科学论文和技术专利的潜在知识发现方法,总结研究不足和未来发展方向。【文献范围】 以Patents and Papers,Science and Technology,Knowledge Discovery,专利和论文,科学和技术,知识发现等为关键词分别在Web of Science、Springer Link和CNKI等学术平台检索文献,筛选出75篇具有代表性的文献进行综述。【方法】 在科学-技术关联关系的基础上,从数据关联、主体关联、主题关联以及多维度关联4方面对文献进行归纳梳理。【结果】 现有研究方法存在不足,包括识别语料的数据来源具有局限性且异构数据源的不规范性;识别方法的潜在知识发现语义性不足、粒度较粗;基于论文和专利的知识体系和测度指标不完善;识别结果缺乏全面性、动态性、探索性。【局限】 主要选取部分代表性文献进行综述,深入阐述不够深刻;在内容分析层面上,科学-技术关联关系的多策略综合分析方法是目前的热点研究,本文对此方法分析系统性不足;对检索得出的代表性综述文献的选择具有一定的主观性。【结论】 在未来的研究中要整合多源数据库资源并规范化异构数据,增强识别方法的语义分析能力和细化识别粒度,完善知识组织体系并丰富测度指标,加强对潜在知识发现动态演变的研究。
[Objective] This paper reviews the latent knowledge discovery methods based on scientific papers and technology patents to identify deficiencies in current studies and future development directions.[Coverage] A total of 75 representative articles were retrieved using keywords such as “Patents and Papers”, “Science and Technology”, and “Knowledge Discovery” from the Web of Science, Springer Link, and CNKI. [Methods] Based on the scientific-technical association, we reviewed the literature from four aspects: data association, subject association, theme association, and multi-dimensional association. [Results] The existing research methods have limitations, such as the need for more data sources for identifying corpus and the non-standardization of heterogeneous data sources. The potential knowledge discovery of the recognition method needs more semantics and better granularity. The knowledge system and measurement index based on papers and patents still need to be completed. The recognition results need more comprehensiveness, dynamic and exploratory nature. [Limitations] Mainly select some representative literature to review, in-depth elaboration is not deep enough. At the level of content analysis, the multi-strategy comprehensive analysis method of science-technology correlation is a hot research at present, but the analysis of this method is not systematic enough in this paper. The selection of representative review literature obtained from the search has a certain degree of individual subjectivity. [Conclusions] In future research, we should integrate multi-source databases and standardize heterogeneous data, enhance the semantic analysis ability of recognition methods, and refine the recognition granularity. We also need to improve the knowledge organization system, enrich the measurement indicators, and strengthen the research on the dynamic evolution of latent knowledge discovery.
王诗炜, 陈春. 基于科学论文和技术专利关联关系识别潜在知识发现方法研究综述*[J]. 数据分析与知识发现, 2023, 7(7): 18-31.
Wang Shiwei, Chen Chun. Review of Latent Knowledge Discovery Methods Based on Association Between Scientific Papers and Technology Patents. Data Analysis and Knowledge Discovery, 2023, 7(7): 18-31.
(Zhang Xue, Zhang Zhiqiang. Research on the Evolution Law and Influence of Patent Knowledge Absorption and Diffusion[J]. Science Research Management, 2022, 43(6): 160-169.)
(Fan Hongxia. Knowledge Discovery in Database and Its Utilization in Digital Library[J]. Modern Information, 2008, 28(8): 90-92.)
[3]
Swanson D R. Fish Oil, Raynaud's Syndrome, and Undiscovered Public Knowledge[J]. Perspectives in Biology and Medicine, 1986, 30(1): 7-18.
pmid: 3797213
[4]
Gordon M D, Lindsay R K. Toward Discovery Support Systems: A Replication, Re-examination, and Extension of Swanson's Work on Literature-Based Discovery of a Connection Between Raynaud's and Fish Oil[J]. Journal of the American Society for Information Science, 1996, 47(2): 116-128.
doi: 10.1002/(ISSN)1097-4571
[5]
Kostoff R N. Literature-Related Discovery(LRD): Introduction and Background[J]. Technological Forecasting and Social Change, 2008, 75(2): 165-185.
doi: 10.1016/j.techfore.2007.11.004
[6]
Kostoff R N. Literature-Related Discovery and Innovation—Update[J]. Technological Forecasting and Social Change, 2012, 79(4): 789-800.
doi: 10.1016/j.techfore.2012.02.002
pmid: 32287411
[7]
Ittipanuvat V, Fujita K, Kajikawa Y, et al. Finding Linkage Between Technology and Social Issues: A Literature Based Discovery Approach[C]// Proceedings of PICMET’12:Technology Management for Emerging Technologies. 2012: 2310-2321.
(Cao Zhijie, Leng Fuhai. Research on the Application of Disjointed Literature-Based Knowledge Discovery Method in Aerospace Scientific and Technical Information Service[J]. Information Studies: Theory & Application, 2008, 31(4): 569-572.)
(Cao Shujin, Cao Ruye. Research on Interdisciplinary Knowledge Discovery Based on Knowledge Graph to Support Scientific Research Innovation[J]. Information Studies: Theory & Application, 2022, 45(11): 10-20.)
(Hu Yuning, Li Xiaotao, Zhu Xuefang. The Knowledge Discovery of Integrating Subject Words and Citation: Data Optimization and Content Visualization[J]. Journal of Intelligence, 2022, 41(10): 130-137, 155.)
(Deng Jun, Wang Ruan. Research on Knowledge Map and Multidimensional Knowledge Discovery of Oral History Archives Resources[J]. Library and Information Service, 2022, 66(7): 4-16.)
doi: 10.13266/j.issn.0252-3116.2022.07.001
(Zhang Han, An Xinyu, Liu Chunhe. Building Multi-source Semantic Knowledge Graph for Drug Repositioning[J]. Data Analysis and Knowledge Discovery, 2022, 6(7): 87-98.)
[14]
Chen C M. CiteSpace II: Detecting and Visualizing Emerging Trends and Transient Patterns in Scientific Literature[J]. Journal of the American Society for Information Science and Technology, 2006, 57(3): 359-377.
doi: 10.1002/(ISSN)1532-2890
(Zeng Haijiao, Sun Wei. Identification of Potential Scientific Frontiers Based on Correlation Between Patents and Papers—A Case Study of Biopesticide[J]. Agricultural Outlook, 2020, 16(9): 93-100.)
(Luo Rui, Xu Haiyun, Dong Kun. A Review of the Main Recognition Methods of Frontier Research[J]. Library and Information Service, 2018, 62(23): 119-131.)
doi: 10.13266/j.issn.0252-3116.2018.23.015
[18]
Pan W W, Jian L R, Liu T. Knowledge Generation and Diffusion in Science & Technology: An Empirical Study of SiC-MOSFET Based on Scientific Papers and Patents[J]. Technology Analysis & Strategic Management, 2022. DOI: 10.1080/09537325.2022.2106419.
doi: 10.1080/09537325.2022.2106419
[19]
Shibata N, Kajikawa Y, Sakata I. Detecting Potential Technological Fronts by Comparing Scientific Papers and Patents[J]. Foresight, 2011, 13(5): 51-60.
doi: 10.1108/14636681111170211
[20]
de Solla P D J. Is Technology Historically Independent of Science? A Study in Statistical Historiography[J]. Technology and Culture, 1965, 6(4): 553-568.
doi: 10.2307/3101749
[21]
Narin F, Noma E. Is Technology Becoming Science?[J]. Scientometrics, 1985, 7: 369-381.
doi: 10.1007/BF02017155
[22]
Verbeek A, Debackere K, Luwel M, et al. Linking Science to Technology: Using Bibliographic References in Patents to Build Linkage Schemes[J]. Scientometrics, 2002, 54(3): 399-420.
doi: 10.1023/A:1016034516731
[23]
Casimir H B G. Industries and Academic Freedom[J]. Research Policy, 1971, 1(1): 3-8.
doi: 10.1016/0048-7333(71)90003-5
[24]
Bhattacharya S, Kretschmer H, Meyer M. Characterizing Intellectual Spaces Between Science and Technology[J]. Scientometrics, 2003, 58(2): 369-390.
doi: 10.1023/A:1026244828759
[25]
van Looy B, Debackere K, Callaert J, et al. Scientific Capabilities and Technological Performance of National Innovation Systems: An Exploration of Emerging Industrial Relevant Research Domains[J]. Scientometrics, 2006, 66(2): 295-310.
doi: 10.1007/s11192-006-0030-3
(Liu Xiaoling, Tan Zongying, Zhang Chaoxing. Research Review of “Science-Technology Relationship” Research Methods: Highlights on Bibliometrics Method[J]. Library and Information Service, 2015, 59(13): 142-148.)
doi: 10.13266/j.issn.0252-3116.2015.13.020
[27]
Narayanamurti V, Odumosu T. Cycles of Invention and Discovery: Rethinking the Endless Frontier[M]. Cambridge: Harvard University Press, 2016.
[28]
Godin B. Models of Innovation: The History of an Idea[M]. Cambridge: The MIT Press, 2017.
[29]
Han F, Magee C L. Testing the Science/Technology Relationship by Analysis of Patent Citations of Scientific Papers after Decomposition of Both Science and Technology[J]. Scientometrics, 2018, 116(2): 767-796.
doi: 10.1007/s11192-018-2774-y
[30]
Zhang G J, Liu L N, Wei F F. Key Nodes Mining in the Inventor-Author Knowledge Diffusion Network[J]. Scientometrics, 2019, 118(3): 721-735.
doi: 10.1007/s11192-019-03005-2
[31]
Xu H Y, Yue Z H, Pang H S, et al. Integrative Model for Discovering Linked Topics in Science and Technology[J]. Journal of Informetrics, 2022, 16(2): 101265.
doi: 10.1016/j.joi.2022.101265
[32]
Yu D J, Yan Z P. Combining Machine Learning and Main Path Analysis to Identify Research Front: From the Perspective of Science-Technology Linkage[J]. Scientometrics, 2022, 127(7): 4251-4274.
[33]
V.布什. 科学:没有止境的前沿[M]. 范岱年译. 北京: 商务印书馆, 2004.
[33]
(Bush Vannevar. Science: The Endless Frontier[M]. Translated by Fan Dainian. Beijing: The Commercial Press, 2004.)
(OECD. Frattie Handbook: Implementation Standards of Research and Experimental Development Survey[M]. Beijing: Scientific and Technical Documents Publishing House, 2010.)
[35]
澳大利亚科学技术委员会. 基础科学与国家目标[M]. 澳大利亚政府出版局, 1984.
[35]
(Australian Science and Technology Commission. Basic Science and National Goals[M]. Australian Government Publishing Service, 1984.)
(Stokes D E. Pasteur's Quadrant: Basic Science and Technological Innovation[M]. Translated by Zhou Chunyan, Gu Chunli. Beijing: Science Press, 1999.)
[37]
Guan J C, He Y. Patent-Bibliometric Analysis on the Chinese Science—Technology Linkages[J]. Scientometrics, 2007, 72(3): 403-425.
doi: 10.1007/s11192-007-1741-1
(Dong Kun, Xu Haiyun, Luo Rui, et al. Review of the Research on Relationship Between Science and Technology[J]. Journal of the China Society for Scientific and Technical Information, 2018, 37(6): 642-652.)
[39]
Hammarfelt B. Linking Science to Technology: The “Patent Paper Citation” and the Rise of Patentometrics in the 1980s[J]. Journal of Documentation, 2021, 77(6): 1413-1429.
[40]
Egghe L, Guns R, Rousseau R. Thoughts on Uncitedness: Nobel Laureates and Fields Medalists as Case Studies[J]. Journal of the American Society for Information Science and Technology, 2011, 62(8): 1637-1644.
doi: 10.1002/asi.v62.8
Zhang G J, Feng Y Q, Yu G, et al. Analyzing the Time Delay Between Scientific Research and Technology Patents Based on the Citation Distribution Model[J]. Scientometrics, 2017, 111(3): 1287-1306.
doi: 10.1007/s11192-017-2357-3
[43]
Qu Z, Zhang S S. References to Literature from the Business Sector in Patent Documents: A Case Study of Charging Technologies for Electric Vehicles[J]. Scientometrics, 2020, 124(2): 867-886.
doi: 10.1007/s11192-020-03518-1
[44]
Liaw Y C, Chan T Y, Fan C Y, et al. Can the Technological Impact of Academic Journals be Evaluated? The Practice of Non-patent Reference (NPR) Analysis[J]. Scientometrics, 2014, 101(1): 17-37.
doi: 10.1007/s11192-014-1337-0
[45]
van Raan A F J, Winnink J J. Do Younger Sleeping Beauties Prefer a Technological Prince?[J]. Scientometrics, 2018, 114(2): 701-717.
doi: 10.1007/s11192-017-2603-8
pmid: 29449753
[46]
Glänzel W, Meyer M. Patents Cited in the Scientific Literature: An Exploratory Study of ‘Reverse’ Citation Relations[J]. Scientometrics, 2003, 58(2): 415-428.
[47]
Hou J H, Yang X C. Patent Sleeping Beauties: Evolutionary Trajectories and Identification Methods[J]. Scientometrics, 2019, 120(1): 187-215.
doi: 10.1007/s11192-019-03123-x
[48]
Gao J P, Ding K, Teng L, et al. Hybrid Documents Co-citation Analysis: Making Sense of the Interaction Between Science and Technology in Technology Diffusion[J]. Scientometrics, 2012, 93(2): 459-471.
doi: 10.1007/s11192-012-0691-z
[49]
Huang M H, Yang H W, Chen D Z. Increasing Science and Technology Linkage in Fuel Cells: A Cross Citation Analysis of Papers and Patents[J]. Journal of Informetrics, 2015, 9(2): 237-249.
doi: 10.1016/j.joi.2015.02.001
[50]
Noyons E C M, van Raan A F J, Grupp H, et al. Exploring the Science and Technology Interface: Inventor-Author Relations in Laser Medicine Research[J]. Research Policy, 1994, 23(4): 443-457.
doi: 10.1016/0048-7333(94)90007-8
[51]
Breschi S, Catalini C. Tracing the Links Between Science and Technology: An Exploratory Analysis of Scientists’ and Inventors’ Networks[J]. Research Policy, 2010, 39(1): 14-26.
doi: 10.1016/j.respol.2009.11.004
[52]
Forti E, Franzoni C, Sobrero M. Bridges or Isolates? Investigating the Social Networks of Academic Inventors[J]. Research Policy, 2013, 42(8): 1378-1388.
doi: 10.1016/j.respol.2013.05.003
[53]
Chang S H. A Pilot Study on the Connection Between Scientific Fields and Patent Classification Systems[J]. Scientometrics, 2018, 114(3): 951-970.
doi: 10.1007/s11192-017-2613-6
[54]
Wang G B, Guan J C. Measuring Science-Technology Interactions Using Patent Citations and Author-Inventor Links: An Exploration Analysis from Chinese Nanotechnology[J]. Journal of Nanoparticle Research, 2011, 13(12): 6245-6262.
doi: 10.1007/s11051-011-0549-y
[55]
Maraut S, Martínez C. Identifying Author-Inventors from Spain: Methods and a First Insight into Results[J]. Scientometrics, 2014, 101(1): 445-476.
[56]
Li X, Zhao D Z, Hu X J. Gatekeepers in Knowledge Transfer Between Science and Technology: An Exploratory Study in the Area of Gene Editing[J]. Scientometrics, 2020, 124(2): 1261-1277.
doi: 10.1007/s11192-020-03537-y
(Song Yanhui, Qiu Junping. A Comparative Study of Inventor Bibliographic-Patent Coupling and Inventor-Patent-Classification-Coupling——Non-Practicing Entities as Example[J]. Journal of the China Society for Scientific and Technical Information, 2021, 40(4): 364-374.)
(Bai Rujiang, Leng Fuhai. Knowledge Innovational Evolution Analysis Based on k-clique Community Network[J]. Library and Information Service, 2013, 57(17): 86-94.)
doi: 10.7536/j.issn.0252-3116.2013.17.017
[59]
赖院根. 期刊论文与专利文献的链接研究[J]. 图书情报知识, 2011(1): 63-69.
[59]
(Lai Yuangen. Research on Linking Method Between Periodical Thesis and Patent Literature[J]. Document, Information & Knowledge, 2011(1): 63-69.)
[60]
Qi Y S, Zhu N, Zhai Y J, et al. The Mutually Beneficial Relationship of Patents and Scientific Literature: Topic Evolution in Nanoscience[J]. Scientometrics, 2018, 115(2): 893-911.
doi: 10.1007/s11192-018-2693-y
[61]
Kuhn T, Perc M, Helbing D. Inheritance Patterns in Citation Networks Reveal Scientific Memes[J]. Physical Review X, 2014, 4(4): 041036.
doi: 10.1103/PhysRevX.4.041036
[62]
Sun X L, Ding K. Identifying and Tracking Scientific and Technological Knowledge Memes from Citation Networks of Publications and Patents[J]. Scientometrics, 2018, 116(3): 1735-1748.
doi: 10.1007/s11192-018-2836-1
[63]
Takano Y, Kajikawa Y. Extracting Commercialization Opportunities of the Internet of Things: Measuring Text Similarity Between Papers and Patents[J]. Technological Forecasting and Social Change, 2019, 138: 45-68.
doi: 10.1016/j.techfore.2018.08.008
(Liu Ziqiang, Xu Haiyun, Luo Rui, et al. Research on Scientific and Technological Interaction Patterns Based on Topic Relevance Analysis[J]. Journal of the China Society for Scientific and Technical Information, 2019, 38(10): 997-1011.)
[65]
Xu S, Zhai D S, Wang F F, et al. A Novel Method for Topic Linkages Between Scientific Publications and Patents[J]. Journal of the Association for Information Science and Technology, 2019, 70(9): 1026-1042.
doi: 10.1002/asi.v70.9
(Han Yan, Peng Aidong. Study on Technology Opportunity Identification Based on Elements of Technology Formation: A Case of the Technology of Medical Service Robot[J]. Information Studies: Theory & Application, 2020, 43(5): 156-162.)
(Han Xiaotong, Zhu Donghua, Wang Xuefeng. Research on the Method of Technology Opportunity Discovery Promoted by Science[J]. Library and Information Service, 2022, 66(10): 19-32.)
doi: 10.13266/j.issn.0252-3116.2022.10.002
[68]
Winnink J J, Tijssen R J W. R&D Dynamics and Scientific Breakthroughs in HIV/AIDS Drugs Development: The Case of Integrase Inhibitors[J]. Scientometrics, 2014, 101(1): 1-16.
doi: 10.1007/s11192-014-1330-7
[69]
Li X, Fan M J, Zhou Y, et al. Monitoring and Forecasting the Development Trends of Nanogenerator Technology Using Citation Analysis and Text Mining[J]. Nano Energy, 2020, 71: 104636.
[70]
Xu H Y, Winnink J, Yue Z H, et al. Topic-Linked Innovation Paths in Science and Technology[J]. Journal of Informetrics, 2020, 14(2): 101014.
doi: 10.1016/j.joi.2020.101014
[71]
Ba Z C, Liang Z T. A Novel Approach to Measuring Science-Technology Linkage: From the Perspective of Knowledge Network Coupling[J]. Journal of Informetrics, 2021, 15(3): 101167.
doi: 10.1016/j.joi.2021.101167
(Zhang Nan, Zhao Hui. Identification of Key & Core Technology Innovation Based on Patent and Paper Data in Graphene Field[J]. Chinese High Technology Letters, 2021, 31(8): 892-900.)
(Lu Jiayue, Li Yan. Mining the Cutting Edge Based on Scientific Papers and Patents——A Case Study on Intelligent and Connected Vehicle[J]. China Invention & Patent, 2021, 18(1): 13-20.)
[74]
Ferreira R B, Parreira M R, Nabout J C. Is There Concordance Between Science and Technology in Natural Science? Mapping the Relationship among Number of Papers and Patents from Research on Cerrado Plants[J]. World Patent Information, 2022, 69: 102108.
doi: 10.1016/j.wpi.2022.102108
(Gao Junguo, Zhang Huanhuan, Li Yan. Research on the Development Trends of Detonation Spraying Technology Based on Patents and Papers[J]. Chinese High Technology Letters, 2022, 32(4): 421-429.)