Please wait a minute...
Advanced Search
数据分析与知识发现
  本期目录 | 过刊浏览 | 高级检索 |
面向TRIZ的专利技术三元组抽取研究与应用
刘春江;李姝影;方曙;胡正银;钱力
(中国科学院成都文献情报中心 成都  610299) (中国科学院大学经济与管理学院信息资源管理系 北京 100190) (中国科学院文献情报中心 北京  100190)
Research and Application of Triplet Extraction of Patented Technology for TRIZ
Liu Chunjiang;Li Shuying;Fang Shu;Hu Zhengyin;Qian Li
(Chengdu Library and Information Center, Chinese Academy of Sciences, Chengdu 610041, China) (Department of Information Resources Management, School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190, China) (National Science Library, Chinese Academy of Sciences, Beijing 100190, China)
全文:
输出: BibTeX | EndNote (RIS)      
摘要 

[目的]针对专利技术三元组自动抽取的准确性和效率不高的问题,本文研究专利技术三元组抽取的模型,以提升个性化、细粒度、多维度的深度抽取与语义关联的准确性。

[方法]针对技术问题、解决方案、技术功能与技术效果等四个技术主题维度,本文提出基于WeakLabel-Bert-BiGRU-CRF模型的抽取方法,使用宏平均等指标进行模型评估。

[结果]选择石墨烯能量存储应用领域专利作为数据集,实验结果表明,相比Bert-BiGRU-CRF模型,本文提出的方法针对三元组抽取的宏平均超过0.8,进一步减轻了数据标注的工作量,抽取效果更好。

[局限]本文提出的模型需要领域专家和专利情报分析人员共同参与数据标注,标注质量的不同会对应用效果产生影响。

[结论]基于此模型,本文研建了对应的原型系统,以便后续可以进一步使用与推广专利技术三元组抽取方法,在科技文献知识挖掘领域也有较广泛的应用前景。

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
关键词 TRIZ三元组抽取专利技术WeakLabel-Bert-BiGRU-CRF     
Abstract

[Objective]In response to the issue of low accuracy and efficiency in automatic extraction of patent technology triplets, this article explores a model for patented technology triplet extraction to improve the accuracy of personalized, fine-grained, multi-dimensional deep extraction and semantic association.

[Methods]This article proposes an extraction method based on the WeakLabel-Bert-BiGRU-CRF model for four technical thematic dimensions: problems, solutions, functions, and effects. The model is evaluated using indicators such as macro average.

[Results] Patents in the field of graphene energy storage applications were selected as the dataset. The experimental results showed that compared to the Bert-BiGRU-CRF model, the proposed method achieved a macro average of over 0.8 for triplet extraction, further reducing the workload of data annotation and achieving better extraction results.

[Limitations] The model proposed in this article requires the joint participation of domain experts and patent intelligence analysts in data annotation, and differences in annotation quality can have an impact on application effectiveness.

[Conclusions] Based on this model, this article has developed a corresponding prototype system for further use and promotion of the patent technology triplet extraction method in the future, which also has a broad application prospect in the field of scientific and technological literature knowledge mining.

Key words TRIZ    triplet extraction    patented technology    WeakLabel-Bert-BiGRU-CRF
     出版日期: 2024-03-15
ZTFLH:  TP393,G250  
引用本文:   
刘春江, 李姝影, 方曙, 胡正银, 钱力. 面向TRIZ的专利技术三元组抽取研究与应用 [J]. 数据分析与知识发现, 10.11925/infotech.2096-3467.2023.0492.
Liu Chunjiang, Li Shuying, Fang Shu, Hu Zhengyin, Qian Li. Research and Application of Triplet Extraction of Patented Technology for TRIZ . Data Analysis and Knowledge Discovery, 0, (): 1-.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.2096-3467.2023.0492      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y0/V/I/1
[1] 胡勇军,韦婷婷,窦子欣,黄芸茵,梁锐成,常会友. 广东刀剪产业转型升级技术发展路径研究*——基于专利TRIZ分析[J]. 数据分析与知识发现, 2020, 4(2/3): 101-109.
[2] 侯剑华,刘盼. 专利技术系统演化的技术熵测度模型与实证研究 *[J]. 数据分析与知识发现, 2019, 3(8): 21-29.
[3] 翟东升, 郭程, 张杰, 夏军. 基于专利的企业潜在研发伙伴推荐方法研究[J]. 数据分析与知识发现, 2017, 1(3): 10-20.
[4] 翟东升,郭程,张杰,李登杰. 采用异常检测的技术机会识别方法研究[J]. 现代图书情报技术, 2016, 32(10): 81-90.
[5] 胡正银, 方曙. 专利文本技术挖掘研究进展综述[J]. 现代图书情报技术, 2014, 30(6): 62-70.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn