Please wait a minute...
Advanced Search
现代图书情报技术  2012, Vol. Issue (11): 53-59     https://doi.org/10.11925/infotech.1003-3513.2012.11.09
  知识组织与知识管理 本期目录 | 过刊浏览 | 高级检索 |
专利文献中新技术术语识别研究
谷俊
宝山钢铁股份有限公司 上海 201900
Study on New Technology Detection in Patents Documents
Gu Jun
Baoshan Iron and Steel Co., Ltd., Shanghai 201900, China
全文: PDF (611 KB)   HTML  
输出: BibTeX | EndNote (RIS)      
摘要 主要介绍从中文专利文本中识别新技术术语的方法。利用ICTCLAS分词系统和停用词表抽取文档词元,通过改进的TFIDF模型计算词元权重并筛选出热点词元,再通过词间距测算对热点词元按顺序进行组配,经权重计算和阈值筛选后得到术语集,由专家人工判定识别出有效的新技术术语。最后给出应用实例并进行分析,验证该方法的有效性。
服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
谷俊
关键词 技术生命周期术语识别热点词元    
Abstract:This paper promotes a method which detecting new technology term from the texts of Chinese patents. Firstly, the element of terms in patents are extracted by ICTCLAS segmentation system and stop words lists. Then the hot elements of terms are filtered based on terms weights computing by improved TFIDF model. Secondly, the hot elements of terms are combined orderly by computing the distance between two words, and obtain the terms collection by terms weights computing and threshold filtering. The valid new technology terms are detected by the experts artificially. Finally, the availability of the method is proved through the applied example.
Key wordsTechnology life cycle    Term detection    Hot elements of terms
收稿日期: 2012-11-07      出版日期: 2013-02-06
:  TP391  
通讯作者: 谷俊     E-mail: jungu@yahoo.cn
引用本文:   
谷俊. 专利文献中新技术术语识别研究[J]. 现代图书情报技术, 2012, (11): 53-59.
Gu Jun. Study on New Technology Detection in Patents Documents. New Technology of Library and Information Service, 2012, (11): 53-59.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2012.11.09      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2012/V/I11/53
[1] 邵波. 企业竞争与反竞争情报中的专利分析研究[J]. 情报科学, 2006, 24(2):235-238.(Shao Bo.The Patent Analysis of Enterprise Competitive Intelligence and Counterintelligence[J].Information Science,2006,24(2):235-238)
[2] 专利分析系统:专利生命周期评价模型[EB/OL].[2011-08-02]. http://www.iprtop.com/pages/view/fn/fxxt_7/.(Patent Analyze System:Patent Lifecycle Evaluation Model[EB/OL].[2011-08-02].http://www.iprtop.com/pages/view/fn/fxxt_7/.)
[3] Zhang K, Zi J, Wu L G. New Event Detection Based on Indexing-Tree and Named Entity[C]. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007: 215-222.
[4] Chanlekha H, Collier N. A Methodology to Enhance Spatial Understanding of Disease Outbreak Events Reported in News Articles[J]. International Journal of Medical Informatics, 2010,79(4):284-296.
[5] Ramadan Q H, Mohd M. A Review of Retrospective News Event Detection[C].In: Proceedings of the International Conference on Semantic Technology and Information Retrieval (STAIR).2011: 209-214.
[6] Sun A, Hu M. Query-guided Event Detection from News and Blog Streams[J]. IEEE Transactions on Systems, Man and Cybernetics—Part A: Systems and Humans, 2011,41(5): 834-839.
[7] Tu Y N, Seng J L. Indices of Novelty for Emerging Topic Detection[J]. Information Processing & Management,2012,48(2): 303-325.
[8] Dai X, He Y, Sun Y. A Two-layer Text Clustering Approach for Retrospective News Event Detection[C]. In: Proceedings of the 2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI). 2010: 364-368.
[9] 张阔,李涓子,吴刚,等. 基于词元再评估的新事件检测模型[J]. 软件学报, 2008, 19(4): 817-828.(Zhang Kuo, Li Juanzi, Wu Gang, et al.A New Event Detection Model Based on Term Reweighting[J].Journal of Software,2008, 19(4): 817-828.)
[10] 洪宇,张宇,范基礼,等. 基于子话题分治匹配的新事件检测[J]. 计算机学报,2008, 31(4): 687-695.(Hong Yu, Zhang Yu, Fan Jiji,et al.New Event Detection Based on Division Comparison of Subtopic[J].Chinese Journal of Computers,2008, 31(4): 687-695.)
[11] 贾自艳,何清,张海俊,等. 一种基于动态进化模型的事件探测和追踪算法[J]. 计算机研究与发展,2004, 41(7): 1273-1280.(Jia Ziyan,He Qing,Zhang Haijun,et al. A News Event Detection and Tracking Algorithm Based on Dynamic Evolution Model[J]. Journal of Computer Research and Development,2004, 41(7): 1273-1280.)
[12] 姚占雷,许鑫. 互联网新闻报道中的突发事件识别研究[J]. 现代图书情报技术,2011(4): 52-57.(Yao Zhanlei, Xu Xin.Research on the Detection of Sudden Events in News Stories of Online Information[J]. New Technology of Library and Information Service,2011(4): 52-57.)
[13] 陈伟,张成,王灿,等.新闻数据流的在线事件检测[J]. 浙江大学学报:工学版,2011,45(6):1006-1012.(Chen Wei,Zhang Cheng,Wang Can,et al.Online Event Detection in News Stream[J]. Journal of Zhejiang University:Engineering Science,2011,45(6);1006-1012.)
[14] 国内外三种专利申请受理状况总累计表[EB/OL].[2011-07-22]. http://www.sipo.gov.cn/sipo2008/ghfzs/zltj/zljb/201101/t20110110_562647.html.(Three Kinds of Patents Received Total Cumulative Table[EB/OL].[2011-07-22]. http://www.sipo.gov.cn/sipo2008/ghfzs/zltj/zljb/201101/t20110 110_562647.html.)
[15] ICTCLAS简介[EB/OL].[2011-06-10]. http://ictclas.org/ictclas_feature.html.(Introduction to ICTCLAS[EB/OL].[2011-06-10]. http://ictclas.org/ictclas_feature.html.)
[16] Kyoto Protocol to the United Nations Framework Convention on Climate Change[EB/OL].[2011-08-12]. http://unfccc.int/resource/docs/convkp/kpeng.html.
[1] 侯剑华,刘盼. 专利技术系统演化的技术熵测度模型与实证研究 *[J]. 数据分析与知识发现, 2019, 3(8): 21-29.
[2] 何远标, 乐小虬, 张帆. 学术论文大纲中关键术语抽取方法研究[J]. 现代图书情报技术, 2014, 30(3): 73-79.
[3] 叶春蕾, 冷伏海. 科技文献全文主题识别方法实证研究[J]. 现代图书情报技术, 2012, 28(1): 53-57.
[4] 姚占雷, 许鑫. 互联网新闻报道中的突发事件识别研究[J]. 现代图书情报技术, 2011, 27(4): 52-57.
[5] 许德山, 张智雄, 王峰, 邢美凤. 上下文分析与统计特征相结合的英文术语抽取研究[J]. 现代图书情报技术, 2010, 26(12): 28-33.
[6] 刘建华,张智雄,徐健,许雁冬. 自动术语识别——对科技文献进行文本挖掘的重要技术方法*[J]. 现代图书情报技术, 2008, 24(8): 12-17.
[7] 岑咏华,韩哲,季培培. 基于隐马尔科夫模型的中文术语识别研究[J]. 现代图书情报技术, 2008, 24(12): 54-58.
[8] 王昊 . 基于层次模式匹配的命名实体识别模型[J]. 现代图书情报技术, 2007, 2(5): 62-68.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn