Please wait a minute...
New Technology of Library and Information Service  2010, Vol. 26 Issue (9): 67-73    DOI: 10.11925/infotech.1003-3513.2010.09.11
article Current Issue | Archive | Adv Search |
Design and Implementation of Data Integration over Heterogeneous Patent Sources
Zhai Dongsheng, He Wenhui
School of Economics and Management, Beijing University of Technology, Beijing 100124, China
Export: BibTeX | EndNote (RIS)      

With consideration of the problems concerning the data of patent analysis, such as single data source, rough pretreatment, and low-level data mining, this paper designs and achieves the data integration over heterogeneous patent sources. Specifically, the local patent database where the data are acquired from heterogeneous sources including two organizations and seven countries is regarded as basic data source. After using the SSIS tool for data cleaning and data transformation, the data from local database are loaded into data warehouse that is built according to the key performance indicators, which provides data support for more advantaged analysis.

Key wordsPatent      information      Data      integration      Data      warehouse      ETL      Data      cleaning      Data      transformation     
Received: 28 June 2010      Published: 26 October 2010
:  null  

Cite this article:

Zhai Dongsheng, He Wenhui. Design and Implementation of Data Integration over Heterogeneous Patent Sources. New Technology of Library and Information Service, 2010, 26(9): 67-73.

URL:     OR

[1] 张静,刘细文,柯贤能,等.国内外专利分析工具功能比较研究
[J]. 情报理论与实践 ,2008,31(1):141-145

[2] 张伟琼.专利信息采集及分析系统设计与开发 . 杭州:浙江大学,2008.

[3] 王志.基于本体的异构数据源集成的研究及应用 .苏州:苏州大学,2009.

[4] 严小泉.基于XML的异构数据源集成系统框架和关键技术研究 .无锡:江南大学,2009.

[5] 郑娅峰.异构数据集成的研究与实现 .西安:西北大学,2005.

[6] 刘晨.专利信息获取与分析系统关键技术研究 .北京:北京工业大学,2009.

[7] 岳泉,晏一平.中外专利信息网络检索工具的比较研究
[J]. 图书馆情报工作 ,2005,49 (9):84-88.

[8] Wiederhold G. Mediators in the Architecture of Future Information Systems
[J]. IEEE Computer,1992, 25(3):38-49.

[9] 还书国,邱海霞.Web信息抽取的研究
[J]. 消费导刊:理论版, 2008 (12):172.

[10] 官建成,王刚波.技术领域优势的国际比较研究
[J]. 科学学研究, 2008,26(1):90-97.

[11] Fabry B, Ernst H, Langholz J,et al.Patent Portfolio Analysis as a Useful Tool for Identifying R&D and Business Opportunities - An Empirical Application in the Nutrition and Health Industry
[J]. World Patent Information, 2006,28 (3):215-225.

[1] Xu Liangchen, Guo Chonghui. Predicting Survival Rates for Gastric Cancer Based on Ensemble Learning[J]. 数据分析与知识发现, 2021, 5(8): 86-99.
[2] Wang Ruolin, Niu Zhendong, Lin Qika, Zhu Yifan, Qiu Ping, Lu Hao, Liu Donglei. Disambiguating Author Names with Embedding Heterogeneous Information and Attentive RNN Clustering Parameters[J]. 数据分析与知识发现, 2021, 5(8): 13-24.
[3] Tan Ying, Tang Yifei. Extracting Citation Contents with Coreference Resolution[J]. 数据分析与知识发现, 2021, 5(8): 25-33.
[4] Wang Qinjie, Qin Chunxiu, Ma Xubu, Liu Huailiang, Xu Cunzhen. Recommending Scientific Literature Based on Author Preference and Heterogeneous Information Network[J]. 数据分析与知识发现, 2021, 5(8): 54-64.
[5] Gu Yaowen, Zhang Bowen, Zheng Si, Yang Fengchun, Li Jiao. Predicting Drug ADMET Properties Based on Graph Attention Network[J]. 数据分析与知识发现, 2021, 5(8): 76-85.
[6] Zhang Le, Leng Jidong, Lv Xueqiang, Cui Zhuo, Wang Lei, You Xindong. RLCPAR: A Rewriting Model for Chinese Patent Abstracts Based on Reinforcement Learning[J]. 数据分析与知识发现, 2021, 5(7): 59-69.
[7] Lu Quan, He Chao, Chen Jing, Tian Min, Liu Ting. A Multi-Label Classification Model with Two-Stage Transfer Learning[J]. 数据分析与知识发现, 2021, 5(7): 91-100.
[8] Dong Mei,Chang Zhijun,Zhang Runjie. A Multiple Pattern Matching Algorithm for Specifications of Incremental Metadata for Sci-Tech Literature[J]. 数据分析与知识发现, 2021, 5(6): 135-144.
[9] Huang Mingxuan,Jiang Caoqing,Lu Shoudong. Expanding Queries Based on Word Embedding and Expansion Terms[J]. 数据分析与知识发现, 2021, 5(6): 115-125.
[10] Ma Yingxue,Zhao Jichang. Patterns and Evolution of Public Opinion on Weibo During Natural Disasters: Case Study of Typhoons and Rainstorms[J]. 数据分析与知识发现, 2021, 5(6): 66-79.
[11] Gao Yilin,Min Chao. Comparing Technology Diffusion Structure of China and the U.S. to Countries Along the Belt and Road[J]. 数据分析与知识发现, 2021, 5(6): 80-92.
[12] Lu Linong,Zhu Zhongming,Zhang Wangqiang,Wang Xiaochun. Cross-database Knowledge Integration and Fingerprint of Institutional Repositories with Lingo3G Clustering Algorithm[J]. 数据分析与知识发现, 2021, 5(5): 127-132.
[13] Liu Tong,Liu Chen,Ni Weijian. A Semi-Supervised Sentiment Analysis Method for Chinese Based on Multi-Level Data Augmentation[J]. 数据分析与知识发现, 2021, 5(5): 51-58.
[14] Ma Yingxue,Gan Mingxin,Xiao Kejun. A Matrix Factorization Recommendation Method with Tags and Contents[J]. 数据分析与知识发现, 2021, 5(5): 71-82.
[15] Meng Zhen,Wang Hao,Yu Wei,Deng Sanhong,Zhang Baolong. Vocal Music Classification Based on Multi-category Feature Fusion[J]. 数据分析与知识发现, 2021, 5(5): 59-70.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938