Please wait a minute...
New Technology of Library and Information Service  2010, Vol. 26 Issue (9): 67-73    DOI: 10.11925/infotech.1003-3513.2010.09.11
article Current Issue | Archive | Adv Search |
Design and Implementation of Data Integration over Heterogeneous Patent Sources
Zhai Dongsheng, He Wenhui
School of Economics and Management, Beijing University of Technology, Beijing 100124, China
Download: PDF(960 KB)   HTML  
Export: BibTeX | EndNote (RIS)      

With consideration of the problems concerning the data of patent analysis, such as single data source, rough pretreatment, and low-level data mining, this paper designs and achieves the data integration over heterogeneous patent sources. Specifically, the local patent database where the data are acquired from heterogeneous sources including two organizations and seven countries is regarded as basic data source. After using the SSIS tool for data cleaning and data transformation, the data from local database are loaded into data warehouse that is built according to the key performance indicators, which provides data support for more advantaged analysis.

Key wordsPatent      information      Data      integration      Data      warehouse      ETL      Data      cleaning      Data      transformation     
Received: 28 June 2010      Published: 26 October 2010
:  null  

Cite this article:

Zhai Dongsheng, He Wenhui. Design and Implementation of Data Integration over Heterogeneous Patent Sources. New Technology of Library and Information Service, 2010, 26(9): 67-73.

URL:     OR

[1] 张静,刘细文,柯贤能,等.国内外专利分析工具功能比较研究
[J]. 情报理论与实践 ,2008,31(1):141-145

[2] 张伟琼.专利信息采集及分析系统设计与开发 . 杭州:浙江大学,2008.

[3] 王志.基于本体的异构数据源集成的研究及应用 .苏州:苏州大学,2009.

[4] 严小泉.基于XML的异构数据源集成系统框架和关键技术研究 .无锡:江南大学,2009.

[5] 郑娅峰.异构数据集成的研究与实现 .西安:西北大学,2005.

[6] 刘晨.专利信息获取与分析系统关键技术研究 .北京:北京工业大学,2009.

[7] 岳泉,晏一平.中外专利信息网络检索工具的比较研究
[J]. 图书馆情报工作 ,2005,49 (9):84-88.

[8] Wiederhold G. Mediators in the Architecture of Future Information Systems
[J]. IEEE Computer,1992, 25(3):38-49.

[9] 还书国,邱海霞.Web信息抽取的研究
[J]. 消费导刊:理论版, 2008 (12):172.

[10] 官建成,王刚波.技术领域优势的国际比较研究
[J]. 科学学研究, 2008,26(1):90-97.

[11] Fabry B, Ernst H, Langholz J,et al.Patent Portfolio Analysis as a Useful Tool for Identifying R&D and Business Opportunities - An Empirical Application in the Nutrition and Health Industry
[J]. World Patent Information, 2006,28 (3):215-225.

[1] Beibei Kong,Jing Xie,Li Qian,Zhijun Chang,Zhenxin Wu. Methodology and Tools to Enrich Sci-Tech Big Data[J]. 数据分析与知识发现, 2019, 3(7): 113-122.
[2] Ke Li,Yuya Sasaki. Analyzing Sentiment Distribution with Spatial-textual Data of Multi-dimensional Clustering[J]. 数据分析与知识发现, 2019, 3(7): 14-22.
[3] Yong Zhang,Shuqing Li,Yongshang Cheng. Mining Algorithm for Weighted Association Rules Based on Frequency Effective Length[J]. 数据分析与知识发现, 2019, 3(7): 85-93.
[4] Xiaozhou Dong,Xinkang Chen. E-Coupon and Economic Performance of E-commerce[J]. 数据分析与知识发现, 2019, 3(6): 42-49.
[5] Jing Shi,Chenlu Li,Yuxing Qian,Liqin Zhou,Bin Zhang. Information Needs of Domestic and International HCQA Users ——An Empirical Analysis[J]. 数据分析与知识发现, 2019, 3(5): 1-10.
[6] Cheng Zhou,Hongqin Wei. Evaluating and Classifying Patent Values Based on Self-Organizing Maps and Support Vector Machine[J]. 数据分析与知识发现, 2019, 3(5): 117-124.
[7] Jinzhu Zhang,Yiming Hu. Extracting Titles from Scientific References in Patents with Fusion of Representation Learning and Machine Learning[J]. 数据分析与知识发现, 2019, 3(5): 68-76.
[8] Shijie Song,Yuxiang Zhao,Wenting Han,Qinghua Zhu. The Inhibition Effect of Health Literacy on Health Risk Under the Internet Environment: An Empirical Study of Chronic Diseases Based on CHNS Data[J]. 数据分析与知识发现, 2019, 3(4): 13-21.
[9] Quan Lu,Anqi Zhu,Jiyue Zhang,Jing Chen. Research on User Information Requirement in Chinese Network Health Community: Taking Tumor-forum Data of Qiuyi as an Example[J]. 数据分析与知识发现, 2019, 3(4): 22-32.
[10] Dongmei Mu,Hui Fa,Ping Wang,Jing Sun. Research on Disease Risk Factors on Structural Equation Model[J]. 数据分析与知识发现, 2019, 3(4): 80-89.
[11] Lianjie Xiao,Mengrui Gao,Xinning Su. An Under-sampling Ensemble Classification Algorithm Based on Fuzzy C-Means Clustering for Imbalanced Data[J]. 数据分析与知识发现, 2019, 3(4): 90-96.
[12] Xuhui Li,Yang Liu. Review of Spatio-temporal Data Modeling Methods[J]. 数据分析与知识发现, 2019, 3(3): 1-13.
[13] Zhiqiang Liu,Yuncheng Du,Shuicai Shi. Extraction of Key Information in Web News Based on Improved Hidden Markov Model[J]. 数据分析与知识发现, 2019, 3(3): 120-128.
[14] Yue Yuan,Dongbo Wang,Shuiqing Huang,Bin Li. The Comparative Study of Different Tagging Sets on Entity Extraction of Classical Books[J]. 数据分析与知识发现, 2019, 3(3): 57-65.
[15] Xiwei Wang,Duo Wang,Qingxiao Zheng,Ya’nan Wei. Information Interaction Between User and Enterprise in Online Brand Community: A Study of Virtual Reality Industry[J]. 数据分析与知识发现, 2019, 3(3): 83-94.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938