Please wait a minute...
New Technology of Library and Information Service  2009, Vol. Issue (10): 62-66    DOI: 10.11925/infotech.1003-3513.2009.10.11
article Current Issue | Archive | Adv Search |
System Design and Implementation of University Laboratory Web Information Extraction Based on Rules
Hua BolinGuo Jiang2
1(Institute of Scientific and Technical Information of China, Beijing 100038, China)
2(Beijing Used Vehicle Trading Market Inc., Beijing 100070, China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

This paper summarizes the laboratory information characters based on analysis of university laboratory Web information, which is used to formulate rules of laboratory Web information.It designs an information extraction system on university laboratory, and presents system architecture and technical architecture of labIE. It also describes the design of rules on table recognition and methodology of constructing characteristic predicate.

Key wordsLab Web information      Information extraction      Rules      Characteristic predicate      Web page structure     
Received: 21 May 2009      Published: 25 October 2009
: 

 

 
  TP391

 
Corresponding Authors: Hua Bolin     E-mail: huabolin@istic.ac.cn
About author:: Hua Bolin,Guo Jiang

Cite this article:

Hua Bolin,Guo Jiang. System Design and Implementation of University Laboratory Web Information Extraction Based on Rules. New Technology of Library and Information Service, 2009, (10): 62-66.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2009.10.11     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2009/V/I10/62

[1] Muggleton S, Building W, Road P. Inverse Entailment and Progol[J]. New Generation Computing (Special issue on Inductive Logic Programming),1995, 13(3-4):245-286.
[2] DeJong G. An Overview of the FRUMP System[C]. In: Proceedings of Strategies for Natural Language Processing. Hillsdale, NJ:Lawrence Erlbaum Associates, 1982:149-176.
[3] Grishman R, Sundheim B. Message Understanding Conference-6: A Brief History[C].In: Proceedings of the 16th International Conference on Computational Linguistics.1996:466-471. 
[4]Automatic Content Extraction(ACE) Evaluation[EB/OL].[2009-02-20]. http://www.itl.nist.gov/iad/mig/tests/ace/.
[5] Grishman R,Huttunen S,Yangarber R.Information Extraction for Enhanced Access to Disease Outbreak Reports[J].Journal of Biomedical Informatics,2002,35(4):236-246.
[6] 陆伟, 韩曙光, 袁泽林,等.信息检索实验[M].武汉:武汉大学出版社,2008:6-26.
[7] 叶娜.面向信息抽取的文本预处理和规则自动学习技术研究[D].沈阳:东北大学,2005.
[8] 车万翔,刘挺,李生.实体关系自动抽取[J].中文信息学报,2005,19(2):1-6.
[9] HTMLParser的两种使用[EB/OL].[2008-12-21]. http://jansener.javaeye.com/blog/205883.

[1] Tan Ying, Tang Yifei. Extracting Citation Contents with Coreference Resolution[J]. 数据分析与知识发现, 2021, 5(8): 25-33.
[2] Li Tiejun,Yan Duanwu,Yang Xiongfei. Recommending Microblogs Based on Emotion-Weighted Association Rules[J]. 数据分析与知识发现, 2020, 4(4): 27-33.
[3] Wei Wei,Guo Chonghui,Xing Xiaoyu. Annotating Knowledge Points & Recommending Questions Based on Semantic Association Rules[J]. 数据分析与知识发现, 2020, 4(2/3): 182-191.
[4] Wang Yi,Shen Zhe,Yao Yifan,Cheng Ying. Domain-Specific Event Graph Construction Methods:A Review[J]. 数据分析与知识发现, 2020, 4(10): 1-13.
[5] Tao Yue,Yu Li,Zhang Runjie. Active Learning Strategies for Extracting Phrase-Level Topics from Scientific Literature[J]. 数据分析与知识发现, 2020, 4(10): 134-143.
[6] Yong Zhang,Shuqing Li,Yongshang Cheng. Mining Algorithm for Weighted Association Rules Based on Frequency Effective Length[J]. 数据分析与知识发现, 2019, 3(7): 85-93.
[7] Zhiqiang Liu,Yuncheng Du,Shuicai Shi. Extraction of Key Information in Web News Based on Improved Hidden Markov Model[J]. 数据分析与知识发现, 2019, 3(3): 120-128.
[8] Qiang Lu,Zhenfang Zhu,Fuyong Xu,Qiangqiang Guo. Chinese Sentiment Classification Method with Bi-LSTM and Grammar Rules[J]. 数据分析与知识发现, 2019, 3(11): 99-107.
[9] Chengzhi Zhang,Zheng Li. Extracting Sentences of Research Originality from Full Text Academic Articles[J]. 数据分析与知识发现, 2019, 3(10): 12-18.
[10] Mu Dongmei,Jin Shan,Ju Yuanhong. Finding Association Between Diseases and Genes from Literature Abstracts[J]. 数据分析与知识发现, 2018, 2(8): 98-106.
[11] Wei Xing,Hu Dehua,Yi Minhan,Zhu Qizhen,Zhu Wenjie. Extracting Disease-Gene-Drug Correlations Based on Data Cube[J]. 数据分析与知识发现, 2017, 1(10): 94-104.
[12] Li Xiaoying,Xia Guanghui,Li Danya. Finding Semantic Relations Among Subject Indexed Papers[J]. 现代图书情报技术, 2016, 32(7-8): 87-93.
[13] Guangce Ruan, Lei Xia. Mining Document Topics Based on Association Rules[J]. 数据分析与知识发现, 2016, 32(12): 50-56.
[14] Yufeng Duan,Sisi Huang. Information Extraction from Chinese Plant Species Diversity Description Text[J]. 现代图书情报技术, 2016, 32(1): 87-96.
[15] Du Siqi, Li Honglian, Lv Xueqiang. Research of Chinese Chunk Parsing in Application of the Product Feature Extraction[J]. 现代图书情报技术, 2015, 31(9): 26-30.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn