Please wait a minute...
New Technology of Library and Information Service  2008, Vol. 24 Issue (4): 7-11    DOI: 10.11925/infotech.1003-3513.2008.04.02
article Current Issue | Archive | Adv Search |
New Development of Automatic Metadata Extraction
Zeng Su1,2  Ma JianxiaZhang Xiuxiu1
1 (The Lanzhou Branch of the National Science Library, Chinese Academy of Sciences, Lanzhou 730000,China)
2 (Graduate University of Chinese Academy of Sciences, Beijing 100049,China)
Download: PDF(333 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  

 This paper analyses realistic demands of automatic metadata extraction, elaborates related research on automatic metadata extraction and compares three typical automatic extractors of metadata, including DROID, NLNZ Metadata Extractor and Metadata Miner Catalogue PRO. On the basis of discussing present limitations of automatic metadata extraction, the authors give a summary and prediction of this technology.

Key wordsMetadata      Automatic extraction      Extractor     
Received: 17 December 2007      Published: 25 April 2008
: 

G250.76

 
Corresponding Authors: Zeng Su     E-mail: zengs@mail.las.ac.cn
About author:: Zeng Su,Ma Jianxia,Zhang Xiuxiu

Cite this article:

Zeng Su,Ma Jianxia,Zhang Xiuxiu. New Development of Automatic Metadata Extraction. New Technology of Library and Information Service, 2008, 24(4): 7-11.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2008.04.02     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2008/V24/I4/7

[1] Dublin Core Metadata Editor[EB/OL].[2007-11-08].http://www.ukoln.ac.uk/metadata/dcdot/.
[2] Liu Y, Bai K, Mitra P, et al. TableSeer: Automatic Table Metadata Extraction and Searching in Digital Libraries[EB/OL]. [2007-11-10]. http://delivery.acm.org/10.1145/1260000/1255193/p91-liu.pdf?key1=1255193&key2=9007077911&coll=GUIDE&dl=GUIDE&CFID=9677192&CFTOKEN=66821516.
[3] Day M Y, Tsai R T, Sung C L, et al. Reference Metadata Extraction Using a Hierarchical Knowledge Representation Framework[J]. Decision Support Systems, 2007(43): 152-167.
[4] Cortezl E, da Silval A S, Goncalves M A, et al. FLUX-CIM: Flexible Unsupervised Extraction of Citation Metadata[EB/OL]. [2007-12-18]. http://delivery.acm.org/10.1145/1260000/1255219/p215-cortez.pdf?key1=1255219&key2=9296088911&coll=GUIDE&dl=GUIDE&CFID=10613840&CFTOKEN=55320929/.
[5] Hu Y H, Li H, Cao Y B, et al. Automatic Extraction of Titles from General Documents Using Machine Learning[J]. Information Processing and Management , 2006,42(1):1276-1293.
[6] 贺亚锋. Web站点元数据自动生成工具介绍[J]. 图书馆杂志, 2001,20(1): 28-30.
[7] Xue Y W,  Hu Y H, Xin G M, et al. Web Page Title Extraction and Its Application[J]. Information Processing and Management, 2007 (43): 1332-1347.
[8] Yu J D,  Fan X Z. Metadata Extraction from Chinese Research Papers Based on Conditional Random Fields[EB/OL]. [2007-12-01]. http://210.37.44.253/nc2007/fskd2007/data/Volume%201/105-1-Chinese%20Research%20Papers.pdf.
[9] 李朝光, 张铭, 邓志鸿, 等. 论文元数据信息的自动抽取[J]. 计算机工程与应用, 2002,38(21): 189-191,235.
[10] DROID[EB/OL].[2007-11-22].http://droid.sourceforge.net/wiki/index.php/Introduction.
[11] Metadata Extraction Tool[CP/OL].[2007-12-03].http://sourceforge.net/projects/meta-extractor/.
[12] Nation Library of New Zealand.[2007-12-05].http://www.natlib.govt.nz/about-us/current-initiatives/metadata-extraction-tool/.
[13] Catalogue PRO[EB/OL]. [2007-12-08]. http://peccatte.karefil.com/software/Catalogue/catalogueDK.htm/.
[14] Main Features of Catalogue[EB/OL]. [2007-12-10].http://peccatte.karefil.com/software/Catalogue/CatalogueENG.htm/.
[15] Implementing the PREMIS Data Dictionary: A Survey of Approaches[EB/OL]. [2007-12-16]. http://www.loc.gov/standards/premis/implementation-report-woodyard.pdf/.

[1] Jinzhu Zhang,Yiming Hu. Extracting Titles from Scientific References in Patents with Fusion of Representation Learning and Machine Learning[J]. 数据分析与知识发现, 2019, 3(5): 68-76.
[2] Lin Jiang,Dongbo Wang. Automatically Detecting and Tagging Foreign Language Citation Metadata[J]. 数据分析与知识发现, 2017, 1(1): 47-54.
[3] Liu Qingxiang,Zhang Pengzhu,Zhang Xiaoyan,Liu Jingfang. Automatically Extracting Talents’ Knowledge Structure Online[J]. 现代图书情报技术, 2016, 32(4): 56-63.
[4] Qianqian Yu,Jianyong Zhang. Practices of NSTL Integrating and Using Third-party Metadata[J]. 现代图书情报技术, 2016, 32(1): 97-102.
[5] Liu Feng, Zhang Xiaolin. Review on the Scientific Metadata Standards and Research on Its Generic Design[J]. 现代图书情报技术, 2015, 31(12): 3-12.
[6] Wang Hui, Michael Witt, Dou Tianfang. Purdue University Research Repository and Scientific Data Management Services Based on PURR[J]. 现代图书情报技术, 2015, 31(1): 9-16.
[7] Tan Xueqing, He Shan. Research Review on Music Personalized Recommendation System[J]. 现代图书情报技术, 2014, 30(9): 22-32.
[8] Zeng Wen,Xu Shuo,Zhang Yunliang,Zhai Juanhua. The Research and Analysis on Automatic Extraction of Science and Technology Literature Terms[J]. 现代图书情报技术, 2014, 30(1): 51-55.
[9] Cheng Yanyan. Comparative Research on International Electronic Records Metadata Packaging Methods—VEO and METS[J]. 现代图书情报技术, 2011, 27(10): 7-11.
[10] Zhou Jing, Zhao Ying, Yang Xin. CWM-based ETL Metadata System Model Design[J]. 现代图书情报技术, 2011, 27(1): 88-93.
[11] Shen Yunyun, Xiao Long, Feng Ying. Study on General Metadata Application Rules for Digital Library[J]. 现代图书情报技术, 2010, 26(12): 1-8.
[12] Zhang Chunhong, Tang Yong, Shao Ke. Digitalization Standards and the Applications of Objects Resources[J]. 现代图书情报技术, 2010, 26(12): 9-14.
[13] Zhou Yutao, Fan Guoyin. Automatically Generating Program for OAI-METS Metadata of Dissertation[J]. 现代图书情报技术, 2010, 26(10): 91-94.
[14] Han Ying,Zhu Zhongming. Research Progress and Application of Contextual Metadata for Digital Object[J]. 现代图书情报技术, 2009, 25(6): 24-30.
[15] Chen Quan,Yang Xiaojiang. Design and Implementation of a Management System for Digital Resource Collection[J]. 现代图书情报技术, 2009, 25(5): 86-91.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn