|
|
New Development of Automatic Metadata Extraction |
Zeng Su1,2 Ma Jianxia1 Zhang Xiuxiu1 |
1 (The Lanzhou Branch of the National Science Library, Chinese Academy of Sciences, Lanzhou 730000,China)
2 (Graduate University of Chinese Academy of Sciences, Beijing 100049,China) |
|
|
Abstract This paper analyses realistic demands of automatic metadata extraction, elaborates related research on automatic metadata extraction and compares three typical automatic extractors of metadata, including DROID, NLNZ Metadata Extractor and Metadata Miner Catalogue PRO. On the basis of discussing present limitations of automatic metadata extraction, the authors give a summary and prediction of this technology.
|
Received: 17 December 2007
Published: 25 April 2008
|
|
Corresponding Authors:
Zeng Su
E-mail: zengs@mail.las.ac.cn
|
About author:: Zeng Su,Ma Jianxia,Zhang Xiuxiu |
[1] Dublin Core Metadata Editor[EB/OL].[2007-11-08].http://www.ukoln.ac.uk/metadata/dcdot/.
[2] Liu Y, Bai K, Mitra P, et al. TableSeer: Automatic Table Metadata Extraction and Searching in Digital Libraries[EB/OL]. [2007-11-10]. http://delivery.acm.org/10.1145/1260000/1255193/p91-liu.pdf?key1=1255193&key2=9007077911&coll=GUIDE&dl=GUIDE&CFID=9677192&CFTOKEN=66821516.
[3] Day M Y, Tsai R T, Sung C L, et al. Reference Metadata Extraction Using a Hierarchical Knowledge Representation Framework[J]. Decision Support Systems, 2007(43): 152-167.
[4] Cortezl E, da Silval A S, Goncalves M A, et al. FLUX-CIM: Flexible Unsupervised Extraction of Citation Metadata[EB/OL]. [2007-12-18]. http://delivery.acm.org/10.1145/1260000/1255219/p215-cortez.pdf?key1=1255219&key2=9296088911&coll=GUIDE&dl=GUIDE&CFID=10613840&CFTOKEN=55320929/.
[5] Hu Y H, Li H, Cao Y B, et al. Automatic Extraction of Titles from General Documents Using Machine Learning[J]. Information Processing and Management , 2006,42(1):1276-1293.
[6] 贺亚锋. Web站点元数据自动生成工具介绍[J]. 图书馆杂志, 2001,20(1): 28-30.
[7] Xue Y W, Hu Y H, Xin G M, et al. Web Page Title Extraction and Its Application[J]. Information Processing and Management, 2007 (43): 1332-1347.
[8] Yu J D, Fan X Z. Metadata Extraction from Chinese Research Papers Based on Conditional Random Fields[EB/OL]. [2007-12-01]. http://210.37.44.253/nc2007/fskd2007/data/Volume%201/105-1-Chinese%20Research%20Papers.pdf.
[9] 李朝光, 张铭, 邓志鸿, 等. 论文元数据信息的自动抽取[J]. 计算机工程与应用, 2002,38(21): 189-191,235.
[10] DROID[EB/OL].[2007-11-22].http://droid.sourceforge.net/wiki/index.php/Introduction.
[11] Metadata Extraction Tool[CP/OL].[2007-12-03].http://sourceforge.net/projects/meta-extractor/.
[12] Nation Library of New Zealand.[2007-12-05].http://www.natlib.govt.nz/about-us/current-initiatives/metadata-extraction-tool/.
[13] Catalogue PRO[EB/OL]. [2007-12-08]. http://peccatte.karefil.com/software/Catalogue/catalogueDK.htm/.
[14] Main Features of Catalogue[EB/OL]. [2007-12-10].http://peccatte.karefil.com/software/Catalogue/CatalogueENG.htm/.
[15] Implementing the PREMIS Data Dictionary: A Survey of Approaches[EB/OL]. [2007-12-16]. http://www.loc.gov/standards/premis/implementation-report-woodyard.pdf/. |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|