|
|
Optimizing Extraction of Science Documents’ Metadata in PDF Format Based on XSLT |
Chen Junlin Zhang Wende |
(Library of Fuzhou Uninversity, Fuzhou 350002, China) |
|
|
Abstract This paper firstly introduces a format transforming tool and XSLT which is the language used to produce extraction rules, then simply analyses the middle documents generated from PDF to HTML. Thirdly, discusses the problem of metadata existed in the science documents in PDF format, finally gives the methods to solve this problem.
|
Received: 10 November 2006
Published: 25 February 2007
|
|
Corresponding Authors:
Chen Junlin
E-mail: bluesea_cc@163.com
|
About author:: Chen Junlin,Zhang Wende |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|