Please wait a minute...
New Technology of Library and Information Service  2008, Vol. 24 Issue (1): 21-32    DOI: 10.11925/infotech.1003-3513.2008.01.03
article Current Issue | Archive | Adv Search |
Migration to Intermediate XML for Electronic Data (MIXED)
DIRK ROORDA
Data Archiving and Networked Services (DANS), NL
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

MIXED is a digital preservation project. It uses a strategy of converting data to intermediate XML. In this paper we position this strategy with respect to the well-known emulation and migration strategies. Then we detail the MIXED strategy and explain why it is an optimized, economical way of migration. Finally, we describe how DANS is implementing a software tool that can perform the migrations needed for this strategy.

Key wordsDigital preservation      Migration      File formats      Research data      Software development     
Published: 25 January 2008
Corresponding Authors: DIRK ROORDA     E-mail: dirk.roorda@dans.knaw.nl
About author:: DIRK ROORDA

Cite this article:

DIRK ROORDA. Migration to Intermediate XML for Electronic Data (MIXED). New Technology of Library and Information Service, 2008, 24(1): 21-32.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2008.01.03     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2008/V24/I1/21

More References can be Found on the MIXED Website at http://mixed.dans.knaw.nl/node/4. There are also Copies of Documents in Case the Given Urls are no Longer Valid.
CESSDA: Council of European Social Science Data Archives.  Retrieved November 23, 2007, from http://www.nsd.uib.no/cessda/index.html.
CHRONOS: A Tool for Long Term Archiving of Relational Data Bases.  Retrieved November 23,2007, from http://chronos.csp-sw.de/?lang=en.
CODATA: Committee on Data for Science and Technology of the International Council for Science. Retrieved November 23, 2007, from http://www.codata.org/about/index.html.
DANS: Data Archiving and Networked Services.  Retrieved November 23, 2007, from http://www.dans.knaw.nl.
DANS EASY.  Retrieved November, 2007, from http://easy.dans.knaw.nl/.
Data Documentation Initiative (DDDI): International Effort to Establish a Standard for Technical Documentation Describing Social Science Data.  Retrieved November 23, 2007, from http://www.icpsr.umich.edu/DDI/.
DexT: Data Exchange Tools: Project of the UK Data Archive.  Retrieved November 23, 2007, from http://www.data-archive.ac.uk/dext/about/introduction.asp
Digital Preservation Coalition.  Retrieved November 23, 2007, from http://www.dpconline.org/graphics/.
Digital Preservation Testbed, A Project of the Dutch National Archive. Various Papers and Publications, Such als Long-term Preservation of Spreadsheets and Databases.  Retrieved November 23, 2007, from http://www.digitaleduurzaamheid.nl/index.cfm?paginakeuze=185&lang=en.
Fedora Repository System, developed by Cornell University Information Science and the University of Virginia Library.  Retrieved November 23, 2007, from http://www.fedora-commons.org/.
File Formats for Preservation (2004). Proceedings ERPANET Seminar in Vienna. Retrieved November 23, 2007, from http://www.erpanet.org/events/2004/vienna/index.php.
Gladney, H.M. (2007). Preserving Digital Information. Springer. ISBN: 978-3-540-37886-0.
Heuscher, S., & Jaermann, S., & Keller-Marxer, p., & Moehle, F., (5-7 October 2004).
JHOVE, JStore/Harvard Object Validation Environment. (2006). Retrieved November 23, 2007, from http://hul.harvard.edu/jhove/.
Matsumura, M. (2006). ESB Roundup Part one: Defining the ESB. In InfoQ. Retrieved November 23, 2007, from http://www.infoq.com/articles/ESB-Roundup-Part1-Defining-ESB.
Mellor, Ph., & Wheatley, P., & Sergeant, D. (2002). Migration on Request, A Practical Technique for Preservation. CAMiLEON project, University of Leeds. Retrieved November 23, 2007, from http://www.si.umich.edu/CAMILEON/reports/migreq.pdf.
Metadata Extraction Tool of the National Library of New Zealand. [software]. Retrieved November 23, 2007, from http://www.natlib.govt.nz/about-us/current-initiatives/metadata-extraction-tool.
Muller, E., & Klosa, U., & Andersson, S., & Hansson, P., The DiVA project: Development of an Electronic Publishing System. D-Lib Magazine 2003,9(11). Retrieved  November 23, 2007, from http://www.dlib.org/dlib/november03/muller/11muller.html.
NEN (01-11-2001). ISO 15489-1. Information and Documentation. Records Management Part 1: General.
OAIS: Reference Model for an Open Archival Information System. ISO Standard 14721:2003. Retrieved November 23, 2007, from http://public.ccsds.org/publications/archive/650x0b1.pdf.
ODF: Open Document Format for Office Applications (native format of OpenOffice, also: interchange format). Retrieved November 23, 2007, from http://www.oasis-open.org/committees/tc_home.php?wg_abbrev=office and  http://xml.coverpages.org/odf.html,  and from Wikipedia http://en.wikipedia.org/wiki/OpenDocument.
OFL: Open Formula Language (sublanguage of ODF). Retrieved November 23, 2007, from  http://wiki.oasis-open.org/office/About_OpenFormula.
Open Packaging Convention (OPC). Retrieved November 23, 2007, from http://www.ecma-international.org/news/TC45_current_work/tc45-2006-335.pdf.
OpenXML: Microsoft Office 2007 Native Format and Interchange Format. Retrieved November 23, 2007 from  http://www.ecma-international.org/publications/standards/Ecma-376.htm and from http://en.wikipedia.org/wiki/Office_Open_XML; and the OpenXML Developer from http://openxmldeveloper.org/.
PLANETS: Digital Preservation and Long-term Access Through Networked Services.  Retrieved November 23, 2007, from http://www.planets-project.eu/.
Premis PREservation Metadata: Implementation Strategies. Retrieved November 23, 2007, from http://www.oclc.org/research/projects/pmwg/.
PRONOM: On-line Information System About Data File Formats and Their Supporting Software Products. Retrieved November 23, 2007, from http://www.nationalarchives.gov.uk/pronom/. Also includes DROID – Digital Record Object Identification. Retrieved November 23, 2007, from  http://droid.sourceforge.net/wiki/index.php/Introduction.
Providing Authentic Long-term Archival Access to Complex Relational Data. European Space Agency Symposium “Ensuring Long-term Preservation and Adding Value to Scientific and Technical Data”, Frascati Italy. Retrieved November 23, 2007, from http://arxiv.org/PS_cache/cs/pdf/0408/0408054v1.pdf.
RODA: Digital Preservation Project of the National Archive Institute of Portugal. Retrieved November 23, 2007, from http://roda.iantt.pt/.
Royal Library of the Netherlands (2002). IBM/KB Long-term Preservation Emulation Techniques. Retrieved November 23, 2007, from http://www.kb.nl/hrd/dd/dd_onderzoek/dnep_ltp_study.html.
SOA: Server Oriented Architecture. Introductory Article from IBM. Retrieved November 23, 2007, from http://www.ibm.com/developerworks/webservices/newto/. Also an introduction on SOA. Retrieved November 27, 2007, from http://en.wikipedia.org/wiki/Service-oriented_architecture.
Trusted Digital Repositories: Attributes and Responsibilities. An RLG/OCLC Report. (2002). Retrieved November 23, 2007, from http://www.rlg.org/en/pdfs/repositories.pdf.
UOF: Open format for Office documents from China. Retrieved November 23, 2007, from  http://en.wikipedia.org/wiki/Uniform_Office_Format.
Waller, M., & Sharpe, R., (2006). Mind the gap.  Retrieved November 23, 2007, from  http://www.dpconline.org/docs/reports/uknamindthegap.pdf.
Wu Zhi-gang, Chinese software standards and inter-operation of Office software.  Retrieved November 23, 2007, from http://blogs.sun.com/dennisding/resource/Chinese%20Office%20Software%20Standards%20and%20Inter-operation%20of%20Office%20Software-WuZhigang(English).pdf.

[1] Sheng Shu, Huang Qi, Yang Yang, Xie Qiwen, Qin Xinguo. Exchanging Chinese Medical Information Based on HL7 FHIR[J]. 数据分析与知识发现, 2021, 5(11): 13-28.
[2] Li Gang, Ye Guanghui, Zhang Yan. Feature Recognition of Niche Expert——Empirical Analysis Based on MetaFilter Dataset[J]. 现代图书情报技术, 2015, 31(6): 71-77.
[3] Wu Zhenxin, Wang Yuju, Fu Honghu, Li Chunwang, Liu Jianhua. Constructing a Trusted Ingest Workflow of Digital Preservation System[J]. 现代图书情报技术, 2015, 31(3): 1-7.
[4] Sun Yi'nan, Ku Liping, Song Xiufang, Liu Jingjing, Jiang Xian. The Policy Research and Analysis of Subject Data Repository ——Cases Study of Life Sciences[J]. 现代图书情报技术, 2015, 31(12): 13-20.
[5] Liu Jingjing, Ku Liping, Fan Shaoping. The Policy Research and Analysis of General Research Data Repository[J]. 现代图书情报技术, 2015, 31(11): 4-11.
[6] Heinz Pampel, Paul Vierkant, Frank Scholze, Roland Bertelmann, Maxi Kindling, Jens Klump, Hans-Jürgen Goebelbecker, Jens Gundlach, Peter Schirmbacher, Uwe Dierolf . Making Research Data Repositories Visible:The re3data.org Registry[J]. 现代图书情报技术, 2014, 30(3): 26-34.
[7] Wu Zhenxin. Research on Fixity of Digital Object in Digital Preservation[J]. 现代图书情报技术, 2014, 30(11): 1-9.
[8] Zhang Zhixiong,Wu Zhenxin,Liu Jianhua,Guo Hongmei. Analysis of the Difference Between Digital Curation and Digital Preservation[J]. 现代图书情报技术, 2014, 30(1): 4-13.
[9] Wu Zhenxin, Qi Yan, Fu Honghu, Liu Chao, Li Wenyan, Liu Xiaomin, Wang Yuju. Infrastructure, Intelligence, Innovation:Driving the Data Science Agenda——A Comprehensive Review of IDCC2013[J]. 现代图书情报技术, 2013, 29(7/8): 13-21.
[10] Ku Liping. Reviews of the Open Data Metric Studies:An Alternative Metric (Altmetrics) for Calculating the Online User Behavior and the Scientific Community Impact[J]. 现代图书情报技术, 2013, (6): 1-8.
[11] Huang Yongwen, Zhang Jianyong, Huang Jinxia, Wang Fang. Research on the Open Research Data[J]. 现代图书情报技术, 2013, (5): 21-27.
[12] Ma Ningning, Li Chao, Qu Yunpeng. Design and Implementation of an Automatic Obsolescence Management System for Digital Preservation[J]. 现代图书情报技术, 2013, (4): 69-76.
[13] Deng Hong, Ding Juntao, Tu Feiping. Design and Implementation of DSpace Resources Import Tool Using NoteExpress[J]. 现代图书情报技术, 2012, 28(1): 80-84.
[14] Gao JianXiu Wu Zhenxin Sun Shuo. Research on the Application of Cloud Storage in Digital Preservation[J]. 现代图书情报技术, 2010, 26(6): 1-6.
[15] Yao Fei,Jiang Airong. System and Features of Planets——A Long-term Preservation Project Funded by European Union[J]. 现代图书情报技术, 2010, 26(2): 12-16.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn