摘要Is there such a thing as too much data? If not, who is going to be responsible for selecting what we keep? There is only starting to be a profession of data curation. Data curation will need at least three skills: expertise from library, archive and museum studies about choosing, preserving and explaining to users; expertise from computer science and engineering about data processing, data exploration and data storage methods; and expertise from the subject area of the material, so as to know what the data means, where it came from, and what its significance is. Will we do this work with a committee, or train one person to do everything; and if the latter, is that person likely to start from the library, computing, or subject domain?
Abstract:Is there such a thing as too much data? If not, who is going to be responsible for selecting what we keep? There is only starting to be a profession of data curation. Data curation will need at least three skills: expertise from library, archive and museum studies about choosing, preserving and explaining to users; expertise from computer science and engineering about data processing, data exploration and data storage methods; and expertise from the subject area of the material, so as to know what the data means, where it came from, and what its significance is. Will we do this work with a committee, or train one person to do everything; and if the latter, is that person likely to start from the library, computing, or subject domain?
Michael Lesk. Curators of the Future[J]. 现代图书情报技术, 2013, 29(3): 1-7.
Michael Lesk. Curators of the Future. New Technology of Library and Information Service, 2013, 29(3): 1-7.
[1] Hey T, Tansley S, Tolle K. The Fourth Paradigm[M]. Microsoft Research, 2009. [2] Kouzes R T, Elbert S T, Anderson G A. The Changing Paradigm of Data-Intensive Computing[J]. IEEE Computer, 2009, 42(1): 26-34. [3] Taking a Closer Look at LHC: LHC Trigger[EB/OL]. http://www.lhc-closer.es/php/index.php?i=1&s=3&p=13&e=0. [4] The National Archives. Acquisition and Disposition Strategy[EB/OL]. http://www.nationalarchives.gov.uk/information-management/projects-and-work/acquisition-disposition-strategy.htm. [5] Bureau of Labor Statistics. National Occupational Employment and Wage Estimates [EB/OL]. http://www.bls.gov/oes/current/oes_nat.htm#15-0000. [6] Indeed.com[OL]. http://www.indeed.com/salary?q1=Digital+Curator&l1= . [7] Hodges M. Structural Data Falls Under a Woman’s Influence[EB/OL]. http://woodforthetrees.wordpress.com/2010/03/24/structural-data-falls-under-a-woman%E2%80%99s-influence/. [8] Kemball A J, Cornwell T J. A Simple Model of Software Costs for the Square Kilometer Array[J]. Experimental Astronomy, 2004(17): 317-327. [9] Beagrie N, Lavoie B, Woollard M. Keeping Research Data Safe: Phase 2[EB/OL]. http://www.jisc.ac.uk/media/documents/publications/reports/2010/keepingresearchdatasafe2.pdf. [10] Holzner A, Igo-Kemenes P, Mele S. Data Preservation, Re-use and (open) Access: A Case Study in High-energy Physics[EB/OL].(2009-06-21). http://www.parse-insight.eu/downloads/PARSEInsight_event200909_casestudy_HEP.pdf. [11] Chandras C, Weaver T, Zouberakis M, et al. Models for Financial Sustainability of Biological Databases and Resources[J]. Database, 2009, doi:10.1093/database/bap017. [12] Accomazzi A, Henneken E, Erdmann C, et al. Telescope Bibliographies: An Essential Component of Archival Data Management and Operations[C]. In: Proceedings of SPIE Conference on Astronomical Telescopes and Instrumentation. 2012. [13] Fang W. Using Google Analytics for Improving Library Website Content and Design: A Case Study[J/OL]. Library Philosophy and Practice, 2007. http://unllib.unl.edu/LPP/fang.htm. [14] National Climatic Data Center. What is NCDC?[EB/OL]. http://www.ncdc.noaa.gov/oa/about/whatisncdc.html.