Abstract
Results are presented from the Data Curation Profiles project research, on who is willing to share what data with whom and when. Emerging from scientists’ discussions on sharing are several dimensions suggestive of the variation in both what it means ‘to share’ and how these processes are carried out. This research indicates that data curation services will need to accommodate a wide range of subdisciplinary data characteristics and sharing practices. As part of a larger set of strategies emerging across academic institutions, institutional repositories (IRs) will contribute to the stewardship and mobilization of scientific research data for e-Research and learning. There will be particular types of data that can be managed well in an IR context when characteristics and practices are well understood. Findings from this study elucidate scientists’ views on ‘sharable’ forms of data—the particular representation that they view as most valued for reuse by others within their own research areas—and the anticipated duration for such reuse. Reported sharing incidents that provide insights into barriers to sharing and related concerns on data misuse are included.
References
Beagrie N., Chruszcz J.& Lavoie B. . 2008Keeping research data safe:a cost model and guidance for UK universities. Final Report to JISC. See http://www.jisc.ac.uk/media/documents/publications/keepingresearchdatasafe0408.pdf. Google ScholarBertzky B.& Stoll-Kleemann S. . 2009Multi-level discrepancies with sharing data on protected areas:what we have and what we need for the global village. J. Environ. Manage 90, 8-24(doi:10.1016/j.jenvman.2007.11.001). Crossref, PubMed, ISI, Google ScholarBorgman C. L., Wallis J. C.& Enyedy N. . 2007Little science confronts the data deluge:habitat ecology, embedded sensor networks, and digital libraries. Int. J. Dig. Libr. 7, 17-30(doi:10.1007/s00799-007-0022-9). Crossref, Google ScholarBos N., Zimmerman A., Olson J., Yew J., Yerkie J., Dahl E.& Olson G. . 2007From shared databases to communities of practice:a taxonomy of collaboratories. J. Comp.-Mediat. Commun. 12article 16. See http://jcmc.indiana.edu/vol12/issue2/bos.html. Google ScholarCampbell E. G., Clarridge B. R., Gokhale M., Birenbaum L., Hilgartner S., Holtzman N. A.& Blumenthal D. . 2002Data withholding in academic genetics:evidence from a national survey. J. Am. Med. Assoc. 287, 473-480(doi:10.1001/jama.287.4.473). Crossref, ISI, Google ScholarCarlson S. . 2006Lost in a sea of science data. The Chronicle of Higher Education23 June 2006. Google ScholarChompalov I., Genuth J.& Shrum W. . 2002The organisation of scientific collaborations. Res. Policy 31, 749-767(doi:10.1016/S0048-7333(01)00145-7). Crossref, ISI, Google ScholarChoudhury G. S. . 2008Case study in data curation at Johns Hopkins University. Libr. Trends 57, 211-220(doi:10.1353/lib.0.0028). Crossref, ISI, Google ScholarCrompton S., Aziz B.& Wilson M. . 2009Sharing scientific data:scenarios and challenges. W3C Workshop on Access Control Application Scenarios, Luxembourg, 17–18 November 2009. See http://epubs.stfc.ac.uk/bitstream/4463/w3c1_syc.pdf. Google ScholarDe Solla Price D. J. . 1963Little science, big scienceNew York, NYColumbia University Press. Crossref, Google ScholarEdwards P. N., Jackson S. J., Bowker G. C.& Knobel C. P. . 2007Understanding infrastructure:dynamics, tensions, and design. Final Report of the Workshop on History and theory of infrastructure:lessons for new scientific cyberinfrastructures. See http://hdl.handlenet/2027.42/49353. Google ScholarFoster M. W.& Sharp R. R. . 2007Share and share alike:deciding how to distribute the scientific and social benefits of genomic data. Nat. Rev. Genet. 8, 633-638(doi:10.1038/nrg2124). Crossref, PubMed, ISI, Google ScholarGardner D., 2003Towards effective and rewarding data sharing. Neuroinformatics 1, 289-295(doi:10.1385/NI:1:3:289). Crossref, PubMed, ISI, Google ScholarHall S. R., Allen F. H.& Brown I. D. . 1991The Crystallographic Information File (CIF):a new standard archive file for crystallography. Acta Cryst A47, 655-685(doi:10.1107/S010876739101067X). Crossref, Google ScholarHeidorn P. B. . 2008Shedding light on the dark data in the long tail of science. Libr. Trends 57, 280-299(doi:10.1353/lib.0.0036). Crossref, ISI, Google Scholar- Integrating with integrity.2010Editorial. Nat. Genet. 42, 1(doi:10.1038/ng0110-1). PubMed, ISI, Google Scholar
Karasti H., Baker K. S.& Halkola E. . 2006Enriching the notion of data curation in e-Science:data managing and information infrastructuring in the Long Term Ecological Research (LTER) network. Comp. Support. Cooperative Work 15, 321-358(doi:10.1007/s10606-006-9023-2). Crossref, Google ScholarKling R.& McKim G. . 2000Not just a matter of time:field differences and the shaping of electronic media in supporting scholarly communication. J. Am. Soc. Inf. Sci 51, 1306-1320(doi:10.1002/1097-4571(2000)9999:9999%3C::AID-ASI1047%3E3.0.CO;2-T). Crossref, Google ScholarMurray-Rust P. . 2007Data-driven science—a scientist’s view. NSF/JISC Repositories Workshop, Phoenix, AZ, 17–19 April 2007. See http://www.sis.pitt.edu/~repwkshop/papers/murray.html. Google Scholar- National Science Board (NSB).2005NSB-05-40, long-lived digital data collections:enabling research and education in the 21st century. See http://www.nsf.gov/pubs/2005/nsb0540/. Google Scholar
Pachura C. M.& Martin J. B. . 1991Mapping the brain and its functions:integrating enabling technologies into neuroscience researchWashington, DCNational Academy Press. Google ScholarPalmer C. L.& Cragin M. H. . 2008Scholarly and disciplinary practices. Annu. Rev. Inf. Sci. 42, 165-212. Google ScholarPeterson B. J. . 1993The costs and benefits of collaborative research. Estuaries 16, 913-918(doi:10.2307/1352449). Crossref, Google ScholarPostle B. R., Shapiro L. A.& Biesanz J. C. . 2003On having one’s data shared. J. Cogn. Neurosci. 14, 838-840(doi:10.1162/089892902760191063). Crossref, ISI, Google ScholarPritchard S. M., Anand S.& Carver L. . 2005Informatics and knowledge management for faculty research data. EDUCAUSE Res. Bull. 2005See http://net.educause.edu/ir/library/pdf/ERB0502.pdf. Google ScholarPryor G. . 2009Multi-scale data sharing in the life sciences:some lessons for policy makers. Int. J. Dig. Curat. 4, 71-82. Crossref, Google Scholar- Research Information Network (RIN).2008To share or not to share:publication and quality assurance of research data outputs. A report commissioned by the Research Information Network. See http://www.rin.ac.uk/data-publication. Google Scholar
Rice R. . 2009DISC-UK DataShare Project:Final Report. See http://ie-repository.jisc.ac.uk/336/1/DataSharefinalreport.pdf. Google ScholarSieber J. E. . 1989Sharing scientific data I:new problems for IRBs. IRB:Ethics Hum. Res. 11, 4-7(doi:10.2307/3564184) See http://www.jstor.org/stable/3564184. Crossref, PubMed, Google ScholarSteinhart G. . 2007DataStaR:an institutional approach to research data curation. IASSIST Quart. 31, 34-39. Crossref, Google ScholarSterling T. D.& Weinkam J. J. . 1990Sharing scientific data. Commun. ACM 33, 112-119(doi:10.1145/79173.79182). Crossref, ISI, Google ScholarVan House N. A., Butler M.& Schiff L. . 1998Cooperative knowledge work and practices of trust:sharing environmental planning data sets. CSCW ’98:Proc. ACM Conf. on Computer Supported Cooperative Work, Seattle, WA, 14–18 November 1998335–343New York, NYACM. Crossref, Google ScholarWitt M. . 2008Institutional repositories and research data curation in a distributed environment. Libr. Trends 57, 191-201(doi:10.1353/lib.0.0029). Crossref, ISI, Google ScholarWitt M., Carlson J., Brandt D. S.& Cragin M. H. . 2009Constructing data curation profiles. Int. J. Dig. Curat. 4, 93-103. Crossref, Google ScholarWong G. K. W. . 2009Exploring research data hosting at the HKUST institutional repository. Ser. Rev. 35, 125-133(doi:10.1016/j.serrev.2009.04.003). Crossref, ISI, Google Scholar


