Presentation by Heather Piwowar at Simon Fraser University in October 2012 at the SFU Research Data Repository Project Launch.
Highlights current state of research data sharing. http://www.lib.sfu.ca/node/11510
1. Momentum of
open research data:
now in 5-D!
Heather
Piwowar
@researchremix
Postdoc
with
NESCent
and
Dryad,
at
Duke
and
UBC
SFU
Research
Data
Repository
Project
Launch
October
2012
some photos NC, SA
29. Gleditsch et al. 2003. Posting Your Data: Will You Be
Scooped or Will You Be Famous?, International Studies
Perspectives 4(1): 89–97.
Piwowar et al. 2007. Sharing Detailed research data is
associated with increased citation Rate. PLoS ONE.
Ioannidis et al. Repeatability of published microarray gene
expression analyses. Nature Genetics 41, 149 - 155
Pienta et al. 2010. NSR Social Science Secondary Use.
Michigan IR.
Henneken et al. 2011. Linking to Data – Effect on Citation
Rates in Astronomy. ESO.
Sears 2011. Data Sharing Effect on Article Citation rate in
Paleoceanography. AGU.
33. Proportion of articles with shared datasets, by year
0.35
Proportion of articles with datasets found in GEO or ArrayExpress
0.30
0.25
0.20
0.15
Across
time
0.10
0.05
2000 2001 2002 2003 2004 2005 2006 2007 2008 2009
Year article published
35. Multivariate nonlinear regression with interactions
Odds Ratio
0.25 0.50 1.00 2.00 4.00
OA journal & previous GEO-AE sharing
Amount of NIH funding
0.95
Journal impact factor and policy
Higher Ed in USA
Cancer & humans
43. 2) more impact per funding dollar
Traditional research funding:
$400k = 16 papers
At Dryad cost levels,
at similar levels of reuse to GEO,
$400k would facilitate 1000 reuse papers
A stellar Scientific ROI is in easy reach.
44. Piwowar, Vision, Whitlock (2011)
Data archiving is a good investment.
Nature 473, 285
http://researchremix.wordpress.com/2011/05/19/nature-letter/
47. journal
data
sharing
policy
“An inherent principle of publication is that
others should be able to replicate and build
upon the authors' published claims.
Therefore, a condition of publication
in a Nature journal is that authors are
required to make materials, data and
associated protocols available in a publicly
accessible database …”
http://www.nature.com/authors/editorial_policies/availability.html
http://www.nature.com/nature/journal/v453/n7197/index.html
48. JDAP
<< Journal>> requires, as a condition for publication, that
data supporting the results in the paper should be archived
in an appropriate public archive, such as << list of approved
archives here >>. Data are important products of the
scientific enterprise, and they should be preserved and
usable for decades in the future. Authors may elect to have
the data publicly available at time of publication, or, if the
technology of the archive allows, may opt to embargo
access to the data for a period up to a year after publication.
Exceptions may be granted at the discretion of the editor,
especially for sensitive information such as human subject
data or the location of endangered species.
66. In 2009, 116 articles cited ORNL DAAC data.
Finding these articles took 70-80 hours
across at least 12 resources
all chosen from a deep understanding
of this specific research domain
then the full text of all the hits were
manually reviewed
Valerie Enriquez interview with James Kidder
http://openwetware.org/wiki/DataONE:Notebook/Reuse_of_repository_data
92. Open up your data
while you are doing it :)
http://www.flickr.com/photos/myklroventine/892446624/
93. thank you!
Todd Vision: PI of Dryad
Jason Priem: cofounder of ImpactStory
Also: Mike Whitlock, Jonathan Carlson, Estephanie Sta Maria
The open science online community and those who release
their articles, datasets and photos openly.
blog: ResearchRemix.wordpress.com
@researchremix