"Undergrad ecologists aren't learning data management" - ESA 2013
DMPTool Overview for UC Merced Research Week
1. The
Data
Management
From
Flickr
by
dipster1
DMPTool
Planning
Tool
Carly
Strasser
@carlystrasser
March
2013
California
Digital
Library
UC
Merced
2.
3. From
Flickr
by
DW0825
From
Flickr
by
Flickmor
From
Flickr
by
deltaMike
Digital
data
www.woodrow.org
C.
Strasser
Courtesey
of
WHOI
From
Flickr
by
US
Army
Environmental
Command
5. UGLY TRUTH
Many
joyfulmomma.com
researchers…
are
not
taught
data
management
don’t
know
what
metadata
are
can’t
name
data
centers
or
repositories
don’t
share
data
publicly
or
store
it
in
an
archive
aren’t
convinced
they
should
share
data
8. What
is
a
data
management
plan?
A
document
that
describes
what
you
will
do
with
your
data
both
during
your
research
and
after
you
complete
your
project
From Flickr by spanaut
9. DMPs
for
Funders
A
short
plan
submitted
alongside
grant
applications
An
outline
of
– what
will
be
created/collected
– methods
– Standards
But they all have
different requirements
– Metadata
and express them in
– sharing/access
different ways
– long-‐term
storage
Includes
how
and
why
10. Evolution
Federal Funding
Accountability and
Transparency Act
2006
2010 –
present 2010
11. NSF
DMP
Requirements
From
Grant
Proposal
Guidelines:
DMP
supplement
may
include:
1. the
types
of
data,
samples,
physical
collections,
software,
curriculum
materials,
and
other
materials
to
be
produced
in
the
course
of
the
project
2.
the
standards
to
be
used
for
data
and
metadata
format
and
content
(where
existing
standards
are
absent
or
deemed
inadequate,
this
should
be
documented
along
with
any
proposed
solutions
or
remedies)
3.
policies
for
access
and
sharing
including
provisions
for
appropriate
protection
of
privacy,
confidentiality,
security,
intellectual
property,
or
other
rights
or
requirements
4.
policies
and
provisions
for
re-‐use,
re-‐distribution,
and
the
production
of
derivatives
5.
plans
for
archiving
data,
samples,
and
other
research
products,
and
for
preservation
of
access
to
them
12. 1. Types
of
data
&
other
information
• Types
of
data
produced
• Relationship
to
existing
data
• How/when/where
will
the
data
be
captured
or
created?
C.
Strasser
• How
will
the
data
be
processed?
• Quality
assurance
&
quality
control
measures
• Security:
version
control,
backing
up
biology.kenyon.edu
• Who
will
be
responsible
for
data
management
during/after
project?
From
Flickr
by
Lazurite
13. 2. Data
&
metadata
standards
• What
metadata
are
needed
to
make
the
data
meaningful?
• How
will
you
create
or
capture
these
metadata?
Wired.com
• Why
have
you
chosen
particular
standards
and
approaches
for
metadata?
14. 3. Policies
for
access
&
sharing
4. Policies
for
re-‐use
&
re-‐distribution
• Are
you
under
any
obligation
to
share
data?
• How,
when,
&
where
will
you
make
the
data
available?
• What
is
the
process
for
gaining
access
to
the
data?
• Who
owns
the
copyright
and/or
intellectual
property?
• Will
you
retain
rights
before
opening
data
to
wider
use?
How
long?
• Are
permission
restrictions
necessary?
• Embargo
periods
for
political/commercial/patent
reasons?
• Ethical
and
privacy
issues?
• Who
are
the
foreseeable
data
users?
• How
should
your
data
be
cited?
15. 5. Plans
for
archiving
&
preservation
• What
data
will
be
preserved
for
the
long
term?
For
how
long?
• Where
will
data
be
preserved?
• What
data
transformations
need
to
occur
before
preservation?
• What
metadata
will
be
submitted
alongside
the
datasets?
• Who
will
be
responsible
for
preparing
data
for
preservation?
Who
will
be
the
main
contact
person
for
the
archived
data?
From
Flickr
by
theManWhoSurfedTooMuch
16. NSF’s
Vision*
DMPs
and
their
evaluation
will
grow
&
change
over
time
(similar
to
broader
impacts)
Peer
review
will
determine
next
steps
Community-‐driven
guidelines
– Discipline-‐specific
– Flexibility
at
the
directorate
and
division
levels
– Tailor
implementation
Evaluation
will
vary
with
directorate,
division,
&
program
officer
*Unofficially
20. DMPTool
Project
• Started
working
in
January
2011
• Developed
requirements,
divided
work
among
partners
• Self-‐funded
/
In-‐kind
21. DMPTool
Participants
CDL/UC3
Smithsonian
University
of
Illinois
Trisha
Cruse
Günter
Waibel
Michael
Grady
Perry
Willett
Howard
Ding
Marisa
Strong
UCLA
Sarah
Shreeves
Tracy
Seneca
Todd
Grappone
Scott
Fisher
Gary
Thompson
University
of
Virginia
Stephen
Abrams
Sharon
Farbe
Andrew
Sallans
Mark
Reyes
Darrow
Cole
Sherry
Lake
Margaret
Low
Carla
Lee
Carly
Strasser
UCSD
Brad
Westbrook
Digital
Curation
Centre
DataONE
Martin
Donnelly
Amber
Budden
22. dmptool.org
• Free
• Guides
through
creating
a
DMP
• Helps
meet
funder
requirements
• Supplies
questions
• Includes
explanation/context
provided
by
the
agency
• Provides
links
to
the
agency
website
23. Step-by-step wizard for generating DMP
Create | edit | re-use | share | save | generate
Open to community
24. Wait!
Data
management
planning
is
complex
&
requires
dialog
Range
of
support
&
understanding
From
Flickr
by
ChrisGoldNY
Our
focus:
• simplify
&
scale
the
common
parts
• develop
community
• provide
incremental
improvement
in
functionality
25.
26.
27.
28.
29.
30.
31.
32.
33.
34.
35.
36. Access
&
Customization
• DMPTool
can
be
added
to
campus
single
sign-‐on
service
• Researchers
use
campus
login
for
tool
37.
38. Increasing
Participation
Johns
Hopkins
University
UC
Merced
Organizations
Michigan
State
University
UC
Office
of
the
President
Moss
Landing
Marine
UC
San
Diego
with
Shibboleth
Laboratories
(CSU)
UC
San
Francisco
log-‐in
set
up
North
Carolina
State
University
of
Chicago
University
University
of
Illinois
at
American
University
Northwestern
University
Chicago
Arizona
State
University
Ohio
State
University
of
Illinois
at
Cal
Poly
State
University
Old
Dominion
University
Urbana-‐Champaign
Cal
State
Chico
Penn
State
University
of
Iowa
Cal
State
Fresno
Purdue
University
of
Miami
Cal
State
Los
Angeles
Rice
University
University
of
Michigan
Cal
State
Office
of
the
Smithsonian
Institution
University
of
Nebraska-‐
Chancellor
Texas
A&M
Lincoln
Clemson
University
Texas
State
University
San
University
of
North
Carolina-‐
George
Mason
University
Marcos
Chapel
Hill
Georgia
Tech
Tulane
University
University
of
Notre
Dame
Humboldt
State
University
University
of
Arizona
University
of
Texas
at
Austin
(CSU)
UC
Los
Angeles
University
of
Virginia
Indiana
University
UC
Berkeley
University
of
Wisconsin-‐
Iowa
State
University
UC
Davis
Madison
James
Madison
University
UC
Irvine
Yale
University
39. Institution-‐specific
resources
Possible
customization:
• Help
text
• Links
to
resources
and
services
• Suggested
answers
Can
provide
specific
info
at
different
levels
• All
DMPs
• All
DMPs
for
a
particular
funding
agency
• A
question
within
a
data
management
plan
40.
41. DMPTool
Uptake
3500
600
Number
of
Plans
&
Unique
Users
3000
500
Number
of
Ins-tu-ons
2500
400
2000
300
1500
Unique
Users
200
1000
Plans
InsEtuEons
100
500
0
0
Oct-‐11
Dec-‐11
Feb-‐12
Apr-‐12
Jun-‐12
Aug-‐12
42. Improvement
via
A.P.
Sloan
Grant
Data
Management
Planning
Tool
2:
Responding
to
the
Community
1. Build
community
of
researchers,
institutions,
funders,
&
libraries
2. Expand
functionality
of
the
current
DMPTool
for
users
&
administrators
3. Release
the
DMPTool2
and
provide
training/documentation
4. Create
an
open-‐source
community
of
DMPTool
contributors
43. New
Areas
of
Functionality
in
2013
Granular
modeling
Granular
modeling
Role-‐based
user
DMP
life
cycle
of
plan
templates
of
institutions
authorization
management
Search
and
Organizational
Enhanced
search
Institutional
reporting
for
planning
activities
and
browse
branding
business
intelligence
Advanced
Collaborative
plan
administrative
Open
API
creation
interface
44. IMLS
Grant
Improving
Data
Stewardship
with
the
DMPTool
Provide
librarians
with
the
tools
and
resources
to
claim
the
data
management
education
space
45. Materials
to
be
Developed
Talking
points
Slide
decks
Promotional
materials
Environmental
scan
kit
Webinar
series
Case
studies
Customization
help
Online
Commons
Libguide
48. My
website
carlystrasser.net
Email
me
carlystrasser@gmail.com
Tweet
me
@carlystrasser
My
slides
slideshare.net/carlystrasser
CDL
Blog
datapub.cdlib.org