This presentation was provided by Carly Strasser of the Chan Zuckerberg Initiative during the NISO hot topic virtual conference "Effective Data Management," which was held on September 29, 2021.
Strasser "Effective data management and its role in open research"
1. Effective data management
and its role in open research
Carly Strasser, PhD | @carlystrasser
Chan Zuckerberg Initiative
NISO Effective Data Management
September 2021
Images from unsplash.com unless otherwise attributed
14. Open research is the practice of
sharing research processes and outputs
publicly in a timely and effective manner,
so that other researchers can learn and build on them.
proposals
preprints
datasets
images
software/code
workflows
reviews
publications
What should be
shared?
16. Open research is the practice of
sharing research processes and outputs
publicly in a timely and effective manner,
so that other researchers can learn and build on them.
24. Standard Data Management Plans
1. Types of data
2. Data & metadata standards
3. Policies
4. Plans for preservation
5. Budget
More from Maria later!
26. Collect & organize data so it’s
FAIR from the start
(for both the researcher themselves and others)
● Use good naming schema for files
● Organize files logically
● Use spreadsheets thoughtfully
● Use an appropriate metadata schema throughout
the project
27. Digital context
• Name of the data set
• The name(s) of the data file(s) in the
data set
• Date the data set was last modified
• Example data file records for each data
type file
• Pertinent companion files
• List of related or ancillary data sets
• Software (including version number)
used to prepare/read the data set
• Data processing that was performed
Personnel & stakeholders
• Who collected
• Who to contact with questions
• Funders
Scientific context
• Scientific reason why the data were collected
• What data were collected
• What instruments (including model & serial
number) were used
• Environmental conditions during collection
• Temporal & spatial resolution
• Standards or calibrations used
Information about parameters
• How each was measured or produced
• Units of measure
• Format used in the data set
• Precision & accuracy if known
Information about data
• Definitions of codes used
• Quality assurance & control measures
• Known problems that limit data use (e.g.
uncertainty, sampling problems)
44. ● Cost
● Lack of training
● Fear of lost rights or benefits
● No incentives
What’s the
holdup?
45. Open research is the practice of
sharing research processes and outputs
publicly in a timely and effective manner,
so that other researchers can learn and build on them.
proposals
preprints
datasets
images
software/code
workflows
reviews
publications
47. Open research is the practice of
sharing research processes and outputs
publicly in a timely and effective manner,
so that other researchers can learn and build on them.
proposals
preprints
datasets
images
software/code
workflows
reviews
publications
How can we get
everything to
count?