Open Access to Research Data: Challenges and Solutions

Martin Donnelly
Martin DonnellyDigital preservation, data management and curation. Erstwhile administrator/manager. PRINCE2 Practitioner.
Facilitate Open Science Training for European Research
Open Access to Research Data:
Challenges and Solutions
Martin Donnelly
Digital Curation Centre
University of Edinburgh
FOSTER event, National Library of Latvia
Riga, 20 October 2015
The Digital Curation Centre
• The UK’s centre of expertise in digital preservation and
data management, established 2004
• Provide guidance, training, tools and other services on
all aspects of research data management
• Organise national and international events and webinars
(International Digital Curation Conference, Research
Data Management Forum)
• Principal audience is the UK higher education sector, but
we increasingly work further afield (Europe, North
America, South Africa…)
• Now offering paid consultancy/training services
Overview
1. Background and context
2. An introduction to Research Data Management (RDM)
a. What is RDM?
b. What are the main benefits?
c. What are the main problems?
3. RDM in practice
a. What does it mean for researchers?
b. Research data policies
4. Some useful resources
Overview
1. Background and context
2. An introduction to Research Data Management (RDM)
a. What is RDM?
b. What are the main benefits?
c. What are the main problems?
3. RDM in practice
a. What does it mean for researchers?
b. Research data policies
4. Some useful resources
Background and context
• Research data management exists within a context of ever
greater transparency, accessibility and accountability
• The impetus for openness in research comes from two
directions:
• Ground-up – Open Access began in the High Energy Physics research
community, which saw benefit in not waiting for publication before
sharing research findings (and data / code)
• Top-down – Government/funder support, increasing public and
commercial engagement with research
• The main goals of these developments are to lower barriers
to accessing the outputs of publicly funded research (often
called ‘science’ for short), to speed up the research process,
and to strengthen the quality, integrity and longevity of the
scholarly record…
The old way of doing research
1. Researcher collects data (information)
2. Researcher interprets/synthesises data
3. Researcher writes paper based on data
4. Paper is published (and preserved)
5. Data is left to benign neglect, and
eventually ceases to be
accessible
Without intervention, data + time = no data
Vines et al. “examined the availability of data from 516 studies between 2 and 22 years old”
- The odds of a data set being reported as extant fell by 17% per year
- Broken e-mails and obsolete storage devices were the main obstacles to data sharing
- Policies mandating data archiving at publication are clearly needed
“The current system of leaving data with authors means that almost all of it is lost over
time, unavailable for validation of the original results or to use for entirely new purposes”
according to Timothy Vines, one of the researchers. This underscores the need for
intentional management of data from all disciplines and opened our conversation on
potential roles for librarians in this arena. (“80 Percent of Scientific Data Gone in 20
Years” HNGN, Dec. 20, 2013, http://www.hngn.com/articles/20083/20131220/80-percent-
of-scientific-data-gone-in-20-years.htm.)
Vines et al., The Availability of Research Data Declines Rapidly with Article Age,
Current Biology (2014), http://dx.doi.org/10.1016/j.cub.2013.11.014
The new way of doing research
Plan
Collect
Assure
Describe
Preserve
Discover
Integrate
Analyze
PUBLISH
…and
RE-USE
The DataONE
lifecycle model
Overview
1. Background and context
2. An introduction to Research Data Management (RDM)
a. What is RDM?
b. What are the main benefits?
c. What are the main problems?
3. RDM in practice
a. What does it mean for researchers?
b. Research data policies
4. Some useful resources
What is RDM?
“the active management
and appraisal of data over
the lifecycle of scholarly
and scientific interest”
What sorts of activities?
- Planning and describing data-
related work before it takes place
- Documenting your data so that
others can find and understand it
- Storing it safely during the project
- Depositing it in a trusted archive
at the end of the project
- Linking publications to the
datasets that underpin them
Data management is a part of
good research practice.
- RCUK Policy and Code of Conduct on the
Governance of Good Research Conduct
RDM: who and how?
• RDM is a hybrid activity, involving multiple stakeholder
groups…
• The researchers themselves
• Research support personnel
• Partners based in other institutions, commercial partners, etc
• Data Management Planning (DMP) underpins and pulls
together different strands of data management activities. DMP
is the process of planning, describing and communicating the
activities carried out during the research lifecycle in order to…
• Keep sensitive data safe
• Maximise data’s re-use potential
• Support longer-term preservation
• Data Management Plans are a means of communication, with
contemporaries and potential future re-users alike…
Benefits of RDM and data sharing
• SPEED: The research process becomes faster
• EFFICIENCY: Data collection can be funded once, and
used many times for a variety of purposes
• ACCESSIBILITY: Interested third parties can (where
appropriate) access and build upon publicly-funded
research resources with minimal barriers to access
• IMPACT and LONGEVITY: Open publications and data
receive more citations, over a longer period
• TRANSPARENCY and QUALITY: The evidence that
underpins research can be made open for anyone to
scrutinise, and attempt to replicate findings. This leads
to a more robust scholarly record
“In genomics research, a large-scale
analysis of data sharing shows that
studies that made data available in
repositories received 9% more
citations, when controlling for other
variables; and that whilst self-reuse
citation declines steeply after two
years, reuse by third parties
increases even after six years.”
(Piwowar and Vision, 2013)
Van den Eynden, V. and Bishop, L.
(2014). Incentives and motivations for
sharing research data, a researcher’s
perspective. A Knowledge Exchange
Report,
http://repository.jisc.ac.uk/5662/1/KE
_report-incentives-for-sharing-
researchdata.pdf
Benefits of RDM: Impact and Longevity
“Data is necessary for
reproducibility of
computational research,
but an equal amount of
concern should be directed
at code sharing.”
Victoria Stodden, “Innovation and Growth
through Open Access to Scientific Research:
Three Ideas for High-Impact Rule Changes” in
Litan, Robert E. et al. Rules for Growth:
Promoting Innovation and Growth Through Legal
Reform. SSRN Scholarly Paper. Rochester, NY:
Social Science Research Network, February 8,
2011. http://papers.ssrn.com/abstract=1757982.
Benefits of RDM: Quality
“Conservatively, we estimate that the value of data in
Australia’s public research to be at least $1.9 billion
and possibly up to $6 billion a year at current levels of
expenditure and activity. Research data curation and
sharing might be worth at least $1.8 billion and possibly
up to $5.5 billion a year, of which perhaps $1.4 billion to
$4.9 billion annually is yet to be realized.”
• “Open Research Data”, Report to the Australian National Data Service (ANDS),
November 2014 - John Houghton, Victoria Institute of Strategic Economic
Studies & Nicholas Gruen, Lateral Economics
Benefits of RDM: Financial
J. Manyika et al. "Open data: Unlocking innovation
and performance with liquid information" McKinsey
Global Institute, October 2013
“If we are going to wait
five years for data to
be released, the Arctic
is going to be a very
different place.”
Bryn Nelson, Nature, 10 Sept 2009
http://www.nature.com/nature/jour
nal/v461/n7261/index.html
Benefits of RDM: Speed
https://www.flickr.com/photos/gsfc/7348953774/
- CC-BY
Why don’t we live in a data sharing utopia?
Five main reasons…
i. Lack of widespread understanding of the
fundamental issues
ii. Lack of joined-up thinking within institutions,
countries, internationally…
iii. Issues around ownership/privacy
iv. Technical/financial limitations, and the need
for selection and appraisal of data (which
takes time, and costs money…)
v. Issues around reward and recognition for
researchers
Overview
1. Background and context
2. An introduction to Research Data Management (RDM)
a. What is RDM?
b. What are the main benefits?
c. What are the main problems?
3. RDM in practice
a. What does it mean for researchers?
b. Research data policies
4. Some useful resources
What does it mean for researchers?
• A disruption to previous working processes
• Additional expectations / requirements from
the funders (and sometimes their home
institutions and publishers too)
• But! It provides opportunities for new types
of investigation
• And leads to a more robust scholarly record
What do researchers need to do?
1. Understand funders’ policies (e.g. EC H2020…)
2. Check your intended publisher’s OA policy (e.g. via Sherpa
Romeo)
3. Create a data management plan (e.g. with DMPonline)
4. Decide which data to preserve using the DCC’s How-To guide
and checklist, “Five Steps to Decide what Data to Keep”
5. Identify a long-term home for your data (e.g. via re3data.org)
6. Link your data to your publications with a persistent identifier
(e.g. via DataCite)
• N.B. Many repositories, including Zenodo, will do this for you
7. Investigate EC infrastructure services and resources, e.g.
EUDAT, OpenAIRE Plus, FOSTER, etc…
UK funders
1. Public good
2. Preservation
3. Discovery
4. Confidentiality
5. First use
6. Recognition
7. Public funding
Six of the seven
RCUK councils
require data
management plans
(or equivalent), as
do Wellcome Trust,
Cancer Research
UK, and more…
EXPECTATIONS
1. Research organisations will promote internal awareness of these principles and expectations and ensure that their researchers and research students have a
general awareness of the regulatory environment and of the available exemptions which may be used, should the need arise, to justify the withholding of
research data;
2. Published research papers should include a short statement describing how and on what terms any supporting research data may be accessed.
3. Each research organisation will have specific policies and associated processes to maintain effective internal awareness of their publicly-funded research
data holdings and of requests by third parties to access such data; all of their researchers or research students funded by EPSRC will be required to comply
with research organisation policies in this area or, in exceptional circumstances, to provide justification of why this is not possible.
4. Publicly-funded research data that is not generated in digital format will be stored in a manner to facilitate it being shared in the event of a valid request for
access to the data being received (this expectation could be satisfied by implementing a policy to convert and store such data in digital format in a timely
manner);
5. Research organisations will ensure that appropriately structured metadata describing the research data they hold is published (normally within 12 months of
the data being generated) and made freely accessible on the internet; in each case the metadata must be sufficient to allow others to understand what
research data exists, why, when and how it was generated, and how to access it. Where the research data referred to in the metadata is a digital object it is
expected that the metadata will include use of a robust digital object identifier (For example as available through the DataCite organisation -
http://datacite.org).
6. Where access to the data is restricted the published metadata should also give the reason and summarise the conditions which must be satisfied for access to
be granted. For example ‘commercially confidential’ data, in which a business organisation has a legitimate interest, might be made available to others
subject to a suitable legally enforceable non-disclosure agreement.
7. Research organisations will ensure that EPSRC-funded research data is securely preserved for a minimum of 10-years from the date that any researcher
‘privileged access’ period expires or, if others have accessed the data, from last date on which access to the data was requested by a third party; all
reasonable steps will be taken to ensure that publicly-funded data is not held in any jurisdiction where the available legal safeguards provide lower levels of
protection than are available in the UK
8. Research organisations will ensure that effective data curation is provided throughout the full data lifecycle, with ‘data curation’ and ‘data lifecycle’ being
as defined by the Digital Curation Centre. The full range of responsibilities associated with data curation over the data lifecycle will be clearly allocated
within the research organisation, and where research data is subject to restricted access the research organisation will implement and manage appropriate
security controls; research organisations will particularly ensure that the quality assurance of their data curation processes is a specifically assigned
responsibility;
9. Research organisations will ensure adequate resources are provided to support the curation of publicly-funded research data; these resources will be
allocated from within their existing public funding streams, whether received from Research Councils as direct or indirect support for specific projects or
from higher education Funding Councils as block grants.
1. INTERNAL AWARENESS - of principles,
expectations, regulatory environment,
possible exemptions
2. ACCESS STATEMENT - included within
research papers
3. POLICIES AND PROCESSES - covering
maintenance and access requests
4. NON-DIGITAL DATA - strategy for access /
digitisation
5. METADATA PUBLICATION - within 12
months of data generation
6. RESTRICTIONS - list these within metadata
7. PRESERVATION - 10 years from date of last
access
8. CURATION - maintenance and security
9. RESOURCING - from existing funding
streams
RDM in other countries (i)
USA
• The National Science Foundation (NSF) announced a DMP requirement in 2010,
effective 2011
• White House Office of Science and Technology Policy requirement for DMPs
announced March 2013 (programmes awarding >$100m annually). White House
requirements include mechanisms covering compliance with plans and policies,
and also cover costs of implementing plans
AUSTRALIA
• In 2014 The Australian Research Council (ARC) released new instructions for
applications for Laureate Fellowships and Discovery Grants. Both include the
following requirements when describing a proposal…
• COMMUNICATION OF RESULTS: Outline plans for communicating the research
results to other researchers and the broader community, including scholarly
and public communication and dissemination
• MANAGEMENT OF DATA: Outline plans for the management of data produced as
a result of the proposed research, including but not limited to storage, access
and re-use arrangements
RDM in other countries (ii)
SOUTH AFRICA
• Announced in January 2015 that (from March 2015) “authors of research
papers generated from research either fully or partially funded by NRF, when
submitting and publishing in academic journals, should deposit their final peer-
reviewed manuscripts that have been accepted by the journals, to the
administering Institution Repository with an embargo period of no more than
12 months.”
• In addition, the data supporting the publication should be deposited in an
accredited Open Access repository, with the provision of a Digital Object
Identifier for future citation and referencing.
• The NRF encourages its stakeholder community, including NRF’s Business Units
and National Research Facilities, to:
• Formulate detailed policies on Open Access of publications and data from its funded
research;
• Establish Open Access repositories; and
• Support public access to the repositories through web search and retrieval according
to international standards and best practice.
RDM in Europe
• Horizon 2020 (FP8) features an Open Research Data pilot, and it
seems likely that it will become an across-the-board requirement
in FP9…
• It applies to data (and metadata) needed to validate scientific
results, which should be deposited in a dedicated data repository
• The Horizon 2020 Open Research Data pilot covers “Innovation
actions” and “Research and Innovation actions”, and involves
three iterations of Data Management Plan (DMP)
• 6 months after start of project, mid-project review, end-of-project
(final review)
• DMP contents
• Data types; Standards used; Sharing/making available; Curation and
preservation
• There are certain opt-out conditions
H2020 Open Data Pilot: specifics (ii)
STEP 1
• The data should be deposited, preferably in a dedicated research data
repository. These may be subject-based/thematic, institutional or centralised.
• EC suggests the Registry of Research Data Repositories (www.re3data.org) and
Databib (http://databib.org) for researchers looking to identify an appropriate
repository
• Open Access Infrastructure for Research in Europe (OpenAIRE) will also become
an entry point for linking publications to data.
STEP 2
• So far as possible, projects must then take measures to enable for third parties
to access, mine, exploit, reproduce and disseminate (free of charge for any
user) this research data.
• EC suggests attaching Creative Commons Licence (CC-BY or CC0) to the data
deposited (http://creativecommons.org/licenses/,
http://creativecommons.org/about/cc0).
• At the same time, projects should provide information via the chosen repository
about tools and instruments at the disposal of the beneficiaries and necessary
for validating the results, for instance specialised software or software code,
algorithms, analysis protocols, etc. Where possible, they should provide the
tools and instruments themselves.
H2020 Open Data Pilot: specifics (iii)
COSTS
Costs relating to the implementation of the pilot will be eligible. Specific
technical and professional support services will also be provided (e-
Infrastructures WP), e.g. EUDAT and OpenAIRE, alongside support measures
such as FOSTER.
OPT-OUTS
Opt outs are possible, either totally or partially. Projects may opt out of the
Pilot at any stage, for a variety of reasons, e.g.
• if participation in the Pilot on Open Research Data is incompatible with the
Horizon 2020 obligation to protect results if they can reasonably be
expected to be commercially or industrially exploited;
• confidentiality (e.g. security issues, protection of personal data);
• if participation in the Pilot on Open Research Data would jeopardise the
achievement of the main aim of the action;
• if the project will not generate / collect any research data;
• if there are other legitimate reasons to not take part in the Pilot (to be
declared at proposal stage)
Overview
1. Background and context
2. An introduction to Research Data Management (RDM)
a. What is RDM?
b. What are the main benefits?
c. What are the main problems?
3. RDM in practice
a. What does it mean for researchers?
b. Research data policies
4. Some useful resources
• Web-based tool to help researchers write and
maintain DMPs
• Provides funder questions and guidance
• Includes templates for all RCUK funders, and Horizon 2020
• Provides tailored help from universities
• Can include examples and suggest responses
• Free to use
• Mature (v1 launched April 2010)
• Code is Open Source (on GitHub)
• https://dmponline.dcc.ac.uk
DMPonline
• EUDAT offers common data services through a geographically distributed, resilient
network of 35 European organisations. These shared services and storage resources are
distributed across 15 European nations and data is stored alongside some of Europe’s
most powerful supercomputers.
• The EUDAT services address the full lifecycle of research data, covering both access and
deposit, from informal data sharing to long-term archiving, and addressing identification,
discoverability and computability of both long-tail and big data
• The vision is to enable European researchers and practitioners from any academic
discipline to preserve, find, access, and process data in a trusted environment, as part of
a Collaborative Data Infrastructure (CDI) conceived as a network of collaborating,
cooperating centres, combining the richness of numerous community-specific data
repositories with the permanence and persistence of some of Europe’s largest scientific
data centres
• Seeks to bridge the gap between research infrastructures and e-Infrastructures through
an active engagement strategy, using the communities in the consortium as EUDAT
beacons, and integrating others through innovative partnership approaches
• Jisc and DCC are partners, and we’re working to embed DCC’s DMPonline tool within the
EUDAT suite of services / infrastructure
EUDAT
Zenodo
• Zenodo is a free-to-use data archive, run
by the people at CERN
• It accepts any kind of data, from any
academic discipline
• It is generally preferable to store data in a
disciplinary data centre, but not all
scholarly subjects are equally well served
with data centres, so this may make for a
useful fallback option
• See http://zenodo.org/ for more details
Other data management resources (DCC)
• Book chapter
• Donnelly, M. (2012) “Data Management Plans
and Planning”, in Pryor (ed.) Managing
Research Data, London: Facet
• Guidance, e.g. “How-To Develop a Data
Management and Sharing Plan”
• DCC Checklist for a Data Management
Plan:
http://www.dcc.ac.uk/resources/data-
management-plans/checklist
• Links to all DCC resources via
http://www.dcc.ac.uk/resources/data-
management-plans
Data management resources (non-DCC)
• UKDA guidance and book
(http://data-
archive.ac.uk/media/2894/managi
ngsharing.pdf)
• Guidance from funders (ESRC and
NERC are particularly strong)
• Resources from other universities,
e.g. Bath, Bristol, Cambridge
Edinburgh, Glasgow, Oxford (to
name but a few)
OBJECTIVES
• To support different stakeholders, especially
younger researchers, in adopting open access in
the context of the European Research Area (ERA)
and in complying with the open access policies
and rules of participation set out for Horizon
2020
• To integrate open access principles and
practice in the current research workflow by
targeting the young researcher training
environment
• To strengthen institutional training capacity to
foster compliance with the open access policies
of the ERA and Horizon 2020 (beyond the FOSTER
project)
• To facilitate the adoption, reinforcement and
implementation of open access policies from
other European funders, in line with the EC’s
recommendation, in partnership with
PASTEUR4OA project
FacilitateOpenScienceTrainingforEuropeanResearch
The project
METHODS
• Identifying already existing content that can be
reused in the context of the training activities
and repackaging, reformatting them to be used
within FOSTER, and
developing/creating/enhancing contents as
required
• Developing the FOSTER Portal to support e-
learning, blended learning, self-learning,
dissemination of training materials/contents and
a Helpdesk
• Delivery of face-to-face training,
especially training trainers/multipliers who can
deliver further training and dissemination
activities, within institutions, nations or
disciplinary communities
• The EC is also funding other specific technical and
professional support services via the e-Infrastructures WP,
e.g. EUDAT and OpenAIRE
FacilitateOpenScienceTrainingforEuropeanResearch
The project
Thank you
• For more information about the
FOSTER project:
• Website: www.fosteropenscience.eu
• Principal investigator: Eloy Rodrigues
(eloy@sdum.uminho.pt)
• General enquiries: Gwen Franck
(gwen.franck@eifl.net)
• Twitter: @fosterscience
• My contact details:
• Email: martin.donnelly@ed.ac.uk
• Twitter: @mkdDCC
• Slideshare:
http://www.slideshare.net/martindo
nnelly
This work is licensed under the
Creative Commons Attribution
2.5 UK: Scotland License.
1 de 37

Recomendados

Data management plans and planning - a gentle introduction por
Data management plans and planning - a gentle introductionData management plans and planning - a gentle introduction
Data management plans and planning - a gentle introductionMartin Donnelly
1.1K visualizações33 slides
Developing a Data Management Plan por
Developing a Data Management PlanDeveloping a Data Management Plan
Developing a Data Management PlanMartin Donnelly
1.5K visualizações37 slides
Planning for Research Data Managment por
Planning for Research Data ManagmentPlanning for Research Data Managment
Planning for Research Data ManagmentDaniel Crane
382 visualizações65 slides
Introduction to Data Management Planning por
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management PlanningSarah Jones
1.3K visualizações24 slides
DMP health sciences por
DMP health sciencesDMP health sciences
DMP health sciencesSarah Jones
870 visualizações18 slides
Writing a successful data management plan with the DMPTool por
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolkfear
574 visualizações42 slides

Mais conteúdo relacionado

Mais procurados

RDM and DMP intro por
RDM and DMP introRDM and DMP intro
RDM and DMP introSarah Jones
1.2K visualizações27 slides
Practical Strategies for Research Data Management por
Practical Strategies for Research Data ManagementPractical Strategies for Research Data Management
Practical Strategies for Research Data Managementdancrane_open
1.4K visualizações76 slides
Managing and sharing data por
Managing and sharing dataManaging and sharing data
Managing and sharing dataSarah Jones
1.7K visualizações47 slides
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo... por
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...National Information Standards Organization (NISO)
419 visualizações20 slides
RDM for librarians por
RDM for librariansRDM for librarians
RDM for librariansSarah Jones
1.6K visualizações56 slides
Praetzellis "Data Management Planning and Tools" por
Praetzellis "Data Management Planning and Tools"Praetzellis "Data Management Planning and Tools"
Praetzellis "Data Management Planning and Tools"National Information Standards Organization (NISO)
410 visualizações36 slides

Mais procurados(20)

RDM and DMP intro por Sarah Jones
RDM and DMP introRDM and DMP intro
RDM and DMP intro
Sarah Jones1.2K visualizações
Practical Strategies for Research Data Management por dancrane_open
Practical Strategies for Research Data ManagementPractical Strategies for Research Data Management
Practical Strategies for Research Data Management
dancrane_open1.4K visualizações
Managing and sharing data por Sarah Jones
Managing and sharing dataManaging and sharing data
Managing and sharing data
Sarah Jones1.7K visualizações
RDM for librarians por Sarah Jones
RDM for librariansRDM for librarians
RDM for librarians
Sarah Jones1.6K visualizações
Getting to grips with Research Data Management por IzzyChad
Getting to grips with Research Data ManagementGetting to grips with Research Data Management
Getting to grips with Research Data Management
IzzyChad1.7K visualizações
Funder requirements for Data Management Plans por Sherry Lake
Funder requirements for Data Management PlansFunder requirements for Data Management Plans
Funder requirements for Data Management Plans
Sherry Lake541 visualizações
RDM policy and recovering costs por Sarah Jones
RDM policy and recovering costsRDM policy and recovering costs
RDM policy and recovering costs
Sarah Jones957 visualizações
Rachel Bruce UK research and data management where are we now por Jisc
Rachel Bruce UK research and data management where are we nowRachel Bruce UK research and data management where are we now
Rachel Bruce UK research and data management where are we now
Jisc1.9K visualizações
Research Data Management, Challenges and Tools - Per Öster por LEARN Project
Research Data Management, Challenges and Tools - Per Öster Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster
LEARN Project905 visualizações
Sarah Jones RDM from a disciplinary perspective por Jisc
Sarah Jones RDM from a disciplinary perspectiveSarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspective
Jisc2K visualizações
OU Library Research Support webinar: Data sharing por Daniel Crane
OU Library Research Support webinar: Data sharingOU Library Research Support webinar: Data sharing
OU Library Research Support webinar: Data sharing
Daniel Crane434 visualizações
What is-rdm por Sarah Jones
What is-rdmWhat is-rdm
What is-rdm
Sarah Jones831 visualizações
Supporting-DMPs por Sarah Jones
Supporting-DMPsSupporting-DMPs
Supporting-DMPs
Sarah Jones609 visualizações

Destaque

Digital Resources for Open Science por
Digital Resources for Open ScienceDigital Resources for Open Science
Digital Resources for Open ScienceMartin Donnelly
854 visualizações41 slides
Research Data in the Arts and Humanities: A Few Difficulties por
Research Data in the Arts and Humanities: A Few DifficultiesResearch Data in the Arts and Humanities: A Few Difficulties
Research Data in the Arts and Humanities: A Few DifficultiesMartin Donnelly
2.3K visualizações26 slides
Open Access and Open Data: what do I need to know (and do)? por
Open Access and Open Data: what do I need to know (and do)?Open Access and Open Data: what do I need to know (and do)?
Open Access and Open Data: what do I need to know (and do)?Martin Donnelly
2.9K visualizações46 slides
Data management planning: UK policies and beyond por
Data management planning: UK policies and beyondData management planning: UK policies and beyond
Data management planning: UK policies and beyondMartin Donnelly
685 visualizações43 slides
Digital Data Sharing: Opportunities and Challenges of Opening Research por
Digital Data Sharing: Opportunities and Challenges of Opening ResearchDigital Data Sharing: Opportunities and Challenges of Opening Research
Digital Data Sharing: Opportunities and Challenges of Opening ResearchMartin Donnelly
776 visualizações18 slides
Research Data in the Arts and Humanities: A Few Tricky Questions por
Research Data in the Arts and Humanities: A Few Tricky QuestionsResearch Data in the Arts and Humanities: A Few Tricky Questions
Research Data in the Arts and Humanities: A Few Tricky QuestionsMartin Donnelly
1.4K visualizações21 slides

Destaque(8)

Digital Resources for Open Science por Martin Donnelly
Digital Resources for Open ScienceDigital Resources for Open Science
Digital Resources for Open Science
Martin Donnelly854 visualizações
Research Data in the Arts and Humanities: A Few Difficulties por Martin Donnelly
Research Data in the Arts and Humanities: A Few DifficultiesResearch Data in the Arts and Humanities: A Few Difficulties
Research Data in the Arts and Humanities: A Few Difficulties
Martin Donnelly2.3K visualizações
Open Access and Open Data: what do I need to know (and do)? por Martin Donnelly
Open Access and Open Data: what do I need to know (and do)?Open Access and Open Data: what do I need to know (and do)?
Open Access and Open Data: what do I need to know (and do)?
Martin Donnelly2.9K visualizações
Data management planning: UK policies and beyond por Martin Donnelly
Data management planning: UK policies and beyondData management planning: UK policies and beyond
Data management planning: UK policies and beyond
Martin Donnelly685 visualizações
Digital Data Sharing: Opportunities and Challenges of Opening Research por Martin Donnelly
Digital Data Sharing: Opportunities and Challenges of Opening ResearchDigital Data Sharing: Opportunities and Challenges of Opening Research
Digital Data Sharing: Opportunities and Challenges of Opening Research
Martin Donnelly776 visualizações
Research Data in the Arts and Humanities: A Few Tricky Questions por Martin Donnelly
Research Data in the Arts and Humanities: A Few Tricky QuestionsResearch Data in the Arts and Humanities: A Few Tricky Questions
Research Data in the Arts and Humanities: A Few Tricky Questions
Martin Donnelly1.4K visualizações
Practical Research Data Management: tools and approaches, pre- and post-award por Martin Donnelly
Practical Research Data Management:  tools and approaches, pre- and post-awardPractical Research Data Management:  tools and approaches, pre- and post-award
Practical Research Data Management: tools and approaches, pre- and post-award
Martin Donnelly1K visualizações
PROPAGANDA DEVICES por Joanna Molabola
PROPAGANDA DEVICESPROPAGANDA DEVICES
PROPAGANDA DEVICES
Joanna Molabola389.8K visualizações

Similar a Open Access to Research Data: Challenges and Solutions

Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016) por
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)dri_ireland
1.1K visualizações18 slides
Open Data - strategies for research data management & impact of best practices por
Open Data - strategies for research data management & impact of best practicesOpen Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practicesMartin Donnelly
256 visualizações34 slides
Immersive informatics - research data management at Pitt iSchool and Carnegie... por
Immersive informatics - research data management at Pitt iSchool and Carnegie...Immersive informatics - research data management at Pitt iSchool and Carnegie...
Immersive informatics - research data management at Pitt iSchool and Carnegie...Keith Webster
1.4K visualizações58 slides
Survey of research data management practices up2010digschol2011 por
Survey of research data management practices up2010digschol2011Survey of research data management practices up2010digschol2011
Survey of research data management practices up2010digschol2011heila1
1.2K visualizações26 slides
RDM LIASA webinar por
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinarSarah Jones
3.6K visualizações63 slides
Creating a Data Management Plan for your Grant Application por
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationHistoric Environment Scotland
606 visualizações36 slides

Similar a Open Access to Research Data: Challenges and Solutions(20)

Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016) por dri_ireland
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
dri_ireland1.1K visualizações
Open Data - strategies for research data management & impact of best practices por Martin Donnelly
Open Data - strategies for research data management & impact of best practicesOpen Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practices
Martin Donnelly256 visualizações
Immersive informatics - research data management at Pitt iSchool and Carnegie... por Keith Webster
Immersive informatics - research data management at Pitt iSchool and Carnegie...Immersive informatics - research data management at Pitt iSchool and Carnegie...
Immersive informatics - research data management at Pitt iSchool and Carnegie...
Keith Webster1.4K visualizações
Survey of research data management practices up2010digschol2011 por heila1
Survey of research data management practices up2010digschol2011Survey of research data management practices up2010digschol2011
Survey of research data management practices up2010digschol2011
heila11.2K visualizações
RDM LIASA webinar por Sarah Jones
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinar
Sarah Jones3.6K visualizações
Open Data Strategies and Research Data Realities por Martin Donnelly
Open Data Strategies and Research Data RealitiesOpen Data Strategies and Research Data Realities
Open Data Strategies and Research Data Realities
Martin Donnelly573 visualizações
Data management: The new frontier for libraries por LEARN Project
Data management: The new frontier for librariesData management: The new frontier for libraries
Data management: The new frontier for libraries
LEARN Project777 visualizações
How to overcome obstacles to data publication: Issues, requirements, and good... por ariadnenetwork
How to overcome obstacles to data publication: Issues, requirements, and good...How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...
ariadnenetwork1.1K visualizações
Open Data: Strategies for Research Data Management (and Planning) por Martin Donnelly
Open Data: Strategies for Research Data  Management (and Planning)Open Data: Strategies for Research Data  Management (and Planning)
Open Data: Strategies for Research Data Management (and Planning)
Martin Donnelly1K visualizações
Survey of research data management practices up2010 por heila1
Survey of research data management practices up2010Survey of research data management practices up2010
Survey of research data management practices up2010
heila1619 visualizações
Services, policy, guidance and training: Improving research data management a... por Robin Rice
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
Robin Rice811 visualizações
Research data management at TU Eindhoven por Leon Osinski
Research data management at TU EindhovenResearch data management at TU Eindhoven
Research data management at TU Eindhoven
Leon Osinski182 visualizações
Laying the Foundation: Establishing an institutional RDM Support Service for ... por GarethKnight
Laying the Foundation: Establishing an institutional RDM Support Service for ...Laying the Foundation: Establishing an institutional RDM Support Service for ...
Laying the Foundation: Establishing an institutional RDM Support Service for ...
GarethKnight571 visualizações
Supporting Research Data Management in UK Universities: the Jisc Managing Res... por L Molloy
Supporting Research Data Management in UK Universities: the Jisc Managing Res...Supporting Research Data Management in UK Universities: the Jisc Managing Res...
Supporting Research Data Management in UK Universities: the Jisc Managing Res...
L Molloy1.1K visualizações
Research data support: a growth area for academic libraries? por Robin Rice
Research data support: a growth area for academic libraries?Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?
Robin Rice81 visualizações
Libraries and Research Data Management – What Works? Lessons Learned from the... por LIBER Europe
Libraries and Research Data Management – What Works? Lessons Learned from the...Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...
LIBER Europe847 visualizações
Supporting Research Data Management at the University of Stirling por Lisa Haddow
Supporting Research Data Management at the University of StirlingSupporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of Stirling
Lisa Haddow648 visualizações

Mais de Martin Donnelly

The Roots of DMPonline por
The Roots of DMPonlineThe Roots of DMPonline
The Roots of DMPonlineMartin Donnelly
137 visualizações13 slides
Horizon 2020 open access and open data mandates por
Horizon 2020 open access and open data mandatesHorizon 2020 open access and open data mandates
Horizon 2020 open access and open data mandatesMartin Donnelly
314 visualizações30 slides
Preparing your own data for future re-use: data management and the FAIR prin... por
Preparing your own data for future re-use:  data management and the FAIR prin...Preparing your own data for future re-use:  data management and the FAIR prin...
Preparing your own data for future re-use: data management and the FAIR prin...Martin Donnelly
611 visualizações34 slides
Open Science and Horizon 2020 por
Open Science and Horizon 2020Open Science and Horizon 2020
Open Science and Horizon 2020Martin Donnelly
698 visualizações20 slides
The Horizon2020 Open Data Pilot - OpenAIRE Webinar por
The Horizon2020 Open Data Pilot - OpenAIRE WebinarThe Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarMartin Donnelly
1.6K visualizações24 slides
The Horizon 2020 Open Data Pilot por
The Horizon 2020 Open Data PilotThe Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotMartin Donnelly
391 visualizações24 slides

Mais de Martin Donnelly(18)

The Roots of DMPonline por Martin Donnelly
The Roots of DMPonlineThe Roots of DMPonline
The Roots of DMPonline
Martin Donnelly137 visualizações
Horizon 2020 open access and open data mandates por Martin Donnelly
Horizon 2020 open access and open data mandatesHorizon 2020 open access and open data mandates
Horizon 2020 open access and open data mandates
Martin Donnelly314 visualizações
Preparing your own data for future re-use: data management and the FAIR prin... por Martin Donnelly
Preparing your own data for future re-use:  data management and the FAIR prin...Preparing your own data for future re-use:  data management and the FAIR prin...
Preparing your own data for future re-use: data management and the FAIR prin...
Martin Donnelly611 visualizações
Open Science and Horizon 2020 por Martin Donnelly
Open Science and Horizon 2020Open Science and Horizon 2020
Open Science and Horizon 2020
Martin Donnelly698 visualizações
The Horizon2020 Open Data Pilot - OpenAIRE Webinar por Martin Donnelly
The Horizon2020 Open Data Pilot - OpenAIRE WebinarThe Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
Martin Donnelly1.6K visualizações
The Horizon 2020 Open Data Pilot por Martin Donnelly
The Horizon 2020 Open Data PilotThe Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data Pilot
Martin Donnelly391 visualizações
Winning Horizon 2020 with Open Science por Martin Donnelly
Winning Horizon 2020 with Open ScienceWinning Horizon 2020 with Open Science
Winning Horizon 2020 with Open Science
Martin Donnelly817 visualizações
The FOSTER project - general overview por Martin Donnelly
The FOSTER project - general overviewThe FOSTER project - general overview
The FOSTER project - general overview
Martin Donnelly918 visualizações
Research Data Management for the Humanities and Social Sciences por Martin Donnelly
Research Data Management for the Humanities and Social SciencesResearch Data Management for the Humanities and Social Sciences
Research Data Management for the Humanities and Social Sciences
Martin Donnelly2K visualizações
Data Management Plans: a gentle introduction por Martin Donnelly
Data Management Plans: a gentle introductionData Management Plans: a gentle introduction
Data Management Plans: a gentle introduction
Martin Donnelly789 visualizações
Research Data Management: a gentle introduction for admin staff por Martin Donnelly
Research Data Management: a gentle introduction for admin staffResearch Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staff
Martin Donnelly769 visualizações
Research Data Management: a gentle introduction por Martin Donnelly
Research Data Management: a gentle introductionResearch Data Management: a gentle introduction
Research Data Management: a gentle introduction
Martin Donnelly788 visualizações
Future agenda: repositories, and the research process por Martin Donnelly
Future agenda: repositories, and the research processFuture agenda: repositories, and the research process
Future agenda: repositories, and the research process
Martin Donnelly395 visualizações
Research data management: a tale of two paradigms: por Martin Donnelly
Research data management: a tale of two paradigms: Research data management: a tale of two paradigms:
Research data management: a tale of two paradigms:
Martin Donnelly738 visualizações
Research data management: definitions, drivers and resources por Martin Donnelly
Research data management: definitions, drivers and resourcesResearch data management: definitions, drivers and resources
Research data management: definitions, drivers and resources
Martin Donnelly1.2K visualizações
'Found' and 'after' - a short history of data reuse in the arts por Martin Donnelly
'Found' and 'after' - a short history of data reuse in the arts'Found' and 'after' - a short history of data reuse in the arts
'Found' and 'after' - a short history of data reuse in the arts
Martin Donnelly998 visualizações
Data management planning: the what, the why, the who, the how por Martin Donnelly
Data management planning: the what, the why, the who, the howData management planning: the what, the why, the who, the how
Data management planning: the what, the why, the who, the how
Martin Donnelly1.4K visualizações
DMP Online: update 2013 por Martin Donnelly
DMP Online: update 2013DMP Online: update 2013
DMP Online: update 2013
Martin Donnelly614 visualizações

Último

BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB... por
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...Nguyen Thanh Tu Collection
65 visualizações113 slides
When Sex Gets Complicated: Porn, Affairs, & Cybersex por
When Sex Gets Complicated: Porn, Affairs, & CybersexWhen Sex Gets Complicated: Porn, Affairs, & Cybersex
When Sex Gets Complicated: Porn, Affairs, & CybersexMarlene Maheu
67 visualizações73 slides
Narration lesson plan por
Narration lesson planNarration lesson plan
Narration lesson planTARIQ KHAN
58 visualizações11 slides
GSoC 2024 por
GSoC 2024GSoC 2024
GSoC 2024DeveloperStudentClub10
79 visualizações15 slides
The basics - information, data, technology and systems.pdf por
The basics - information, data, technology and systems.pdfThe basics - information, data, technology and systems.pdf
The basics - information, data, technology and systems.pdfJonathanCovena1
115 visualizações1 slide
Ch. 7 Political Participation and Elections.pptx por
Ch. 7 Political Participation and Elections.pptxCh. 7 Political Participation and Elections.pptx
Ch. 7 Political Participation and Elections.pptxRommel Regala
97 visualizações11 slides

Último(20)

BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB... por Nguyen Thanh Tu Collection
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
Nguyen Thanh Tu Collection65 visualizações
When Sex Gets Complicated: Porn, Affairs, & Cybersex por Marlene Maheu
When Sex Gets Complicated: Porn, Affairs, & CybersexWhen Sex Gets Complicated: Porn, Affairs, & Cybersex
When Sex Gets Complicated: Porn, Affairs, & Cybersex
Marlene Maheu67 visualizações
Narration lesson plan por TARIQ KHAN
Narration lesson planNarration lesson plan
Narration lesson plan
TARIQ KHAN58 visualizações
The basics - information, data, technology and systems.pdf por JonathanCovena1
The basics - information, data, technology and systems.pdfThe basics - information, data, technology and systems.pdf
The basics - information, data, technology and systems.pdf
JonathanCovena1115 visualizações
Ch. 7 Political Participation and Elections.pptx por Rommel Regala
Ch. 7 Political Participation and Elections.pptxCh. 7 Political Participation and Elections.pptx
Ch. 7 Political Participation and Elections.pptx
Rommel Regala97 visualizações
How to empty an One2many field in Odoo por Celine George
How to empty an One2many field in OdooHow to empty an One2many field in Odoo
How to empty an One2many field in Odoo
Celine George65 visualizações
Women from Hackney’s History: Stoke Newington by Sue Doe por History of Stoke Newington
Women from Hackney’s History: Stoke Newington by Sue DoeWomen from Hackney’s History: Stoke Newington by Sue Doe
Women from Hackney’s History: Stoke Newington by Sue Doe
History of Stoke Newington150 visualizações
The Accursed House by Émile Gaboriau por DivyaSheta
The Accursed House  by Émile GaboriauThe Accursed House  by Émile Gaboriau
The Accursed House by Émile Gaboriau
DivyaSheta201 visualizações
Jibachha publishing Textbook.docx por DrJibachhaSahVetphys
Jibachha publishing Textbook.docxJibachha publishing Textbook.docx
Jibachha publishing Textbook.docx
DrJibachhaSahVetphys47 visualizações
Pharmaceutical Inorganic Chemistry Unit IVMiscellaneous compounds Expectorant... por Ms. Pooja Bhandare
Pharmaceutical Inorganic Chemistry Unit IVMiscellaneous compounds Expectorant...Pharmaceutical Inorganic Chemistry Unit IVMiscellaneous compounds Expectorant...
Pharmaceutical Inorganic Chemistry Unit IVMiscellaneous compounds Expectorant...
Ms. Pooja Bhandare93 visualizações
Sociology KS5 por WestHatch
Sociology KS5Sociology KS5
Sociology KS5
WestHatch70 visualizações
11.28.23 Social Capital and Social Exclusion.pptx por mary850239
11.28.23 Social Capital and Social Exclusion.pptx11.28.23 Social Capital and Social Exclusion.pptx
11.28.23 Social Capital and Social Exclusion.pptx
mary850239298 visualizações
Education and Diversity.pptx por DrHafizKosar
Education and Diversity.pptxEducation and Diversity.pptx
Education and Diversity.pptx
DrHafizKosar173 visualizações
Solar System and Galaxies.pptx por DrHafizKosar
Solar System and Galaxies.pptxSolar System and Galaxies.pptx
Solar System and Galaxies.pptx
DrHafizKosar91 visualizações
Create a Structure in VBNet.pptx por Breach_P
Create a Structure in VBNet.pptxCreate a Structure in VBNet.pptx
Create a Structure in VBNet.pptx
Breach_P75 visualizações

Open Access to Research Data: Challenges and Solutions

  • 1. Facilitate Open Science Training for European Research Open Access to Research Data: Challenges and Solutions Martin Donnelly Digital Curation Centre University of Edinburgh FOSTER event, National Library of Latvia Riga, 20 October 2015
  • 2. The Digital Curation Centre • The UK’s centre of expertise in digital preservation and data management, established 2004 • Provide guidance, training, tools and other services on all aspects of research data management • Organise national and international events and webinars (International Digital Curation Conference, Research Data Management Forum) • Principal audience is the UK higher education sector, but we increasingly work further afield (Europe, North America, South Africa…) • Now offering paid consultancy/training services
  • 3. Overview 1. Background and context 2. An introduction to Research Data Management (RDM) a. What is RDM? b. What are the main benefits? c. What are the main problems? 3. RDM in practice a. What does it mean for researchers? b. Research data policies 4. Some useful resources
  • 4. Overview 1. Background and context 2. An introduction to Research Data Management (RDM) a. What is RDM? b. What are the main benefits? c. What are the main problems? 3. RDM in practice a. What does it mean for researchers? b. Research data policies 4. Some useful resources
  • 5. Background and context • Research data management exists within a context of ever greater transparency, accessibility and accountability • The impetus for openness in research comes from two directions: • Ground-up – Open Access began in the High Energy Physics research community, which saw benefit in not waiting for publication before sharing research findings (and data / code) • Top-down – Government/funder support, increasing public and commercial engagement with research • The main goals of these developments are to lower barriers to accessing the outputs of publicly funded research (often called ‘science’ for short), to speed up the research process, and to strengthen the quality, integrity and longevity of the scholarly record…
  • 6. The old way of doing research 1. Researcher collects data (information) 2. Researcher interprets/synthesises data 3. Researcher writes paper based on data 4. Paper is published (and preserved) 5. Data is left to benign neglect, and eventually ceases to be accessible
  • 7. Without intervention, data + time = no data Vines et al. “examined the availability of data from 516 studies between 2 and 22 years old” - The odds of a data set being reported as extant fell by 17% per year - Broken e-mails and obsolete storage devices were the main obstacles to data sharing - Policies mandating data archiving at publication are clearly needed “The current system of leaving data with authors means that almost all of it is lost over time, unavailable for validation of the original results or to use for entirely new purposes” according to Timothy Vines, one of the researchers. This underscores the need for intentional management of data from all disciplines and opened our conversation on potential roles for librarians in this arena. (“80 Percent of Scientific Data Gone in 20 Years” HNGN, Dec. 20, 2013, http://www.hngn.com/articles/20083/20131220/80-percent- of-scientific-data-gone-in-20-years.htm.) Vines et al., The Availability of Research Data Declines Rapidly with Article Age, Current Biology (2014), http://dx.doi.org/10.1016/j.cub.2013.11.014
  • 8. The new way of doing research Plan Collect Assure Describe Preserve Discover Integrate Analyze PUBLISH …and RE-USE The DataONE lifecycle model
  • 9. Overview 1. Background and context 2. An introduction to Research Data Management (RDM) a. What is RDM? b. What are the main benefits? c. What are the main problems? 3. RDM in practice a. What does it mean for researchers? b. Research data policies 4. Some useful resources
  • 10. What is RDM? “the active management and appraisal of data over the lifecycle of scholarly and scientific interest” What sorts of activities? - Planning and describing data- related work before it takes place - Documenting your data so that others can find and understand it - Storing it safely during the project - Depositing it in a trusted archive at the end of the project - Linking publications to the datasets that underpin them Data management is a part of good research practice. - RCUK Policy and Code of Conduct on the Governance of Good Research Conduct
  • 11. RDM: who and how? • RDM is a hybrid activity, involving multiple stakeholder groups… • The researchers themselves • Research support personnel • Partners based in other institutions, commercial partners, etc • Data Management Planning (DMP) underpins and pulls together different strands of data management activities. DMP is the process of planning, describing and communicating the activities carried out during the research lifecycle in order to… • Keep sensitive data safe • Maximise data’s re-use potential • Support longer-term preservation • Data Management Plans are a means of communication, with contemporaries and potential future re-users alike…
  • 12. Benefits of RDM and data sharing • SPEED: The research process becomes faster • EFFICIENCY: Data collection can be funded once, and used many times for a variety of purposes • ACCESSIBILITY: Interested third parties can (where appropriate) access and build upon publicly-funded research resources with minimal barriers to access • IMPACT and LONGEVITY: Open publications and data receive more citations, over a longer period • TRANSPARENCY and QUALITY: The evidence that underpins research can be made open for anyone to scrutinise, and attempt to replicate findings. This leads to a more robust scholarly record
  • 13. “In genomics research, a large-scale analysis of data sharing shows that studies that made data available in repositories received 9% more citations, when controlling for other variables; and that whilst self-reuse citation declines steeply after two years, reuse by third parties increases even after six years.” (Piwowar and Vision, 2013) Van den Eynden, V. and Bishop, L. (2014). Incentives and motivations for sharing research data, a researcher’s perspective. A Knowledge Exchange Report, http://repository.jisc.ac.uk/5662/1/KE _report-incentives-for-sharing- researchdata.pdf Benefits of RDM: Impact and Longevity
  • 14. “Data is necessary for reproducibility of computational research, but an equal amount of concern should be directed at code sharing.” Victoria Stodden, “Innovation and Growth through Open Access to Scientific Research: Three Ideas for High-Impact Rule Changes” in Litan, Robert E. et al. Rules for Growth: Promoting Innovation and Growth Through Legal Reform. SSRN Scholarly Paper. Rochester, NY: Social Science Research Network, February 8, 2011. http://papers.ssrn.com/abstract=1757982. Benefits of RDM: Quality
  • 15. “Conservatively, we estimate that the value of data in Australia’s public research to be at least $1.9 billion and possibly up to $6 billion a year at current levels of expenditure and activity. Research data curation and sharing might be worth at least $1.8 billion and possibly up to $5.5 billion a year, of which perhaps $1.4 billion to $4.9 billion annually is yet to be realized.” • “Open Research Data”, Report to the Australian National Data Service (ANDS), November 2014 - John Houghton, Victoria Institute of Strategic Economic Studies & Nicholas Gruen, Lateral Economics Benefits of RDM: Financial
  • 16. J. Manyika et al. "Open data: Unlocking innovation and performance with liquid information" McKinsey Global Institute, October 2013
  • 17. “If we are going to wait five years for data to be released, the Arctic is going to be a very different place.” Bryn Nelson, Nature, 10 Sept 2009 http://www.nature.com/nature/jour nal/v461/n7261/index.html Benefits of RDM: Speed https://www.flickr.com/photos/gsfc/7348953774/ - CC-BY
  • 18. Why don’t we live in a data sharing utopia? Five main reasons… i. Lack of widespread understanding of the fundamental issues ii. Lack of joined-up thinking within institutions, countries, internationally… iii. Issues around ownership/privacy iv. Technical/financial limitations, and the need for selection and appraisal of data (which takes time, and costs money…) v. Issues around reward and recognition for researchers
  • 19. Overview 1. Background and context 2. An introduction to Research Data Management (RDM) a. What is RDM? b. What are the main benefits? c. What are the main problems? 3. RDM in practice a. What does it mean for researchers? b. Research data policies 4. Some useful resources
  • 20. What does it mean for researchers? • A disruption to previous working processes • Additional expectations / requirements from the funders (and sometimes their home institutions and publishers too) • But! It provides opportunities for new types of investigation • And leads to a more robust scholarly record
  • 21. What do researchers need to do? 1. Understand funders’ policies (e.g. EC H2020…) 2. Check your intended publisher’s OA policy (e.g. via Sherpa Romeo) 3. Create a data management plan (e.g. with DMPonline) 4. Decide which data to preserve using the DCC’s How-To guide and checklist, “Five Steps to Decide what Data to Keep” 5. Identify a long-term home for your data (e.g. via re3data.org) 6. Link your data to your publications with a persistent identifier (e.g. via DataCite) • N.B. Many repositories, including Zenodo, will do this for you 7. Investigate EC infrastructure services and resources, e.g. EUDAT, OpenAIRE Plus, FOSTER, etc…
  • 22. UK funders 1. Public good 2. Preservation 3. Discovery 4. Confidentiality 5. First use 6. Recognition 7. Public funding Six of the seven RCUK councils require data management plans (or equivalent), as do Wellcome Trust, Cancer Research UK, and more…
  • 23. EXPECTATIONS 1. Research organisations will promote internal awareness of these principles and expectations and ensure that their researchers and research students have a general awareness of the regulatory environment and of the available exemptions which may be used, should the need arise, to justify the withholding of research data; 2. Published research papers should include a short statement describing how and on what terms any supporting research data may be accessed. 3. Each research organisation will have specific policies and associated processes to maintain effective internal awareness of their publicly-funded research data holdings and of requests by third parties to access such data; all of their researchers or research students funded by EPSRC will be required to comply with research organisation policies in this area or, in exceptional circumstances, to provide justification of why this is not possible. 4. Publicly-funded research data that is not generated in digital format will be stored in a manner to facilitate it being shared in the event of a valid request for access to the data being received (this expectation could be satisfied by implementing a policy to convert and store such data in digital format in a timely manner); 5. Research organisations will ensure that appropriately structured metadata describing the research data they hold is published (normally within 12 months of the data being generated) and made freely accessible on the internet; in each case the metadata must be sufficient to allow others to understand what research data exists, why, when and how it was generated, and how to access it. Where the research data referred to in the metadata is a digital object it is expected that the metadata will include use of a robust digital object identifier (For example as available through the DataCite organisation - http://datacite.org). 6. Where access to the data is restricted the published metadata should also give the reason and summarise the conditions which must be satisfied for access to be granted. For example ‘commercially confidential’ data, in which a business organisation has a legitimate interest, might be made available to others subject to a suitable legally enforceable non-disclosure agreement. 7. Research organisations will ensure that EPSRC-funded research data is securely preserved for a minimum of 10-years from the date that any researcher ‘privileged access’ period expires or, if others have accessed the data, from last date on which access to the data was requested by a third party; all reasonable steps will be taken to ensure that publicly-funded data is not held in any jurisdiction where the available legal safeguards provide lower levels of protection than are available in the UK 8. Research organisations will ensure that effective data curation is provided throughout the full data lifecycle, with ‘data curation’ and ‘data lifecycle’ being as defined by the Digital Curation Centre. The full range of responsibilities associated with data curation over the data lifecycle will be clearly allocated within the research organisation, and where research data is subject to restricted access the research organisation will implement and manage appropriate security controls; research organisations will particularly ensure that the quality assurance of their data curation processes is a specifically assigned responsibility; 9. Research organisations will ensure adequate resources are provided to support the curation of publicly-funded research data; these resources will be allocated from within their existing public funding streams, whether received from Research Councils as direct or indirect support for specific projects or from higher education Funding Councils as block grants. 1. INTERNAL AWARENESS - of principles, expectations, regulatory environment, possible exemptions 2. ACCESS STATEMENT - included within research papers 3. POLICIES AND PROCESSES - covering maintenance and access requests 4. NON-DIGITAL DATA - strategy for access / digitisation 5. METADATA PUBLICATION - within 12 months of data generation 6. RESTRICTIONS - list these within metadata 7. PRESERVATION - 10 years from date of last access 8. CURATION - maintenance and security 9. RESOURCING - from existing funding streams
  • 24. RDM in other countries (i) USA • The National Science Foundation (NSF) announced a DMP requirement in 2010, effective 2011 • White House Office of Science and Technology Policy requirement for DMPs announced March 2013 (programmes awarding >$100m annually). White House requirements include mechanisms covering compliance with plans and policies, and also cover costs of implementing plans AUSTRALIA • In 2014 The Australian Research Council (ARC) released new instructions for applications for Laureate Fellowships and Discovery Grants. Both include the following requirements when describing a proposal… • COMMUNICATION OF RESULTS: Outline plans for communicating the research results to other researchers and the broader community, including scholarly and public communication and dissemination • MANAGEMENT OF DATA: Outline plans for the management of data produced as a result of the proposed research, including but not limited to storage, access and re-use arrangements
  • 25. RDM in other countries (ii) SOUTH AFRICA • Announced in January 2015 that (from March 2015) “authors of research papers generated from research either fully or partially funded by NRF, when submitting and publishing in academic journals, should deposit their final peer- reviewed manuscripts that have been accepted by the journals, to the administering Institution Repository with an embargo period of no more than 12 months.” • In addition, the data supporting the publication should be deposited in an accredited Open Access repository, with the provision of a Digital Object Identifier for future citation and referencing. • The NRF encourages its stakeholder community, including NRF’s Business Units and National Research Facilities, to: • Formulate detailed policies on Open Access of publications and data from its funded research; • Establish Open Access repositories; and • Support public access to the repositories through web search and retrieval according to international standards and best practice.
  • 26. RDM in Europe • Horizon 2020 (FP8) features an Open Research Data pilot, and it seems likely that it will become an across-the-board requirement in FP9… • It applies to data (and metadata) needed to validate scientific results, which should be deposited in a dedicated data repository • The Horizon 2020 Open Research Data pilot covers “Innovation actions” and “Research and Innovation actions”, and involves three iterations of Data Management Plan (DMP) • 6 months after start of project, mid-project review, end-of-project (final review) • DMP contents • Data types; Standards used; Sharing/making available; Curation and preservation • There are certain opt-out conditions
  • 27. H2020 Open Data Pilot: specifics (ii) STEP 1 • The data should be deposited, preferably in a dedicated research data repository. These may be subject-based/thematic, institutional or centralised. • EC suggests the Registry of Research Data Repositories (www.re3data.org) and Databib (http://databib.org) for researchers looking to identify an appropriate repository • Open Access Infrastructure for Research in Europe (OpenAIRE) will also become an entry point for linking publications to data. STEP 2 • So far as possible, projects must then take measures to enable for third parties to access, mine, exploit, reproduce and disseminate (free of charge for any user) this research data. • EC suggests attaching Creative Commons Licence (CC-BY or CC0) to the data deposited (http://creativecommons.org/licenses/, http://creativecommons.org/about/cc0). • At the same time, projects should provide information via the chosen repository about tools and instruments at the disposal of the beneficiaries and necessary for validating the results, for instance specialised software or software code, algorithms, analysis protocols, etc. Where possible, they should provide the tools and instruments themselves.
  • 28. H2020 Open Data Pilot: specifics (iii) COSTS Costs relating to the implementation of the pilot will be eligible. Specific technical and professional support services will also be provided (e- Infrastructures WP), e.g. EUDAT and OpenAIRE, alongside support measures such as FOSTER. OPT-OUTS Opt outs are possible, either totally or partially. Projects may opt out of the Pilot at any stage, for a variety of reasons, e.g. • if participation in the Pilot on Open Research Data is incompatible with the Horizon 2020 obligation to protect results if they can reasonably be expected to be commercially or industrially exploited; • confidentiality (e.g. security issues, protection of personal data); • if participation in the Pilot on Open Research Data would jeopardise the achievement of the main aim of the action; • if the project will not generate / collect any research data; • if there are other legitimate reasons to not take part in the Pilot (to be declared at proposal stage)
  • 29. Overview 1. Background and context 2. An introduction to Research Data Management (RDM) a. What is RDM? b. What are the main benefits? c. What are the main problems? 3. RDM in practice a. What does it mean for researchers? b. Research data policies 4. Some useful resources
  • 30. • Web-based tool to help researchers write and maintain DMPs • Provides funder questions and guidance • Includes templates for all RCUK funders, and Horizon 2020 • Provides tailored help from universities • Can include examples and suggest responses • Free to use • Mature (v1 launched April 2010) • Code is Open Source (on GitHub) • https://dmponline.dcc.ac.uk DMPonline
  • 31. • EUDAT offers common data services through a geographically distributed, resilient network of 35 European organisations. These shared services and storage resources are distributed across 15 European nations and data is stored alongside some of Europe’s most powerful supercomputers. • The EUDAT services address the full lifecycle of research data, covering both access and deposit, from informal data sharing to long-term archiving, and addressing identification, discoverability and computability of both long-tail and big data • The vision is to enable European researchers and practitioners from any academic discipline to preserve, find, access, and process data in a trusted environment, as part of a Collaborative Data Infrastructure (CDI) conceived as a network of collaborating, cooperating centres, combining the richness of numerous community-specific data repositories with the permanence and persistence of some of Europe’s largest scientific data centres • Seeks to bridge the gap between research infrastructures and e-Infrastructures through an active engagement strategy, using the communities in the consortium as EUDAT beacons, and integrating others through innovative partnership approaches • Jisc and DCC are partners, and we’re working to embed DCC’s DMPonline tool within the EUDAT suite of services / infrastructure EUDAT
  • 32. Zenodo • Zenodo is a free-to-use data archive, run by the people at CERN • It accepts any kind of data, from any academic discipline • It is generally preferable to store data in a disciplinary data centre, but not all scholarly subjects are equally well served with data centres, so this may make for a useful fallback option • See http://zenodo.org/ for more details
  • 33. Other data management resources (DCC) • Book chapter • Donnelly, M. (2012) “Data Management Plans and Planning”, in Pryor (ed.) Managing Research Data, London: Facet • Guidance, e.g. “How-To Develop a Data Management and Sharing Plan” • DCC Checklist for a Data Management Plan: http://www.dcc.ac.uk/resources/data- management-plans/checklist • Links to all DCC resources via http://www.dcc.ac.uk/resources/data- management-plans
  • 34. Data management resources (non-DCC) • UKDA guidance and book (http://data- archive.ac.uk/media/2894/managi ngsharing.pdf) • Guidance from funders (ESRC and NERC are particularly strong) • Resources from other universities, e.g. Bath, Bristol, Cambridge Edinburgh, Glasgow, Oxford (to name but a few)
  • 35. OBJECTIVES • To support different stakeholders, especially younger researchers, in adopting open access in the context of the European Research Area (ERA) and in complying with the open access policies and rules of participation set out for Horizon 2020 • To integrate open access principles and practice in the current research workflow by targeting the young researcher training environment • To strengthen institutional training capacity to foster compliance with the open access policies of the ERA and Horizon 2020 (beyond the FOSTER project) • To facilitate the adoption, reinforcement and implementation of open access policies from other European funders, in line with the EC’s recommendation, in partnership with PASTEUR4OA project FacilitateOpenScienceTrainingforEuropeanResearch The project
  • 36. METHODS • Identifying already existing content that can be reused in the context of the training activities and repackaging, reformatting them to be used within FOSTER, and developing/creating/enhancing contents as required • Developing the FOSTER Portal to support e- learning, blended learning, self-learning, dissemination of training materials/contents and a Helpdesk • Delivery of face-to-face training, especially training trainers/multipliers who can deliver further training and dissemination activities, within institutions, nations or disciplinary communities • The EC is also funding other specific technical and professional support services via the e-Infrastructures WP, e.g. EUDAT and OpenAIRE FacilitateOpenScienceTrainingforEuropeanResearch The project
  • 37. Thank you • For more information about the FOSTER project: • Website: www.fosteropenscience.eu • Principal investigator: Eloy Rodrigues (eloy@sdum.uminho.pt) • General enquiries: Gwen Franck (gwen.franck@eifl.net) • Twitter: @fosterscience • My contact details: • Email: martin.donnelly@ed.ac.uk • Twitter: @mkdDCC • Slideshare: http://www.slideshare.net/martindo nnelly This work is licensed under the Creative Commons Attribution 2.5 UK: Scotland License.

Notas do Editor

  1. 45 mins (inc. Q&A) Data Management in an Open World: the data sharing agenda, the policy landscape & some useful resources Audience: PhD students, mostly in social sciences / humanities – reuse material from De Gruyter blog post?
  2. Context: Open Knowledge Foundation, Creative Commons, G8 statement Open Science principles are an essential part of knowledge creation and sharing and innovation. They directly support researchers’ need for greater impact, optimum dissemination of research, while also enabling the engagement of citizen scientists and society at large on societal challenges. FOSTER aims to set in place sustainable mechanisms for EU researchers to integrate Open Science in their daily workflow, supporting researchers to optimizing their research visibility and impact and to facilitate the adoption of EU open access policies. Introduction What is Open Science? Why is it beneficial? What are some of the problems What does it mean in practice for researchers? OA and data sharing policies OA resources in detail (inc EU projects) Sherpa Romeo Jorum Pasteur4OA OpenAIRE Others? Data management tools/infrastructure in detail (inc EU projects) DMPonline EUDAT Zenodo Others? Joint Open Science support projects FOSTER Other training resources? Conclusions – take home messages
  3. Context: Open Knowledge Foundation, Creative Commons, G8 statement Open Science principles are an essential part of knowledge creation and sharing and innovation. They directly support researchers’ need for greater impact, optimum dissemination of research, while also enabling the engagement of citizen scientists and society at large on societal challenges. FOSTER aims to set in place sustainable mechanisms for EU researchers to integrate Open Science in their daily workflow, supporting researchers to optimizing their research visibility and impact and to facilitate the adoption of EU open access policies. Introduction What is Open Science? Why is it beneficial? What are some of the problems What does it mean in practice for researchers? OA and data sharing policies OA resources in detail (inc EU projects) Sherpa Romeo Jorum Pasteur4OA OpenAIRE Others? Data management tools/infrastructure in detail (inc EU projects) DMPonline EUDAT Zenodo Others? Joint Open Science support projects FOSTER Other training resources? Conclusions – take home messages
  4. Context: Open Knowledge Foundation, Creative Commons, G8 statement Open Science principles are an essential part of knowledge creation and sharing and innovation. They directly support researchers’ need for greater impact, optimum dissemination of research, while also enabling the engagement of citizen scientists and society at large on societal challenges. FOSTER aims to set in place sustainable mechanisms for EU researchers to integrate Open Science in their daily workflow, supporting researchers to optimizing their research visibility and impact and to facilitate the adoption of EU open access policies.
  5. INNOVATION: Enables new uses for publication (e.g. text mining and automation)
  6. Context: Open Knowledge Foundation, Creative Commons, G8 statement Open Science principles are an essential part of knowledge creation and sharing and innovation. They directly support researchers’ need for greater impact, optimum dissemination of research, while also enabling the engagement of citizen scientists and society at large on societal challenges. FOSTER aims to set in place sustainable mechanisms for EU researchers to integrate Open Science in their daily workflow, supporting researchers to optimizing their research visibility and impact and to facilitate the adoption of EU open access policies. Introduction What is Open Science? Why is it beneficial? What are some of the problems What does it mean in practice for researchers? OA and data sharing policies OA resources in detail (inc EU projects) Sherpa Romeo Jorum Pasteur4OA OpenAIRE Others? Data management tools/infrastructure in detail (inc EU projects) DMPonline EUDAT Zenodo Others? Joint Open Science support projects FOSTER Other training resources? Conclusions – take home messages
  7. Will talk about active management now, and appraisal a little later…
  8. Focus on data management – what is it, what activities are involved (and how do these affect different roles, e.g. researchers, PhD Students, librarians, data managers, research administrators, publishers, policy makers, funders, project managers.) – cite Vines et al.
  9. Note that even if data is not suitable for sharing/publication, it still needs active management!
  10. ACCESSIBILITY: Interested third parties can (where appropriate) access and build upon publicly-funded research resources with minimal barriers to access.
  11. TRANSPARENCY and QUALITY: The evidence that underpins research can be made open for anyone to scrutinise, and attempt to replicate findings. This leads to a more robust scholarly record - linking scholarly publications to the data which underpins them contributes to strengthening the longevity, utility and quality of both ‘DATA’ IS SHORTHAND FOR DATA, RECORDS, SOFTWARE, VIDEO, ETC AND MORE
  12. EFFICIENCY: Data collection can be funded once, and used many times for a variety of purposes.
  13. SPEED: the research and innovation process is now fastest in High Energy Physics due to the community practice of immediate data publication
  14. So, those are the benefits, but there are still barriers to this utopia…
  15. Context: Open Knowledge Foundation, Creative Commons, G8 statement Open Science principles are an essential part of knowledge creation and sharing and innovation. They directly support researchers’ need for greater impact, optimum dissemination of research, while also enabling the engagement of citizen scientists and society at large on societal challenges. FOSTER aims to set in place sustainable mechanisms for EU researchers to integrate Open Science in their daily workflow, supporting researchers to optimizing their research visibility and impact and to facilitate the adoption of EU open access policies. Introduction What is Open Science? Why is it beneficial? What are some of the problems What does it mean in practice for researchers? OA and data sharing policies OA resources in detail (inc EU projects) Sherpa Romeo Jorum Pasteur4OA OpenAIRE Others? Data management tools/infrastructure in detail (inc EU projects) DMPonline EUDAT Zenodo Others? Joint Open Science support projects FOSTER Other training resources? Conclusions – take home messages
  16. EPSRC is the odd one out – requiring an institutional infrastructure, rather than placing the onus directly on the researchers
  17. Interesting because H2020 explicitly requires collaboration, and private sector/SME partners etc (check detail on this). Also note the EU focus on societal benefits etc. Note that open data, in particular, is a ‘novelty’ in many member states
  18. Interesting because H2020 explicitly requires collaboration, and private sector/SME partners etc (check detail on this). Also note the EU focus on societal benefits etc. Note that open data, in particular, is a ‘novelty’ in many member states
  19. A detailed description and scope of the Open Research Data Pilot requirements is provided on the Participants’ Portal
  20. Context: Open Knowledge Foundation, Creative Commons, G8 statement Open Science principles are an essential part of knowledge creation and sharing and innovation. They directly support researchers’ need for greater impact, optimum dissemination of research, while also enabling the engagement of citizen scientists and society at large on societal challenges. FOSTER aims to set in place sustainable mechanisms for EU researchers to integrate Open Science in their daily workflow, supporting researchers to optimizing their research visibility and impact and to facilitate the adoption of EU open access policies. Introduction What is Open Science? Why is it beneficial? What are some of the problems What does it mean in practice for researchers? OA and data sharing policies OA resources in detail (inc EU projects) Sherpa Romeo Jorum Pasteur4OA OpenAIRE Others? Data management tools/infrastructure in detail (inc EU projects) DMPonline EUDAT Zenodo Others? Joint Open Science support projects FOSTER Other training resources? Conclusions – take home messages
  21. ONE LINE DECRIPTION OF WHAT THIS PROJECT DOES!
  22. The FOSTER project – what and how. FOSTER’s training strategy uses a combination of methods and activities, from face-to-face training, to the use of e-learning, blended and self-learning, as well as the dissemination of training materials/contents/curricula via a dedicated training portal, plus a helpdesk. Face-to-face trainings targets graduate schools in European universities and in particular will train trainers/teachers/multipliers that can conduct further training and dissemination activities in their institution, country and disciplinary community. FOSTER combines experiences and materials to showcase best practices, setting the scene for an active learning and teaching community for open access practices across Europe. The main outcomes of the project are: The FOSTER portal to host training courses and curricula; Facilitate the organisation of FOSTER training events and the creation of training content across Europe Identification of existing contents that can be reused in the context of the training activities and develop/create/ enhance contents if/where they are needed;
  23. The FOSTER project – what and how. FOSTER’s training strategy uses a combination of methods and activities, from face-to-face training, to the use of e-learning, blended and self-learning, as well as the dissemination of training materials/contents/curricula via a dedicated training portal, plus a helpdesk. Face-to-face trainings targets graduate schools in European universities and in particular will train trainers/teachers/multipliers that can conduct further training and dissemination activities in their institution, country and disciplinary community. FOSTER combines experiences and materials to showcase best practices, setting the scene for an active learning and teaching community for open access practices across Europe. The main outcomes of the project are: The FOSTER portal to host training courses and curricula; Facilitate the organisation of FOSTER training events and the creation of training content across Europe Identification of existing contents that can be reused in the context of the training activities and develop/create/ enhance contents if/where they are needed;
  24. Conclusion – take home messages and project contact details Open Science makes research stronger, more visible, more valuable to scholarly communities and society at large - this requires training at all stages of the research cycle to make it happen. OS can train future PhDs to be excellent at providing societal context of their research as well as be research excellent; OS can prepare PhD to deal with Research Excellence Framework (REF2020) criteria evolving towards “societal impact”, if they chose to pursue a career in research; OS practices directly prepare practitioners (and academia itself) to engage non-academic stakeholders (general public, policy makers, knowledge based SMEs); Soft-skills for Hard Impact: Broaden doctoral education from strict focus on “research excellence” to include soft-skills that deal with “societal impact of research”, so that future graduates have greater career options whether they stay in research, or not (this could also be a high-level, ambitious conclusion to end the 10 min with)