SlideShare uma empresa Scribd logo
1 de 25
Monitoring the FAIRness of
Data Sets - Introducing the
DANS Approach to FAIR Metrics
Elly Dijk & Peter Doorn, DANS
With thanks to Emily Thomas for some slides
www.dans.knaw.n
l
Open Science Monitor – A workshop organised by OpenAIRE,
EOSCpilot, Jisc, and DANS at the OPEN SCIENCE FAIR, Athens,
7 Sept 2017
What is FAIR data and why is it
important for Open Science?
• Open Science has become a policy priority: not only
publications need to be openly accessible, but also other
research outputs, especially research data
• Principle: open if possible, protected if necessary
• Legitimate restrictions to openness:
• to protect privacy
• data creator must be able to publish first
• Openness itself is not enough, data must also be:
• Findable F
• Accessible A
• Interoperable I
• Reusable R
Institute of
Dutch Academy
and Research
Funding
Organisation
(KNAW & NWO)
since 2005
First predecessor
dates back to
1964 (Steinmetz
Foundation),
Historical Data
Archive 1989
Mission: promote
and provide
permanent
access to digital
research
resources
DANS is about keeping data
FAIR
FAIR guiding principles
1. Findable – Easy to find by both humans and computer systems and based
on mandatory description of the metadata that allow the discovery
of interesting datasets;
2. Accessible – Stored for long term such that they can be easily accessed
and/or downloaded with well-defined license and access conditions (Open
Access when possible), whether at the level of metadata, or at the level of
the actual data content;
3. Interoperable – Ready to be combined with other datasets by humans as
well as computer systems;
4. Reusable – Ready to be used for future research and to be processed
further using computational methods.
Source:
Paper: http://www.nature.com/articles/sdata201618
Everybody loves FAIR!
Everybody wants to be FAIR… But what does that
mean? How to put the principles into practice?
Our struggle with FAIR
operationalization
To be Findable:
•F1. (meta)data are assigned a globally unique and
eternally persistent identifier.
•F2. data are described with rich metadata.
•F3. (meta)data are registered or indexed in a
searchable resource.
•F4. metadata specify the data identifier.
To be Accessible:
•A1 (meta)data are retrievable by their identifier
using a standardized communications protocol.
•A1.1 the protocol is open, free, and universally
implementable.
•A1.2 the protocol allows for an authentication and
authorization procedure, where necessary.
•A2 metadata are accessible, even when the data are
no longer available.
No rankings
Relation F2 – F4?
Relation A1 –
F2,3,4?
What if they never were?
Do access licenses
belong to Re-use?
Access criteria relate mainly to
machine access
Help-page: “The FAIR Data Principles explained”
Difficulties in FAIR
operationalization II
To be Interoperable:
• I1. (meta)data use a formal, accessible, shared, and
broadly applicable language for knowledge
representation.
• I2. (meta)data use vocabularies that follow FAIR
principles.
• I3. (meta)data include qualified references to other
(meta)data.
To be Re-usable:
• R1. meta(data) have a plurality of accurate and
relevant attributes.
• R1.1. (meta)data are released with a clear and
accessible data usage license.
• R1.2. (meta)data are associated with
their provenance.
• R1.3. (meta)data meet domain-relevant community
standards.
Droste effect:
self-reference
Which other metadata?
Is not R1 about
Findable again?
Are not Licenses about Acc
conditions?
What if a license says “Not accessible”?
Their? Whose?
Is not R1.3 about
Findable again?
For FAIR Help see: https://www.dtls.nl/fair-data/fair-principles-explained/
Our evaluation of the original
FAIR principles:
1. Dependencies among dimensions, difficulty to understand and measure the
criteria, no rank order from “low” to “high” FAIRness, grouping of criteria
under dimensions is disputable
2. Do we need or want additional dimensions, principles or criteria?
• Is “openness” a separate dimension, not included in FAIR?
• Is it desirable/possible to say something about “substantive” data quality, such as the
accuracy/precision or correctness of the data?
• What about the long-term access? For how long does data remain FAIR?
• Should data security be included?
3. Several FAIR criteria can be solved at the level of the repository
4. Do we need separate FAIR criteria for different disciplines?
• e.g. machine actionable data are more important in some fields than in other; note that
data accessibility by machines is partly defined by technical specs (A1), partly by licenses
(R1.1)
Therefore, DANS started work on a FAIR “light” implementation in the context
of data repositories compliant with the Data Seal of Approval
GO-FAIR Initiative:
FAIR Metrics Group
• Aim: to define metrics enabling both qualitative and quantitative
assessment of the degree to which online resources comply with
the 15 Principles of FAIR Data
• Founding Members:
• Mark Wilkinson, Universidad Politécnica de Madrid
• Susanna Sansone, University of Oxford
• Michel Dumontier, Maastricht University
• Peter Doorn, DANS
• Luiz Olavo Bonino, VU/DTL
• Erik Schultes, DTL
• https://www.dtls.nl/fair-data/fair-metrics-group/ and
http://fairmetrics.org/
FAIR Metrics Framework
www.eoscpilot.eu
The European Open Science Cloud for Research pilot project is funded by the
European Commission, DG Research & Innovation under contract no. 739563
11
FAIR Metrics Form
Metric Identifier
Metric Name
To which principle does it apply?
What is being measured?
Why should we measure it?
What must be provided?
How do we measure it?
What is a valid result?
For which digital resource(s) is this
relevant?
Examples of their application
across types of digital resource
Comment
www.eoscpilot.eu
The European Open Science Cloud for Research pilot project is funded by the
European Commission, DG Research & Innovation under contract no. 739563
12
• GOALS:
• Develop a broadly useable framework
and tool for FAIR assessment
• Harmonize current ongoing efforts
• Expected outcomes:
• Checklist of clearly defined metrics
• Indication of how these metrics can be
readily implemented (e.g. self-
reporting, automated)
• Proposal(s) for undertaking a FAIR
assessment of a digital resource
• One or more implementations for
performing an assessment
Can we do it?
FAIR Metrics Group
The DANS approach to FAIR
www.eoscpilot.eu
The European Open Science Cloud for Research pilot project is funded by the
European Commission, DG Research & Innovation under contract no. 739563
13
Data Seal of Approval Principles (for
data repositories) 2006
FAIR Principles (for data sets)
2014
data can be found on the internet Findable
data are accessible Accessible
data are in a usable format Interoperable
data are reliable Reusable
data can be referred to (citable)
The resemblance is not perfect:
• usable format (DSA) is an aspect of interoperability (FAIR)
• reliability (DSA) is a precondition for reusability (FAIR)
• FAIR explicitly addresses machine readability
A DSA certified repository already guarantees a baseline data quality level
All data sets in a
Trusted Repository
are FAIR, but some
are more FAIR than
others
Findable (defined by metadata (PID included) and documentation)
1. No PID nor metadata/documentation
2. PID without or with insufficient metadata
3. Sufficient/limited metadata without PID
4. PID with sufficient metadata
5. Extensive metadata and rich additional documentation available
Accessible (defined by presence of user license)
1. Metadata nor data are accessible
2. Metadata are accessible but data is not accessible (no clear terms of reuse in license)
3. User restrictions apply (i.e. privacy, commercial interests, embargo period)
4. Public access (after registration)
5. Open access unrestricted
Interoperable (defined by data format)
1. Proprietary (privately owned), non-open format data
2. Proprietary format, accepted by Certified Trustworthy Data Repository
3. Non-proprietary, open format = ‘preferred format’
4. As well as in the preferred format, data is standardised using a standard vocabulary
format (for the research field to which the data pertain)
5. Data additionally linked to other data to provide context
FAIR “Light” assessment
Towards a FAIR Framework?
Analogous to Certification Framework?
Formal
-----------------------------------
Extended
-----------------------------------------
Core
All noses in the same
direction?
DANS FAIR assessment
profile
• Developing the data assessment
tool: FAIRdat
• We want to create a badge system
using the FAIR principles to assess
data sets in a TDR
• Operationalise the original
principles to ensure no interactions
among dimensions to ease scoring
• Consider Reusability as the
resultant of the other three:
• the average FAIRness as an
indicator of data quality
• (F+A+I)/3=R
• Manual and automatic scoring
• New principles still in progress
F A I R
2 User Reviews
1 Archivist Assessment
24 Downloads
Ideas for our operationalization Solution
R1. (meta)data are richly described with a plurality of
accurate and relevant attributes
≈ F2: “data are described with
rich metadata” → F
R1.1. (meta)data are released with a clear and
accessible data usage license
→ A, aren’t licenses about access
conditions?
R1.2. (meta)data are associated with detailed provenance → F, appears as a criteria for
Findable metadata
R1.3. (meta)data meet domain-relevant community
standards
→ I, ≈ standardized vocabularies
Attempt to operationalize ‘R’
• Proposed solution
Creating
the assessment
tool FAIRdat
Accompanying
documentation
Future goals…
FAIRdat website
…FAIR database
• Identifier
• Name of Data Set
• Depositor / “owner”
• Repository
• Discipline
• …
• FAIR scores per item
• Total FAIR score
• Badge
Display FAIR badges in any repository (Zenodo,
Dataverse, Mendeley Data, figshare, B2SAFE, …)
Monitoring Open Science:
research data
•How can we use this tool for Monitoring Open Science?
• Do you think people are willing to assess research data in
repositories?
• And: Who will do the assessment? (depositors, data
experts at repositories, users)
• Using the FAIR metrics
• in own repository – harvested by OpenAIRE
• in FAIR database – harvested by OpenAIRE
www.eoscpilot.eu
The European Open Science Cloud for Research pilot project is funded by the
European Commission, DG Research & Innovation under contract no. 739563
24
Thank you for your attention!
• You’re invited to visit
the DANS/EOSCpilot
workshop and try
FAIRdat
• FAIR metrics: starring
your datasets
•Friday 8 September at
11:30h
•Book Castle Ground
Floor
• elly.dijk@dans.knaw.nl
• peter.doorn@dans.knaw.nl
• www.dans.knaw.nl
• www.eoscpilot.eu
• www.openaire.eu
www.eoscpilot.eu
The European Open Science Cloud for Research pilot project is funded by the
European Commission, DG Research & Innovation under contract no. 739563
25
FAIR Metrics:
STARRING
YOUR DATAF A I R
2 User Reviews
1 Archivist Assessment
24 Downloads

Mais conteúdo relacionado

Mais procurados

FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
EUDAT
 

Mais procurados (20)

Webinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management PlanningWebinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management Planning
 
Horizon 2020 open access and open data mandates
Horizon 2020 open access and open data mandatesHorizon 2020 open access and open data mandates
Horizon 2020 open access and open data mandates
 
CARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practiceCARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practice
 
ANDS and Data Management
ANDS and Data ManagementANDS and Data Management
ANDS and Data Management
 
Increasing the Reputation of your Published Data on the Web
Increasing the Reputation of your Published Data on the WebIncreasing the Reputation of your Published Data on the Web
Increasing the Reputation of your Published Data on the Web
 
Open Data: Strategies for Research Data Management (and Planning)
Open Data: Strategies for Research Data  Management (and Planning)Open Data: Strategies for Research Data  Management (and Planning)
Open Data: Strategies for Research Data Management (and Planning)
 
Preparing Data for Sharing: The FAIR Principles
Preparing Data for Sharing: The FAIR PrinciplesPreparing Data for Sharing: The FAIR Principles
Preparing Data for Sharing: The FAIR Principles
 
FAIR data and data management
FAIR data and data managementFAIR data and data management
FAIR data and data management
 
Kr slides fair astronomy 20181019
Kr slides fair astronomy 20181019Kr slides fair astronomy 20181019
Kr slides fair astronomy 20181019
 
FAIR data overview
FAIR data overviewFAIR data overview
FAIR data overview
 
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
"Cool" metadata for FAIR data
"Cool" metadata for FAIR data"Cool" metadata for FAIR data
"Cool" metadata for FAIR data
 
Framework and Roadmap towards an Open Science Infrastructure/Simon Hodson
Framework and Roadmap towards an Open Science Infrastructure/Simon HodsonFramework and Roadmap towards an Open Science Infrastructure/Simon Hodson
Framework and Roadmap towards an Open Science Infrastructure/Simon Hodson
 
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech Proposals
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
 
Mendeley Data FAIR hackathon
Mendeley Data FAIR hackathonMendeley Data FAIR hackathon
Mendeley Data FAIR hackathon
 
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
 
A Finnish perspective on FAIRsFAIR outputs
A Finnish perspective on FAIRsFAIR outputsA Finnish perspective on FAIRsFAIR outputs
A Finnish perspective on FAIRsFAIR outputs
 
Why institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRWhy institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIR
 

Semelhante a OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the DANS approach to FAIR metrics

Semelhante a OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the DANS approach to FAIR metrics (20)

The future of FAIR
The future of FAIRThe future of FAIR
The future of FAIR
 
FAIR data
FAIR dataFAIR data
FAIR data
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
 
FAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDAFAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDA
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) data
 
Data sharing in the Netherlands
Data sharing in the NetherlandsData sharing in the Netherlands
Data sharing in the Netherlands
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
DTL Integrator's meeting
DTL Integrator's meetingDTL Integrator's meeting
DTL Integrator's meeting
 
PARTHENOS Common Policies and Implementation Strategies
PARTHENOS Common Policies and Implementation StrategiesPARTHENOS Common Policies and Implementation Strategies
PARTHENOS Common Policies and Implementation Strategies
 
VODAN Africa IN.pptx
VODAN Africa IN.pptxVODAN Africa IN.pptx
VODAN Africa IN.pptx
 
John morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptxJohn morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptx
 
Turning FAIR into Reality - Role for Libraries
Turning FAIR into Reality - Role for Libraries Turning FAIR into Reality - Role for Libraries
Turning FAIR into Reality - Role for Libraries
 
FAIR play?
FAIR play? FAIR play?
FAIR play?
 
Introduction to FAIR Data and Research Objects
Introduction to FAIR Data and Research ObjectsIntroduction to FAIR Data and Research Objects
Introduction to FAIR Data and Research Objects
 
FAIR-Principles-and-ELN.pdf
FAIR-Principles-and-ELN.pdfFAIR-Principles-and-ELN.pdf
FAIR-Principles-and-ELN.pdf
 
FAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basicsFAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basics
 
EOSC-Nordic: FAIR support for data repositories
EOSC-Nordic: FAIR support for data repositoriesEOSC-Nordic: FAIR support for data repositories
EOSC-Nordic: FAIR support for data repositories
 
Turning FAIR into Reality: Briefing on the EC’s report on FAIR data
Turning FAIR into Reality: Briefing on the EC’s report on FAIR dataTurning FAIR into Reality: Briefing on the EC’s report on FAIR data
Turning FAIR into Reality: Briefing on the EC’s report on FAIR data
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 

Mais de Open Science Fair

OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...
OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...
OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...
Open Science Fair
 
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
Open Science Fair
 
OSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social Sciences
OSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social SciencesOSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social Sciences
OSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social Sciences
Open Science Fair
 

Mais de Open Science Fair (20)

OSFair2017 workshop | Monitoring open science trends in europe
OSFair2017 workshop | Monitoring open science trends in europeOSFair2017 workshop | Monitoring open science trends in europe
OSFair2017 workshop | Monitoring open science trends in europe
 
OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...
OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...
OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...
 
OSFair2017 Workshop | Data Analytics meets Social Sciences: New Frontiers of ...
OSFair2017 Workshop | Data Analytics meets Social Sciences: New Frontiers of ...OSFair2017 Workshop | Data Analytics meets Social Sciences: New Frontiers of ...
OSFair2017 Workshop | Data Analytics meets Social Sciences: New Frontiers of ...
 
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
 
OSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social Sciences
OSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social SciencesOSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social Sciences
OSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social Sciences
 
OSFair2017 Workshop | Towards a Policy Framework for the European Open Scienc...
OSFair2017 Workshop | Towards a Policy Framework for the European Open Scienc...OSFair2017 Workshop | Towards a Policy Framework for the European Open Scienc...
OSFair2017 Workshop | Towards a Policy Framework for the European Open Scienc...
 
OSFair2017 Workshop | Big Mechanism: deep reading for cancer biology
OSFair2017 Workshop | Big Mechanism: deep reading for cancer biologyOSFair2017 Workshop | Big Mechanism: deep reading for cancer biology
OSFair2017 Workshop | Big Mechanism: deep reading for cancer biology
 
OSFair2017 Workshop | Text mining
OSFair2017 Workshop | Text miningOSFair2017 Workshop | Text mining
OSFair2017 Workshop | Text mining
 
OSFair2017 Workshop | EOSCpilot governance
OSFair2017 Workshop | EOSCpilot governanceOSFair2017 Workshop | EOSCpilot governance
OSFair2017 Workshop | EOSCpilot governance
 
OSFair2017 Workshop | Brokering services facilitating interoperability and da...
OSFair2017 Workshop | Brokering services facilitating interoperability and da...OSFair2017 Workshop | Brokering services facilitating interoperability and da...
OSFair2017 Workshop | Brokering services facilitating interoperability and da...
 
OSFair2017 Workshop | Service provisioning for excellent sciences
OSFair2017 Workshop | Service provisioning for excellent sciencesOSFair2017 Workshop | Service provisioning for excellent sciences
OSFair2017 Workshop | Service provisioning for excellent sciences
 
OSFair2017 Theatrical Workshop | Are you ready to perform in the rri ecosystem
OSFair2017 Theatrical Workshop | Are you ready to perform in the rri ecosystemOSFair2017 Theatrical Workshop | Are you ready to perform in the rri ecosystem
OSFair2017 Theatrical Workshop | Are you ready to perform in the rri ecosystem
 
OSFair2017 Theatrical Workshop | Nucleus H2020 EU project
OSFair2017 Theatrical Workshop | Nucleus H2020 EU projectOSFair2017 Theatrical Workshop | Nucleus H2020 EU project
OSFair2017 Theatrical Workshop | Nucleus H2020 EU project
 
OSFair2017 Workshop | Open Knowledge Maps, A visual interface to the world's ...
OSFair2017 Workshop | Open Knowledge Maps, A visual interface to the world's ...OSFair2017 Workshop | Open Knowledge Maps, A visual interface to the world's ...
OSFair2017 Workshop | Open Knowledge Maps, A visual interface to the world's ...
 
OSFair2017 Training | Reproducibility in critical care research
OSFair2017 Training | Reproducibility in critical care researchOSFair2017 Training | Reproducibility in critical care research
OSFair2017 Training | Reproducibility in critical care research
 
OSFair2017 Training | Big data and evidence-based medicine in Greece
OSFair2017 Training | Big data and evidence-based medicine in GreeceOSFair2017 Training | Big data and evidence-based medicine in Greece
OSFair2017 Training | Big data and evidence-based medicine in Greece
 
OSFair2017 Training | What is Open Science and why should I care?
OSFair2017 Training | What is Open Science and why should I care?OSFair2017 Training | What is Open Science and why should I care?
OSFair2017 Training | What is Open Science and why should I care?
 
OSFair2017 Training | OpenAIRE monitoring services, EC FP7 & H2020 & other na...
OSFair2017 Training | OpenAIRE monitoring services, EC FP7 & H2020 & other na...OSFair2017 Training | OpenAIRE monitoring services, EC FP7 & H2020 & other na...
OSFair2017 Training | OpenAIRE monitoring services, EC FP7 & H2020 & other na...
 
OSFair2017 Training | Designing & implementing open access, open data & open ...
OSFair2017 Training | Designing & implementing open access, open data & open ...OSFair2017 Training | Designing & implementing open access, open data & open ...
OSFair2017 Training | Designing & implementing open access, open data & open ...
 
OSFair2017 Training | Best practice in Open Science
OSFair2017 Training | Best practice in Open ScienceOSFair2017 Training | Best practice in Open Science
OSFair2017 Training | Best practice in Open Science
 

Último

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
gindu3009
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Sérgio Sacani
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
Lokesh Kothari
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 

Último (20)

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 

OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the DANS approach to FAIR metrics

  • 1. Monitoring the FAIRness of Data Sets - Introducing the DANS Approach to FAIR Metrics Elly Dijk & Peter Doorn, DANS With thanks to Emily Thomas for some slides www.dans.knaw.n l Open Science Monitor – A workshop organised by OpenAIRE, EOSCpilot, Jisc, and DANS at the OPEN SCIENCE FAIR, Athens, 7 Sept 2017
  • 2. What is FAIR data and why is it important for Open Science? • Open Science has become a policy priority: not only publications need to be openly accessible, but also other research outputs, especially research data • Principle: open if possible, protected if necessary • Legitimate restrictions to openness: • to protect privacy • data creator must be able to publish first • Openness itself is not enough, data must also be: • Findable F • Accessible A • Interoperable I • Reusable R
  • 3. Institute of Dutch Academy and Research Funding Organisation (KNAW & NWO) since 2005 First predecessor dates back to 1964 (Steinmetz Foundation), Historical Data Archive 1989 Mission: promote and provide permanent access to digital research resources DANS is about keeping data FAIR
  • 4. FAIR guiding principles 1. Findable – Easy to find by both humans and computer systems and based on mandatory description of the metadata that allow the discovery of interesting datasets; 2. Accessible – Stored for long term such that they can be easily accessed and/or downloaded with well-defined license and access conditions (Open Access when possible), whether at the level of metadata, or at the level of the actual data content; 3. Interoperable – Ready to be combined with other datasets by humans as well as computer systems; 4. Reusable – Ready to be used for future research and to be processed further using computational methods. Source:
  • 6. Everybody loves FAIR! Everybody wants to be FAIR… But what does that mean? How to put the principles into practice?
  • 7. Our struggle with FAIR operationalization To be Findable: •F1. (meta)data are assigned a globally unique and eternally persistent identifier. •F2. data are described with rich metadata. •F3. (meta)data are registered or indexed in a searchable resource. •F4. metadata specify the data identifier. To be Accessible: •A1 (meta)data are retrievable by their identifier using a standardized communications protocol. •A1.1 the protocol is open, free, and universally implementable. •A1.2 the protocol allows for an authentication and authorization procedure, where necessary. •A2 metadata are accessible, even when the data are no longer available. No rankings Relation F2 – F4? Relation A1 – F2,3,4? What if they never were? Do access licenses belong to Re-use? Access criteria relate mainly to machine access Help-page: “The FAIR Data Principles explained”
  • 8. Difficulties in FAIR operationalization II To be Interoperable: • I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation. • I2. (meta)data use vocabularies that follow FAIR principles. • I3. (meta)data include qualified references to other (meta)data. To be Re-usable: • R1. meta(data) have a plurality of accurate and relevant attributes. • R1.1. (meta)data are released with a clear and accessible data usage license. • R1.2. (meta)data are associated with their provenance. • R1.3. (meta)data meet domain-relevant community standards. Droste effect: self-reference Which other metadata? Is not R1 about Findable again? Are not Licenses about Acc conditions? What if a license says “Not accessible”? Their? Whose? Is not R1.3 about Findable again? For FAIR Help see: https://www.dtls.nl/fair-data/fair-principles-explained/
  • 9. Our evaluation of the original FAIR principles: 1. Dependencies among dimensions, difficulty to understand and measure the criteria, no rank order from “low” to “high” FAIRness, grouping of criteria under dimensions is disputable 2. Do we need or want additional dimensions, principles or criteria? • Is “openness” a separate dimension, not included in FAIR? • Is it desirable/possible to say something about “substantive” data quality, such as the accuracy/precision or correctness of the data? • What about the long-term access? For how long does data remain FAIR? • Should data security be included? 3. Several FAIR criteria can be solved at the level of the repository 4. Do we need separate FAIR criteria for different disciplines? • e.g. machine actionable data are more important in some fields than in other; note that data accessibility by machines is partly defined by technical specs (A1), partly by licenses (R1.1) Therefore, DANS started work on a FAIR “light” implementation in the context of data repositories compliant with the Data Seal of Approval
  • 10. GO-FAIR Initiative: FAIR Metrics Group • Aim: to define metrics enabling both qualitative and quantitative assessment of the degree to which online resources comply with the 15 Principles of FAIR Data • Founding Members: • Mark Wilkinson, Universidad Politécnica de Madrid • Susanna Sansone, University of Oxford • Michel Dumontier, Maastricht University • Peter Doorn, DANS • Luiz Olavo Bonino, VU/DTL • Erik Schultes, DTL • https://www.dtls.nl/fair-data/fair-metrics-group/ and http://fairmetrics.org/
  • 11. FAIR Metrics Framework www.eoscpilot.eu The European Open Science Cloud for Research pilot project is funded by the European Commission, DG Research & Innovation under contract no. 739563 11
  • 12. FAIR Metrics Form Metric Identifier Metric Name To which principle does it apply? What is being measured? Why should we measure it? What must be provided? How do we measure it? What is a valid result? For which digital resource(s) is this relevant? Examples of their application across types of digital resource Comment www.eoscpilot.eu The European Open Science Cloud for Research pilot project is funded by the European Commission, DG Research & Innovation under contract no. 739563 12 • GOALS: • Develop a broadly useable framework and tool for FAIR assessment • Harmonize current ongoing efforts • Expected outcomes: • Checklist of clearly defined metrics • Indication of how these metrics can be readily implemented (e.g. self- reporting, automated) • Proposal(s) for undertaking a FAIR assessment of a digital resource • One or more implementations for performing an assessment Can we do it? FAIR Metrics Group
  • 13. The DANS approach to FAIR www.eoscpilot.eu The European Open Science Cloud for Research pilot project is funded by the European Commission, DG Research & Innovation under contract no. 739563 13 Data Seal of Approval Principles (for data repositories) 2006 FAIR Principles (for data sets) 2014 data can be found on the internet Findable data are accessible Accessible data are in a usable format Interoperable data are reliable Reusable data can be referred to (citable) The resemblance is not perfect: • usable format (DSA) is an aspect of interoperability (FAIR) • reliability (DSA) is a precondition for reusability (FAIR) • FAIR explicitly addresses machine readability A DSA certified repository already guarantees a baseline data quality level
  • 14. All data sets in a Trusted Repository are FAIR, but some are more FAIR than others
  • 15. Findable (defined by metadata (PID included) and documentation) 1. No PID nor metadata/documentation 2. PID without or with insufficient metadata 3. Sufficient/limited metadata without PID 4. PID with sufficient metadata 5. Extensive metadata and rich additional documentation available Accessible (defined by presence of user license) 1. Metadata nor data are accessible 2. Metadata are accessible but data is not accessible (no clear terms of reuse in license) 3. User restrictions apply (i.e. privacy, commercial interests, embargo period) 4. Public access (after registration) 5. Open access unrestricted Interoperable (defined by data format) 1. Proprietary (privately owned), non-open format data 2. Proprietary format, accepted by Certified Trustworthy Data Repository 3. Non-proprietary, open format = ‘preferred format’ 4. As well as in the preferred format, data is standardised using a standard vocabulary format (for the research field to which the data pertain) 5. Data additionally linked to other data to provide context FAIR “Light” assessment
  • 16. Towards a FAIR Framework? Analogous to Certification Framework? Formal ----------------------------------- Extended ----------------------------------------- Core All noses in the same direction?
  • 17. DANS FAIR assessment profile • Developing the data assessment tool: FAIRdat • We want to create a badge system using the FAIR principles to assess data sets in a TDR • Operationalise the original principles to ensure no interactions among dimensions to ease scoring • Consider Reusability as the resultant of the other three: • the average FAIRness as an indicator of data quality • (F+A+I)/3=R • Manual and automatic scoring • New principles still in progress F A I R 2 User Reviews 1 Archivist Assessment 24 Downloads
  • 18. Ideas for our operationalization Solution R1. (meta)data are richly described with a plurality of accurate and relevant attributes ≈ F2: “data are described with rich metadata” → F R1.1. (meta)data are released with a clear and accessible data usage license → A, aren’t licenses about access conditions? R1.2. (meta)data are associated with detailed provenance → F, appears as a criteria for Findable metadata R1.3. (meta)data meet domain-relevant community standards → I, ≈ standardized vocabularies Attempt to operationalize ‘R’ • Proposed solution
  • 22. …FAIR database • Identifier • Name of Data Set • Depositor / “owner” • Repository • Discipline • … • FAIR scores per item • Total FAIR score • Badge
  • 23. Display FAIR badges in any repository (Zenodo, Dataverse, Mendeley Data, figshare, B2SAFE, …)
  • 24. Monitoring Open Science: research data •How can we use this tool for Monitoring Open Science? • Do you think people are willing to assess research data in repositories? • And: Who will do the assessment? (depositors, data experts at repositories, users) • Using the FAIR metrics • in own repository – harvested by OpenAIRE • in FAIR database – harvested by OpenAIRE www.eoscpilot.eu The European Open Science Cloud for Research pilot project is funded by the European Commission, DG Research & Innovation under contract no. 739563 24
  • 25. Thank you for your attention! • You’re invited to visit the DANS/EOSCpilot workshop and try FAIRdat • FAIR metrics: starring your datasets •Friday 8 September at 11:30h •Book Castle Ground Floor • elly.dijk@dans.knaw.nl • peter.doorn@dans.knaw.nl • www.dans.knaw.nl • www.eoscpilot.eu • www.openaire.eu www.eoscpilot.eu The European Open Science Cloud for Research pilot project is funded by the European Commission, DG Research & Innovation under contract no. 739563 25 FAIR Metrics: STARRING YOUR DATAF A I R 2 User Reviews 1 Archivist Assessment 24 Downloads