SlideShare uma empresa Scribd logo
1 de 28
Quality and
improving
interoperability
between language
resources:             http://www.datasealofapproval.org/
trust, process,
simplicity         Dirk.Roorda@dans.knaw.nl
                   coordinator infrastructure at
                   http://www.dans.knaw.nl
Quality and interoperability




    evolution
    hard-to-fake traits
    indicating fitness
    promote interoperability
Overview
 • Introduction and Theory
   • qualities
   • trust, simplicity
   • guidelines
 • Process and Demo
   • assessment and review
 • Discussion and Application
   • CLARIN centers
   • language resources
Scientific Quality
http://www.ploscompbiol.org/article/metrics/info:doi/10.1371/journal.pcbi.1000112
Scientific quality
  • transparent
    • from producer
    • through repository
    • to consumer
  • properties to guard
    • authenticity
    • integrity
    • provenance
Usage quality
 • data formats
   • usability
 • metadata
   • findability
   • intellegibility
Quality control
 • by the stakeholders
   • data producers
   • data custodians
   • date consumers
 • custodians = repositories
 • substantial role for repositories
   • guidelines for producers
   • agreements for consumers
Quality issues
 • metadata standards
    • CMDI and www.isocat.org
 • preferred formats
    • TEI, XML
 • referencing systems
    • persistent identifiers
 • long term preservation
    • after the live-environment has died off
 • interoperability
    • OAI-PMH
Quality issues
 • search engines
   • CLARIN search and develop
 • access rights
   • comply with privacy law, copyright law
   • respect people from which data is obtained
 • accountability
   • for all repository operations
Quality and Trust
 • imperfection lurks everywhere
 • trust works where certainty blocks
 • trust is a process
   • to greater quality
   • to better relationships
   • to more certainty
Quality and Simplicity
  reduce       organize
time learn differences
         context
  emotion         trust
        failure
focus:
  subtract what is obvious
   add what is meaningful
                             http://lawsofsimplicity.com/
Guidelines: producers
http://www.datasealofapproval.org/

    1.The data producer deposits the research data in
    a data repository with sufficient information for
    others to assess the scientific and scholarly quality
    of the research data and compliance with
    disciplinary and ethical norms.
    2. The data producer provides the research data in
    formats recommended by the data repository
    3. The data producer provides the research data
    together with the metadata requested by the data
    repository
Guidelines: consumers
http://www.datasealofapproval.org/

    14. The data consumer complies with access
    regulations set by the data repository
    15. The data consumer conforms to and agrees
    with any codes of conduct that are generally
    accepted in higher education and research for the
    exchange and proper use of knowledge and
    information
    16. The data consumer respects the applicable
    licenses of the data repository regarding the use of
    the research data
Guidelines: repositories
http://www.datasealofapproval.org/

4. The data repository has an explicit mission in the area
of digital archiving and promulgates it
5. The data repository uses due diligence to ensure
compliance with legal regulations and contracts
including, when applicable, regulations governing the
protection of human subjects.
6. The data repository applies documented processes
and procedures for managing data storage
7. The data repository has a plan for long-term
preservation of its digital assets
Guidelines: repositories
http://www.datasealofapproval.org/

8. Archiving takes place according to explicit workflows across the
data life cycle
9. The data repository assumes responsibility from the data
producers for access and availability of the digital objects
10. The data repository enables the users to utilize the research
data and refer to them
11. The data repository ensures the integrity of the digital objects
and the metadata
12. The data repository ensures the authenticity of the digital
objects and the metadata
13. The technical infrastructure explicitly supports the tasks and
functions described in internationally accepted archival standards
like OAIS
Guidelines: outsourcing
http://www.datasealofapproval.org/


 repositories may outsource digital preservation
 to specialist repositories
 • implement all except 4,6,7,8 and 13
 • store a copy of the data in another (TDR) that
     • has acquired the DSA logo
     • by implementing each of the sixteen guidelines
     • (including 4, 6, 7, 8 and 13).
Seal of Approvement
 • a repository shows it on its webpage
 • if conditions are fulfilled
 • as testified by
   • a self-assessment
   • with reviews
   • on a yearly basis
 • the exact level of compliance is
   • transparently published under the seal
Assessment and review
 minimum requirements
 threshold will go up
 as time proceeds


 score actions taken           comments                  issues
 *      nothing done           give a reason
 **     theoretical concept    point to initiation doc   describe main issues
 ***    implementation phase   point to definition doc   describe main issues
 ****   fully implemented      point to definition doc
 N/A    not applicable         give a reason
Organisation
 • repositories represented by a board
 • tools to facilitate the procedure
   • modifiaction record
 • the DSA website links to compliant
   repositories
CLARIN centres
 • A = provide infrastructure
    • managing the federation
 • B = provide services
    • data and webservices
 • C = provide metadata
    • harvestable metadata
 • R = respected = recognised
    • offer LRT resources in whatever form
 • E = external
    • offer non-LRT resources or services
       • identity federations
       • national libraries
Group assignment
 • P(roducers)
   • invent p-guidelines for B/C centers
 • R(epositories)
   • invent r-guidelines for A/B centers
 • C(onsumers)
   • invent c-guidelines for B/C/R centers
 Suggestions for
 • assessment
 • review
 • modification record
Wrap-up: P-Group

metadata about background

information about researchers

    who, why, publications

    DAI

    In IMDI it is difficult to update information, affiliation updates,
    use unique identifiers for participants in building a corpus, store records of people,
    and link from the metadata of resources to the records of people

using formats depending on formats
    formats maybe standardised, but not usable to researchers, I do not want to wrap
    my data in dead formats: the repositories should support innovation in this respect,
    when it is driven by researchers
Wrap up: C-group
goal is: finding info in a repository

we need:

    overview of access rights

    proper web-connection to the repository

    user-friendly interface

    low threshold for feedback for new features

         we should be part of the chain in the design of the access tools

GUIDELINES
   WE WANT ALL CENTERS IN THE CHAIN THAT PROVIDE US WITH THE
   INFORMATION WE NEED TO OFFER US TRANSPARENCY AND VERIFIABILITY
   ON HOW THEIR DATA IS OBTAINED, PROCESSED AND
    CONTROLLED/MANAGED
    WE WANT TOOLS WITH CLEAR COPYRIGHT PERMISSIONS THAT HAVE A
Wrap-up: R-group
we provide infrastructure and management for data

we want to standardize our stuff

we need knowledge, the right metadata of the stuff that is coming to us

we want the materials in the right format, allowing for some flexibility
retro-archiving: we offer tools for converting legacy data, so that producers may submit
raw materials

management of data concerning legal access

    protect the providers, so that the providers can trust the consumers: licensing forms

share knowledge about services we provide with

    potential users: people working in the field

    other repositories
we want a forum as an instrument for developing trust between producers and
consumers: the community becomes more transparent
Wrap-up: General

add weights to guidelines, in order to
declare some guidelines more important
than others.

Mais conteúdo relacionado

Mais procurados

II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
TDWI Checklist Report: Active Data Archiving
TDWI Checklist Report:  Active Data ArchivingTDWI Checklist Report:  Active Data Archiving
TDWI Checklist Report: Active Data Archiving
RainStor
 

Mais procurados (20)

Mendeley Data FAIR hackathon
Mendeley Data FAIR hackathonMendeley Data FAIR hackathon
Mendeley Data FAIR hackathon
 
Data accessibilityandchallenges
Data accessibilityandchallengesData accessibilityandchallenges
Data accessibilityandchallenges
 
Core Trust Seal for Trustworthy Data Repositories, 2018-04-19
Core Trust Seal for Trustworthy Data Repositories, 2018-04-19Core Trust Seal for Trustworthy Data Repositories, 2018-04-19
Core Trust Seal for Trustworthy Data Repositories, 2018-04-19
 
ERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management Webinar
 
White Manipulating Metadata to Enhance Access
White Manipulating Metadata to Enhance AccessWhite Manipulating Metadata to Enhance Access
White Manipulating Metadata to Enhance Access
 
How the Core Trust Seal (CTS) Enables FAIR Data
How the Core Trust Seal (CTS) Enables FAIR DataHow the Core Trust Seal (CTS) Enables FAIR Data
How the Core Trust Seal (CTS) Enables FAIR Data
 
How core trust seal enables FAIR data - Natalie Harrower
How core trust seal enables FAIR data - Natalie HarrowerHow core trust seal enables FAIR data - Natalie Harrower
How core trust seal enables FAIR data - Natalie Harrower
 
Rusbridge Feb 8 Improving Clarity around Continuing Access
Rusbridge Feb 8 Improving Clarity around Continuing AccessRusbridge Feb 8 Improving Clarity around Continuing Access
Rusbridge Feb 8 Improving Clarity around Continuing Access
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
"Cool" metadata for FAIR data
"Cool" metadata for FAIR data"Cool" metadata for FAIR data
"Cool" metadata for FAIR data
 
Updates on the FAIR Data Maturity Model RDA Working Group & the DG RTD FAIR i...
Updates on the FAIR Data Maturity Model RDA Working Group & the DG RTD FAIR i...Updates on the FAIR Data Maturity Model RDA Working Group & the DG RTD FAIR i...
Updates on the FAIR Data Maturity Model RDA Working Group & the DG RTD FAIR i...
 
EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu | EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu |
 
RDMRose 1.4 The research data lifecycle
RDMRose 1.4 The research data lifecycleRDMRose 1.4 The research data lifecycle
RDMRose 1.4 The research data lifecycle
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
TDWI Checklist Report: Active Data Archiving
TDWI Checklist Report:  Active Data ArchivingTDWI Checklist Report:  Active Data Archiving
TDWI Checklist Report: Active Data Archiving
 
Supporting Data Stewardship in the Solid Earth Sciences
Supporting Data Stewardship in the Solid Earth SciencesSupporting Data Stewardship in the Solid Earth Sciences
Supporting Data Stewardship in the Solid Earth Sciences
 
Persistent Identifiers in EUDAT services| www.eudat.eu |
Persistent Identifiers in EUDAT services| www.eudat.eu | Persistent Identifiers in EUDAT services| www.eudat.eu |
Persistent Identifiers in EUDAT services| www.eudat.eu |
 
Data Publishing Models by Sünje Dallmeier-Tiessen
Data Publishing Models by Sünje Dallmeier-TiessenData Publishing Models by Sünje Dallmeier-Tiessen
Data Publishing Models by Sünje Dallmeier-Tiessen
 
Leverage DSpace for an enterprise, mission critical platform
Leverage DSpace for an enterprise, mission critical platformLeverage DSpace for an enterprise, mission critical platform
Leverage DSpace for an enterprise, mission critical platform
 

Semelhante a 2010 CLARA Nijmegen - Data Seal of Approval tutorial

Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
Karlsruhe Institute of Technology (KIT)
 
PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013
Frauke Ziedorn
 
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_SharedManaging Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Rob Daley
 

Semelhante a 2010 CLARA Nijmegen - Data Seal of Approval tutorial (20)

FAIRsharing - ENVRI-FAIR Webinar
FAIRsharing - ENVRI-FAIR WebinarFAIRsharing - ENVRI-FAIR Webinar
FAIRsharing - ENVRI-FAIR Webinar
 
Data sharing in the Netherlands
Data sharing in the NetherlandsData sharing in the Netherlands
Data sharing in the Netherlands
 
Turning FAIR into Reality - Role for Libraries
Turning FAIR into Reality - Role for Libraries Turning FAIR into Reality - Role for Libraries
Turning FAIR into Reality - Role for Libraries
 
PARTHENOS Common Policies and Implementation Strategies
PARTHENOS Common Policies and Implementation StrategiesPARTHENOS Common Policies and Implementation Strategies
PARTHENOS Common Policies and Implementation Strategies
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
 
Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
 
PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013
 
Researcher KnowHow: Research Data Management
Researcher KnowHow: Research Data ManagementResearcher KnowHow: Research Data Management
Researcher KnowHow: Research Data Management
 
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_SharedManaging Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
 
DATAD-R: Criteria for Trusted African Institutional Repositories
DATAD-R: Criteria for Trusted African Institutional RepositoriesDATAD-R: Criteria for Trusted African Institutional Repositories
DATAD-R: Criteria for Trusted African Institutional Repositories
 
Architecture and Standards
Architecture and StandardsArchitecture and Standards
Architecture and Standards
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
 
FAIRsharing - Mapping the Landscape of Databases, Repositories, Standards and...
FAIRsharing - Mapping the Landscape of Databases, Repositories, Standards and...FAIRsharing - Mapping the Landscape of Databases, Repositories, Standards and...
FAIRsharing - Mapping the Landscape of Databases, Repositories, Standards and...
 
Digital Curation 101 - Taster
Digital Curation 101 - TasterDigital Curation 101 - Taster
Digital Curation 101 - Taster
 
OU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research dataOU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research data
 

Mais de Dirk Roorda

Verbal Valency in Hebrew Verbs
Verbal Valency in Hebrew VerbsVerbal Valency in Hebrew Verbs
Verbal Valency in Hebrew Verbs
Dirk Roorda
 

Mais de Dirk Roorda (20)

TF-FAIR.pdf
TF-FAIR.pdfTF-FAIR.pdf
TF-FAIR.pdf
 
Textpy
TextpyTextpy
Textpy
 
General Missives
General MissivesGeneral Missives
General Missives
 
Text Display (when it gets tricky)
Text Display (when it gets tricky)Text Display (when it gets tricky)
Text Display (when it gets tricky)
 
Tf in-context
Tf in-contextTf in-context
Tf in-context
 
Quran and Text-Fabric
Quran and Text-FabricQuran and Text-Fabric
Quran and Text-Fabric
 
Ancient corpora analysis
Ancient corpora analysisAncient corpora analysis
Ancient corpora analysis
 
Qdf2tf
Qdf2tfQdf2tf
Qdf2tf
 
Text fabric
Text fabricText fabric
Text fabric
 
Verbal Valency in Hebrew Verbs
Verbal Valency in Hebrew VerbsVerbal Valency in Hebrew Verbs
Verbal Valency in Hebrew Verbs
 
Data management for researchers
Data management for researchersData management for researchers
Data management for researchers
 
Annotating the Hebrew Bible
Annotating the Hebrew BibleAnnotating the Hebrew Bible
Annotating the Hebrew Bible
 
20151111 utrecht ver theolbibliothecarissen
20151111 utrecht ver theolbibliothecarissen20151111 utrecht ver theolbibliothecarissen
20151111 utrecht ver theolbibliothecarissen
 
Text as Data: processing the Hebrew Bible
Text as Data: processing the Hebrew BibleText as Data: processing the Hebrew Bible
Text as Data: processing the Hebrew Bible
 
Datamanagement for Research: A Case Study
Datamanagement for Research: A Case StudyDatamanagement for Research: A Case Study
Datamanagement for Research: A Case Study
 
Award
AwardAward
Award
 
Datamanagement for Research: A Case Study
Datamanagement for Research: A Case StudyDatamanagement for Research: A Case Study
Datamanagement for Research: A Case Study
 
Hebrew Bible as Data: Laboratory, Sharing, Lessons
Hebrew Bible as Data: Laboratory, Sharing, LessonsHebrew Bible as Data: Laboratory, Sharing, Lessons
Hebrew Bible as Data: Laboratory, Sharing, Lessons
 
Laf fabric-dh benelux2014
Laf fabric-dh benelux2014Laf fabric-dh benelux2014
Laf fabric-dh benelux2014
 
Data Analysis in the Hebrew Bible
Data Analysis in the Hebrew BibleData Analysis in the Hebrew Bible
Data Analysis in the Hebrew Bible
 

Último

Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 

Último (20)

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 

2010 CLARA Nijmegen - Data Seal of Approval tutorial

  • 1. Quality and improving interoperability between language resources: http://www.datasealofapproval.org/ trust, process, simplicity Dirk.Roorda@dans.knaw.nl coordinator infrastructure at http://www.dans.knaw.nl
  • 2. Quality and interoperability evolution hard-to-fake traits indicating fitness promote interoperability
  • 3. Overview • Introduction and Theory • qualities • trust, simplicity • guidelines • Process and Demo • assessment and review • Discussion and Application • CLARIN centers • language resources
  • 4.
  • 6. Scientific quality • transparent • from producer • through repository • to consumer • properties to guard • authenticity • integrity • provenance
  • 7. Usage quality • data formats • usability • metadata • findability • intellegibility
  • 8. Quality control • by the stakeholders • data producers • data custodians • date consumers • custodians = repositories • substantial role for repositories • guidelines for producers • agreements for consumers
  • 9. Quality issues • metadata standards • CMDI and www.isocat.org • preferred formats • TEI, XML • referencing systems • persistent identifiers • long term preservation • after the live-environment has died off • interoperability • OAI-PMH
  • 10. Quality issues • search engines • CLARIN search and develop • access rights • comply with privacy law, copyright law • respect people from which data is obtained • accountability • for all repository operations
  • 11. Quality and Trust • imperfection lurks everywhere • trust works where certainty blocks • trust is a process • to greater quality • to better relationships • to more certainty
  • 12. Quality and Simplicity reduce organize time learn differences context emotion trust failure focus: subtract what is obvious add what is meaningful http://lawsofsimplicity.com/
  • 13. Guidelines: producers http://www.datasealofapproval.org/ 1.The data producer deposits the research data in a data repository with sufficient information for others to assess the scientific and scholarly quality of the research data and compliance with disciplinary and ethical norms. 2. The data producer provides the research data in formats recommended by the data repository 3. The data producer provides the research data together with the metadata requested by the data repository
  • 14. Guidelines: consumers http://www.datasealofapproval.org/ 14. The data consumer complies with access regulations set by the data repository 15. The data consumer conforms to and agrees with any codes of conduct that are generally accepted in higher education and research for the exchange and proper use of knowledge and information 16. The data consumer respects the applicable licenses of the data repository regarding the use of the research data
  • 15. Guidelines: repositories http://www.datasealofapproval.org/ 4. The data repository has an explicit mission in the area of digital archiving and promulgates it 5. The data repository uses due diligence to ensure compliance with legal regulations and contracts including, when applicable, regulations governing the protection of human subjects. 6. The data repository applies documented processes and procedures for managing data storage 7. The data repository has a plan for long-term preservation of its digital assets
  • 16. Guidelines: repositories http://www.datasealofapproval.org/ 8. Archiving takes place according to explicit workflows across the data life cycle 9. The data repository assumes responsibility from the data producers for access and availability of the digital objects 10. The data repository enables the users to utilize the research data and refer to them 11. The data repository ensures the integrity of the digital objects and the metadata 12. The data repository ensures the authenticity of the digital objects and the metadata 13. The technical infrastructure explicitly supports the tasks and functions described in internationally accepted archival standards like OAIS
  • 17. Guidelines: outsourcing http://www.datasealofapproval.org/ repositories may outsource digital preservation to specialist repositories • implement all except 4,6,7,8 and 13 • store a copy of the data in another (TDR) that • has acquired the DSA logo • by implementing each of the sixteen guidelines • (including 4, 6, 7, 8 and 13).
  • 18. Seal of Approvement • a repository shows it on its webpage • if conditions are fulfilled • as testified by • a self-assessment • with reviews • on a yearly basis • the exact level of compliance is • transparently published under the seal
  • 19. Assessment and review minimum requirements threshold will go up as time proceeds score actions taken comments issues * nothing done give a reason ** theoretical concept point to initiation doc describe main issues *** implementation phase point to definition doc describe main issues **** fully implemented point to definition doc N/A not applicable give a reason
  • 20. Organisation • repositories represented by a board • tools to facilitate the procedure • modifiaction record • the DSA website links to compliant repositories
  • 21.
  • 22.
  • 23. CLARIN centres • A = provide infrastructure • managing the federation • B = provide services • data and webservices • C = provide metadata • harvestable metadata • R = respected = recognised • offer LRT resources in whatever form • E = external • offer non-LRT resources or services • identity federations • national libraries
  • 24. Group assignment • P(roducers) • invent p-guidelines for B/C centers • R(epositories) • invent r-guidelines for A/B centers • C(onsumers) • invent c-guidelines for B/C/R centers Suggestions for • assessment • review • modification record
  • 25. Wrap-up: P-Group metadata about background information about researchers who, why, publications DAI In IMDI it is difficult to update information, affiliation updates, use unique identifiers for participants in building a corpus, store records of people, and link from the metadata of resources to the records of people using formats depending on formats formats maybe standardised, but not usable to researchers, I do not want to wrap my data in dead formats: the repositories should support innovation in this respect, when it is driven by researchers
  • 26. Wrap up: C-group goal is: finding info in a repository we need: overview of access rights proper web-connection to the repository user-friendly interface low threshold for feedback for new features we should be part of the chain in the design of the access tools GUIDELINES WE WANT ALL CENTERS IN THE CHAIN THAT PROVIDE US WITH THE INFORMATION WE NEED TO OFFER US TRANSPARENCY AND VERIFIABILITY ON HOW THEIR DATA IS OBTAINED, PROCESSED AND CONTROLLED/MANAGED WE WANT TOOLS WITH CLEAR COPYRIGHT PERMISSIONS THAT HAVE A
  • 27. Wrap-up: R-group we provide infrastructure and management for data we want to standardize our stuff we need knowledge, the right metadata of the stuff that is coming to us we want the materials in the right format, allowing for some flexibility retro-archiving: we offer tools for converting legacy data, so that producers may submit raw materials management of data concerning legal access protect the providers, so that the providers can trust the consumers: licensing forms share knowledge about services we provide with potential users: people working in the field other repositories we want a forum as an instrument for developing trust between producers and consumers: the community becomes more transparent
  • 28. Wrap-up: General add weights to guidelines, in order to declare some guidelines more important than others.

Notas do Editor

  1. 1reduce (restrict to the most important issues, a few guidelines will do)2organize (group the guidelines in sections for producers, custodians, consumers)3time (save time by a smooth assessment process)4learn (use expertise in preservation)5differences (reintroduce complexity in a controlled way, because sometimes it is needed)6context (exploit knowledge of the community, requirements of the users)7emotion (do not make it purely bureaucratical, keep the feeling of value, enjoy good relationships with stakeholders)8trust (by default trust, but know where your undo button is, even against the ones you trust)9failure (learn from failures, improve the guidelines, the assessment procedures)10focus: subtract what is obvious, add what is meaningful (this is not about the data in bank accounts, nor highly sensitive medical data, nor company archives, but about research data: the scientific value is protected by the guidelines)