SlideShare uma empresa Scribd logo
1 de 26
Open Data (and Software, and
other Research Artefacts):
A proper management
Seminar: Let’s Do it Together! How to implement
Open Science practices in Research Projects
Universidad Politécnica de Madrid
29/11/2019
With contributions from Esteban González, Daniel Garijo,
Idafen Santana, Olga Giraldo
Oscar Corcho
ocorcho@fi.upm.es
@ocorcho, @opencitydata_es
https://www.slideshare.com/ocorcho
License
• This work is licensed under the license
CC BY-NC-SA 4.0 International
• http://purl.org/NET/rdflicense/cc-by-nc-sa4.0
• You are free:
• to Share — to copy, distribute and transmit the work
• to Remix — to adapt the work
• Under the following conditions
• Attribution — You must attribute the work by inserting
• “[source Oscar Corcho]” at the footer of each reused slide
• a credits slide stating: “These slides are partially based on
“Open Data (and Software, and other Research Artefacts):
A proper management” by O. Corcho”
• Non-commercial
• Share-Alike
The key messages of my talk...
Open Science ≠ Open Access
Science is not only about papers (other objects exist)
Open Science = Open Access + Research Data Management
+ Research Object Management
We all need principled approaches and clear guidelines
(community or institution driven) to adopt an Open
Science approach
We expect this (non-extra) work to pay off in the future
Outline
• From Open (Government) Data to Open Science
• Our previous OEG-UPM research and development
to support Open Science practices
• Research Objects
• Systematic (Meta)Data Management in Research
• Ontology-based Representation of Laboratory Protocols
• Reproducibility of Computational Experiments
• Our (practical) understanding of Open Science and
current practices at OEG-UPM
Outline
• From Open (Government) Data to Open Science
• Our previous OEG-UPM research and development
to support Open Science practices
• Research Objects
• Systematic (Meta)Data Management in Research
• Ontology-based Representation of Laboratory Protocols
• Reproducibility of Computational Experiments
• Our (practical) understanding of Open Science and
current practices at OEG-UPM
What is Open (Government) Data?
• Open data is data that can be freely used, re-used
and redistributed by anyone - subject only, at most, to
the requirement to attribute and sharealike
• Key aspects:
• Availability and access: the data must be available as a
whole and at no more than a reasonable reproduction cost,
preferably by downloading over the Internet. The data must
also be available in a convenient and modifiable form.
• Re-use and redistribution: the data must be provided
under terms that permit re-use and redistribution including
the intermixing with other datasets.
• Universal participation: everyone must be able to use, re-
use and redistribute - there should be no discrimination
against fields of endeavour or against persons or groups
[source: Open Data Handbook, http://opendatahandbook.org/en/what-is-open-data/ ]
Relevant Legislation. Europe and Spain
• Open Access Initiative (2001). Scientific information; > 510 orgs
• Aarhus Convention (1998). Right to participate and access; 41
countries and the EU
• Convention on official documentation access (2009). 12 countries
• (Open Data and) PSI-reuse Directives (2003/98/EC, 2013/37/UE and
2019/1024)
• https://ec.europa.eu/digital-single-market/en/european-legislation-reuse-public-sector-information
• List of high-value datasets: geospatial, Earth Observation and environment, meteorological, statistics,
companies and company ownership, mobility
• Law 37/2007. PSI reuse (transposition of directive 2003/98/EC)
• Modified in law 18/2015 (BOE 10/07/2015, directive 2013/37/UE)
• 2019/1024 Directive to be transposed by 16/07/2021
• Law 11/2007. Citizen rights to access to good-quality public services
• RD 4/2010 Esquema Nacional de Interoperabilidad
• Open standards, technology neutral, open source
• RD 1495/2011 It develops Law 37/2007 for national agencies
• Norma Técnica de Interoperabilidad (19/02/2013, BOE 4/3/2013)
[source: based on material from Antonio Rodríguez Pascual (CNIG)]
An Explosion of Open Data Portals
Some of our activities in Open (Government) Data
Culture (@BNE) Geograhy (@IGN) Metereology (@AEMET)
Cities (@ Zaragoza, Gob Aragón, Catalogues)
Host of esDBpedia
UNE 178301:2015
Norm on Open Data
for Smart Cities
However, today we are talking
about Open Science
Open Scientific Data vs Open Government Data (I)
• Is Open Data in Science actually much different from Open
Government Data?
• NO
• “freely used, re-used and redistributed by anyone - subject
only, at most, to the requirement to attribute and sharealike”
• Funders encourage the generation of open research data
• E.g., guidelines on FAIR Data Management H2020
http://ec.europa.eu/research/participants/data/ref/h2020/gr
ants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf
• YES
• Not such a large history of legislation
• Initially, most work focused on open access (papers)
• Not only available for use and reuse, but also for reproducibility
• It is often not useful without the rest of research artefacts that
come together with it (methods, software, protocols, papers)
Open Scientific Data vs Open Government Data (II)
The same explosion of Data Portals
General-purpose Region-specific
Domain-specific (e.g. Astronomy) Institution-specific
Open Scientific Data vs Open Government Data (III)
• And a good number of alternative technologies
• And a good number of metadata schemas
• DataCite
• CrossRef
• CKAN Metadata
• DDI4
• DCAT
• …
Are we applying what we learned in Open Gov Data?
• Some of the same mistakes are being done
• Setting up a portal/infrastructure does not mean that you are
better than others
• Having more objects in your repository does not mean that
you are doing more or better Open Science
• No clear instructions on what to upload or not, and how to
ensure quality (except for mature domains or organisations)
• No clear governance (handled by researchers?, handled by
data centers?, handled by libraries?)
• And a few more things
• No clear relationship among all research artefacts
• No clear relationship between the Data Management Plans
and the way in which data is finally handled
Outline
• From Open (Government) Data to Open Science
• Our previous OEG-UPM research and development
to support Open Science practices
• Research Objects
• Systematic (Meta)Data Management in Research
• Ontology-based Representation of Laboratory Protocols
• Reproducibility of Computational Experiments
• Our (practical) understanding of Open Science and
current practices at OEG-UPM
How do we do Science? Main components
[source: Idafen Santana]
The life of our researchers at OEG-UPM
Scientist
Live RO Live RO
RO snapshot
<<copy>>
Permanent URI
Some metadata
Some curation
Mostly private (for my group)
RO snapshot
<<copy>>
Permanent URI
Some metadata
Some curation
Mostly private (for my group
and for paper reviewers)
Librarian/Curator
Scientist
My supervisor calls me to
report my work
My supervisor calls me
again and we decide to
publish our RO+paper
<<versionOf>>
Archived RO
<<copy, filter
and curate>>
Permanent URI
Good metadata
and curation
Mostly public
Reviews received and
final version
published
<<versionOf>>
A new PhD student
continues my work
<<copy>>
19
bundles and relates digital resources of a scientific experiment
or investigation using standard mechanisms, “tool middleware”
http://www.w3.org/community/rosc/
http://www.researchobject.org/
Systematic (meta)data Management in Research
• Open (Research) Data portals
• Data Management
• Data Publication
• DOIs
• Sensor Data (photometers)
• Management
• Visualisation
» And Citizen Science
Laboratory protocols
22PhD Thesis: SeMAntic RepresenTation for Experimental Protocols
Pegasus
Montage
SoyKB
Epigenomics
CLOUD
Reproducibility of Computational Scientific Experiments
23
FORMER
EQUIPMENT
ANNOTATE REPRODUCE
SEMANTIC
ANNOTATIONS
EQUIVALENT EXECUTION
ENVIRONMENT
Dispel4Py
Internal Extinction
Seismic Cross
Correlation
Makeflow
Blast
Outline
• From Open (Government) Data to Open Science
• Our previous OEG-UPM research and development
to support Open Science practices
• Research Objects
• Systematic (Meta)Data Management in Research
• Ontology-based Representation of Laboratory Protocols
• Reproducibility of Computational Experiments
• Our (practical) understanding of Open Science and
current practices at OEG-UPM
How do we do it at OEG-UPM?
• Which research artefacts do we handle at OEG-UPM?
• Papers (sure, let’s see the following talk by UPM’s library)
• Data Management Plans (DMPOnline –PaGoDa did not exist)
• Datasets
• Normally in GitHub, e.g. https://github.com/oeg-upm/btn100
• Software source code
• Normally in GitHub: http://www.github.com/oeg-upm
• Docker images, models and APIs
• Normally in DockerHub: https://hub.docker.com/u/oegupm/
• Ontologies, thesauri, etc.
• Normally in GitHub, e.g.,
https://github.com/CiudadesAbiertas/vocab-sector-publico-
agenda-municipal
• And published online, e.g.,
http://vocab.ciudadesabiertas.es/def/sector-publico/agenda-
municipal/
• …
And which are our (good) practices?
• Still missing many, but...
• When a research or experiment starts, a new GitHub
repository is created
• The repository is connected to Zenodo, so as to get DOIs
and ensure archival
• Automated archival process after every release
• DOIs also added to the GitHub repository
• Our papers cite those DOIs
• Bit.ly, dropbox, GDrive links, etc., are strictly prohibited in
our papers
• Zenodo community
• https://zenodo.org/communities/ontologyengineeringgrou
p/
The key messages of my talk...
Open Science ≠ Open Access
Science is not only about papers (other objects exist)
Open Science = Open Access + Research Data Management
+ Research Object Management
We all need principled approaches and clear guidelines
(community or institution driven) to adopt an Open
Science approach
We expect this (non-extra) work to pay off in the future
Open Data (and Software, and
other Research Artefacts):
A proper management
Seminar: Let’s Do it Together! How to implement
Open Science practices in Research Projects
Universidad Politécnica de Madrid
29/11/2019
With contributions from Esteban González, Daniel Garijo,
Idafen Santana, Olga Giraldo
Oscar Corcho
ocorcho@fi.upm.es
@ocorcho, @opencitydata_es
https://www.slideshare.com/ocorcho

Mais conteúdo relacionado

Mais procurados

Proteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomicsProteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomicsJuan Antonio Vizcaino
 
SSSW2015 Data Workflow Tutorial
SSSW2015 Data Workflow TutorialSSSW2015 Data Workflow Tutorial
SSSW2015 Data Workflow TutorialSSSW
 
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...Juan Antonio Vizcaino
 
Experiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics fieldExperiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics fieldJuan Antonio Vizcaino
 
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...Pedro Príncipe
 
Open Science and European Access Policies in H2020
Open Science and European Access Policies in H2020 Open Science and European Access Policies in H2020
Open Science and European Access Policies in H2020 Reme Melero
 
GIS Day 2015: Geoinformatics, Open Source and Videos - a library perspective
GIS Day 2015: Geoinformatics, Open Source and Videos - a library perspectiveGIS Day 2015: Geoinformatics, Open Source and Videos - a library perspective
GIS Day 2015: Geoinformatics, Open Source and Videos - a library perspectivePeter Löwe
 
Introduction to open science
Introduction to open scienceIntroduction to open science
Introduction to open scienceReme Melero
 
Elephant in the Room: Scaling Storage for the HathiTrust Research Center
Elephant in the Room: Scaling Storage for the HathiTrust Research CenterElephant in the Room: Scaling Storage for the HathiTrust Research Center
Elephant in the Room: Scaling Storage for the HathiTrust Research CenterRobert H. McDonald
 
Keystone summer school 2015 paolo-missier-provenance
Keystone summer school 2015 paolo-missier-provenanceKeystone summer school 2015 paolo-missier-provenance
Keystone summer school 2015 paolo-missier-provenancePaolo Missier
 
OpenAIRE webinar: Horizon 2020 Open Science Policies and beyond, with Emilie ...
OpenAIRE webinar: Horizon 2020 Open Science Policies and beyond, with Emilie ...OpenAIRE webinar: Horizon 2020 Open Science Policies and beyond, with Emilie ...
OpenAIRE webinar: Horizon 2020 Open Science Policies and beyond, with Emilie ...OpenAIRE
 
Liber 2014 - Chain Reactions: TEL & RLUK on their Linked Open data.
Liber 2014 - Chain Reactions: TEL & RLUK on their Linked Open data.Liber 2014 - Chain Reactions: TEL & RLUK on their Linked Open data.
Liber 2014 - Chain Reactions: TEL & RLUK on their Linked Open data.Mike Mertens
 
Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Oscar Corcho
 
ContentMining and Copyright at CopyCamp2017
ContentMining and Copyright at CopyCamp2017ContentMining and Copyright at CopyCamp2017
ContentMining and Copyright at CopyCamp2017petermurrayrust
 
Search, Exploration and Analytics of Evolving Data
Search, Exploration and Analytics of Evolving DataSearch, Exploration and Analytics of Evolving Data
Search, Exploration and Analytics of Evolving DataNattiya Kanhabua
 
Open Research Data: Present and planned EC Policy, Jean-Claude Burgelman impl...
Open Research Data: Present and planned EC Policy, Jean-Claude Burgelman impl...Open Research Data: Present and planned EC Policy, Jean-Claude Burgelman impl...
Open Research Data: Present and planned EC Policy, Jean-Claude Burgelman impl...Platforma Otwartej Nauki
 
FAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsCarole Goble
 

Mais procurados (20)

Proteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomicsProteomics public data resources: enabling "big data" analysis in proteomics
Proteomics public data resources: enabling "big data" analysis in proteomics
 
SSSW2015 Data Workflow Tutorial
SSSW2015 Data Workflow TutorialSSSW2015 Data Workflow Tutorial
SSSW2015 Data Workflow Tutorial
 
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
Proteomics and the "big data" trend: challenges and new possibilitites (Talk ...
 
Experiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics fieldExperiences to learn from the MS proteomics field
Experiences to learn from the MS proteomics field
 
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
 
Open Science and European Access Policies in H2020
Open Science and European Access Policies in H2020 Open Science and European Access Policies in H2020
Open Science and European Access Policies in H2020
 
GIS Day 2015: Geoinformatics, Open Source and Videos - a library perspective
GIS Day 2015: Geoinformatics, Open Source and Videos - a library perspectiveGIS Day 2015: Geoinformatics, Open Source and Videos - a library perspective
GIS Day 2015: Geoinformatics, Open Source and Videos - a library perspective
 
Introduction to open science
Introduction to open scienceIntroduction to open science
Introduction to open science
 
Elephant in the Room: Scaling Storage for the HathiTrust Research Center
Elephant in the Room: Scaling Storage for the HathiTrust Research CenterElephant in the Room: Scaling Storage for the HathiTrust Research Center
Elephant in the Room: Scaling Storage for the HathiTrust Research Center
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
Keystone summer school 2015 paolo-missier-provenance
Keystone summer school 2015 paolo-missier-provenanceKeystone summer school 2015 paolo-missier-provenance
Keystone summer school 2015 paolo-missier-provenance
 
OpenAIRE webinar: Horizon 2020 Open Science Policies and beyond, with Emilie ...
OpenAIRE webinar: Horizon 2020 Open Science Policies and beyond, with Emilie ...OpenAIRE webinar: Horizon 2020 Open Science Policies and beyond, with Emilie ...
OpenAIRE webinar: Horizon 2020 Open Science Policies and beyond, with Emilie ...
 
Liber 2014 - Chain Reactions: TEL & RLUK on their Linked Open data.
Liber 2014 - Chain Reactions: TEL & RLUK on their Linked Open data.Liber 2014 - Chain Reactions: TEL & RLUK on their Linked Open data.
Liber 2014 - Chain Reactions: TEL & RLUK on their Linked Open data.
 
Implementing Linked Data in Low-Resource Conditions
Implementing Linked Data in Low-Resource ConditionsImplementing Linked Data in Low-Resource Conditions
Implementing Linked Data in Low-Resource Conditions
 
Bio2RDF @ DILS 2008
Bio2RDF @ DILS 2008Bio2RDF @ DILS 2008
Bio2RDF @ DILS 2008
 
Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?
 
ContentMining and Copyright at CopyCamp2017
ContentMining and Copyright at CopyCamp2017ContentMining and Copyright at CopyCamp2017
ContentMining and Copyright at CopyCamp2017
 
Search, Exploration and Analytics of Evolving Data
Search, Exploration and Analytics of Evolving DataSearch, Exploration and Analytics of Evolving Data
Search, Exploration and Analytics of Evolving Data
 
Open Research Data: Present and planned EC Policy, Jean-Claude Burgelman impl...
Open Research Data: Present and planned EC Policy, Jean-Claude Burgelman impl...Open Research Data: Present and planned EC Policy, Jean-Claude Burgelman impl...
Open Research Data: Present and planned EC Policy, Jean-Claude Burgelman impl...
 
FAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research Commons
 

Semelhante a Open Data (and Software, and other Research Artefacts) - A proper management

OpenAIRE: eInfrastructure for Open Science
OpenAIRE: eInfrastructure for Open ScienceOpenAIRE: eInfrastructure for Open Science
OpenAIRE: eInfrastructure for Open ScienceOpenAIRE
 
Leadership in Open Access Arena in Turkey and Effect of OpenAIRE2020 Project
Leadership in Open Access Arena in Turkey and Effect of OpenAIRE2020 ProjectLeadership in Open Access Arena in Turkey and Effect of OpenAIRE2020 Project
Leadership in Open Access Arena in Turkey and Effect of OpenAIRE2020 ProjectGultekin Gurdal
 
Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Peter Löwe
 
A funder’s perspective: Welcome from the EC, Caroline Colin (OpenAIRE worksho...
A funder’s perspective: Welcome from the EC, Caroline Colin (OpenAIRE worksho...A funder’s perspective: Welcome from the EC, Caroline Colin (OpenAIRE worksho...
A funder’s perspective: Welcome from the EC, Caroline Colin (OpenAIRE worksho...OpenAIRE
 
Research Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityResearch Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityOscar Corcho
 
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...Bertram Ludäscher
 
What does open science mean? A stakeholder perspective
What does open science mean? A stakeholder perspectiveWhat does open science mean? A stakeholder perspective
What does open science mean? A stakeholder perspectiveLIBER Europe
 
European Commission's Open Science Initiative: co-creating added value with data
European Commission's Open Science Initiative: co-creating added value with dataEuropean Commission's Open Science Initiative: co-creating added value with data
European Commission's Open Science Initiative: co-creating added value with dataEFSA EU
 
Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College LondonSarah Anna Stewart
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...BigData_Europe
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation Research Data Alliance
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation Research Data Alliance
 
How practising open research can benefit you
How practising open research can benefit youHow practising open research can benefit you
How practising open research can benefit youUoLResearchSupport
 
Open Science policy: EC, ERC, Belspo, FWO
Open Science policy: EC, ERC, Belspo, FWOOpen Science policy: EC, ERC, Belspo, FWO
Open Science policy: EC, ERC, Belspo, FWOOpenAccessBelgium
 
Open science and its advocacy
Open science and its advocacyOpen science and its advocacy
Open science and its advocacySarah Jones
 
Next generation repositories
Next generation repositoriesNext generation repositories
Next generation repositoriesPaul Walk
 

Semelhante a Open Data (and Software, and other Research Artefacts) - A proper management (20)

Open Science
Open ScienceOpen Science
Open Science
 
OpenAIRE: eInfrastructure for Open Science
OpenAIRE: eInfrastructure for Open ScienceOpenAIRE: eInfrastructure for Open Science
OpenAIRE: eInfrastructure for Open Science
 
Data and Research Infrastructures and Open Science
Data and Research Infrastructures and Open ScienceData and Research Infrastructures and Open Science
Data and Research Infrastructures and Open Science
 
Leadership in Open Access Arena in Turkey and Effect of OpenAIRE2020 Project
Leadership in Open Access Arena in Turkey and Effect of OpenAIRE2020 ProjectLeadership in Open Access Arena in Turkey and Effect of OpenAIRE2020 Project
Leadership in Open Access Arena in Turkey and Effect of OpenAIRE2020 Project
 
Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...
 
A funder’s perspective: Welcome from the EC, Caroline Colin (OpenAIRE worksho...
A funder’s perspective: Welcome from the EC, Caroline Colin (OpenAIRE worksho...A funder’s perspective: Welcome from the EC, Caroline Colin (OpenAIRE worksho...
A funder’s perspective: Welcome from the EC, Caroline Colin (OpenAIRE worksho...
 
Research Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityResearch Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibility
 
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...
 
What does open science mean? A stakeholder perspective
What does open science mean? A stakeholder perspectiveWhat does open science mean? A stakeholder perspective
What does open science mean? A stakeholder perspective
 
European Commission's Open Science Initiative: co-creating added value with data
European Commission's Open Science Initiative: co-creating added value with dataEuropean Commission's Open Science Initiative: co-creating added value with data
European Commission's Open Science Initiative: co-creating added value with data
 
Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College London
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
 
Intro-EOSC.pptx
Intro-EOSC.pptxIntro-EOSC.pptx
Intro-EOSC.pptx
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
How practising open research can benefit you
How practising open research can benefit youHow practising open research can benefit you
How practising open research can benefit you
 
Open Science policy: EC, ERC, Belspo, FWO
Open Science policy: EC, ERC, Belspo, FWOOpen Science policy: EC, ERC, Belspo, FWO
Open Science policy: EC, ERC, Belspo, FWO
 
Are Libraries Sustainable in a World of Free, Networked, Digital Information?
Are Libraries Sustainable in a World of Free, Networked, Digital Information?Are Libraries Sustainable in a World of Free, Networked, Digital Information?
Are Libraries Sustainable in a World of Free, Networked, Digital Information?
 
Open science and its advocacy
Open science and its advocacyOpen science and its advocacy
Open science and its advocacy
 
Next generation repositories
Next generation repositoriesNext generation repositories
Next generation repositories
 

Mais de Oscar Corcho

Organisational Interoperability in Practice at Universidad Politécnica de Madrid
Organisational Interoperability in Practice at Universidad Politécnica de MadridOrganisational Interoperability in Practice at Universidad Politécnica de Madrid
Organisational Interoperability in Practice at Universidad Politécnica de MadridOscar Corcho
 
Introducción a los Datos Abiertos - Open Data Day 2020
Introducción a los Datos Abiertos - Open Data Day 2020Introducción a los Datos Abiertos - Open Data Day 2020
Introducción a los Datos Abiertos - Open Data Day 2020Oscar Corcho
 
Adiós a los ficheros, hola a los grafos de conocimientos estadísticos
Adiós a los ficheros, hola a los grafos de conocimientos estadísticosAdiós a los ficheros, hola a los grafos de conocimientos estadísticos
Adiós a los ficheros, hola a los grafos de conocimientos estadísticosOscar Corcho
 
Ontology Engineering at Scale for Open City Data Sharing
Ontology Engineering at Scale for Open City Data SharingOntology Engineering at Scale for Open City Data Sharing
Ontology Engineering at Scale for Open City Data SharingOscar Corcho
 
Situación de las iniciativas de Open Data internacionales (y algunas recomen...
Situación de las iniciativas de Open Data internacionales (y algunas recomen...Situación de las iniciativas de Open Data internacionales (y algunas recomen...
Situación de las iniciativas de Open Data internacionales (y algunas recomen...Oscar Corcho
 
STARS4ALL - Contaminación Lumínica
STARS4ALL - Contaminación LumínicaSTARS4ALL - Contaminación Lumínica
STARS4ALL - Contaminación LumínicaOscar Corcho
 
Towards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experienceTowards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experienceOscar Corcho
 
Publishing Linked Statistical Data: Aragón, a case study
Publishing Linked Statistical Data: Aragón, a case studyPublishing Linked Statistical Data: Aragón, a case study
Publishing Linked Statistical Data: Aragón, a case studyOscar Corcho
 
An initial analysis of topic-based similarity among scientific documents base...
An initial analysis of topic-based similarity among scientific documents base...An initial analysis of topic-based similarity among scientific documents base...
An initial analysis of topic-based similarity among scientific documents base...Oscar Corcho
 
Linked Statistical Data 101
Linked Statistical Data 101Linked Statistical Data 101
Linked Statistical Data 101Oscar Corcho
 
Aplicando los principios de Linked Data en AEMET
Aplicando los principios de Linked Data en AEMETAplicando los principios de Linked Data en AEMET
Aplicando los principios de Linked Data en AEMET Oscar Corcho
 
Ojo Al Data 100 - Call for sharing session at IODC 2016
Ojo Al Data 100 - Call for sharing session at IODC 2016Ojo Al Data 100 - Call for sharing session at IODC 2016
Ojo Al Data 100 - Call for sharing session at IODC 2016Oscar Corcho
 
Educando sobre datos abiertos: desde el colegio a la universidad
Educando sobre datos abiertos: desde el colegio a la universidadEducando sobre datos abiertos: desde el colegio a la universidad
Educando sobre datos abiertos: desde el colegio a la universidadOscar Corcho
 
STARS4ALL general presentation at ALAN2016
STARS4ALL general presentation at ALAN2016STARS4ALL general presentation at ALAN2016
STARS4ALL general presentation at ALAN2016Oscar Corcho
 
Generación de datos estadísticos enlazados del Instituto Aragonés de Estadística
Generación de datos estadísticos enlazados del Instituto Aragonés de EstadísticaGeneración de datos estadísticos enlazados del Instituto Aragonés de Estadística
Generación de datos estadísticos enlazados del Instituto Aragonés de EstadísticaOscar Corcho
 
Presentación de la red de excelencia de Open Data y Smart Cities
Presentación de la red de excelencia de Open Data y Smart CitiesPresentación de la red de excelencia de Open Data y Smart Cities
Presentación de la red de excelencia de Open Data y Smart CitiesOscar Corcho
 
Linked Statistical Data: does it actually pay off?
Linked Statistical Data: does it actually pay off?Linked Statistical Data: does it actually pay off?
Linked Statistical Data: does it actually pay off?Oscar Corcho
 
Slow-cooked data and APIs in the world of Big Data: the view from a city per...
Slow-cooked data and APIs in the world of Big Data: the view from a city per...Slow-cooked data and APIs in the world of Big Data: the view from a city per...
Slow-cooked data and APIs in the world of Big Data: the view from a city per...Oscar Corcho
 
(Big) Data (Science) Skills
(Big) Data (Science) Skills(Big) Data (Science) Skills
(Big) Data (Science) SkillsOscar Corcho
 
Big Data - El Futuro a través de los Datos
Big Data - El Futuro a través de los DatosBig Data - El Futuro a través de los Datos
Big Data - El Futuro a través de los DatosOscar Corcho
 

Mais de Oscar Corcho (20)

Organisational Interoperability in Practice at Universidad Politécnica de Madrid
Organisational Interoperability in Practice at Universidad Politécnica de MadridOrganisational Interoperability in Practice at Universidad Politécnica de Madrid
Organisational Interoperability in Practice at Universidad Politécnica de Madrid
 
Introducción a los Datos Abiertos - Open Data Day 2020
Introducción a los Datos Abiertos - Open Data Day 2020Introducción a los Datos Abiertos - Open Data Day 2020
Introducción a los Datos Abiertos - Open Data Day 2020
 
Adiós a los ficheros, hola a los grafos de conocimientos estadísticos
Adiós a los ficheros, hola a los grafos de conocimientos estadísticosAdiós a los ficheros, hola a los grafos de conocimientos estadísticos
Adiós a los ficheros, hola a los grafos de conocimientos estadísticos
 
Ontology Engineering at Scale for Open City Data Sharing
Ontology Engineering at Scale for Open City Data SharingOntology Engineering at Scale for Open City Data Sharing
Ontology Engineering at Scale for Open City Data Sharing
 
Situación de las iniciativas de Open Data internacionales (y algunas recomen...
Situación de las iniciativas de Open Data internacionales (y algunas recomen...Situación de las iniciativas de Open Data internacionales (y algunas recomen...
Situación de las iniciativas de Open Data internacionales (y algunas recomen...
 
STARS4ALL - Contaminación Lumínica
STARS4ALL - Contaminación LumínicaSTARS4ALL - Contaminación Lumínica
STARS4ALL - Contaminación Lumínica
 
Towards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experienceTowards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experience
 
Publishing Linked Statistical Data: Aragón, a case study
Publishing Linked Statistical Data: Aragón, a case studyPublishing Linked Statistical Data: Aragón, a case study
Publishing Linked Statistical Data: Aragón, a case study
 
An initial analysis of topic-based similarity among scientific documents base...
An initial analysis of topic-based similarity among scientific documents base...An initial analysis of topic-based similarity among scientific documents base...
An initial analysis of topic-based similarity among scientific documents base...
 
Linked Statistical Data 101
Linked Statistical Data 101Linked Statistical Data 101
Linked Statistical Data 101
 
Aplicando los principios de Linked Data en AEMET
Aplicando los principios de Linked Data en AEMETAplicando los principios de Linked Data en AEMET
Aplicando los principios de Linked Data en AEMET
 
Ojo Al Data 100 - Call for sharing session at IODC 2016
Ojo Al Data 100 - Call for sharing session at IODC 2016Ojo Al Data 100 - Call for sharing session at IODC 2016
Ojo Al Data 100 - Call for sharing session at IODC 2016
 
Educando sobre datos abiertos: desde el colegio a la universidad
Educando sobre datos abiertos: desde el colegio a la universidadEducando sobre datos abiertos: desde el colegio a la universidad
Educando sobre datos abiertos: desde el colegio a la universidad
 
STARS4ALL general presentation at ALAN2016
STARS4ALL general presentation at ALAN2016STARS4ALL general presentation at ALAN2016
STARS4ALL general presentation at ALAN2016
 
Generación de datos estadísticos enlazados del Instituto Aragonés de Estadística
Generación de datos estadísticos enlazados del Instituto Aragonés de EstadísticaGeneración de datos estadísticos enlazados del Instituto Aragonés de Estadística
Generación de datos estadísticos enlazados del Instituto Aragonés de Estadística
 
Presentación de la red de excelencia de Open Data y Smart Cities
Presentación de la red de excelencia de Open Data y Smart CitiesPresentación de la red de excelencia de Open Data y Smart Cities
Presentación de la red de excelencia de Open Data y Smart Cities
 
Linked Statistical Data: does it actually pay off?
Linked Statistical Data: does it actually pay off?Linked Statistical Data: does it actually pay off?
Linked Statistical Data: does it actually pay off?
 
Slow-cooked data and APIs in the world of Big Data: the view from a city per...
Slow-cooked data and APIs in the world of Big Data: the view from a city per...Slow-cooked data and APIs in the world of Big Data: the view from a city per...
Slow-cooked data and APIs in the world of Big Data: the view from a city per...
 
(Big) Data (Science) Skills
(Big) Data (Science) Skills(Big) Data (Science) Skills
(Big) Data (Science) Skills
 
Big Data - El Futuro a través de los Datos
Big Data - El Futuro a través de los DatosBig Data - El Futuro a través de los Datos
Big Data - El Futuro a través de los Datos
 

Último

Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 

Último (20)

Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 

Open Data (and Software, and other Research Artefacts) - A proper management

  • 1. Open Data (and Software, and other Research Artefacts): A proper management Seminar: Let’s Do it Together! How to implement Open Science practices in Research Projects Universidad Politécnica de Madrid 29/11/2019 With contributions from Esteban González, Daniel Garijo, Idafen Santana, Olga Giraldo Oscar Corcho ocorcho@fi.upm.es @ocorcho, @opencitydata_es https://www.slideshare.com/ocorcho
  • 2. License • This work is licensed under the license CC BY-NC-SA 4.0 International • http://purl.org/NET/rdflicense/cc-by-nc-sa4.0 • You are free: • to Share — to copy, distribute and transmit the work • to Remix — to adapt the work • Under the following conditions • Attribution — You must attribute the work by inserting • “[source Oscar Corcho]” at the footer of each reused slide • a credits slide stating: “These slides are partially based on “Open Data (and Software, and other Research Artefacts): A proper management” by O. Corcho” • Non-commercial • Share-Alike
  • 3. The key messages of my talk... Open Science ≠ Open Access Science is not only about papers (other objects exist) Open Science = Open Access + Research Data Management + Research Object Management We all need principled approaches and clear guidelines (community or institution driven) to adopt an Open Science approach We expect this (non-extra) work to pay off in the future
  • 4. Outline • From Open (Government) Data to Open Science • Our previous OEG-UPM research and development to support Open Science practices • Research Objects • Systematic (Meta)Data Management in Research • Ontology-based Representation of Laboratory Protocols • Reproducibility of Computational Experiments • Our (practical) understanding of Open Science and current practices at OEG-UPM
  • 5. Outline • From Open (Government) Data to Open Science • Our previous OEG-UPM research and development to support Open Science practices • Research Objects • Systematic (Meta)Data Management in Research • Ontology-based Representation of Laboratory Protocols • Reproducibility of Computational Experiments • Our (practical) understanding of Open Science and current practices at OEG-UPM
  • 6. What is Open (Government) Data? • Open data is data that can be freely used, re-used and redistributed by anyone - subject only, at most, to the requirement to attribute and sharealike • Key aspects: • Availability and access: the data must be available as a whole and at no more than a reasonable reproduction cost, preferably by downloading over the Internet. The data must also be available in a convenient and modifiable form. • Re-use and redistribution: the data must be provided under terms that permit re-use and redistribution including the intermixing with other datasets. • Universal participation: everyone must be able to use, re- use and redistribute - there should be no discrimination against fields of endeavour or against persons or groups [source: Open Data Handbook, http://opendatahandbook.org/en/what-is-open-data/ ]
  • 7. Relevant Legislation. Europe and Spain • Open Access Initiative (2001). Scientific information; > 510 orgs • Aarhus Convention (1998). Right to participate and access; 41 countries and the EU • Convention on official documentation access (2009). 12 countries • (Open Data and) PSI-reuse Directives (2003/98/EC, 2013/37/UE and 2019/1024) • https://ec.europa.eu/digital-single-market/en/european-legislation-reuse-public-sector-information • List of high-value datasets: geospatial, Earth Observation and environment, meteorological, statistics, companies and company ownership, mobility • Law 37/2007. PSI reuse (transposition of directive 2003/98/EC) • Modified in law 18/2015 (BOE 10/07/2015, directive 2013/37/UE) • 2019/1024 Directive to be transposed by 16/07/2021 • Law 11/2007. Citizen rights to access to good-quality public services • RD 4/2010 Esquema Nacional de Interoperabilidad • Open standards, technology neutral, open source • RD 1495/2011 It develops Law 37/2007 for national agencies • Norma Técnica de Interoperabilidad (19/02/2013, BOE 4/3/2013) [source: based on material from Antonio Rodríguez Pascual (CNIG)]
  • 8. An Explosion of Open Data Portals
  • 9. Some of our activities in Open (Government) Data Culture (@BNE) Geograhy (@IGN) Metereology (@AEMET) Cities (@ Zaragoza, Gob Aragón, Catalogues) Host of esDBpedia UNE 178301:2015 Norm on Open Data for Smart Cities
  • 10. However, today we are talking about Open Science
  • 11. Open Scientific Data vs Open Government Data (I) • Is Open Data in Science actually much different from Open Government Data? • NO • “freely used, re-used and redistributed by anyone - subject only, at most, to the requirement to attribute and sharealike” • Funders encourage the generation of open research data • E.g., guidelines on FAIR Data Management H2020 http://ec.europa.eu/research/participants/data/ref/h2020/gr ants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf • YES • Not such a large history of legislation • Initially, most work focused on open access (papers) • Not only available for use and reuse, but also for reproducibility • It is often not useful without the rest of research artefacts that come together with it (methods, software, protocols, papers)
  • 12. Open Scientific Data vs Open Government Data (II) The same explosion of Data Portals General-purpose Region-specific Domain-specific (e.g. Astronomy) Institution-specific
  • 13. Open Scientific Data vs Open Government Data (III) • And a good number of alternative technologies • And a good number of metadata schemas • DataCite • CrossRef • CKAN Metadata • DDI4 • DCAT • …
  • 14. Are we applying what we learned in Open Gov Data? • Some of the same mistakes are being done • Setting up a portal/infrastructure does not mean that you are better than others • Having more objects in your repository does not mean that you are doing more or better Open Science • No clear instructions on what to upload or not, and how to ensure quality (except for mature domains or organisations) • No clear governance (handled by researchers?, handled by data centers?, handled by libraries?) • And a few more things • No clear relationship among all research artefacts • No clear relationship between the Data Management Plans and the way in which data is finally handled
  • 15. Outline • From Open (Government) Data to Open Science • Our previous OEG-UPM research and development to support Open Science practices • Research Objects • Systematic (Meta)Data Management in Research • Ontology-based Representation of Laboratory Protocols • Reproducibility of Computational Experiments • Our (practical) understanding of Open Science and current practices at OEG-UPM
  • 16. How do we do Science? Main components [source: Idafen Santana]
  • 17. The life of our researchers at OEG-UPM Scientist Live RO Live RO RO snapshot <<copy>> Permanent URI Some metadata Some curation Mostly private (for my group) RO snapshot <<copy>> Permanent URI Some metadata Some curation Mostly private (for my group and for paper reviewers) Librarian/Curator Scientist My supervisor calls me to report my work My supervisor calls me again and we decide to publish our RO+paper <<versionOf>> Archived RO <<copy, filter and curate>> Permanent URI Good metadata and curation Mostly public Reviews received and final version published <<versionOf>> A new PhD student continues my work <<copy>> 19
  • 18. bundles and relates digital resources of a scientific experiment or investigation using standard mechanisms, “tool middleware” http://www.w3.org/community/rosc/ http://www.researchobject.org/
  • 19. Systematic (meta)data Management in Research • Open (Research) Data portals • Data Management • Data Publication • DOIs • Sensor Data (photometers) • Management • Visualisation » And Citizen Science
  • 20. Laboratory protocols 22PhD Thesis: SeMAntic RepresenTation for Experimental Protocols
  • 21. Pegasus Montage SoyKB Epigenomics CLOUD Reproducibility of Computational Scientific Experiments 23 FORMER EQUIPMENT ANNOTATE REPRODUCE SEMANTIC ANNOTATIONS EQUIVALENT EXECUTION ENVIRONMENT Dispel4Py Internal Extinction Seismic Cross Correlation Makeflow Blast
  • 22. Outline • From Open (Government) Data to Open Science • Our previous OEG-UPM research and development to support Open Science practices • Research Objects • Systematic (Meta)Data Management in Research • Ontology-based Representation of Laboratory Protocols • Reproducibility of Computational Experiments • Our (practical) understanding of Open Science and current practices at OEG-UPM
  • 23. How do we do it at OEG-UPM? • Which research artefacts do we handle at OEG-UPM? • Papers (sure, let’s see the following talk by UPM’s library) • Data Management Plans (DMPOnline –PaGoDa did not exist) • Datasets • Normally in GitHub, e.g. https://github.com/oeg-upm/btn100 • Software source code • Normally in GitHub: http://www.github.com/oeg-upm • Docker images, models and APIs • Normally in DockerHub: https://hub.docker.com/u/oegupm/ • Ontologies, thesauri, etc. • Normally in GitHub, e.g., https://github.com/CiudadesAbiertas/vocab-sector-publico- agenda-municipal • And published online, e.g., http://vocab.ciudadesabiertas.es/def/sector-publico/agenda- municipal/ • …
  • 24. And which are our (good) practices? • Still missing many, but... • When a research or experiment starts, a new GitHub repository is created • The repository is connected to Zenodo, so as to get DOIs and ensure archival • Automated archival process after every release • DOIs also added to the GitHub repository • Our papers cite those DOIs • Bit.ly, dropbox, GDrive links, etc., are strictly prohibited in our papers • Zenodo community • https://zenodo.org/communities/ontologyengineeringgrou p/
  • 25. The key messages of my talk... Open Science ≠ Open Access Science is not only about papers (other objects exist) Open Science = Open Access + Research Data Management + Research Object Management We all need principled approaches and clear guidelines (community or institution driven) to adopt an Open Science approach We expect this (non-extra) work to pay off in the future
  • 26. Open Data (and Software, and other Research Artefacts): A proper management Seminar: Let’s Do it Together! How to implement Open Science practices in Research Projects Universidad Politécnica de Madrid 29/11/2019 With contributions from Esteban González, Daniel Garijo, Idafen Santana, Olga Giraldo Oscar Corcho ocorcho@fi.upm.es @ocorcho, @opencitydata_es https://www.slideshare.com/ocorcho

Notas do Editor

  1. http://opendatahandbook.org/en/what-is-open-data/
  2. To share your research materials (RO as a social object) To facilitate reproducibility and reuse of methods To be recognized and cited (even for constituent resources) To preserve results and prevent decay (curation of workflow definition; using provenance for partial rerun) Middleware