SlideShare a Scribd company logo
1 of 66
How to share useful data
Peter McQuilton
Biosharing.org
@drosophilic
Outline
• Data sharing
• Reusability and reproducibility
• How the lack of these affects scientific accountability and progress
• Experimental context
• What to report – what level of granularity
• How to report it – what format, structure
• Content standards
• How to find them
• Complying with repositories, funders and publishers
Outline
• Data sharing
• Reusability and reproducibility
• How the lack of these affects scientific accountability and progress
• Experimental context
• What to report – what level of granularity
• How to report it – what format, structure
• Content standards
• How to find them
• Complying with repositories, funders and publishers
Research data life cycle
Image credit to:
Credit to: ttps://projects.ac/blog/five-top-reasons-to-protect-your-data-and-practise-safe-science/ 2014
Better data = better science
A community mobilization for “openness”
image by Greg Emmerich
http://discovery.urlibraries.org/ https://okfn.org
Open data
is a means to do
better science
more efficiently
http://pantonprinciples.org
https://creativecommons.org
Growing movement for FAIR data and research
outputs
But in all fairness, not much data is FAIR!
But in all fairness, not much data is FAIR!
But in all fairness, not much data is FAIR!
“Reproducing the method took several months of effort, and
required using new versions and new software that posed
challenges to reconstructing and validating the results”
Unfairness in both experimental and computation
areas
• Not always well cited, stored
o Software, codes, workflows are hard(er) to get hold of
• Poorly described for third party reuse
o Different level of detail and annotation
• Curation activities are perceived as time consuming
o Collection and harmonization of detailed methods and
experimental steps is rushed at the publication stage
Not very FAIR: low findability and
understandability
• Effectively document your data so that it can be understood
in the future
• Periodically move data to new storage media (drives
degrade over time)
• Keep more than one copy of data (local and cloud)
• Migrate data to new software versions
• Use a well documented and supported format
Ideally this should be covered in a data management plan at
the start of a project, so that you can factor any associated
time and resources into your budget.
What can I do to ensure my data are
shareable/usable in the future?
Outline
• Data sharing
• Reusability and reproducibility
• How the lack of these affects scientific accountability and progress
• Experimental context - standards
• What to report – what level of granularity
• How to report it – what format, structure
• Content standards
• How to find them
• Complying with repositories, funders and publishers
Do you know what this is?
LS1_C2_LD_TP2_P1 file1-fastq.gz
…how NOT to report the experimental
information!
LS1_C2_LD_TP2_P1 file1-fastq.gz
…how NOT to report the experimental
information!
Sample name (?!) Data file
LS1_C2_LD_TP2_P1 file1-fastq.gz
We need to clearly describe the information
• LS1 liver sample 1
• C2 compound 2
• LD low dose
• TP2 time point 2
• P1 protocol 1
• file1-fastq.gz compressed data file for sequence
information corresponding to this
sample
Sample name (?!) Data file
LS1_C2_LD_TP2_P1 file1-fastq.gz
Without context data is meaningless
Without context data is meaningless
Without context data is meaningless
Without context data is meaningless
• We need to report sufficient
information to reuse the dataset
• We must strike a balance between
depth and breadth of information
Information intensive experiments
Information intensive experiments
• Not too much
• Not too little
• ….just right
Seven week old C57BL/6N mice were treated
with low-fat diet.
Liver was dissected out, hepatocytes prepared…
From natural language to ‘computable’ concepts
Age value?
Unit?
Strain name
Subject of the experiment
Type of diet and
experimental condition
Anatomy part
Seven week old C57BL/6N mice were treated
with low-fat diet.
Liver was dissected out, hepatocytes prepared …
From natural language to ‘computable’ concepts
Age value
Unit
Strain name?
Subject of the experiment?
Type of diet and
experimental condition
Anatomy part
Seven week old C57BL/6N mice were treated
with low-fat diet.
Liver was dissected out, hepatocytes prepared …
From natural language to ‘computable’ concepts
Age value
Unit
Strain name
Subject of the experiment
Type of diet and
experimental condition?
Anatomy part
Seven week old C57BL/6N mice were treated
with low-fat diet.
Liver was dissected out, hepatocytes prepared …
From natural language to ‘computable’ concepts
Age value
Unit
Strain name
Subject of the experiment
Type of diet and
experimental condition
Anatomy part?
Seven week old C57BL/6N mice were treated
with low-fat diet.
Liver was dissected out, hepatocytes prepared …
From natural language to ‘computable’ concepts
Age value
Unit
Strain name
Subject of the experiment
Type of diet and
experimental condition
Anatomy part
Seven week old C57BL/6N mice were treated
with low-fat diet.
Liver was dissected out, hepatocytes prepared …
From natural language to ‘computable’ concepts
Age value
Unit
Strain name
Subject of the experiment
Type of diet and
experimental condition
Anatomy part
Seven week old C57BL/6N mice were treated
with low-fat diet.
Liver was dissected out, hepatocytes prepared …
From natural language to ‘computable’ concepts
Type of protocol – cell preparation
Type of protocol - sample treatment
Type of protocol – liver preparation
How do you know what to report, or how to
structure it?
• Data/content standards:
• Structure, enrich and report the description of the
datasets and the experimental context under which they
were produced
• Facilitate the discovery, sharing, understanding and
reuse of datasets
Outline
• Data sharing
• Reusability and reproducibility
• How the lack of these affects scientific accountability and progress
• Experimental context
• What to report – what level of granularity
• How to report it – what format, structure
• Content standards
• How to find them
• Complying with repositories, funders and publishers
193
85
346
miame
MIAPA
MIRIAM
MIQAS
MIX
MIGEN
ARRIVE
MIAPE
MIASE
MIQE
MISFISHIE….
REMARK
CONSORT
MAGE-Tab
GCDML
SRAxml
SOFT
FASTA
DICOM
MzML
SBRML
SEDML…
GELML
ISA-Tab
CML
MITAB
AAO
CHEBI
OBI
PATO ENVO
MOD
BTO
IDO…
TEDDY
PRO
XAO
DO
VO
There are over 600 content standards in the life sciences
de jure de facto
grass-roots
groups
standard
organizations
Nanotechnology Working Group
Community mobilisation to develop content
standards
Databases have their own standards, e.g. at EBI:
Enablers: to better describe, share and query data
Enablers: to better describe, share and query data
• Minimum information
reporting requirements, or
checklists
o Report the same core,
essential information
• Minimum information
reporting requirements, or
checklists
o Report the same core,
essential information
• Controlled vocabularies, taxonomies,
thesauri, ontologies etc.
o Use the same word and refer to the same
‘thing’
Enablers: to better describe, share and query data
• Minimum information
reporting requirements, or
checklists
o Report the same core,
essential information
• Controlled vocabularies, taxonomies,
thesauri, ontologies etc.
o Use the same word and refer to the same
‘thing’
• Conceptual model,
conceptual schema, or
exchange formats
o Allow data to flow from one
system to another
Enablers: to better describe, share and query data
A web-based, curated and searchable registry ensuring that biological
standards and databases are registered, informative and discoverable; also
monitoring the development and evolution of standards, their use in databases
and the adoption of both in data policies.
Researchers, developers and curators lack support and guidance on how to best navigate and select
content standards, understand their maturity, or find databases that implement them;
Funders, journals and librarians do not have enough information to make informed decisions on which
content standards or database to recommended in policies, or fund or implement
Our mission: To help people make the right choice
Three interlinked registries
Work out which format your data should be in for
submission to a particular database
STANDARD DATABASE
Standards and databases (and policies) cross-linked
From simple and advanced searches
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta
Sansone www.ebi.ac.uk/net-project
Search and filter to find what is relevant to your type of data
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta
Sansone www.ebi.ac.uk/net-project
Tracking evolution, e.g. deprecations and
substitutions
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta
Sansone www.ebi.ac.uk/net-project
Tracking evolution, e.g. deprecations and
substitutions
Create your own Collection
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta
Sansone www.ebi.ac.uk/net-project
5
3
User profiles populated from ORCID...
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta
Sansone www.ebi.ac.uk/net-project
5
4
... credit for creating, contributing to, maintaining standards, databases and
policies
Ownership of open standards can be problematic in
broad, grass-root collaborations
It requires improved models, to encourage
maintenance of and contributions to these
efforts, rewards and incentives need to be
identified for all contributors to supporting the
continued development of standards
What you can do with BioSharing…
“Which standard should I use for this data, considering I’d
like to publish in journal X?
“Are we using the most up-to-date version of this standard?”
“My data is in X format, which databases take that format?
How can you use community-standards?
model and related
formats
These tools and formats will help you to:
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta
Sansone www.ebi.ac.uk/net-project
ISA powers data collection, curation resources and repositories, e.g.:
ISA
model and related
formats
1
Create template(s) to fit the type of
experiments to be described
Create templates detailing the steps to
be reported for different investigations,
complying to community standards in
e.g. configuring the value(s) allowed for
each field to be
• text (with/without regular expressions),
• ontology terms,
• numbers etc.
We have ‘ready to use’ community
standards compliant configurations
and can create more according to
user needs
• The ISA model records the data’s provenance, how it was generated and
where it is located.
• Published Data Descriptors are indexed in all major bibliographic indexing
services (incl. PubMed)
• However, accompanying every Data Descriptor article there are metadata files,
specifically created to aid discovery and understanding of the data itself.
• Using the ISA (Investigation, Study, Assay) model, these metadata files
provide a machine readable overview of the study that generated the data.
• Filter datasets by
data repository or
metadata
• Boolean searches
• Future enhancements:
- Statistics
- Richer queries based
on semantics of the data
ISA-explorer: A demo tool for discovering and exploring Scientific
Data’s ISA-tab metadata
ISA-explorer: A demo tool for discovering and exploring Scientific
Data’s ISA-tab metadata
Visualise the data
associated with
a paper
http://tinyurl.com/isaexplorer
• Reusability and reproducibility
o Is pivotal to drive science and discoveries
o Do your best to make your digital research outputs FAIR
• Experimental context
o Report the experimental context of your findings
o Do to your data what you wish that others would do to theirs
• Content standards
o Continuously evolving
o Make use of tools implementing standards, such as ISAtools
o Use biosharing.org to explore repositories, standards and policies
Summary
Acknowledgements
Find the right database for your data, and which data standard to
use – https://www.biosharing.org
Checking your data conforms to a standard, or making your own
templates – http://www.isa-tools.org
Where to keep research data: DCC checklist for evaluating data
repositories (DCC) - http://tinyurl.com/DCCResearchData
How and why you should manage your research data (JISC) -
http://tinyurl.com/JISCDMP
Useful links
How to share useful data

More Related Content

What's hot

Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...
Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...
Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...taxonbytes
 
Structured Data & the Future of Educational Material
Structured Data & the Future of Educational MaterialStructured Data & the Future of Educational Material
Structured Data & the Future of Educational MaterialPaul Groth
 
CDL Tools for DataCite 2014
CDL Tools for DataCite 2014CDL Tools for DataCite 2014
CDL Tools for DataCite 2014Carly Strasser
 
Funders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeFunders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeCarly Strasser
 
The Dryad Digital Repository: Published evolutionary data as part of the gre...
The Dryad Digital Repository: Published evolutionary data as part of the gre...The Dryad Digital Repository: Published evolutionary data as part of the gre...
The Dryad Digital Repository: Published evolutionary data as part of the gre...Todd Vision
 
Franz ludaescher tdwg 2016 an update on taxonomic concept reasoning
Franz ludaescher tdwg 2016 an update on taxonomic concept reasoningFranz ludaescher tdwg 2016 an update on taxonomic concept reasoning
Franz ludaescher tdwg 2016 an update on taxonomic concept reasoningtaxonbytes
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015Fiona Nielsen
 
Data for Science: How Elsevier is using data science to empower researchers
Data for Science: How Elsevier is using data science to empower researchersData for Science: How Elsevier is using data science to empower researchers
Data for Science: How Elsevier is using data science to empower researchersPaul Groth
 
Small Science: First Impressions of Curation Needs. Presentation at Digital L...
Small Science: First Impressions of Curation Needs. Presentation at Digital L...Small Science: First Impressions of Curation Needs. Presentation at Digital L...
Small Science: First Impressions of Curation Needs. Presentation at Digital L...Sarah Shreeves
 
Developing data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesDeveloping data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesAmanda Whitmire
 
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015Carly Strasser
 
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...GigaScience, BGI Hong Kong
 
Laurie Goodman at #SSPBoston: Article+Data+Tools Reproducibility, Reuse, & Ra...
Laurie Goodman at #SSPBoston: Article+Data+ToolsReproducibility, Reuse, & Ra...Laurie Goodman at #SSPBoston: Article+Data+ToolsReproducibility, Reuse, & Ra...
Laurie Goodman at #SSPBoston: Article+Data+Tools Reproducibility, Reuse, & Ra...GigaScience, BGI Hong Kong
 
Information systems on fish and marine genetic resources
Information systems on fish and marine genetic resourcesInformation systems on fish and marine genetic resources
Information systems on fish and marine genetic resourcesapaari
 
Introduction to open access and how it helps in your research and increases t...
Introduction to open access and how it helps in your research and increases t...Introduction to open access and how it helps in your research and increases t...
Introduction to open access and how it helps in your research and increases t...Iryna Kuchma
 
Information architecture at Elsevier
Information architecture at ElsevierInformation architecture at Elsevier
Information architecture at ElsevierPaul Groth
 
Is that a scientific report or just some cool pictures from the lab? Reproduc...
Is that a scientific report or just some cool pictures from the lab? Reproduc...Is that a scientific report or just some cool pictures from the lab? Reproduc...
Is that a scientific report or just some cool pictures from the lab? Reproduc...Greg Landrum
 
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014Microsoft Azure for Research
 
Peter Li: GigaDB and Galaxy - revolutionizing data dissemination, organizatio...
Peter Li: GigaDB and Galaxy - revolutionizing data dissemination, organizatio...Peter Li: GigaDB and Galaxy - revolutionizing data dissemination, organizatio...
Peter Li: GigaDB and Galaxy - revolutionizing data dissemination, organizatio...GigaScience, BGI Hong Kong
 

What's hot (20)

Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...
Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...
Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...
 
Structured Data & the Future of Educational Material
Structured Data & the Future of Educational MaterialStructured Data & the Future of Educational Material
Structured Data & the Future of Educational Material
 
CDL Tools for DataCite 2014
CDL Tools for DataCite 2014CDL Tools for DataCite 2014
CDL Tools for DataCite 2014
 
Funders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeFunders and Publishers: Agents of Change
Funders and Publishers: Agents of Change
 
The Dryad Digital Repository: Published evolutionary data as part of the gre...
The Dryad Digital Repository: Published evolutionary data as part of the gre...The Dryad Digital Repository: Published evolutionary data as part of the gre...
The Dryad Digital Repository: Published evolutionary data as part of the gre...
 
Franz ludaescher tdwg 2016 an update on taxonomic concept reasoning
Franz ludaescher tdwg 2016 an update on taxonomic concept reasoningFranz ludaescher tdwg 2016 an update on taxonomic concept reasoning
Franz ludaescher tdwg 2016 an update on taxonomic concept reasoning
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015
 
Data for Science: How Elsevier is using data science to empower researchers
Data for Science: How Elsevier is using data science to empower researchersData for Science: How Elsevier is using data science to empower researchers
Data for Science: How Elsevier is using data science to empower researchers
 
Small Science: First Impressions of Curation Needs. Presentation at Digital L...
Small Science: First Impressions of Curation Needs. Presentation at Digital L...Small Science: First Impressions of Curation Needs. Presentation at Digital L...
Small Science: First Impressions of Curation Needs. Presentation at Digital L...
 
Developing data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesDeveloping data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universities
 
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
 
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
 
Laurie Goodman at #SSPBoston: Article+Data+Tools Reproducibility, Reuse, & Ra...
Laurie Goodman at #SSPBoston: Article+Data+ToolsReproducibility, Reuse, & Ra...Laurie Goodman at #SSPBoston: Article+Data+ToolsReproducibility, Reuse, & Ra...
Laurie Goodman at #SSPBoston: Article+Data+Tools Reproducibility, Reuse, & Ra...
 
Information systems on fish and marine genetic resources
Information systems on fish and marine genetic resourcesInformation systems on fish and marine genetic resources
Information systems on fish and marine genetic resources
 
METRO RDM Webinar
METRO RDM WebinarMETRO RDM Webinar
METRO RDM Webinar
 
Introduction to open access and how it helps in your research and increases t...
Introduction to open access and how it helps in your research and increases t...Introduction to open access and how it helps in your research and increases t...
Introduction to open access and how it helps in your research and increases t...
 
Information architecture at Elsevier
Information architecture at ElsevierInformation architecture at Elsevier
Information architecture at Elsevier
 
Is that a scientific report or just some cool pictures from the lab? Reproduc...
Is that a scientific report or just some cool pictures from the lab? Reproduc...Is that a scientific report or just some cool pictures from the lab? Reproduc...
Is that a scientific report or just some cool pictures from the lab? Reproduc...
 
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
 
Peter Li: GigaDB and Galaxy - revolutionizing data dissemination, organizatio...
Peter Li: GigaDB and Galaxy - revolutionizing data dissemination, organizatio...Peter Li: GigaDB and Galaxy - revolutionizing data dissemination, organizatio...
Peter Li: GigaDB and Galaxy - revolutionizing data dissemination, organizatio...
 

Viewers also liked

Computational Thinking
Computational ThinkingComputational Thinking
Computational Thinkingshowslidedump
 
Big data in biology
Big data in biologyBig data in biology
Big data in biologyOmkar Reddy
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | EUDAT
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data ManagementJamie Bisset
 
Why the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be oneWhy the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be onemhaendel
 
Graph Databases: Trends in the Web of Data
Graph Databases: Trends in the Web of DataGraph Databases: Trends in the Web of Data
Graph Databases: Trends in the Web of DataMarko Rodriguez
 

Viewers also liked (6)

Computational Thinking
Computational ThinkingComputational Thinking
Computational Thinking
 
Big data in biology
Big data in biologyBig data in biology
Big data in biology
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Why the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be oneWhy the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be one
 
Graph Databases: Trends in the Web of Data
Graph Databases: Trends in the Web of DataGraph Databases: Trends in the Web of Data
Graph Databases: Trends in the Web of Data
 

Similar to How to share useful data

NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataSusanna-Assunta Sansone
 
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better ScienceNC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better ScienceSusanna-Assunta Sansone
 
Oxford DTP - Sansone curation tools - Dec 2014
Oxford DTP - Sansone curation tools - Dec 2014Oxford DTP - Sansone curation tools - Dec 2014
Oxford DTP - Sansone curation tools - Dec 2014Susanna-Assunta Sansone
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)aaroncollie
 
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Susanna-Assunta Sansone
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
2018 Bio-IT World Agile in Wet Labs Speeds Big Data
2018 Bio-IT World Agile in Wet Labs Speeds Big Data2018 Bio-IT World Agile in Wet Labs Speeds Big Data
2018 Bio-IT World Agile in Wet Labs Speeds Big DataBruce Kozuma
 
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Susanna-Assunta Sansone
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Susanna-Assunta Sansone
 
BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...
BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...
BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...Peter McQuilton
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data ChallengesPhilip Bourne
 
BioSharing - mapping the landscape of Standards, Databases and Data policies ...
BioSharing - mapping the landscape of Standards, Databases and Data policies ...BioSharing - mapping the landscape of Standards, Databases and Data policies ...
BioSharing - mapping the landscape of Standards, Databases and Data policies ...Peter McQuilton
 
Records professionals and Research Data - a new role?
Records professionals and Research Data - a new role?Records professionals and Research Data - a new role?
Records professionals and Research Data - a new role?Rebecca Grant
 
California Ocean Science Trust " Building a Sustainable Knowledge Base for ...
California Ocean Science Trust " Building a Sustainable Knowledge Base for ...California Ocean Science Trust " Building a Sustainable Knowledge Base for ...
California Ocean Science Trust " Building a Sustainable Knowledge Base for ...Tom Moritz
 
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATOMetadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATOAlejandra Gonzalez-Beltran
 
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...Susanna-Assunta Sansone
 
Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing Mojtaba Lotfaliany
 
RDA Publishing Workflows
RDA Publishing WorkflowsRDA Publishing Workflows
RDA Publishing WorkflowsPeter McQuilton
 

Similar to How to share useful data (20)

NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
 
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better ScienceNC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
 
Oxford DTP - Sansone curation tools - Dec 2014
Oxford DTP - Sansone curation tools - Dec 2014Oxford DTP - Sansone curation tools - Dec 2014
Oxford DTP - Sansone curation tools - Dec 2014
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)
 
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
2018 Bio-IT World Agile in Wet Labs Speeds Big Data
2018 Bio-IT World Agile in Wet Labs Speeds Big Data2018 Bio-IT World Agile in Wet Labs Speeds Big Data
2018 Bio-IT World Agile in Wet Labs Speeds Big Data
 
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014
 
BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...
BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...
BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
 
BioSharing - mapping the landscape of Standards, Databases and Data policies ...
BioSharing - mapping the landscape of Standards, Databases and Data policies ...BioSharing - mapping the landscape of Standards, Databases and Data policies ...
BioSharing - mapping the landscape of Standards, Databases and Data policies ...
 
Records professionals and Research Data - a new role?
Records professionals and Research Data - a new role?Records professionals and Research Data - a new role?
Records professionals and Research Data - a new role?
 
California Ocean Science Trust " Building a Sustainable Knowledge Base for ...
California Ocean Science Trust " Building a Sustainable Knowledge Base for ...California Ocean Science Trust " Building a Sustainable Knowledge Base for ...
California Ocean Science Trust " Building a Sustainable Knowledge Base for ...
 
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATOMetadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
 
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
 
Enhance your rese​arch impact through open science
Enhance your rese​arch impact through open scienceEnhance your rese​arch impact through open science
Enhance your rese​arch impact through open science
 
Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing
 
RDA Publishing Workflows
RDA Publishing WorkflowsRDA Publishing Workflows
RDA Publishing Workflows
 

More from Peter McQuilton

terms4FAIRskills - RDA VP17 - April 2021
terms4FAIRskills - RDA VP17 - April 2021terms4FAIRskills - RDA VP17 - April 2021
terms4FAIRskills - RDA VP17 - April 2021Peter McQuilton
 
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8 RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8 Peter McQuilton
 
FAIRsharing: more than a registry
FAIRsharing: more than a registryFAIRsharing: more than a registry
FAIRsharing: more than a registryPeter McQuilton
 
FAIRsharing - ENVRI-FAIR Webinar
FAIRsharing - ENVRI-FAIR WebinarFAIRsharing - ENVRI-FAIR Webinar
FAIRsharing - ENVRI-FAIR WebinarPeter McQuilton
 
FAIR StRePo - GO TRAIN Workshop, Hamburg, November 2019
FAIR StRePo - GO TRAIN Workshop, Hamburg, November 2019FAIR StRePo - GO TRAIN Workshop, Hamburg, November 2019
FAIR StRePo - GO TRAIN Workshop, Hamburg, November 2019Peter McQuilton
 
FAIRsharing - connecting standards, repositories and data policies across agr...
FAIRsharing - connecting standards, repositories and data policies across agr...FAIRsharing - connecting standards, repositories and data policies across agr...
FAIRsharing - connecting standards, repositories and data policies across agr...Peter McQuilton
 
FAIRsharing - manually curated metadata on standards, repositories and data p...
FAIRsharing - manually curated metadata on standards, repositories and data p...FAIRsharing - manually curated metadata on standards, repositories and data p...
FAIRsharing - manually curated metadata on standards, repositories and data p...Peter McQuilton
 
Making Repositories FAIR (via metadata in FAIRsharing.org
Making Repositories FAIR (via metadata in FAIRsharing.orgMaking Repositories FAIR (via metadata in FAIRsharing.org
Making Repositories FAIR (via metadata in FAIRsharing.orgPeter McQuilton
 
Bridging Semantics and Repositories
Bridging Semantics and RepositoriesBridging Semantics and Repositories
Bridging Semantics and RepositoriesPeter McQuilton
 
FAIRsharing - Mapping the Landscape of Databases, Repositories, Standards and...
FAIRsharing - Mapping the Landscape of Databases, Repositories, Standards and...FAIRsharing - Mapping the Landscape of Databases, Repositories, Standards and...
FAIRsharing - Mapping the Landscape of Databases, Repositories, Standards and...Peter McQuilton
 
RDA UK - FAIRsharing WG output
RDA UK - FAIRsharing WG outputRDA UK - FAIRsharing WG output
RDA UK - FAIRsharing WG outputPeter McQuilton
 
FAIRsharing and Engineering Research Data Management
FAIRsharing and Engineering Research Data ManagementFAIRsharing and Engineering Research Data Management
FAIRsharing and Engineering Research Data ManagementPeter McQuilton
 
FAIRsharing presentation to IUPAC Workshop
FAIRsharing presentation to IUPAC WorkshopFAIRsharing presentation to IUPAC Workshop
FAIRsharing presentation to IUPAC WorkshopPeter McQuilton
 
RDA Data Innovation Forum: FAIRsharing.org, an output of the joint RDA/Force ...
RDA Data Innovation Forum: FAIRsharing.org, an output of the joint RDA/Force ...RDA Data Innovation Forum: FAIRsharing.org, an output of the joint RDA/Force ...
RDA Data Innovation Forum: FAIRsharing.org, an output of the joint RDA/Force ...Peter McQuilton
 
ELIXIR Standards and Formats: ISA Tools and FAIRsharing
ELIXIR Standards and Formats: ISA Tools and FAIRsharingELIXIR Standards and Formats: ISA Tools and FAIRsharing
ELIXIR Standards and Formats: ISA Tools and FAIRsharingPeter McQuilton
 
FAIR landscape in ELIXIR: FAIR metrics and other initiatives
FAIR landscape in ELIXIR: FAIR metrics and other initiativesFAIR landscape in ELIXIR: FAIR metrics and other initiatives
FAIR landscape in ELIXIR: FAIR metrics and other initiativesPeter McQuilton
 
FAIRsharing Presentation at the EOSCpilot data interoperability technical wor...
FAIRsharing Presentation at the EOSCpilot data interoperability technical wor...FAIRsharing Presentation at the EOSCpilot data interoperability technical wor...
FAIRsharing Presentation at the EOSCpilot data interoperability technical wor...Peter McQuilton
 
FAIRsharing Keynote - International Workshop on Sharing, Citation and Publica...
FAIRsharing Keynote - International Workshop on Sharing, Citation and Publica...FAIRsharing Keynote - International Workshop on Sharing, Citation and Publica...
FAIRsharing Keynote - International Workshop on Sharing, Citation and Publica...Peter McQuilton
 
FAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyFAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyPeter McQuilton
 
RDA BioSharing WG/ELIXIR Session Montreal 2017
RDA BioSharing WG/ELIXIR Session Montreal 2017RDA BioSharing WG/ELIXIR Session Montreal 2017
RDA BioSharing WG/ELIXIR Session Montreal 2017Peter McQuilton
 

More from Peter McQuilton (20)

terms4FAIRskills - RDA VP17 - April 2021
terms4FAIRskills - RDA VP17 - April 2021terms4FAIRskills - RDA VP17 - April 2021
terms4FAIRskills - RDA VP17 - April 2021
 
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8 RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
 
FAIRsharing: more than a registry
FAIRsharing: more than a registryFAIRsharing: more than a registry
FAIRsharing: more than a registry
 
FAIRsharing - ENVRI-FAIR Webinar
FAIRsharing - ENVRI-FAIR WebinarFAIRsharing - ENVRI-FAIR Webinar
FAIRsharing - ENVRI-FAIR Webinar
 
FAIR StRePo - GO TRAIN Workshop, Hamburg, November 2019
FAIR StRePo - GO TRAIN Workshop, Hamburg, November 2019FAIR StRePo - GO TRAIN Workshop, Hamburg, November 2019
FAIR StRePo - GO TRAIN Workshop, Hamburg, November 2019
 
FAIRsharing - connecting standards, repositories and data policies across agr...
FAIRsharing - connecting standards, repositories and data policies across agr...FAIRsharing - connecting standards, repositories and data policies across agr...
FAIRsharing - connecting standards, repositories and data policies across agr...
 
FAIRsharing - manually curated metadata on standards, repositories and data p...
FAIRsharing - manually curated metadata on standards, repositories and data p...FAIRsharing - manually curated metadata on standards, repositories and data p...
FAIRsharing - manually curated metadata on standards, repositories and data p...
 
Making Repositories FAIR (via metadata in FAIRsharing.org
Making Repositories FAIR (via metadata in FAIRsharing.orgMaking Repositories FAIR (via metadata in FAIRsharing.org
Making Repositories FAIR (via metadata in FAIRsharing.org
 
Bridging Semantics and Repositories
Bridging Semantics and RepositoriesBridging Semantics and Repositories
Bridging Semantics and Repositories
 
FAIRsharing - Mapping the Landscape of Databases, Repositories, Standards and...
FAIRsharing - Mapping the Landscape of Databases, Repositories, Standards and...FAIRsharing - Mapping the Landscape of Databases, Repositories, Standards and...
FAIRsharing - Mapping the Landscape of Databases, Repositories, Standards and...
 
RDA UK - FAIRsharing WG output
RDA UK - FAIRsharing WG outputRDA UK - FAIRsharing WG output
RDA UK - FAIRsharing WG output
 
FAIRsharing and Engineering Research Data Management
FAIRsharing and Engineering Research Data ManagementFAIRsharing and Engineering Research Data Management
FAIRsharing and Engineering Research Data Management
 
FAIRsharing presentation to IUPAC Workshop
FAIRsharing presentation to IUPAC WorkshopFAIRsharing presentation to IUPAC Workshop
FAIRsharing presentation to IUPAC Workshop
 
RDA Data Innovation Forum: FAIRsharing.org, an output of the joint RDA/Force ...
RDA Data Innovation Forum: FAIRsharing.org, an output of the joint RDA/Force ...RDA Data Innovation Forum: FAIRsharing.org, an output of the joint RDA/Force ...
RDA Data Innovation Forum: FAIRsharing.org, an output of the joint RDA/Force ...
 
ELIXIR Standards and Formats: ISA Tools and FAIRsharing
ELIXIR Standards and Formats: ISA Tools and FAIRsharingELIXIR Standards and Formats: ISA Tools and FAIRsharing
ELIXIR Standards and Formats: ISA Tools and FAIRsharing
 
FAIR landscape in ELIXIR: FAIR metrics and other initiatives
FAIR landscape in ELIXIR: FAIR metrics and other initiativesFAIR landscape in ELIXIR: FAIR metrics and other initiatives
FAIR landscape in ELIXIR: FAIR metrics and other initiatives
 
FAIRsharing Presentation at the EOSCpilot data interoperability technical wor...
FAIRsharing Presentation at the EOSCpilot data interoperability technical wor...FAIRsharing Presentation at the EOSCpilot data interoperability technical wor...
FAIRsharing Presentation at the EOSCpilot data interoperability technical wor...
 
FAIRsharing Keynote - International Workshop on Sharing, Citation and Publica...
FAIRsharing Keynote - International Workshop on Sharing, Citation and Publica...FAIRsharing Keynote - International Workshop on Sharing, Citation and Publica...
FAIRsharing Keynote - International Workshop on Sharing, Citation and Publica...
 
FAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyFAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology Agency
 
RDA BioSharing WG/ELIXIR Session Montreal 2017
RDA BioSharing WG/ELIXIR Session Montreal 2017RDA BioSharing WG/ELIXIR Session Montreal 2017
RDA BioSharing WG/ELIXIR Session Montreal 2017
 

Recently uploaded

MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGIThomas Poetter
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...ttt fff
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一F sss
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 

Recently uploaded (20)

MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docx
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 

How to share useful data

  • 1. How to share useful data Peter McQuilton Biosharing.org @drosophilic
  • 2. Outline • Data sharing • Reusability and reproducibility • How the lack of these affects scientific accountability and progress • Experimental context • What to report – what level of granularity • How to report it – what format, structure • Content standards • How to find them • Complying with repositories, funders and publishers
  • 3. Outline • Data sharing • Reusability and reproducibility • How the lack of these affects scientific accountability and progress • Experimental context • What to report – what level of granularity • How to report it – what format, structure • Content standards • How to find them • Complying with repositories, funders and publishers
  • 4. Research data life cycle Image credit to:
  • 6. A community mobilization for “openness” image by Greg Emmerich http://discovery.urlibraries.org/ https://okfn.org Open data is a means to do better science more efficiently http://pantonprinciples.org https://creativecommons.org
  • 7. Growing movement for FAIR data and research outputs
  • 8. But in all fairness, not much data is FAIR!
  • 9. But in all fairness, not much data is FAIR!
  • 10. But in all fairness, not much data is FAIR!
  • 11. “Reproducing the method took several months of effort, and required using new versions and new software that posed challenges to reconstructing and validating the results” Unfairness in both experimental and computation areas
  • 12. • Not always well cited, stored o Software, codes, workflows are hard(er) to get hold of • Poorly described for third party reuse o Different level of detail and annotation • Curation activities are perceived as time consuming o Collection and harmonization of detailed methods and experimental steps is rushed at the publication stage Not very FAIR: low findability and understandability
  • 13. • Effectively document your data so that it can be understood in the future • Periodically move data to new storage media (drives degrade over time) • Keep more than one copy of data (local and cloud) • Migrate data to new software versions • Use a well documented and supported format Ideally this should be covered in a data management plan at the start of a project, so that you can factor any associated time and resources into your budget. What can I do to ensure my data are shareable/usable in the future?
  • 14. Outline • Data sharing • Reusability and reproducibility • How the lack of these affects scientific accountability and progress • Experimental context - standards • What to report – what level of granularity • How to report it – what format, structure • Content standards • How to find them • Complying with repositories, funders and publishers
  • 15. Do you know what this is? LS1_C2_LD_TP2_P1 file1-fastq.gz
  • 16. …how NOT to report the experimental information! LS1_C2_LD_TP2_P1 file1-fastq.gz
  • 17. …how NOT to report the experimental information! Sample name (?!) Data file LS1_C2_LD_TP2_P1 file1-fastq.gz
  • 18. We need to clearly describe the information • LS1 liver sample 1 • C2 compound 2 • LD low dose • TP2 time point 2 • P1 protocol 1 • file1-fastq.gz compressed data file for sequence information corresponding to this sample Sample name (?!) Data file LS1_C2_LD_TP2_P1 file1-fastq.gz
  • 19. Without context data is meaningless
  • 20. Without context data is meaningless
  • 21. Without context data is meaningless
  • 22. Without context data is meaningless
  • 23. • We need to report sufficient information to reuse the dataset • We must strike a balance between depth and breadth of information Information intensive experiments
  • 24. Information intensive experiments • Not too much • Not too little • ….just right
  • 25. Seven week old C57BL/6N mice were treated with low-fat diet. Liver was dissected out, hepatocytes prepared… From natural language to ‘computable’ concepts
  • 26. Age value? Unit? Strain name Subject of the experiment Type of diet and experimental condition Anatomy part Seven week old C57BL/6N mice were treated with low-fat diet. Liver was dissected out, hepatocytes prepared … From natural language to ‘computable’ concepts
  • 27. Age value Unit Strain name? Subject of the experiment? Type of diet and experimental condition Anatomy part Seven week old C57BL/6N mice were treated with low-fat diet. Liver was dissected out, hepatocytes prepared … From natural language to ‘computable’ concepts
  • 28. Age value Unit Strain name Subject of the experiment Type of diet and experimental condition? Anatomy part Seven week old C57BL/6N mice were treated with low-fat diet. Liver was dissected out, hepatocytes prepared … From natural language to ‘computable’ concepts
  • 29. Age value Unit Strain name Subject of the experiment Type of diet and experimental condition Anatomy part? Seven week old C57BL/6N mice were treated with low-fat diet. Liver was dissected out, hepatocytes prepared … From natural language to ‘computable’ concepts
  • 30. Age value Unit Strain name Subject of the experiment Type of diet and experimental condition Anatomy part Seven week old C57BL/6N mice were treated with low-fat diet. Liver was dissected out, hepatocytes prepared … From natural language to ‘computable’ concepts
  • 31. Age value Unit Strain name Subject of the experiment Type of diet and experimental condition Anatomy part Seven week old C57BL/6N mice were treated with low-fat diet. Liver was dissected out, hepatocytes prepared … From natural language to ‘computable’ concepts Type of protocol – cell preparation Type of protocol - sample treatment Type of protocol – liver preparation
  • 32. How do you know what to report, or how to structure it? • Data/content standards: • Structure, enrich and report the description of the datasets and the experimental context under which they were produced • Facilitate the discovery, sharing, understanding and reuse of datasets
  • 33. Outline • Data sharing • Reusability and reproducibility • How the lack of these affects scientific accountability and progress • Experimental context • What to report – what level of granularity • How to report it – what format, structure • Content standards • How to find them • Complying with repositories, funders and publishers
  • 35. de jure de facto grass-roots groups standard organizations Nanotechnology Working Group Community mobilisation to develop content standards
  • 36. Databases have their own standards, e.g. at EBI:
  • 37. Enablers: to better describe, share and query data
  • 38. Enablers: to better describe, share and query data • Minimum information reporting requirements, or checklists o Report the same core, essential information
  • 39. • Minimum information reporting requirements, or checklists o Report the same core, essential information • Controlled vocabularies, taxonomies, thesauri, ontologies etc. o Use the same word and refer to the same ‘thing’ Enablers: to better describe, share and query data
  • 40. • Minimum information reporting requirements, or checklists o Report the same core, essential information • Controlled vocabularies, taxonomies, thesauri, ontologies etc. o Use the same word and refer to the same ‘thing’ • Conceptual model, conceptual schema, or exchange formats o Allow data to flow from one system to another Enablers: to better describe, share and query data
  • 41. A web-based, curated and searchable registry ensuring that biological standards and databases are registered, informative and discoverable; also monitoring the development and evolution of standards, their use in databases and the adoption of both in data policies.
  • 42. Researchers, developers and curators lack support and guidance on how to best navigate and select content standards, understand their maturity, or find databases that implement them; Funders, journals and librarians do not have enough information to make informed decisions on which content standards or database to recommended in policies, or fund or implement Our mission: To help people make the right choice
  • 44. Work out which format your data should be in for submission to a particular database
  • 45. STANDARD DATABASE Standards and databases (and policies) cross-linked
  • 46. From simple and advanced searches
  • 47. The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project Search and filter to find what is relevant to your type of data
  • 48. The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project Tracking evolution, e.g. deprecations and substitutions
  • 49. The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project Tracking evolution, e.g. deprecations and substitutions
  • 50. Create your own Collection
  • 51.
  • 52.
  • 53. The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project 5 3 User profiles populated from ORCID...
  • 54. The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project 5 4 ... credit for creating, contributing to, maintaining standards, databases and policies Ownership of open standards can be problematic in broad, grass-root collaborations It requires improved models, to encourage maintenance of and contributions to these efforts, rewards and incentives need to be identified for all contributors to supporting the continued development of standards
  • 55. What you can do with BioSharing… “Which standard should I use for this data, considering I’d like to publish in journal X? “Are we using the most up-to-date version of this standard?” “My data is in X format, which databases take that format?
  • 56. How can you use community-standards? model and related formats These tools and formats will help you to:
  • 57. The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project ISA powers data collection, curation resources and repositories, e.g.: ISA model and related formats
  • 58.
  • 59. 1 Create template(s) to fit the type of experiments to be described Create templates detailing the steps to be reported for different investigations, complying to community standards in e.g. configuring the value(s) allowed for each field to be • text (with/without regular expressions), • ontology terms, • numbers etc. We have ‘ready to use’ community standards compliant configurations and can create more according to user needs
  • 60. • The ISA model records the data’s provenance, how it was generated and where it is located. • Published Data Descriptors are indexed in all major bibliographic indexing services (incl. PubMed) • However, accompanying every Data Descriptor article there are metadata files, specifically created to aid discovery and understanding of the data itself. • Using the ISA (Investigation, Study, Assay) model, these metadata files provide a machine readable overview of the study that generated the data.
  • 61. • Filter datasets by data repository or metadata • Boolean searches • Future enhancements: - Statistics - Richer queries based on semantics of the data ISA-explorer: A demo tool for discovering and exploring Scientific Data’s ISA-tab metadata
  • 62. ISA-explorer: A demo tool for discovering and exploring Scientific Data’s ISA-tab metadata Visualise the data associated with a paper http://tinyurl.com/isaexplorer
  • 63. • Reusability and reproducibility o Is pivotal to drive science and discoveries o Do your best to make your digital research outputs FAIR • Experimental context o Report the experimental context of your findings o Do to your data what you wish that others would do to theirs • Content standards o Continuously evolving o Make use of tools implementing standards, such as ISAtools o Use biosharing.org to explore repositories, standards and policies Summary
  • 65. Find the right database for your data, and which data standard to use – https://www.biosharing.org Checking your data conforms to a standard, or making your own templates – http://www.isa-tools.org Where to keep research data: DCC checklist for evaluating data repositories (DCC) - http://tinyurl.com/DCCResearchData How and why you should manage your research data (JISC) - http://tinyurl.com/JISCDMP Useful links