A 30 minute webinar presented on behalf of the RDA/Force11 BioSharing WG, covering our work to map data standards, databases, and data policies in the life, biomedical and environmental sciences.
RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences
1. Webinar
Mapping the landscape of data standards, data repositories and
data policies: Use case – The Life Sciences (biomed, agri, environ)
Peter McQuilton, PhD
On behalf of the joint RDA/Force11 BioSharing WG
2. What were the aims of the RDA/Force11
BioSharing WG
• To develop guidelines for linking information on
databases, content standards and journal and
funder data policies in the life sciences
• To develop a curated registry (running since 2011),
to access and cross-search this information, such
that a variety of stakeholders can make decisions
on which standards and databases to use or
endorse
https://www.biosharing.org
3. What were the aims of the RDA/Force11
BioSharing WG
• To develop guidelines for linking information on
databases, content standards and journal and
funder data policies in the life sciences
• To develop a curated registry (running since 2011),
to access and cross-search this information, such
that a variety of stakeholders can make decisions
on which standards and databases to use or
endorse
https://www.biosharing.orghttps://www.FAIRsharing.org
In preparation over the summer
4.
5. • A web-based, curated and searchable portal that monitors the
development and evolution of standards, their use in
databases and the adoption of both in data policies,
to inform and educate
6. Data policies Databases
Content standards
Formats Terminologies Guidelines
Models/Formats = Conceptual
model, conceptual schema,
exchange formats
Terminologies = Controlled
vocabularies, taxonomies,
thesauri, ontologies etc.
Guidelines = Minimum information
reporting requirements, checklists
Mapping a complex and evolving
landscape
7. Content standards
Data policies Databases
Formats Terminologies Guidelines
By grass-roots
groups, e.g.:
By standard
organizations, e.g.:
Mapping a complex and evolving
landscape
8. Formats Terminologies Guidelines
225
115
554
source source
source
MIAME
MIRIAM
MIQAS
MIX
MIGEN
ARRIVE
MIAPE
MIASE
MIQE
MISFISHIE….
REMARK
CONSORT
SRAxml
SOFT FASTA
DICOM
MzML
SBRML
SEDML…
GELML
ISA
CML
MITAB
AAO
CHEBIOBI
PATO ENVO
MOD
BTO
IDO…
TEDDY
PRO
XAO
DO
VO
Content standards in numbers
Statistics as of May 2017
14. My funder’s data policy recommends the use of established
standards, but which are widely endorsed and applicable to
my crop data?
Am I using the most up-to-date
version of this terminology to
annotate cell-based assays?
I understand this format has been
deprecated; what has it been replaced by
and is it appropriate for this data?
Which are the mature standards
and standards-compliant databases
that we should recommend to our
authors?
Helping users make the right decision
20. Ready for use, implementation, or recommendation
In development
Status uncertain
Deprecated as subsumed or superseded
Manually curated and verified
by the community behind each
resource
Mapping a complex and evolving landscape
– indicators of status
21. Collections group together one or
more types of resource by
domain, project or organization.
Recommendations are a core-set
of resources that are selected and
recommended by a funder or
journal data policy.
All collections and
recommendations created
between BioSharing and the
collection/recommendation
owner.
Grouping records
25. “The interactive browser will allow us to discover which databases and editorial standards are not currently included
in our author guidelines, enabling us to regularly monitor and refine our policies as appropriate, in support of our
mission to help our authors enhance the reproducibility of their work.” – Holly, F1000
26. Collections group together one or
more types of resource by
domain, project or organization.
Recommendations are a core-set
of resources that are selected and
recommended by a funder or
journal data policy.
All collections and
recommendations created
between BioSharing and the
collection/recommendation
owner.
Grouping records
27.
28. Standard developing groups:Journal, publishers:
Cross-links, data exchange:
Societies and organisations: Institutional RDM services:
Projects, programmes:
Working with the community/adopters