A 45 minute webinar presented to the AMIA (American Medical Informatics Association - www.amia.org) in May 2016 on BioSharing, a curated, searchable portal of inter-related data standards, databases, and policies in the life, environmental and biomedical sciences. We cover how we describe standards, how one can search using our simple, advanced and faceted search, how our wizard can guide you, and how our recommendations from journal data policies can aid your selection of metadata standards and repositories for your data.
7. A B C D E
1 Group1 Group2
2 Day 0
3 Sodium 139 142
4 Potassium 3.3 4.8
5 Chloride 100 108
6 BUN 18 18
7 Creatine 1.2 1.2
8 Uric acid 5.5* 6.2*
9 Day 7
10 Sodium 140 146
11 Potassium 3.4 5.1
12 Chloride 97 108
S1Sh.cuo
Credit to: Iain Hrynaszkiewicz
Sharing starts with good metadata…
8. A B C D E
1 Group1 Group2
2 Day 0
3 Sodium 139 142
4 Potassium 3.3 4.8
5 Chloride 100 108
6 BUN 18 18
7 Creatine 1.2 1.2
8 Uric acid 5.5* 6.2*
9 Day 7
10 Sodium 140 146
11 Potassium 3.4 5.1
12 Chloride 97 108
S1Sh.cuo Meaningless
column titles
Special characters can
cause text mining
errors
No units
Unhelpful
document name
Undefined
abbreviation
Formatting for
information that
should be in
metadata
Credit to: Iain Hrynaszkiewicz
…. which this isn’t...
9. A B C D E F
1 Parameter Day Control Treated Units P
2 Sodium 0 139 142 mEq/l 0.82
3 Sodium 7 140 146 mEq/l 0.70
4 Sodium 14 140 158 mEq/l 0.03
5 Sodium 21 143 160 mEq/l 0.02
6 Potassium 0 3.3 4.8 mEq/l 0.06
7 Potassium 7 3.4 5.1 mEq/l 0.07
8 Potassium 14 3.7 4.7 mEq/l 0.10
9 Potassium 21 3.1 3.6 mEq/l 0.52
10 Chloride 0 100 108 mEq/l 0.56
11 Chloride 7 97 108 mEq/l 0.68
12 Chloride 14 101 106 mEq/l 0.79
Table_S1_Shanghai_blood.xls
Credit to: Iain Hrynaszkiewicz
…. This is much clearer!
10. Seven week old C57BL/6N mice were treated
with low-fat diet.
Liver was dissected out, hepatocytes prepared…
From natural language to
structured data
11. Age value
Unit
Strain name
Subject of the experiment
Type of diet and
experimental condition
Anatomy part
Seven week old C57BL/6N mice were treated
with low-fat diet.
Liver was dissected out, hepatocytes prepared …
Type of protocol – cell preparation
Type of protocol - sample treatment
Type of protocol – liver preparation
From natural language to
structured data
12. • Data/content standards:
• Structure, enrich and report the description of the
datasets and the experimental context under which they
were produced
• Facilitate the discovery, sharing, understanding and
reuse of datasets
Data has to be structured for sharing
– we need standards
13. de jure de facto
grass-roots
groups
standard
organizations
Nanotechnology Working Group
Community mobilisation to develop
content standards
Formats Terminologies Guidelines
19. Mapping the landscape of ‘standards’ in the life, environmental and
biomedical sciences
Mapping the landscape of ‘standards’ in the life, environmental and
biomedical sciences
1,400 records and growing
What is BioSharing?
A web-based, curated and searchable portal that monitors the development
and evolution of standards, their use in databases and the adoption of both in
data policies, to inform and educate the user community.
20. Mapping the landscape of ‘standards’ in the life, environmental and
biomedical sciences
Mapping the landscape of ‘standards’ in the life, environmental and
biomedical sciences
What is BioSharing?
Launched in 2011, as an evolution of the MIBBI portal (2008-2011)
Manually curated
Community driven
Growing userbase and visibility
1,400 records and growing
34. The International Conference on Systems Biology (ICSB), 22-28 August,2008 Susanna-Assunta
Sansone www.ebi.ac.uk/net-project
Search, filter, and refine using our faceted
search
Search, filter, and refine using our faceted
search
48. BioSharing – what we do
Inform – what’s out there, which databases use
which standards. Map the landscape.
Educate– what databases are recommended by your
funder, or journal of choice, which standards should
you be using, which standards and databases should
you recommend? Explore the landscape.