Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017
1. Susanna-Assunta Sansone,
Associate Director, Oxford e-Research Centre,
University of Oxford, UK
dx.doi.org/10.6084/m9.figshare.4055496.v1
@biosharing
bioCADDIE – DATS and CDEs Workshop, Bethesda, 8 May 2017
3. Minimum information reporting
requirements, checklists
o Report the same core,
essential information
o e.g. MIAME guidelines
Controlled vocabularies, taxonomies,
thesauri, ontologies etc.
o Unambiguous identification and
definition of concepts
o e.g. Gene Ontology
Conceptual model, schema,
exchange formats etc
o Define the structure and
interrelation of information,
and the transmission format
o e.g. FASTA Formats Terminologies Guidelines
Types of content standards
Common
Data
Elements
4. de jure de facto
grass-roots
groups
standard
organizations
Nanotechnology Working Group
Formats Terminologies Guidelines
Community-driven efforts, just few examples
5. Formats Terminologies Guidelines
224
115
500+
source source
source
MIAME
MIRIAM
MIQAS
MIX
MIGEN
ARRIVE
MIAPE
MIASE
MIQE
MISFISHIE….
REMARK
CONSORT
SRAxml
SOFT FASTA
DICOM
MzML
SBRML
SEDML…
GELML
ISA
CML
MITAB
AAO
CHEBIOBI
PATO ENVO
MOD
BTO
IDO…
TEDDY
PRO
XAO
DO
VO
Content standards in numbers
9. Data policies by
funders, journals and
other organizations
Content standards
Formats Terminologies Guidelines
Map this complex and evolving landscape
Databases
10. Data policies by
funders, journals and
other organizations
Databases
Content standards
Formats Terminologies Guidelines
Using indicators to describe ‘status’
Ready for use, implementation, or recommendation
In development
Status uncertain
Deprecated as subsumed or superseded
All records are manually curated
in-house and verified by the
community behind each resource
18. Technologically-delineated
views of the world
Biologically-delineated
views of the world
Generic features (‘common core’)
- description of source biomaterial
- experimental design components
Arrays
Scanning Arrays &
Scanning
Columns
Gels
MS MS
FTIR
NMR
Columns
transcriptomics
proteomics
metabolomics
plant biology
epidemiology
microbiology
Duplications & lack of interoperability among standards
19. Arrays
Scanning Arrays &
Scanning
Columns
Gels
MS MS
FTIR
NMR
Columns
transcriptomics
proteomics
metabolomics
plant biology
epidemiology
microbiology
Hard to use them in combinations, e.g. to represent:
Proteomics-based gut microbiota profiling
Proteomics and metabolomics based gut
microbiota profiling
20. Arrays
Scanning Arrays &
Scanning
Columns
Gels
MS MS
FTIR
NMR
Columns
transcriptomics
proteomics
metabolomics
plant biology
epidemiology
microbiology
Enhancing modularization
Proteomics-based gut microbiota profiling
Proteomics and metabolomics based gut
microbiota profiling
21. Arrays
Scanning Arrays &
Scanning
Columns
Gels
MS MS
FTIR
NMR
Columns
transcriptomics
proteomics
metabolomics
plant biology
epidemiology
microbiology
Proteomics-based gut microbiota profiling
Proteomics and metabolomics based gut
microbiota profiling
Enhancing modularization