SlideShare uma empresa Scribd logo
1 de 60
Investigating Term Reuse and Overlap
in Biomedical Ontologies
International Conference on Biomedical Ontology
Lisbon, 27th -30th July 2015
MAU LI K R. K AM D AR , TANI A TUDORACHE A N D MARK A . MUS E N
Are we there yet?
C0011849Diabetes
Mellitus
Diabetes
Mellitus
Unified Medical Language System (UMLS)
SNOMEDCT ICD9CM
C0011849Diabetes
Mellitus
Diabetes
Mellitus
Unified Medical Language System (UMLS)
Open Biomedical Ontologies (OBO) Foundry
SNOMEDCT ICD9CM
Binding to RNA
(GRO#BindingToRNA)
GO:0003723
IRI xref
RNA Binding
(GO:0003723)
Gene Expression
Ontology (GEXO)
Gene Regulation
Ontology (GEXO)
Gene Ontology (GO)
Ghazvinian, Amir, et al. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
OBO Reuse vs Overlap in 2010
Ghazvinian, Amir, et al. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
OBO Reuse vs Overlap in 2010
Same IRI
Ghazvinian, Amir, et al. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
OBO Reuse vs Overlap in 2010
Same IRI
Intent for
Reuse
Ghazvinian, Amir, et al. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
OBO Reuse vs Overlap in 2010
Xref
mapping
Same IRI
Intent for
Reuse
Ghazvinian, Amir, et al. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
OBO Reuse vs Overlap in 2010
September 2009
Ghazvinian, Amir, et al. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
OBO Reuse vs Overlap in 2010
September 2010
Key Findings
Key Findings
 ~3% Term Reuse
 Only popular or upper-
level ontologies reused
 14.4% Term Overlap
Key Findings
 ~3% Term Reuse
 Only popular or upper-
level ontologies reused
 14.4% Term Overlap
 Semantically-similar
terms reused together
 Similarity metric for a
Recommender system
BioPortal Import Plugin
DOG4DAG
Ontofox Web tool
Neurological Disease Ontology
Neurological Disease Ontology
OBI
Reuse of an Ontology
Neurological Disease Ontology
Reuse of Terms
OGMS
Neurological Disease Ontology
NDO
Key Findings
 ~3% Term Reuse
 Only popular or upper-
level ontologies reused
 14.4% Term Overlap
 Semantically-similar
terms reused together
 Similarity metric for a
Recommender system
BioPortal
N-triples dump
Biomedical
Ontologies
Terms, Labels,
xrefs, CUIs
Xref ReuseIRI Reuse CUI Reuse
Clustering Determine
Source Ontology
Term Overlap
Analysis
509 ontologies
377 ontologies
Remove ontology views
5,718,276 class terms
Label
normalization
Source-Target
Ontology pairs
>35% reuse
for ontology
reuse
14.4% Naïve Term Overlap!
• Normalized String Matching on
Term Labels
14.4%
(823621)
156/377 ontologies reuse no terms from other ontologies!
<5% of Terms reused from other Ontologies!
>
IRI Reuse
156/377 ontologies reuse no terms from other ontologies!
<5% of Terms reused from other Ontologies!
>
IRI Reuse
156/377 ontologies reuse no terms from other ontologies!
<5% of Terms reused from other Ontologies!
>
IRI Reuse
315/377 ontologies xref link to no terms from other ontologies!
<5% of Terms reused from other Ontologies!
>
Xref Reuse
263/377 ontologies have no terms reused by other ontologies!
Reuse from a small set of ontologies only!
>
IRI Reuse
286/377 ontologies have no terms xref linked by other ontologies!
Reuse from a small set of ontologies only!
>
Xref Reuse
0-5% of total terms reused explicitly or using
xref, with >150 ontologies showing 0% reuse.
Average Term Reuse ~ 3%
Reuse from a small set of ontologies only with
terms from >250 ontologies never reused
>100% term reuse from some ontologies! Why?
0
10
20
30
40
50
60
70
80
90
100 BFO
GO
IAO
OBI
PATO
CHEBI
CL
NCBITAXON
UO
SO
UBERON
CARO
NCIT
FMA
MP
SNOMEDCT
NumberofOntologiesReusingTerms(#)
Ontologies
>100% terms reused from some ontologies!
xref Reuse (No.
of Ontologies
IRI Reuse (No. of
Ontologies)
0
10
20
30
40
50
60
70
80
90
100 1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
NumberofOntologiesReusingTerms(#)
Ontologies
>100% terms reused from some ontologies!
% of Terms
reused IRIs
% of Terms
reused xref
BFO:10
1/39
… Reuse from a small set of popular or upper-
level ontologies only with terms from >250
ontologies never reused
>100% terms reused w.r.t current version of the
BFO, PATO, CARO, UO, SO ontologies!
Needs rigorous analysis through term overlap …
1
10
100
1000
10000
100000
1000000
ICD10PCS
HCPCS
NCBITAXON
LOINC
MESH
HL7
ICD10CM
OMIM
RXNORM
CPT
PDQ
MEDDRA
ICD9CM
NDDF
ICPC
ICPC2P
MDDB
NDFRT
SNOMEDCT
VANDF
CRISP
RCD
MEDLINEPLUS
SNMI
COSTART
WHO-ART
Procedural Terminologies do not share CUIs!
CUIs shared
0 Terminologies
CUI Reuse
NumberofTerms(LogScale)
1
10
100
1000
10000
100000
1000000
ICD10PCS
HCPCS
NCBITAXON
LOINC
MESH
HL7
ICD10CM
OMIM
RXNORM
CPT
PDQ
MEDDRA
ICD9CM
NDDF
ICPC
ICPC2P
MDDB
NDFRT
SNOMEDCT
VANDF
CRISP
RCD
MEDLINEPLUS
SNMI
COSTART
WHO-ART
Procedural Terminologies do not share CUIs!
CUIs shared
1-5 Terminologies
CUI Reuse
NumberofTerms(LogScale)
1
10
100
1000
10000
100000
1000000
ICD10PCS
HCPCS
NCBITAXON
LOINC
MESH
HL7
ICD10CM
OMIM
RXNORM
CPT
PDQ
MEDDRA
ICD9CM
NDDF
ICPC
ICPC2P
MDDB
NDFRT
SNOMEDCT
VANDF
CRISP
RCD
MEDLINEPLUS
SNMI
COSTART
WHO-ART
Procedural Terminologies do not share CUIs!
CUIs shared
6-10 Terminologies
CUI Reuse
NumberofTerms(LogScale)
1
10
100
1000
10000
100000
1000000
ICD10PCS
HCPCS
NCBITAXON
LOINC
MESH
HL7
ICD10CM
OMIM
RXNORM
CPT
PDQ
MEDDRA
ICD9CM
NDDF
ICPC
ICPC2P
MDDB
NDFRT
SNOMEDCT
VANDF
CRISP
RCD
MEDLINEPLUS
SNMI
COSTART
WHO-ART
Procedural Terminologies do not share CUIs!
CUIs shared
11-15 Terminologies
CUI Reuse
NumberofTerms(LogScale)
1
10
100
1000
10000
100000
1000000
ICD10PCS
HCPCS
NCBITAXON
LOINC
MESH
HL7
ICD10CM
OMIM
RXNORM
CPT
PDQ
MEDDRA
ICD9CM
NDDF
ICPC
ICPC2P
MDDB
NDFRT
SNOMEDCT
VANDF
CRISP
RCD
MEDLINEPLUS
SNMI
COSTART
WHO-ART
Procedural Terminologies do not share CUIs!
CUIs shared
16-20 Terminologies
CUI Reuse
NumberofTerms(LogScale)
1
10
100
1000
10000
100000
1000000
ICD10PCS
HCPCS
NCBITAXON
LOINC
MESH
HL7
ICD10CM
OMIM
RXNORM
CPT
PDQ
MEDDRA
ICD9CM
NDDF
ICPC
ICPC2P
MDDB
NDFRT
SNOMEDCT
VANDF
CRISP
RCD
MEDLINEPLUS
SNMI
COSTART
WHO-ART
Procedural Terminologies do not share CUIs!
CUIs sharedCUI Reuse
NumberofTerms(LogScale)
1
10
100
1000
10000
100000
1000000
ICD10PCS
HCPCS
NCBITAXON
LOINC
MESH
HL7
ICD10CM
OMIM
RXNORM
CPT
PDQ
MEDDRA
ICD9CM
NDDF
ICPC
ICPC2P
MDDB
NDFRT
SNOMEDCT
VANDF
CRISP
RCD
MEDLINEPLUS
SNMI
COSTART
WHO-ART
Procedural Terminologies do not share CUIs!
CUIs sharedCUI Reuse
NumberofTerms(LogScale)
Minimum sharing of CUIs, especially across
UMLS Procedural Terminologies
- ICD10PCS, HCPCS and CPT
Several unique terms introduced as we migrate
from ICD9CM -> ICD10CM, leading to decrease
in Term reuse.
Should there actually be Term Reuse?
Overlap decreases using correct representations!
14.4%
(823621)
• Normalized String Matching on Term Labels
13.2%
(752,176)
• Removing Explicitly Reused Terms
10.8%
(617509)
• Removing Terms Mapped to the same UMLS CUI
1.6%
(93,650)
• Removing almost-similar terms (same identifier
and source ontology but different representation)
Average 3% Term reuse across ontologies using
any method, yet a 14.4% naïve Term overlap!
Term overlap decreases substantially on
removing almost similar terms …
Examples for almost similar terms?
Version 1.0/Version1.1
Subcellular Anatomy Ontology (SAO)
Suggested Ontology for Pharmacogenomics (SOPHARM)
Intent
Different
Versions
BFO
NCIT
Different
Notations
FMA
Different
Namespaces
MESH
SNOMEDCT
Ontology Engineers show an intent for reuse!
Intent
Different
Versions
BFO
NCIT
Different
Notations
FMA
Different
Namespaces
MESH
SNOMEDCT
NCIT:C53037/NCIT:Cerebral_Vein
Cigarette Smoke Exposure (CSEO)
Sage Bionetworks Synapse (SYN)
Ontology Engineers show an intent for reuse!
OBO:FMA_31396
OBO:owlapi/fma#FMA_31396
OBO:owl/FMA#FMA_31396
OBO:fma#Cartilage_of_inferior_surface …
Ontology Engineers show an intent for reuse!
Intent
Different
Versions
BFO
NCIT
Different
Notations
FMA
Different
Namespaces
MESH
SNOMEDCT
http://purl.bioontology.org/ontology/MESH
http://phenomebrowser.net/ontologies/mesh/mesh.owl
Intent
Different
Versions
BFO
NCIT
Different
Notations
FMA
Different
Namespaces
MESH
SNOMEDCT
Ontology Engineers show an intent for reuse!
Intent
Different
Versions
BFO
NCIT
Different
Notations
FMA
Different
Namespaces
MESH
SNOMEDCT
http://ihtsdo.org/snomedct/
http://purl.bioontology.org/ontology/SNOMEDCT
Ontology Engineers show an intent for reuse!
Different versions, notations, namespaces
• >100% Reuse of few source ontologies
• Increase in Term Overlap
Incorrect representations without mappings do
not provide advantages of Term Reuse!
Key Findings
 ~3% Term Reuse
 Only popular or upper-
level ontologies reused
 14.4% Term Overlap
 Semantically-similar
terms reused together
 Similarity metric for a
Recommender system
Onto 1 Onto 2 Onto 3 Onto 4 Onto 5 Onto 6 Onto 7
Term 1 1 1 1 0 0 0 0
Term 2 0 0 0 1 1 0 0
Term 3 0 0 0 0 0 1 1
Term 4 1 1 0 0 1 0 0
Term 5 1 1 1 0 0 0 1
Term 6 0 0 0 1 1 1 0
Term 7 0 0 1 0 1 0 0
Term-
Ontology
Matrix
K-modes
Clustering
Term-Term
Affinity
Matrix
Spectral
Clustering
Understanding how Term Reuse Occurs
Term-
Ontology
Matrix
K-modes
Clustering
Term-Term
Affinity
Matrix
Spectral
Clustering
Understanding how Term Reuse Occurs
Term-
Ontology
Matrix
K-modes
Clustering
Term-Term
Affinity
Matrix
Spectral
Clustering
Understanding how Term Reuse Occurs
• Weighted Similarity Score between Term pairs
– Shared Ontologies
– Jaccard Semantic Similarity Score
– CUI Hierarchy from UMLS Metathesaurus
Semantically-similar terms are reused together!
Semantic Similarity < 0.9
Cluster Size
Semantic Similarity > 0.9
Semantically-similar terms are reused together!
Semantic Similarity > 0.9
Semantically-similar terms are reused together!
Semantic Similarity > 0.9
Semantic-similar terms (Parent-child or siblings)
are reused together …
Similarity Metric and BioPortal can be used to
provide recommendations to ontology
developers through a Web Protégé plugin!
Challenges to Term Reuse
• Substantial term overlap but less than 5% reuse.
• Lexically-similar terms may represent different concepts (e.g.,
anatomical concepts between ZFA and XAO).
• Lexically-different terms may represent same concepts (e.g.
myocardium and cardiac muscle)
• Same terms use different IRI representations, and without explicit
CUI or xref mappings.
• Lack of guidelines and semi-automated tools.
Future Work: WebProtégé Plugin
Term reuse recommendations using
Item-based Collaborative Filtering method.
Two-fold (A Posteriori and User-Centered) Evaluation
GO:0033036
GO:0008104
GO:1902432 GO:1903260
GO:0061472
GO:0090174
GO:0071850
GO:0044770
GO:0044839
GO:0045786
GO:0007050
GO:0044843 GO:1902969 GO:0036226
- Still far from achieving ideal term reuse, beyond upper
level and popular ontologies
- Newer ontologies added in BioPortal
- Without strict guidelines and semi-automated tools,
we will deviate more away …
The Road Ahead …
Acknowledgments
Musen Lab, Stanford
BMI PhD Program, Stanford
US NIH Grants
GM086587
GM103316
maulikrk@stanford.edu
http://stanford.edu/~maulikrk/data/OntologyReuse

Mais conteúdo relacionado

Destaque

Basic Life Support CPR Training by Mrs. Harriette Adams
Basic Life Support CPR Training by Mrs. Harriette AdamsBasic Life Support CPR Training by Mrs. Harriette Adams
Basic Life Support CPR Training by Mrs. Harriette Adams
Atlantic Training, LLC.
 

Destaque (11)

PÕLVNEMISEST TULENEV ÜLALPIDAMISKOHUSTUS JA ELATIS LAPSELE EHK ALIMENDID
PÕLVNEMISEST TULENEV ÜLALPIDAMISKOHUSTUS JA ELATIS LAPSELE EHK ALIMENDIDPÕLVNEMISEST TULENEV ÜLALPIDAMISKOHUSTUS JA ELATIS LAPSELE EHK ALIMENDID
PÕLVNEMISEST TULENEV ÜLALPIDAMISKOHUSTUS JA ELATIS LAPSELE EHK ALIMENDID
 
Master in Finance
Master in FinanceMaster in Finance
Master in Finance
 
OLSR setup
OLSR setup OLSR setup
OLSR setup
 
Hands only CPR
Hands only CPRHands only CPR
Hands only CPR
 
バス列の現状(2016ORFバージョン)
バス列の現状(2016ORFバージョン)バス列の現状(2016ORFバージョン)
バス列の現状(2016ORFバージョン)
 
Open Science and ORCID (in Japanese)
Open Science and ORCID (in Japanese)Open Science and ORCID (in Japanese)
Open Science and ORCID (in Japanese)
 
Excelのどうでもよいtipsの紹介
Excelのどうでもよいtipsの紹介Excelのどうでもよいtipsの紹介
Excelのどうでもよいtipsの紹介
 
Towards Knowledge-Enabled Society
Towards Knowledge-Enabled SocietyTowards Knowledge-Enabled Society
Towards Knowledge-Enabled Society
 
【UDC2015】アイデア 061 tenbin
【UDC2015】アイデア 061 tenbin【UDC2015】アイデア 061 tenbin
【UDC2015】アイデア 061 tenbin
 
Basic Life Support CPR Training by Mrs. Harriette Adams
Basic Life Support CPR Training by Mrs. Harriette AdamsBasic Life Support CPR Training by Mrs. Harriette Adams
Basic Life Support CPR Training by Mrs. Harriette Adams
 
"What Is RUS?" - Requisite Unifying Structure (RUS) - Requisite Technology (2...
"What Is RUS?" - Requisite Unifying Structure (RUS) - Requisite Technology (2..."What Is RUS?" - Requisite Unifying Structure (RUS) - Requisite Technology (2...
"What Is RUS?" - Requisite Unifying Structure (RUS) - Requisite Technology (2...
 

Semelhante a Investigating Term Reuse and Overlap in Biomedical Ontologies

GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
Neo4j
 
Uberon EBI industry workshop
Uberon EBI industry workshopUberon EBI industry workshop
Uberon EBI industry workshop
Chris Mungall
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
Venkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkitVenkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkit
BOSC 2010
 

Semelhante a Investigating Term Reuse and Overlap in Biomedical Ontologies (20)

BMI 201 - Investigating Term Reuse and Overlap in Biomedical Ontologies
BMI 201 - Investigating Term Reuse and Overlap in Biomedical OntologiesBMI 201 - Investigating Term Reuse and Overlap in Biomedical Ontologies
BMI 201 - Investigating Term Reuse and Overlap in Biomedical Ontologies
 
Mikel egana itbam_2010_ogo_system
Mikel egana itbam_2010_ogo_systemMikel egana itbam_2010_ogo_system
Mikel egana itbam_2010_ogo_system
 
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
 
Introduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental Biology
 
Chibucos annot go_final
Chibucos annot go_finalChibucos annot go_final
Chibucos annot go_final
 
Ontologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficientOntologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficient
 
Uberon EBI industry workshop
Uberon EBI industry workshopUberon EBI industry workshop
Uberon EBI industry workshop
 
Semantic Technologies at FAO
Semantic Technologies at FAOSemantic Technologies at FAO
Semantic Technologies at FAO
 
OBO Foundry
OBO FoundryOBO Foundry
OBO Foundry
 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontology
 
Fao Semantics Related Projects
Fao Semantics Related ProjectsFao Semantics Related Projects
Fao Semantics Related Projects
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
NCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesNCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners Slides
 
ICAR2016 TAIR talk
ICAR2016 TAIR talkICAR2016 TAIR talk
ICAR2016 TAIR talk
 
Venkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkitVenkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkit
 
NCBO BioPortal SPARQL Endpoint - The Quad Economy of a Semantic Web Ontology ...
NCBO BioPortal SPARQL Endpoint - The Quad Economy of a Semantic Web Ontology ...NCBO BioPortal SPARQL Endpoint - The Quad Economy of a Semantic Web Ontology ...
NCBO BioPortal SPARQL Endpoint - The Quad Economy of a Semantic Web Ontology ...
 
Ensembl Browser Workshop
Ensembl Browser WorkshopEnsembl Browser Workshop
Ensembl Browser Workshop
 
Connecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics Institute
 
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
 
SMBE 2015: Expression STRs
SMBE 2015: Expression STRsSMBE 2015: Expression STRs
SMBE 2015: Expression STRs
 

Mais de Maulik Kamdar

Analyzing User Interactions with Biomedical Ontologies: A Visual Perspective
Analyzing User Interactions with Biomedical Ontologies: A Visual PerspectiveAnalyzing User Interactions with Biomedical Ontologies: A Visual Perspective
Analyzing User Interactions with Biomedical Ontologies: A Visual Perspective
Maulik Kamdar
 
ReVeaLD: A user-driven domain-specific interactive search platform for biomed...
ReVeaLD: A user-driven domain-specific interactive search platform for biomed...ReVeaLD: A user-driven domain-specific interactive search platform for biomed...
ReVeaLD: A user-driven domain-specific interactive search platform for biomed...
Maulik Kamdar
 
ReVeaLD: A User-driven Domain Specific Interactive Search Platform for Biomed...
ReVeaLD: A User-driven Domain Specific Interactive Search Platform for Biomed...ReVeaLD: A User-driven Domain Specific Interactive Search Platform for Biomed...
ReVeaLD: A User-driven Domain Specific Interactive Search Platform for Biomed...
Maulik Kamdar
 

Mais de Maulik Kamdar (16)

Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
 
Text Snippets to Corroborate Medical Relations: An Unsupervised Approach usin...
Text Snippets to Corroborate Medical Relations: An Unsupervised Approach usin...Text Snippets to Corroborate Medical Relations: An Unsupervised Approach usin...
Text Snippets to Corroborate Medical Relations: An Unsupervised Approach usin...
 
Invited Talk at NASA Ames Research Center
Invited Talk at NASA Ames Research CenterInvited Talk at NASA Ames Research Center
Invited Talk at NASA Ames Research Center
 
Mechanism-Based Pharmacovigilance Over the Life-Sciences Linked-Open-Data Cloud
Mechanism-Based Pharmacovigilance Over the Life-Sciences Linked-Open-Data CloudMechanism-Based Pharmacovigilance Over the Life-Sciences Linked-Open-Data Cloud
Mechanism-Based Pharmacovigilance Over the Life-Sciences Linked-Open-Data Cloud
 
Analyzing User Interactions with Biomedical Ontologies: A Visual Perspective
Analyzing User Interactions with Biomedical Ontologies: A Visual PerspectiveAnalyzing User Interactions with Biomedical Ontologies: A Visual Perspective
Analyzing User Interactions with Biomedical Ontologies: A Visual Perspective
 
BiOnIC: A Catalog of User Interactions with Biomedical Ontologies
BiOnIC: A Catalog of User Interactions with Biomedical OntologiesBiOnIC: A Catalog of User Interactions with Biomedical Ontologies
BiOnIC: A Catalog of User Interactions with Biomedical Ontologies
 
Preproposal Talk
Preproposal TalkPreproposal Talk
Preproposal Talk
 
Graph Analytics in Pharmacology over the Web of Life Sciences Linked Open Data
Graph Analytics in Pharmacology over the Web of Life Sciences Linked Open DataGraph Analytics in Pharmacology over the Web of Life Sciences Linked Open Data
Graph Analytics in Pharmacology over the Web of Life Sciences Linked Open Data
 
BMI Research in Progress - Thursday talk
BMI Research in Progress - Thursday talkBMI Research in Progress - Thursday talk
BMI Research in Progress - Thursday talk
 
PRISM: A data-driven platform for monitoring mental health
PRISM: A data-driven platform for monitoring mental healthPRISM: A data-driven platform for monitoring mental health
PRISM: A data-driven platform for monitoring mental health
 
Integrating Wearables and User Interaction Patterns to Monitor Mental Health
Integrating Wearables and User Interaction Patterns to Monitor Mental HealthIntegrating Wearables and User Interaction Patterns to Monitor Mental Health
Integrating Wearables and User Interaction Patterns to Monitor Mental Health
 
Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...
 
GenomeSnip: Fragmenting the Genomic Wheel to augment discovery in cancer rese...
GenomeSnip: Fragmenting the Genomic Wheel to augment discovery in cancer rese...GenomeSnip: Fragmenting the Genomic Wheel to augment discovery in cancer rese...
GenomeSnip: Fragmenting the Genomic Wheel to augment discovery in cancer rese...
 
Isolation and characterization of an extracellular antifungal protein from an...
Isolation and characterization of an extracellular antifungal protein from an...Isolation and characterization of an extracellular antifungal protein from an...
Isolation and characterization of an extracellular antifungal protein from an...
 
ReVeaLD: A user-driven domain-specific interactive search platform for biomed...
ReVeaLD: A user-driven domain-specific interactive search platform for biomed...ReVeaLD: A user-driven domain-specific interactive search platform for biomed...
ReVeaLD: A user-driven domain-specific interactive search platform for biomed...
 
ReVeaLD: A User-driven Domain Specific Interactive Search Platform for Biomed...
ReVeaLD: A User-driven Domain Specific Interactive Search Platform for Biomed...ReVeaLD: A User-driven Domain Specific Interactive Search Platform for Biomed...
ReVeaLD: A User-driven Domain Specific Interactive Search Platform for Biomed...
 

Último

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 

Último (20)

April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 

Investigating Term Reuse and Overlap in Biomedical Ontologies

Notas do Editor

  1. To support the interoperability, the Unified Medical Language System (UMLS) uses the notion of a Concept Unique Identifier (CUI) to map terms with similar meaning in different terminologies
  2. To support the interoperability, the Unified Medical Language System (UMLS) uses the notion of a Concept Unique Identifier (CUI) to map terms with similar meaning in different terminologies
  3. Ghazvinian, Amir, Natalya Fridman Noy, and Mark A. Musen. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
  4. Ghazvinian, Amir, Natalya Fridman Noy, and Mark A. Musen. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
  5. Ghazvinian, Amir, Natalya Fridman Noy, and Mark A. Musen. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
  6. Ghazvinian, Amir, Natalya Fridman Noy, and Mark A. Musen. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
  7. Ghazvinian, Amir, Natalya Fridman Noy, and Mark A. Musen. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
  8. Ghazvinian, Amir, Natalya Fridman Noy, and Mark A. Musen. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
  9. Contributions: A set of descriptive statistics describing the level of reuse in biomedical ontologies stored in BioPortal, An interactive visualization technique for displaying the reuse dependencies among biomedical ontologies A clustering method to help identify patterns of reuse using semantic similarity between the terms A discussion on the state and challenges of reuse in biomedical ontologies and development of a semi-automated tool enabling reuse
  10. Contributions: A set of descriptive statistics describing the level of reuse in biomedical ontologies stored in BioPortal, An interactive visualization technique for displaying the reuse dependencies among biomedical ontologies A clustering method to help identify patterns of reuse using semantic similarity between the terms A discussion on the state and challenges of reuse in biomedical ontologies and development of a semi-automated tool enabling reuse
  11. Contributions: A set of descriptive statistics describing the level of reuse in biomedical ontologies stored in BioPortal, An interactive visualization technique for displaying the reuse dependencies among biomedical ontologies A clustering method to help identify patterns of reuse using semantic similarity between the terms A discussion on the state and challenges of reuse in biomedical ontologies and development of a semi-automated tool enabling reuse
  12. Dresden Ontology Generator for Directed Acyclic Graphs MIREOT Principles
  13. Dresden Ontology Generator for Directed Acyclic Graphs MIREOT Principles
  14. Dresden Ontology Generator for Directed Acyclic Graphs MIREOT Principles
  15. Contributions: A set of descriptive statistics describing the level of reuse in biomedical ontologies stored in BioPortal, An interactive visualization technique for displaying the reuse dependencies among biomedical ontologies A clustering method to help identify patterns of reuse using semantic similarity between the terms A discussion on the state and challenges of reuse in biomedical ontologies and development of a semi-automated tool enabling reuse
  16. 175,347 terms (3.1%) were explicitly shared using the same IRIs. Source ontology for all but 37 terms, whose ontologies were not present in BioPortal (e.g., owl:Thing and time#datetimedescription). After removing the imported ontology terms (term reuse > 35% threshold), only 59,618 terms (1.1%) were actually reused We found a total of 4,370,350 xref axioms across all the BioPortal ontologies. After extracting xrefs, which assert equivalence between BioPortal ontology terms, we found 171,069 ‘outlinking’ terms (3.9%) xref-linked to 386,442 `inlinking' terms (8.84%)
  17. 175,347 terms (3.1%) were explicitly shared using the same IRIs. Source ontology for all but 37 terms, whose ontologies were not present in BioPortal (e.g., owl:Thing and time#datetimedescription). After removing the imported ontology terms (term reuse > 35% threshold), only 59,618 terms (1.1%) were actually reused We found a total of 4,370,350 xref axioms across all the BioPortal ontologies. After extracting xrefs, which assert equivalence between BioPortal ontology terms, we found 171,069 ‘outlinking’ terms (3.9%) xref-linked to 386,442 `inlinking' terms (8.84%)
  18. 175,347 terms (3.1%) were explicitly shared using the same IRIs. Source ontology for all but 37 terms, whose ontologies were not present in BioPortal (e.g., owl:Thing and time#datetimedescription). After removing the imported ontology terms (term reuse > 35% threshold), only 59,618 terms (1.1%) were actually reused We found a total of 4,370,350 xref axioms across all the BioPortal ontologies. After extracting xrefs, which assert equivalence between BioPortal ontology terms, we found 171,069 ‘outlinking’ terms (3.9%) xref-linked to 386,442 `inlinking' terms (8.84%)
  19. 175,347 terms (3.1%) were explicitly shared using the same IRIs. Source ontology for all but 37 terms, whose ontologies were not present in BioPortal (e.g., owl:Thing and time#datetimedescription). After removing the imported ontology terms (term reuse > 35% threshold), only 59,618 terms (1.1%) were actually reused We found a total of 4,370,350 xref axioms across all the BioPortal ontologies. After extracting xrefs, which assert equivalence between BioPortal ontology terms, we found 171,069 ‘outlinking’ terms (3.9%) xref-linked to 386,442 `inlinking' terms (8.84%)
  20. BFO, PATO(Phenotypic Quality Ontology) CARO (Core anatomy reference ontology), UO (Units of Measurement) and SO (Sequence and Cell Feature Types ontology)
  21. BFO, PATO(Phenotypic Quality Ontology) CARO (Core anatomy reference ontology), UO (Units of Measurement) and SO (Sequence and Cell Feature Types ontology)
  22. Healthcare Common Procedure Coding System (HCPCS) Current Procedural Terminology
  23. Healthcare Common Procedure Coding System (HCPCS) Current Procedural Terminology
  24. Healthcare Common Procedure Coding System (HCPCS) Current Procedural Terminology
  25. Healthcare Common Procedure Coding System (HCPCS) Current Procedural Terminology
  26. Healthcare Common Procedure Coding System (HCPCS) Current Procedural Terminology
  27. Healthcare Common Procedure Coding System (HCPCS) Current Procedural Terminology
  28. Healthcare Common Procedure Coding System (HCPCS) Current Procedural Terminology
  29. Executing normalised string matching on the term labels, we found a term overlap of 823,621 shared term labels (14.4%). Removing explicitly-reused terms, list reduced to 752,176 labels (13.2%). Removing terms mapped to the same UMLS CUI, list reduced to 617,509 labels (10.8%). On extracting the resource identifier from each term IRI, we removed terms with almost similar term IRIs (same identifier and source ontology, but a different or incorrect representation) List reduced to 93,650 term labels (1.6%). The last step does not represent actual reuse between ontologies, but rather that ontology developers showed an intention to reuse terms, but used different and sometimes incorrect term representations (discussed below)
  30. SO (
  31. SO (
  32. SO (
  33. SO (
  34. SO (
  35. Contributions: A set of descriptive statistics describing the level of reuse in biomedical ontologies stored in BioPortal, An interactive visualization technique for displaying the reuse dependencies among biomedical ontologies A clustering method to help identify patterns of reuse using semantic similarity between the terms A discussion on the state and challenges of reuse in biomedical ontologies and development of a semi-automated tool enabling reuse
  36. Term-ontology matrix. The rows contain the explicitly-reused terms and the columns contain the ontology in which the term appears. Sparse K-means algorithm with the Gap-Estimate method (K=6) For each pair of terms in each cluster, we compute similarity scores. Use spectral clustering method with the term-term affinity matrix.
  37. Term-ontology matrix. The rows contain the explicitly-reused terms and the columns contain the ontology in which the term appears. Sparse K-means algorithm with the Gap-Estimate method (K=6) For each pair of terms in each cluster, we compute similarity scores. Use spectral clustering method with the term-term affinity matrix.
  38. Term-ontology matrix. The rows contain the explicitly-reused terms and the columns contain the ontology in which the term appears. Sparse K-means algorithm with the Gap-Estimate method (K=6) For each pair of terms in each cluster, we compute similarity scores. Use spectral clustering method with the term-term affinity matrix.
  39. Reuse dependencies could guide term reuse based on the structure of ontologies in related domains. Identifying reuse patterns and providing personalized recommendations could help increase term reuse.
  40. Item-based Collaborative Filtering Method (used by Amazon) to provide term reuse recommendations to users through a Web Protégé Plugin, and also allow automated updating. Two-fold Evaluation a posteriori: check if the term-reuse recommendations match those actually reused by users, as analyzed from the logs user-centered: monitoring term reuse when developers build an ontology combining existing ontologies, and surveys