SlideShare a Scribd company logo
1 of 28
Semantic Web Development for
Traditional Chinese Medicine


Tong Yu, Zhejiang University. China.
July. 15th, 2008, IAAI, Chicago, Il.
Outline

• Overview of TCM Semantic Web
• Ontology Engineering and Reuse
• Semantic Mapping and Integration
• Semantic Query, Search and
  Navigation
• Semantic Graph Mining
• Summary
The Semantic Web:
            “A Giant Graph of Things”
• Based on the Internet and the Web
• Formal Semantics
  o Use URIs as names for things
  o RDF information about things
    available through HTTP URIs
  o Use RDF statements for
    semantic links between things
• Global network of databases
Project Background

• Preservation and
  Modernization of
  TCM
• Integrative Medicine
• Connecting the data
• A Student Project
The ultimate vision:
Make a connection between TCM and modern medicine
User Scenario
TCM Ontology Platform

• Domain Categorization
  o   The current TCM ontology contains 15 major
      categories for each sub-domain.
• Ontology Structure
  o   A typing system as a concept hierarchy
  o   A semantic network defining the associations between concepts
• Scale
  o   More than 20,000 classes and 100,000 instances
      defined in the current ontology
• Access Control
  o   layered privilege mechanism that defines users as
      reader, editor, checker and administrator.
• Service
  o   Web APIs for ontology-based applications.
The 15 Categories Defined
in the TCM Ontology
Ajax-based Ontology
Viewer and Editor
Ontology Fusion and
Reuse
Visualized Mapper:
An Eclipse Plugin
Visualized Mapper:
                  The Ajax-based Tool




•   http://ec2-67-202-22-44.compute-1.amazonaws.com:8890/demo/mapper/
Semantic Search Portal
Semantic Graph Mining

• We envision that intelligent agents could work on
  the Semantic Web of structured data, and assist
  their masters to solve problems, who can
  o   discover important Web resources
  o   discern latent semantic associations
  o   interpret interesting graph patterns.
• Existing methods of data mining, especially
  graph mining, can be adopted to implement
  these intelligent agents.
• We propose a methodology, called Semantic
  Graph Mining (SGM), for building agents that
  discover knowledge on the Semantic Web.
TCM Semantic Graph
The Network of Herbs
The Process for Analyzing
         the Network of Herbs

– Data Modeling
– Data
  Transformation &
  Integration :
– Entity
  Disambiguation
– Interaction
  Identification
– Network Mapping
– Network Analysis
Semantic Graph
                 Resource Importance
• the in-degree centrality CI of a resource is measured by the
  weighted sum of statements with the resource as object, and the
  out-degree centrality is measured by the weighted sum of
  statements with the resource as subject.
Semantic Graph
                  Resource Importance
• The Closeness Centrality of a resource r is defined as the inverse of
  the sum of the distance from r to all other resources.
Semantic Graph
                  Resource Importance
• The Betweenness Centrality of a resource r is defined as the ratio of
  shortest paths across the resource in the graph.
Frequent Semantic
      Subgraph
SG1          SG2




             SG5      SG4




SG3
Frequent Semantic
Subgraph
Pattern Interpretation
Interactive Mining of TCM
Knowledge
Conclusion

• We took the first systematic approach to leverage the
  progress of Biomedical Informatics to address the
  modernization of TCM.
• Domain experts evaluate the platform’s major technical
  features as original and productive in Drug Safety and
  Efficacy analysis.
• This case study demonstrates the Semantic Web’s
  advantages in representation, integration, and
  discovery of knowledge with complex domain models.
• Contributes to the Preservation and Modernization of
  TCM as intangible cultural heritage.
Reference

• TCM Ontology Engineering and Reuse
   o Y. Mao, et al. Dynamic Sub-Ontology Evolution for Traditional Chinese
     Medicine Web Ontology. Journal of Biomedical Informatics, 2008 (In
     progress)
   o Y. Mao et al. Sub-Ontology Based Resource Management for Web-based
     e-Learning. IEEE Transactions on Knowledge and Data Engineering, 2008
     (In Progress)
• Data Mapping and Integration
   o Zhao-hui Wu, Hua-jun Chen. 2008. Semantic Grid:Model,
     Methodology,and Applications (Monograph). Co-published by Zhejiang
     University Press and Springer-Verlag GmbH.
   o Huajun Chen et al. RDF/RDFS-based Relational Database Integration.
     ICDE 2006
   o Huajun Chen et al.From Legacy Relational Databases to the Semantic
     Web: an In-Use Application for Traditional Chinese Medicine. ISWC 2006.
Reference

• Semantic Search, Query, and Navigation
   o Huajun Chen et al. Towards semantic e-science for traditional chinese
     medicine. BMC Bioinformatics, 8(Suppl 3):56, 2007.
• Knowledge Discovery for TCM
   o Yi Feng et al. Knowledge discovery in traditional Chinese medicine: State
     of the art and perspectives, AI in Medicine, 38(3): 219-236, 2006.
   o Xuezhong Zhou et al. Integrative mining of traditional Chinese medicine
     literature and MEDLINE for functional gene networks. AI in Medicine
     (2007) 41, 87—104.
• Semantic Graph Mining
   o Tong Yu et al. Semantic Graph Mining for Biomedical Complex Network
     Analysis. WWW ’08 Workshops: HCLS.
   o Huajun Chen et al. Semantic Graph Mining for Biomedical Complex
     Network Analysis. Brief. In Bioinformatics ( In progress).
Thanks for your time!

More Related Content

What's hot

Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesPistoia Alliance
 
Standardization of the HIPC Data Templates: The Story So Far
Standardization of the HIPC Data Templates: The Story So FarStandardization of the HIPC Data Templates: The Story So Far
Standardization of the HIPC Data Templates: The Story So FarAhmad C. Bukhari
 
Research on ontology based information retrieval techniques
Research on ontology based information retrieval techniquesResearch on ontology based information retrieval techniques
Research on ontology based information retrieval techniquesKausar Mukadam
 
Presentation_euroCRIS_ES
Presentation_euroCRIS_ESPresentation_euroCRIS_ES
Presentation_euroCRIS_ESEd Simons
 
AnIML: A New Analytical Data Standard
AnIML: A New Analytical Data StandardAnIML: A New Analytical Data Standard
AnIML: A New Analytical Data StandardStuart Chalk
 
Evolution and state-of-the art of Altmetric research: Insights from network a...
Evolution and state-of-the art of Altmetric research: Insights from network a...Evolution and state-of-the art of Altmetric research: Insights from network a...
Evolution and state-of-the art of Altmetric research: Insights from network a...Aravind Sesagiri Raamkumar
 
The need for a transparent data supply chain
The need for a transparent data supply chainThe need for a transparent data supply chain
The need for a transparent data supply chainPaul Groth
 
Bibliometric - MIT MetaResources
Bibliometric - MIT MetaResourcesBibliometric - MIT MetaResources
Bibliometric - MIT MetaResourcesMicah Altman
 
Indexing Automated Vs Automatic Galvan1
Indexing Automated Vs Automatic   Galvan1Indexing Automated Vs Automatic   Galvan1
Indexing Automated Vs Automatic Galvan1CorinaF
 
Clinical modelling with openEHR Archetypes
Clinical modelling with openEHR ArchetypesClinical modelling with openEHR Archetypes
Clinical modelling with openEHR ArchetypesKoray Atalag
 
Genome science intermine
Genome science intermineGenome science intermine
Genome science intermineELIXIR UK
 
Multidimensioal database
Multidimensioal  databaseMultidimensioal  database
Multidimensioal databaseTPO TPO
 
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...Koray Atalag
 
KDD18- contex-aware academic collaboration recommendation
KDD18- contex-aware academic collaboration recommendationKDD18- contex-aware academic collaboration recommendation
KDD18- contex-aware academic collaboration recommendationMohsenFarahani8
 
BMS Literature Searching
BMS Literature SearchingBMS Literature Searching
BMS Literature SearchingJamie Halstead
 
Annotating research resources with rrid’s
Annotating research resources with rrid’sAnnotating research resources with rrid’s
Annotating research resources with rrid’sMaryann Martone
 

What's hot (19)

Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matrices
 
Text mining
Text miningText mining
Text mining
 
Standardization of the HIPC Data Templates: The Story So Far
Standardization of the HIPC Data Templates: The Story So FarStandardization of the HIPC Data Templates: The Story So Far
Standardization of the HIPC Data Templates: The Story So Far
 
Research on ontology based information retrieval techniques
Research on ontology based information retrieval techniquesResearch on ontology based information retrieval techniques
Research on ontology based information retrieval techniques
 
Presentation_euroCRIS_ES
Presentation_euroCRIS_ESPresentation_euroCRIS_ES
Presentation_euroCRIS_ES
 
Jonathan Breeze, Symplectic
Jonathan Breeze, SymplecticJonathan Breeze, Symplectic
Jonathan Breeze, Symplectic
 
AnIML: A New Analytical Data Standard
AnIML: A New Analytical Data StandardAnIML: A New Analytical Data Standard
AnIML: A New Analytical Data Standard
 
Evolution and state-of-the art of Altmetric research: Insights from network a...
Evolution and state-of-the art of Altmetric research: Insights from network a...Evolution and state-of-the art of Altmetric research: Insights from network a...
Evolution and state-of-the art of Altmetric research: Insights from network a...
 
The need for a transparent data supply chain
The need for a transparent data supply chainThe need for a transparent data supply chain
The need for a transparent data supply chain
 
Bibliometric - MIT MetaResources
Bibliometric - MIT MetaResourcesBibliometric - MIT MetaResources
Bibliometric - MIT MetaResources
 
Indexing Automated Vs Automatic Galvan1
Indexing Automated Vs Automatic   Galvan1Indexing Automated Vs Automatic   Galvan1
Indexing Automated Vs Automatic Galvan1
 
Clinical modelling with openEHR Archetypes
Clinical modelling with openEHR ArchetypesClinical modelling with openEHR Archetypes
Clinical modelling with openEHR Archetypes
 
Ir 01
Ir   01Ir   01
Ir 01
 
Genome science intermine
Genome science intermineGenome science intermine
Genome science intermine
 
Multidimensioal database
Multidimensioal  databaseMultidimensioal  database
Multidimensioal database
 
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
 
KDD18- contex-aware academic collaboration recommendation
KDD18- contex-aware academic collaboration recommendationKDD18- contex-aware academic collaboration recommendation
KDD18- contex-aware academic collaboration recommendation
 
BMS Literature Searching
BMS Literature SearchingBMS Literature Searching
BMS Literature Searching
 
Annotating research resources with rrid’s
Annotating research resources with rrid’sAnnotating research resources with rrid’s
Annotating research resources with rrid’s
 

Similar to Semantic Web Development for Traditional Chinese Medicine

FedCentric_Presentation
FedCentric_PresentationFedCentric_Presentation
FedCentric_PresentationYatpang Cheung
 
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGYINTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGYcscpconf
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Lucy McKenna
 
2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)Michael Atkins
 
UNIT - 5: Data Warehousing and Data Mining
UNIT - 5: Data Warehousing and Data MiningUNIT - 5: Data Warehousing and Data Mining
UNIT - 5: Data Warehousing and Data MiningNandakumar P
 
Knowledge Management in the AI Driven Scintific System
Knowledge Management in the AI Driven Scintific SystemKnowledge Management in the AI Driven Scintific System
Knowledge Management in the AI Driven Scintific SystemSubhasis Dasgupta
 
From Semantic Grid To Knowledge Service
From Semantic Grid To Knowledge ServiceFrom Semantic Grid To Knowledge Service
From Semantic Grid To Knowledge Serviceguest10cfd4d
 
Data-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemMaryann Martone
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Amit Sheth
 
An Improved Mining Of Biomedical Data From Web Documents Using Clustering
An Improved Mining Of Biomedical Data From Web Documents Using ClusteringAn Improved Mining Of Biomedical Data From Web Documents Using Clustering
An Improved Mining Of Biomedical Data From Web Documents Using ClusteringKelly Lipiec
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...William Gunn
 
Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003robertstevens65
 
Semantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based SystemSemantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based Systemijcnes
 
Enabling knowledge management in the Agronomic Domain
Enabling knowledge management in the Agronomic DomainEnabling knowledge management in the Agronomic Domain
Enabling knowledge management in the Agronomic DomainPierre Larmande
 
Semantic Data Retrieval: Search, Ranking, and Summarization
Semantic Data Retrieval: Search, Ranking, and SummarizationSemantic Data Retrieval: Search, Ranking, and Summarization
Semantic Data Retrieval: Search, Ranking, and SummarizationGong Cheng
 

Similar to Semantic Web Development for Traditional Chinese Medicine (20)

FedCentric_Presentation
FedCentric_PresentationFedCentric_Presentation
FedCentric_Presentation
 
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGYINTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
 
Semantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including AstrophysicsSemantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including Astrophysics
 
2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)
 
UNIT - 5: Data Warehousing and Data Mining
UNIT - 5: Data Warehousing and Data MiningUNIT - 5: Data Warehousing and Data Mining
UNIT - 5: Data Warehousing and Data Mining
 
Knowledge Management in the AI Driven Scintific System
Knowledge Management in the AI Driven Scintific SystemKnowledge Management in the AI Driven Scintific System
Knowledge Management in the AI Driven Scintific System
 
From Semantic Grid To Knowledge Service
From Semantic Grid To Knowledge ServiceFrom Semantic Grid To Knowledge Service
From Semantic Grid To Knowledge Service
 
Data-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystem
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
 
M045067275
M045067275M045067275
M045067275
 
An Improved Mining Of Biomedical Data From Web Documents Using Clustering
An Improved Mining Of Biomedical Data From Web Documents Using ClusteringAn Improved Mining Of Biomedical Data From Web Documents Using Clustering
An Improved Mining Of Biomedical Data From Web Documents Using Clustering
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
 
Neuroscience as networked science
Neuroscience as networked scienceNeuroscience as networked science
Neuroscience as networked science
 
Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003
 
Semantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based SystemSemantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based System
 
Enabling knowledge management in the Agronomic Domain
Enabling knowledge management in the Agronomic DomainEnabling knowledge management in the Agronomic Domain
Enabling knowledge management in the Agronomic Domain
 
Martone grethe
Martone gretheMartone grethe
Martone grethe
 
Presentationonline
PresentationonlinePresentationonline
Presentationonline
 
Semantic Data Retrieval: Search, Ranking, and Summarization
Semantic Data Retrieval: Search, Ranking, and SummarizationSemantic Data Retrieval: Search, Ranking, and Summarization
Semantic Data Retrieval: Search, Ranking, and Summarization
 

Semantic Web Development for Traditional Chinese Medicine

  • 1. Semantic Web Development for Traditional Chinese Medicine Tong Yu, Zhejiang University. China. July. 15th, 2008, IAAI, Chicago, Il.
  • 2. Outline • Overview of TCM Semantic Web • Ontology Engineering and Reuse • Semantic Mapping and Integration • Semantic Query, Search and Navigation • Semantic Graph Mining • Summary
  • 3. The Semantic Web: “A Giant Graph of Things” • Based on the Internet and the Web • Formal Semantics o Use URIs as names for things o RDF information about things available through HTTP URIs o Use RDF statements for semantic links between things • Global network of databases
  • 4. Project Background • Preservation and Modernization of TCM • Integrative Medicine • Connecting the data • A Student Project
  • 5. The ultimate vision: Make a connection between TCM and modern medicine
  • 7. TCM Ontology Platform • Domain Categorization o The current TCM ontology contains 15 major categories for each sub-domain. • Ontology Structure o A typing system as a concept hierarchy o A semantic network defining the associations between concepts • Scale o More than 20,000 classes and 100,000 instances defined in the current ontology • Access Control o layered privilege mechanism that defines users as reader, editor, checker and administrator. • Service o Web APIs for ontology-based applications.
  • 8. The 15 Categories Defined in the TCM Ontology
  • 12. Visualized Mapper: The Ajax-based Tool • http://ec2-67-202-22-44.compute-1.amazonaws.com:8890/demo/mapper/
  • 14. Semantic Graph Mining • We envision that intelligent agents could work on the Semantic Web of structured data, and assist their masters to solve problems, who can o discover important Web resources o discern latent semantic associations o interpret interesting graph patterns. • Existing methods of data mining, especially graph mining, can be adopted to implement these intelligent agents. • We propose a methodology, called Semantic Graph Mining (SGM), for building agents that discover knowledge on the Semantic Web.
  • 16. The Network of Herbs
  • 17. The Process for Analyzing the Network of Herbs – Data Modeling – Data Transformation & Integration : – Entity Disambiguation – Interaction Identification – Network Mapping – Network Analysis
  • 18. Semantic Graph Resource Importance • the in-degree centrality CI of a resource is measured by the weighted sum of statements with the resource as object, and the out-degree centrality is measured by the weighted sum of statements with the resource as subject.
  • 19. Semantic Graph Resource Importance • The Closeness Centrality of a resource r is defined as the inverse of the sum of the distance from r to all other resources.
  • 20. Semantic Graph Resource Importance • The Betweenness Centrality of a resource r is defined as the ratio of shortest paths across the resource in the graph.
  • 21. Frequent Semantic Subgraph SG1 SG2 SG5 SG4 SG3
  • 24. Interactive Mining of TCM Knowledge
  • 25. Conclusion • We took the first systematic approach to leverage the progress of Biomedical Informatics to address the modernization of TCM. • Domain experts evaluate the platform’s major technical features as original and productive in Drug Safety and Efficacy analysis. • This case study demonstrates the Semantic Web’s advantages in representation, integration, and discovery of knowledge with complex domain models. • Contributes to the Preservation and Modernization of TCM as intangible cultural heritage.
  • 26. Reference • TCM Ontology Engineering and Reuse o Y. Mao, et al. Dynamic Sub-Ontology Evolution for Traditional Chinese Medicine Web Ontology. Journal of Biomedical Informatics, 2008 (In progress) o Y. Mao et al. Sub-Ontology Based Resource Management for Web-based e-Learning. IEEE Transactions on Knowledge and Data Engineering, 2008 (In Progress) • Data Mapping and Integration o Zhao-hui Wu, Hua-jun Chen. 2008. Semantic Grid:Model, Methodology,and Applications (Monograph). Co-published by Zhejiang University Press and Springer-Verlag GmbH. o Huajun Chen et al. RDF/RDFS-based Relational Database Integration. ICDE 2006 o Huajun Chen et al.From Legacy Relational Databases to the Semantic Web: an In-Use Application for Traditional Chinese Medicine. ISWC 2006.
  • 27. Reference • Semantic Search, Query, and Navigation o Huajun Chen et al. Towards semantic e-science for traditional chinese medicine. BMC Bioinformatics, 8(Suppl 3):56, 2007. • Knowledge Discovery for TCM o Yi Feng et al. Knowledge discovery in traditional Chinese medicine: State of the art and perspectives, AI in Medicine, 38(3): 219-236, 2006. o Xuezhong Zhou et al. Integrative mining of traditional Chinese medicine literature and MEDLINE for functional gene networks. AI in Medicine (2007) 41, 87—104. • Semantic Graph Mining o Tong Yu et al. Semantic Graph Mining for Biomedical Complex Network Analysis. WWW ’08 Workshops: HCLS. o Huajun Chen et al. Semantic Graph Mining for Biomedical Complex Network Analysis. Brief. In Bioinformatics ( In progress).