SlideShare uma empresa Scribd logo
1 de 35
Baixar para ler offline
Using Text to Build Semantics
Networks for Pharmacogenomics


                  George Karystianis


   Adrien Coulet, Nigam Shah, Yael Garten, Mark Musen, Russ B. Altman

                 Journal of Biomedical informatics (2010)
Motivation
●   Manually crafted rules to define relationships
    between entities.
        –   Limited scope domains.
●   Pharmacogenomics.
        –   Semantic complexity.
●   Enhance the PharmaGKB.
●   Large size of literature.
●   NLP techniques promising.
                                                 2
Aim
●   Automatic relationship extraction.
●   Entity mapping in a schema.
        –   Semantic network structure.
●   Curation of PGx knowledge.
●   Resource for knowledge discovery.



                                          3
However...



             4
What is the meaning of
Pharmacogenomics?




                         5
Pharmacogenomics (1)


Pharmaco       Genomics       PGx

 Φάρμακο         Γίνομαι




                                    6
Pharmacogenomics (2)
●   How genetic variation influences drug
    response in patients.
●   Most of this knowledge presented in binary
    relationships.

                        R(a,b)


         Relationship     Subject   Object
                                                 7
Is This Something New?
●   Co-occurrence approach:      Complex relationship
       –   Pharmexpresso.        semantics.

       –   Tri-co-occurrences.   Manual relationship
                                 evaluation.




●   Syntactic parser approach:   Explicit relationship
                                 identification.
       –   OpenDMAP.
                                 Large pattern sets.
       –   Vocabularies.
                                 Stable ontologies.
                                                         8
So...



Gene-disease networks           Molecular interaction networks




Drug-disease networks       Regular gene expression networks

                                                                 9
Method Overview
                              Ontology




MEDLINE
Abstracts
             Dependency
              Graphs of
              Sentences
                          R
                                         PGx network




                                                  10
1a. Sentence Parsing
●   Implementation of lexicons for sentence
    retrieval.
●   Stanford Parser.
●   Focused on sentences with at least 2 key PGX
    entities.



                                              11
1b. Sentence Parsing
●   Querying the sentence index using seeds.
        –   particular terms corresponding to recognized entities.
        –   focus on gene-drug/gene-phenotype pairs.
●   Reducing set/size of parse trees.
●   Parse trees -> dependency graphs.
        –   rooted, oriented, labelled, easy to read, process,
              understand than parse trees.


                                                                 12
Parsing Example
“Several single nucleotide polymorphisms (SNPs) in VKORC1 are associated
              with warfarin dose across the normal dose range”




                                                                           13
Dependency Graph




                   14
2a. Relation Extraction
●   Sentence analysis for raw relationship
    extraction.
●   Seed recognition:
       –   through PharmGKB lexicons.
●   Seed expansion:
       –   edge traversal of DG to see if the seed is a key entity
             or a modified entity.


                                                               15
Dependencies for Seed
     Expansion



 ●   Expand the seed
 ●   End the expansion
 ●   Interrupt the expansion
                               16
2b. Relation Extraction
●   Seed coupling
       –   Two seeds wend with a normalised verb.
       –   Relationship creation.




                                                    17
2c. Relation Extraction
●   Evaluation of precision:
        –   manual precision evaluation of extracting raw
             relationships.
        –   random selection of 220 raw relationships.
        –   classification-complete and true, incomplete and true,
              false.




                                                               18
3. Ontology Construction
●   Identification of R types.
●   Hierarchical organisation of R types and E.
        –   4 lists: most frequent, the most frequent modified
              entities by genes, drugs, phenotype.
●   Refine choice available.




                                                                 19
4a. Relationship Normalization
●   Application of ontology to relationship
    instances.
●   Creation of set of normalised relationships.
●   Normalization of entity names:
        –   modified entity name returned in normalized form
             according to ontology.
        –   Decomposition of modified entity to iterate for the
             construction of normalised form.

                                                                  20
Example




          21
Example
●   Seed: VKORC1_polymorphisms.
●   Seed concept: Gene.
●   Next word: polymorphism.
        –   refers to a concept modified by Gene.
        –   synonym of the concept “variant”.
●   Normalised word:
        –   VKORC1_variant.


                                                    22
4b. Relation Normalization
●   Normalization of relationship types.
        –   search for a role label which matches the relationship.
        –   the identifier of the corresponding role is the
              normalized type.
        –   creation of knowledge base of PGX relationships.




                                                               23
Did it work?
●   Input:
        –   17.396.436 MEDLINE abstracts
●   Sentences:
        –   87.806.828.
●   Sentences with pairs of PGx entities:
        –   295.569.
●   After pruning:
        –   41.134 raw relationships, 21.050 gene-drug pair,
              20.084 gene-phenotype pair.                      24
25
Results
●   The 200 most frequent raw relationship types:
        –   80% of the extracted relationships.
●   Creation of an ontology:
        –   200 most frequent relationship types and modified
              entities called PHARE-PHArmacogenomics
              RElationships.
        –   237 concepts and 76 roles.



                                                                26
Results (2)




              27
Results (3)




              28
29
30
Discussion (1)
●   Identification of both PGx entities.
●   Identification of PGx modified entities.
●   Use of key entity lexicons for discovery and
    normalization of modified entities.
●   Record and recognition of modified entities
    under very general textual conditions.
●   Flexible, precise method.
                                                   31
Discussion (2)
●   Concern: lower recall due to the large corpus
    size.
        –   improve precision with full text parsing.
●   Applicable to other domains.
        –   Human effort required for the ontology creation.




                                                               32
Conclusions (1)
●   New method for PGX relationship extraction.
●   Use of key PGX entities to identify modified
    entities.
●   Capture and normalization of raw
    relationships.
●   Automatic labelling of parsed sentences.


                                                   33
Conclusions (2)
●   Creation of a knowledge base.
●   Creation of relationship summaries between:
       –   Genes, drugs, phenotypes.
●   Novel approach for PGX text processing.




                                              34
Questions?
         (in French ^_^)


                           Questions?


質問 ?

                           Ερωτήσεις;
Preguntas?
                                        35

Mais conteúdo relacionado

Semelhante a Using text to build semantic networks for pharmacaogenomics2

Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicshemantbreeder
 
GeneArt® services - Gene synthesis through protein production
GeneArt® services - Gene synthesis through protein productionGeneArt® services - Gene synthesis through protein production
GeneArt® services - Gene synthesis through protein productionThermo Fisher Scientific
 
bioinformatics simple
bioinformatics simple bioinformatics simple
bioinformatics simple nadeem akhter
 
Bio-IT 2010 Genome Commons
Bio-IT 2010 Genome CommonsBio-IT 2010 Genome Commons
Bio-IT 2010 Genome CommonsReece Hart
 
Research report (alternative splicing, protein structure; retinitis pigmentosa)
Research report (alternative splicing, protein structure; retinitis pigmentosa)Research report (alternative splicing, protein structure; retinitis pigmentosa)
Research report (alternative splicing, protein structure; retinitis pigmentosa)avalgar
 
Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...Greg Landrum
 
Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07Monica Munoz-Torres
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformaticsphilmaweb
 
SOT short course on computational toxicology
SOT short course on computational toxicology SOT short course on computational toxicology
SOT short course on computational toxicology Sean Ekins
 
SBML (the Systems Biology Markup Language), model databases, and other resources
SBML (the Systems Biology Markup Language), model databases, and other resourcesSBML (the Systems Biology Markup Language), model databases, and other resources
SBML (the Systems Biology Markup Language), model databases, and other resourcesMike Hucka
 
Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.sangeeta jadav
 

Semelhante a Using text to build semantic networks for pharmacaogenomics2 (20)

Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
GeneArt® services - Gene synthesis through protein production
GeneArt® services - Gene synthesis through protein productionGeneArt® services - Gene synthesis through protein production
GeneArt® services - Gene synthesis through protein production
 
Slides 0
Slides 0Slides 0
Slides 0
 
Drug design
Drug designDrug design
Drug design
 
presentation
presentationpresentation
presentation
 
bioinformatics simple
bioinformatics simple bioinformatics simple
bioinformatics simple
 
Bio-IT 2010 Genome Commons
Bio-IT 2010 Genome CommonsBio-IT 2010 Genome Commons
Bio-IT 2010 Genome Commons
 
Research report (alternative splicing, protein structure; retinitis pigmentosa)
Research report (alternative splicing, protein structure; retinitis pigmentosa)Research report (alternative splicing, protein structure; retinitis pigmentosa)
Research report (alternative splicing, protein structure; retinitis pigmentosa)
 
Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...
 
Applied Bioinformatics Assignment 5docx
Applied Bioinformatics Assignment  5docxApplied Bioinformatics Assignment  5docx
Applied Bioinformatics Assignment 5docx
 
functional genomics.ppt
functional genomics.pptfunctional genomics.ppt
functional genomics.ppt
 
Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07
 
CADD Lecture
CADD LectureCADD Lecture
CADD Lecture
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
genomic comparison
genomic comparison genomic comparison
genomic comparison
 
Apollo Workshop at KSU 2015
Apollo Workshop at KSU 2015Apollo Workshop at KSU 2015
Apollo Workshop at KSU 2015
 
SOT short course on computational toxicology
SOT short course on computational toxicology SOT short course on computational toxicology
SOT short course on computational toxicology
 
Genomics
GenomicsGenomics
Genomics
 
SBML (the Systems Biology Markup Language), model databases, and other resources
SBML (the Systems Biology Markup Language), model databases, and other resourcesSBML (the Systems Biology Markup Language), model databases, and other resources
SBML (the Systems Biology Markup Language), model databases, and other resources
 
Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.
 

Último

Hematology and Immunology - Leukocytes Functions
Hematology and Immunology - Leukocytes FunctionsHematology and Immunology - Leukocytes Functions
Hematology and Immunology - Leukocytes FunctionsMedicoseAcademics
 
High Profile Call Girls Mavalli - 7001305949 | 24x7 Service Available Near Me
High Profile Call Girls Mavalli - 7001305949 | 24x7 Service Available Near MeHigh Profile Call Girls Mavalli - 7001305949 | 24x7 Service Available Near Me
High Profile Call Girls Mavalli - 7001305949 | 24x7 Service Available Near Menarwatsonia7
 
Case Report Peripartum Cardiomyopathy.pptx
Case Report Peripartum Cardiomyopathy.pptxCase Report Peripartum Cardiomyopathy.pptx
Case Report Peripartum Cardiomyopathy.pptxNiranjan Chavan
 
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...narwatsonia7
 
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowKolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowNehru place Escorts
 
Call Girls Service Nandiambakkam | 7001305949 At Low Cost Cash Payment Booking
Call Girls Service Nandiambakkam | 7001305949 At Low Cost Cash Payment BookingCall Girls Service Nandiambakkam | 7001305949 At Low Cost Cash Payment Booking
Call Girls Service Nandiambakkam | 7001305949 At Low Cost Cash Payment BookingNehru place Escorts
 
Housewife Call Girls Hsr Layout - Call 7001305949 Rs-3500 with A/C Room Cash ...
Housewife Call Girls Hsr Layout - Call 7001305949 Rs-3500 with A/C Room Cash ...Housewife Call Girls Hsr Layout - Call 7001305949 Rs-3500 with A/C Room Cash ...
Housewife Call Girls Hsr Layout - Call 7001305949 Rs-3500 with A/C Room Cash ...narwatsonia7
 
Book Call Girls in Kasavanahalli - 7001305949 with real photos and phone numbers
Book Call Girls in Kasavanahalli - 7001305949 with real photos and phone numbersBook Call Girls in Kasavanahalli - 7001305949 with real photos and phone numbers
Book Call Girls in Kasavanahalli - 7001305949 with real photos and phone numbersnarwatsonia7
 
Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...
Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...
Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...rajnisinghkjn
 
Pharmaceutical Marketting: Unit-5, Pricing
Pharmaceutical Marketting: Unit-5, PricingPharmaceutical Marketting: Unit-5, Pricing
Pharmaceutical Marketting: Unit-5, PricingArunagarwal328757
 
call girls in munirka DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in munirka  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in munirka  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in munirka DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️saminamagar
 
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...rajnisinghkjn
 
See the 2,456 pharmacies on the National E-Pharmacy Platform
See the 2,456 pharmacies on the National E-Pharmacy PlatformSee the 2,456 pharmacies on the National E-Pharmacy Platform
See the 2,456 pharmacies on the National E-Pharmacy PlatformKweku Zurek
 
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking ModelsMumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Modelssonalikaur4
 
Call Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service NoidaCall Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service NoidaPooja Gupta
 
Statistical modeling in pharmaceutical research and development.
Statistical modeling in pharmaceutical research and development.Statistical modeling in pharmaceutical research and development.
Statistical modeling in pharmaceutical research and development.ANJALI
 
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...narwatsonia7
 
VIP Call Girls Lucknow Nandini 7001305949 Independent Escort Service Lucknow
VIP Call Girls Lucknow Nandini 7001305949 Independent Escort Service LucknowVIP Call Girls Lucknow Nandini 7001305949 Independent Escort Service Lucknow
VIP Call Girls Lucknow Nandini 7001305949 Independent Escort Service Lucknownarwatsonia7
 
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service JaipurHigh Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipurparulsinha
 

Último (20)

Hematology and Immunology - Leukocytes Functions
Hematology and Immunology - Leukocytes FunctionsHematology and Immunology - Leukocytes Functions
Hematology and Immunology - Leukocytes Functions
 
High Profile Call Girls Mavalli - 7001305949 | 24x7 Service Available Near Me
High Profile Call Girls Mavalli - 7001305949 | 24x7 Service Available Near MeHigh Profile Call Girls Mavalli - 7001305949 | 24x7 Service Available Near Me
High Profile Call Girls Mavalli - 7001305949 | 24x7 Service Available Near Me
 
Case Report Peripartum Cardiomyopathy.pptx
Case Report Peripartum Cardiomyopathy.pptxCase Report Peripartum Cardiomyopathy.pptx
Case Report Peripartum Cardiomyopathy.pptx
 
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
 
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowKolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
 
Call Girls Service Nandiambakkam | 7001305949 At Low Cost Cash Payment Booking
Call Girls Service Nandiambakkam | 7001305949 At Low Cost Cash Payment BookingCall Girls Service Nandiambakkam | 7001305949 At Low Cost Cash Payment Booking
Call Girls Service Nandiambakkam | 7001305949 At Low Cost Cash Payment Booking
 
Housewife Call Girls Hsr Layout - Call 7001305949 Rs-3500 with A/C Room Cash ...
Housewife Call Girls Hsr Layout - Call 7001305949 Rs-3500 with A/C Room Cash ...Housewife Call Girls Hsr Layout - Call 7001305949 Rs-3500 with A/C Room Cash ...
Housewife Call Girls Hsr Layout - Call 7001305949 Rs-3500 with A/C Room Cash ...
 
Book Call Girls in Kasavanahalli - 7001305949 with real photos and phone numbers
Book Call Girls in Kasavanahalli - 7001305949 with real photos and phone numbersBook Call Girls in Kasavanahalli - 7001305949 with real photos and phone numbers
Book Call Girls in Kasavanahalli - 7001305949 with real photos and phone numbers
 
Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...
Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...
Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...
 
Pharmaceutical Marketting: Unit-5, Pricing
Pharmaceutical Marketting: Unit-5, PricingPharmaceutical Marketting: Unit-5, Pricing
Pharmaceutical Marketting: Unit-5, Pricing
 
call girls in munirka DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in munirka  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in munirka  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in munirka DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
 
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
 
See the 2,456 pharmacies on the National E-Pharmacy Platform
See the 2,456 pharmacies on the National E-Pharmacy PlatformSee the 2,456 pharmacies on the National E-Pharmacy Platform
See the 2,456 pharmacies on the National E-Pharmacy Platform
 
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
 
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking ModelsMumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
 
Call Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service NoidaCall Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service Noida
 
Statistical modeling in pharmaceutical research and development.
Statistical modeling in pharmaceutical research and development.Statistical modeling in pharmaceutical research and development.
Statistical modeling in pharmaceutical research and development.
 
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
 
VIP Call Girls Lucknow Nandini 7001305949 Independent Escort Service Lucknow
VIP Call Girls Lucknow Nandini 7001305949 Independent Escort Service LucknowVIP Call Girls Lucknow Nandini 7001305949 Independent Escort Service Lucknow
VIP Call Girls Lucknow Nandini 7001305949 Independent Escort Service Lucknow
 
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service JaipurHigh Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
 

Using text to build semantic networks for pharmacaogenomics2

  • 1. Using Text to Build Semantics Networks for Pharmacogenomics George Karystianis Adrien Coulet, Nigam Shah, Yael Garten, Mark Musen, Russ B. Altman Journal of Biomedical informatics (2010)
  • 2. Motivation ● Manually crafted rules to define relationships between entities. – Limited scope domains. ● Pharmacogenomics. – Semantic complexity. ● Enhance the PharmaGKB. ● Large size of literature. ● NLP techniques promising. 2
  • 3. Aim ● Automatic relationship extraction. ● Entity mapping in a schema. – Semantic network structure. ● Curation of PGx knowledge. ● Resource for knowledge discovery. 3
  • 5. What is the meaning of Pharmacogenomics? 5
  • 6. Pharmacogenomics (1) Pharmaco Genomics PGx Φάρμακο Γίνομαι 6
  • 7. Pharmacogenomics (2) ● How genetic variation influences drug response in patients. ● Most of this knowledge presented in binary relationships. R(a,b) Relationship Subject Object 7
  • 8. Is This Something New? ● Co-occurrence approach: Complex relationship – Pharmexpresso. semantics. – Tri-co-occurrences. Manual relationship evaluation. ● Syntactic parser approach: Explicit relationship identification. – OpenDMAP. Large pattern sets. – Vocabularies. Stable ontologies. 8
  • 9. So... Gene-disease networks Molecular interaction networks Drug-disease networks Regular gene expression networks 9
  • 10. Method Overview Ontology MEDLINE Abstracts Dependency Graphs of Sentences R PGx network 10
  • 11. 1a. Sentence Parsing ● Implementation of lexicons for sentence retrieval. ● Stanford Parser. ● Focused on sentences with at least 2 key PGX entities. 11
  • 12. 1b. Sentence Parsing ● Querying the sentence index using seeds. – particular terms corresponding to recognized entities. – focus on gene-drug/gene-phenotype pairs. ● Reducing set/size of parse trees. ● Parse trees -> dependency graphs. – rooted, oriented, labelled, easy to read, process, understand than parse trees. 12
  • 13. Parsing Example “Several single nucleotide polymorphisms (SNPs) in VKORC1 are associated with warfarin dose across the normal dose range” 13
  • 15. 2a. Relation Extraction ● Sentence analysis for raw relationship extraction. ● Seed recognition: – through PharmGKB lexicons. ● Seed expansion: – edge traversal of DG to see if the seed is a key entity or a modified entity. 15
  • 16. Dependencies for Seed Expansion ● Expand the seed ● End the expansion ● Interrupt the expansion 16
  • 17. 2b. Relation Extraction ● Seed coupling – Two seeds wend with a normalised verb. – Relationship creation. 17
  • 18. 2c. Relation Extraction ● Evaluation of precision: – manual precision evaluation of extracting raw relationships. – random selection of 220 raw relationships. – classification-complete and true, incomplete and true, false. 18
  • 19. 3. Ontology Construction ● Identification of R types. ● Hierarchical organisation of R types and E. – 4 lists: most frequent, the most frequent modified entities by genes, drugs, phenotype. ● Refine choice available. 19
  • 20. 4a. Relationship Normalization ● Application of ontology to relationship instances. ● Creation of set of normalised relationships. ● Normalization of entity names: – modified entity name returned in normalized form according to ontology. – Decomposition of modified entity to iterate for the construction of normalised form. 20
  • 21. Example 21
  • 22. Example ● Seed: VKORC1_polymorphisms. ● Seed concept: Gene. ● Next word: polymorphism. – refers to a concept modified by Gene. – synonym of the concept “variant”. ● Normalised word: – VKORC1_variant. 22
  • 23. 4b. Relation Normalization ● Normalization of relationship types. – search for a role label which matches the relationship. – the identifier of the corresponding role is the normalized type. – creation of knowledge base of PGX relationships. 23
  • 24. Did it work? ● Input: – 17.396.436 MEDLINE abstracts ● Sentences: – 87.806.828. ● Sentences with pairs of PGx entities: – 295.569. ● After pruning: – 41.134 raw relationships, 21.050 gene-drug pair, 20.084 gene-phenotype pair. 24
  • 25. 25
  • 26. Results ● The 200 most frequent raw relationship types: – 80% of the extracted relationships. ● Creation of an ontology: – 200 most frequent relationship types and modified entities called PHARE-PHArmacogenomics RElationships. – 237 concepts and 76 roles. 26
  • 29. 29
  • 30. 30
  • 31. Discussion (1) ● Identification of both PGx entities. ● Identification of PGx modified entities. ● Use of key entity lexicons for discovery and normalization of modified entities. ● Record and recognition of modified entities under very general textual conditions. ● Flexible, precise method. 31
  • 32. Discussion (2) ● Concern: lower recall due to the large corpus size. – improve precision with full text parsing. ● Applicable to other domains. – Human effort required for the ontology creation. 32
  • 33. Conclusions (1) ● New method for PGX relationship extraction. ● Use of key PGX entities to identify modified entities. ● Capture and normalization of raw relationships. ● Automatic labelling of parsed sentences. 33
  • 34. Conclusions (2) ● Creation of a knowledge base. ● Creation of relationship summaries between: – Genes, drugs, phenotypes. ● Novel approach for PGX text processing. 34
  • 35. Questions? (in French ^_^) Questions? 質問 ? Ερωτήσεις; Preguntas? 35