SlideShare uma empresa Scribd logo
1 de 37
VAAST
Deciphering Genetic Disease with Next-Generation
Sequencing
Barry Moore, M.S.
Research Scientist
Department of Human Genetics
Department of Biomedical Informatics
Outline

 The VAAST Analysis Pipeline
 Ogden Syndrome: Application of VAAST to a Genetic Disease
 of Unknown Cause
 The Future of VAAST Development
$10,000,000
Venter Genome



                $1,000,000
                  Watson




                             $5,000
                              You?
Next Generation Sequencing




                                             Disease


                                             Healthy

     geneA   geneB   geneX   geneY   geneZ
Variant
 Variant     Annotation
Annotation   Tool


             Variant
 Variant     Selection
Selection    Tool

             Variant
 Variant     Annotation
             Analysis
 Analysis    Search
             Tool
GVF

VAAST Pipeline                                             3.5 Million
                                                            Variants

 Reference                       VAT                       Reference
  Genome                 (Variant Annotation Tool)          Genes
   Fasta                                                         GFF3


           Annotated          Annotated              Annotated
                   GVF
            Variants           Variants               Variants



                                VST
                         (Variant Selection Tool)




                  CDR
                               Merged
                             Variant Sets
GVF

VAAST Pipeline                        Variant Effect
                                                        3.5 Million
                                      •sequence_variant
                                                         Variants
                                         •gene_variant
 Reference                     VAT                           Reference
                                         •five_prime_UTR_variant
  Genome Type
    Variant                                                    Genes
                                         •three_prime_UTR_variant
                        (Variant Annotation Tool)
    •sequence_alteration
   Fasta                                 •exon_variant          GFF3
    •deletion                            •splice_region_variant
    •insertion                           •splice_donor_variant
    •duplication
         Annotated            Annotated  •splice_acceptor_variant
                                                    Annotated
    •inversion     GVF                   •intron_variant
          Variants
    •substitution
                               Variants               Variants
                                         •coding_sequence_variant
    •SNV                                 •stop_retained
    •MNP                                 •stop_lost
    •complex substitution                •stop_gained
    •translocation            VST        •synonymous_codon
                                         •non_synonymous_codon
                         (Variant Selection Tool)
                                         •amino_acid_substitution
                                         •frameshift_variant
                                         •inframe_variant

                  CDR
                             Merged
                           Variant Sets
GVF

VAAST Pipeline                        Variant Effect
                                                        3.5 Million
                                      •sequence_variant
                                                         Variants
                                         •gene_variant
 Reference                     VAT                           Reference
                                         •five_prime_UTR_variant
  Genome Type
    Variant                                                    Genes
                                         •three_prime_UTR_variant
                        (Variant Annotation Tool)
    •sequence_alteration
   Fasta                                 •exon_variant          GFF3
    •deletion                            •splice_region_variant
    •insertion                           •splice_donor_variant
    •duplication
         Annotated            Annotated  •splice_acceptor_variant
                                                    Annotated
    •inversion     GVF                   •intron_variant
          Variants
    •substitution
                               Variants               Variants
                                         •coding_sequence_variant
    •SNV                                 •stop_retained
    •MNP                                 •stop_lost
    •complex substitution                •stop_gained
    •translocation            VST        •synonymous_codon
                                         •non_synonymous_codon
                         (Variant Selection Tool)
                                         •amino_acid_substitution
                                         •frameshift_variant
                                         •inframe_variant

                  CDR
                             Merged
                           Variant Sets
CDR                       CDR

Background                  Target
 Genomes                   Genomes



             VAAST


             Prioritized
             Candidate
              Genes
               VAAST
               Report
Key Features of VAAST

• Probabilistic
• Feature Based
• Both Allele and AAS Frequencies
• Considers Inheritance Model
• Fast
• Standardized Ontology Based Format
• Modular and Flexible in Design
VAAST Uses Variant Frequencies in a
Probabilistic Fashion

         Likelihood Ratio Test

                   Maximum Likelihood
                    of the Null Model
                     (No Difference)
                   Maximum Likelihood
                  of the Alternate Model
                   (There is Difference)
VAAST Uses Variant Frequencies in a
Probabilistic Fashion
VAAST Uses Variant Frequencies in a
Probabilistic Fashion
•   VAAST gives us the likelihood of the composite genotype
    at GENE X in the target given the background.

•   Do allele frequencies differ between Background and
    Target genomes within a given gene or feature?

•   Composite likelihood calculation assumes independence
    across sites. To control for LD, statistical significance is
    estimated by permutation test.

•   Multiple test correction for number of features (~20,000)
    is two orders of magnitude better than for the number of
    variants (~3,500,000).
Noise Decreases Dramatically with
Increasing Number of Genomes
            1 genome target
         1 genome background
1 genome target
10 genome background
1 genome target
250 genome background
1 genome target
250 genome background
       Trio Data
Alleles Responsible for Miller
    Syndrome in Utah Kindred
               CHR 16: DHODH                               CHR 5: DNAH5
                Mom     Dad                                Mom     Dad
            G:R                                        R:Q
                                     G:A                                    R:
                                                                                 *

                Son         Daughter                       Son        Daughter
            G:R            G:R                         R:Q            R:Q
                                                                 R:         R:
                      G:A            G:A
                                                                      *          *

•Ng et al, Nature Genetics 42, 30–35 (2010) doi:10.1038/ng.499
•Roach, et al, Science , 328 636, 2101
Schematic of VAAST Analysis of Utah
Miller Kindred Using a Single Quartet



                               DHODH


DNAH5
Average Rank for 100 Dominant and
Recessive Diseases
                           1300
   Ave. rank genome-wide


                                                            SIZE OF CASE COHORT
                           1100
                                                                 2 allele copies
                           900
                                                                 4 allele copies
                           700
                                                                 6 allele copies
                           500


                           300
                                  156                 132
                           100          21   9               8      3

                           -100
                                   DOMINANT             RECESSIVE
                           -300


                           -500         443 genomes in background
Impact of Missing Data
                          4000


                          3500
                                               2 of 6 allele copies
  Ave. rank genome-wide




                          3000
                                               4 of 6 allele copies
                          2500
                                               6 of 6 allele copies
                          2000


                          1500


                          1000
                                 639
                           500                                    373
                                          61
                                                                            21
                                                    9                               3
                             0


                          -500
                                       DOMINANT                         RECESSIVE

                                           443 genomes in background
Outline

 The VAAST Analysis Pipeline
 Ogden Syndrome: Application of VAAST to a Genetic
 Disease of Unknown Cause

 The Future of VAAST Development
An Rare X-linked Mendelian Disorder

•   A Utah family coming to the
    University Hospital for 20+
    years
•   About half of the male offspring
    die around 1 year of age
•   Aged appearance
•   Craniofacial anomalies
•   Hypotonia
•   Global developmental delays
•   Cardiac arrhythmias
Four Affected Boys over Two
      Generations
  I



 II



III
Exome Sequencing
 •   Agilent SureSelect In-Solution X Chromosome Capture
 •   Covaris S series Sonication (150-200 bp)
 •   76 bp single-end reads on one lane each of the
     IlluminaGAIIx



Variant Calling
 •    Sequence alignment with bwa
 •    Remove duplicate reads with PICARD
 •    Realign indel regions with GATK
 •    Variant calling with Samtools, GATK
Identifying Candidate Genes

 VAAST Identifies NAA10 as Candidate Gene
    •   About 20 min. run time
    •   3 candidate genes (NAA10 ranked 2) proband only
    •   1 candidate gene (NAA10) with pedigree
Additional Analyses

 •   Microarray based CNV analysis
     •   No likely causal variants found
 •   Sanger sequencing confirmation
     •   Variant segregates perfectly with disease in 13
         family members
 •   Haplotype sharing (STR genotyping)
     •   ~11 MB shared between two affected boys
 •   A second family discovered – same mutation
 •   IBD relatedness analysis – independent mutational
     events
N(alpha)-acetyltransferase
 • N-alpha-acetylation is one of the most common protein
      modifications that occurs during protein synthesis.
  •   NatA (catalytic subunit NAA10 (hARD1)
  •   Eight exons, Crick strand, highly conserved
  •   A:G transition causes p.Ser37Pro
Functional Analyses
 •   Quantitative in vitro N-terminal acetylation assay (RP-
     HPLC).
 •   Four peptide substrates previously shown to be
     acetylated by NatA (NAA10)
 •   Assays indicate loss-of-function allele.
Functional Analyses
VAAST in Summary

•   Probabilistic Disease Gene Finder
•   Feature Based not Variant Based
•   Both Allele and AAS Frequencies
•   Considers Inheritance Model
•   As few as two target genomes can be sufficient to
    identify causative gene.
•   Background Genomes are “Reusable”
•   Not Limited to Human Analyses
VAAST: Future Directions

 •   Indel support
 •   Splice-site
 •   No-call support
 •   Pedigree support
 •   Phylogenetic conservation
Acknowledgements
VAAST Development Ogden
•Chad Huff        Syndrome                •Thomas Arnesen
•HaoHu              •John Carey           •Rune Evjenth
•Lynn Jorde         •Steven Chin          •Johan R. Lillehaug
•Barry Moore        •Heidi Deborah Fain
•Martin Reese       •Gholson Lyon         •Leslie G. Biesecker
•Marc Singleton     •John Optiz           •Jennifer J.
•Jinchuan Xing      •Theodore J. Pysher   Johnston
•Mark Yandell       •Alan Rope            •Cathy A. Stevens
Yandell Lab         •Reid Robison
                    •Sarah T. South       •Brian Dalley
•Michael Campbell                         •Tao Jiang
•Daniel Ence                              •JeffereySwensen
                    •Chad Huff
•Guozhen Fan
                    •Evan Johnson
•Steven Flygare                           •HakonHakonarson
                    •Barry Moore
•HaoHu                                    •Lynn B. Jorde
                    •Christa Schank
•Zev Kronenberg                           •Mark Yandell
                    •Kai Wang
•Barry Moore
                    •Jinchuan Xing
•Marc Singleton
•Robert Ross
•Mark Yandell
Acknowledgements

Mais conteúdo relacionado

Último

Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPirithiRaju
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
Forensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxForensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxkumarsanjai28051
 
Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Tamer Koksalan, PhD
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptJoemSTuliba
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxJorenAcuavera1
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxmaryFF1
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxEran Akiva Sinbar
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)riyaescorts54
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomyDrAnita Sharma
 

Último (20)

Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
Forensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxForensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptx
 
Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical Engineering
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.ppt
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptx
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptx
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomy
 

Destaque

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationErica Santiago
 

Destaque (20)

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
 

VAAST: Deciphering Genetic Disease with Next-Generation Sequencing

  • 1. VAAST Deciphering Genetic Disease with Next-Generation Sequencing Barry Moore, M.S. Research Scientist Department of Human Genetics Department of Biomedical Informatics
  • 2. Outline  The VAAST Analysis Pipeline  Ogden Syndrome: Application of VAAST to a Genetic Disease of Unknown Cause  The Future of VAAST Development
  • 3. $10,000,000 Venter Genome $1,000,000 Watson $5,000 You?
  • 4. Next Generation Sequencing Disease Healthy geneA geneB geneX geneY geneZ
  • 5. Variant Variant Annotation Annotation Tool Variant Variant Selection Selection Tool Variant Variant Annotation Analysis Analysis Search Tool
  • 6. GVF VAAST Pipeline 3.5 Million Variants Reference VAT Reference Genome (Variant Annotation Tool) Genes Fasta GFF3 Annotated Annotated Annotated GVF Variants Variants Variants VST (Variant Selection Tool) CDR Merged Variant Sets
  • 7. GVF VAAST Pipeline Variant Effect 3.5 Million •sequence_variant Variants •gene_variant Reference VAT Reference •five_prime_UTR_variant Genome Type Variant Genes •three_prime_UTR_variant (Variant Annotation Tool) •sequence_alteration Fasta •exon_variant GFF3 •deletion •splice_region_variant •insertion •splice_donor_variant •duplication Annotated Annotated •splice_acceptor_variant Annotated •inversion GVF •intron_variant Variants •substitution Variants Variants •coding_sequence_variant •SNV •stop_retained •MNP •stop_lost •complex substitution •stop_gained •translocation VST •synonymous_codon •non_synonymous_codon (Variant Selection Tool) •amino_acid_substitution •frameshift_variant •inframe_variant CDR Merged Variant Sets
  • 8. GVF VAAST Pipeline Variant Effect 3.5 Million •sequence_variant Variants •gene_variant Reference VAT Reference •five_prime_UTR_variant Genome Type Variant Genes •three_prime_UTR_variant (Variant Annotation Tool) •sequence_alteration Fasta •exon_variant GFF3 •deletion •splice_region_variant •insertion •splice_donor_variant •duplication Annotated Annotated •splice_acceptor_variant Annotated •inversion GVF •intron_variant Variants •substitution Variants Variants •coding_sequence_variant •SNV •stop_retained •MNP •stop_lost •complex substitution •stop_gained •translocation VST •synonymous_codon •non_synonymous_codon (Variant Selection Tool) •amino_acid_substitution •frameshift_variant •inframe_variant CDR Merged Variant Sets
  • 9. CDR CDR Background Target Genomes Genomes VAAST Prioritized Candidate Genes VAAST Report
  • 10. Key Features of VAAST • Probabilistic • Feature Based • Both Allele and AAS Frequencies • Considers Inheritance Model • Fast • Standardized Ontology Based Format • Modular and Flexible in Design
  • 11. VAAST Uses Variant Frequencies in a Probabilistic Fashion Likelihood Ratio Test Maximum Likelihood of the Null Model (No Difference) Maximum Likelihood of the Alternate Model (There is Difference)
  • 12. VAAST Uses Variant Frequencies in a Probabilistic Fashion
  • 13. VAAST Uses Variant Frequencies in a Probabilistic Fashion • VAAST gives us the likelihood of the composite genotype at GENE X in the target given the background. • Do allele frequencies differ between Background and Target genomes within a given gene or feature? • Composite likelihood calculation assumes independence across sites. To control for LD, statistical significance is estimated by permutation test. • Multiple test correction for number of features (~20,000) is two orders of magnitude better than for the number of variants (~3,500,000).
  • 14. Noise Decreases Dramatically with Increasing Number of Genomes 1 genome target 1 genome background
  • 15. 1 genome target 10 genome background
  • 16. 1 genome target 250 genome background
  • 17. 1 genome target 250 genome background Trio Data
  • 18.
  • 19. Alleles Responsible for Miller Syndrome in Utah Kindred CHR 16: DHODH CHR 5: DNAH5 Mom Dad Mom Dad G:R R:Q G:A R: * Son Daughter Son Daughter G:R G:R R:Q R:Q R: R: G:A G:A * * •Ng et al, Nature Genetics 42, 30–35 (2010) doi:10.1038/ng.499 •Roach, et al, Science , 328 636, 2101
  • 20. Schematic of VAAST Analysis of Utah Miller Kindred Using a Single Quartet DHODH DNAH5
  • 21. Average Rank for 100 Dominant and Recessive Diseases 1300 Ave. rank genome-wide SIZE OF CASE COHORT 1100 2 allele copies 900 4 allele copies 700 6 allele copies 500 300 156 132 100 21 9 8 3 -100 DOMINANT RECESSIVE -300 -500 443 genomes in background
  • 22. Impact of Missing Data 4000 3500 2 of 6 allele copies Ave. rank genome-wide 3000 4 of 6 allele copies 2500 6 of 6 allele copies 2000 1500 1000 639 500 373 61 21 9 3 0 -500 DOMINANT RECESSIVE 443 genomes in background
  • 23. Outline  The VAAST Analysis Pipeline  Ogden Syndrome: Application of VAAST to a Genetic Disease of Unknown Cause  The Future of VAAST Development
  • 24. An Rare X-linked Mendelian Disorder • A Utah family coming to the University Hospital for 20+ years • About half of the male offspring die around 1 year of age • Aged appearance • Craniofacial anomalies • Hypotonia • Global developmental delays • Cardiac arrhythmias
  • 25. Four Affected Boys over Two Generations I II III
  • 26. Exome Sequencing • Agilent SureSelect In-Solution X Chromosome Capture • Covaris S series Sonication (150-200 bp) • 76 bp single-end reads on one lane each of the IlluminaGAIIx Variant Calling • Sequence alignment with bwa • Remove duplicate reads with PICARD • Realign indel regions with GATK • Variant calling with Samtools, GATK
  • 27. Identifying Candidate Genes VAAST Identifies NAA10 as Candidate Gene • About 20 min. run time • 3 candidate genes (NAA10 ranked 2) proband only • 1 candidate gene (NAA10) with pedigree
  • 28. Additional Analyses • Microarray based CNV analysis • No likely causal variants found • Sanger sequencing confirmation • Variant segregates perfectly with disease in 13 family members • Haplotype sharing (STR genotyping) • ~11 MB shared between two affected boys • A second family discovered – same mutation • IBD relatedness analysis – independent mutational events
  • 29. N(alpha)-acetyltransferase • N-alpha-acetylation is one of the most common protein modifications that occurs during protein synthesis. • NatA (catalytic subunit NAA10 (hARD1) • Eight exons, Crick strand, highly conserved • A:G transition causes p.Ser37Pro
  • 30. Functional Analyses • Quantitative in vitro N-terminal acetylation assay (RP- HPLC). • Four peptide substrates previously shown to be acetylated by NatA (NAA10) • Assays indicate loss-of-function allele.
  • 32.
  • 33. VAAST in Summary • Probabilistic Disease Gene Finder • Feature Based not Variant Based • Both Allele and AAS Frequencies • Considers Inheritance Model • As few as two target genomes can be sufficient to identify causative gene. • Background Genomes are “Reusable” • Not Limited to Human Analyses
  • 34. VAAST: Future Directions • Indel support • Splice-site • No-call support • Pedigree support • Phylogenetic conservation
  • 35.
  • 36. Acknowledgements VAAST Development Ogden •Chad Huff Syndrome •Thomas Arnesen •HaoHu •John Carey •Rune Evjenth •Lynn Jorde •Steven Chin •Johan R. Lillehaug •Barry Moore •Heidi Deborah Fain •Martin Reese •Gholson Lyon •Leslie G. Biesecker •Marc Singleton •John Optiz •Jennifer J. •Jinchuan Xing •Theodore J. Pysher Johnston •Mark Yandell •Alan Rope •Cathy A. Stevens Yandell Lab •Reid Robison •Sarah T. South •Brian Dalley •Michael Campbell •Tao Jiang •Daniel Ence •JeffereySwensen •Chad Huff •Guozhen Fan •Evan Johnson •Steven Flygare •HakonHakonarson •Barry Moore •HaoHu •Lynn B. Jorde •Christa Schank •Zev Kronenberg •Mark Yandell •Kai Wang •Barry Moore •Jinchuan Xing •Marc Singleton •Robert Ross •Mark Yandell

Notas do Editor

  1. I’m going to begin the discussion of VAAST with a simple description of how the pipeline runs
  2. Numerator = Null Model (No Difference)Denominator = Alternate Model (Difference)
  3. The maximum likelihood of the null model over the maximum likelihood of the alternate model - weighted by the frequency of the AAS in the healthy dataset over the frequency of that AAS in a disease datasetn=frequency of that AAS in the background p=estimated probability of...B=T=Y=X=a=frequency of this AAS in OMIM
  4. The maximum likelihood of the null model over the maximum likelihood of the alternate model - weighted by the frequency of the AAS in the healthy dataset over the frequency of that AAS in a disease datasetn=frequency of that AAS in the background p=estimated probability of...B=T=Y=X=a=frequency of this AAS in OMIM
  5. Miller Syndrome
  6. A-G TransitionSerine to Proline
  7. Thank the family