SlideShare uma empresa Scribd logo
1 de 23
Baixar para ler offline
Using VarSeq
to Improve
Variant Analysis
Research
June 10, 2015
G Bryce Christensen
Director of Services
Use the Questions pane in
your GoToWebinar window
Questions during
the presentation
Agenda
What makes a damaging variant?
VarSeq Interactive Demonstration
2
3
4
QC Considerations
Variant analysis workflows1
What is VarSeq?
VarSeq
Simple
Flexible
Scalable
 Variant annotation, filtering
and ranking
 Repeatable workflows
 Rich visualizations with
GenomeBrowse
integration
 Powerful GUI and
command-line interfaces
Workflow Development Process in VarSeq
1. Begin from one or many VCF files
2. Annotate variants using public data sources curated by Golden Helix and/or
annotate with custom data sources.
3. Run additional computation algorithms
- Allele counts, genotype zygosity, gene list matching, etc
4. Construct filter chain to identify candidate variants
- May use combinations of logical operators in filters
- May have multiple independent filter chains and/or endpoints
5. Process results
- Gene Ranking with PhoRank
- Review variant QC
- Vizualization with GenomeBrowse
- Commit variants to local database
- Etc.
Annotations are the key
 Good variant analysis
begins with accurate
annotations.
 Golden Helix invests
extensive time and effort
in validating and
maintaining data sources.
 Annotation data sources
may be used for either
quality control or analytic
purposes.
Defining Deleteriousness
 What makes a variant potentially damaging?
 Start by defining the search space:
- Rare, non-synonymous, homozygous variants?
- DeNovo mutations in highly conserved genes?
- Splice-site mutations?
- Etc.
 Review annotations for remaining variants to
identify causal candidates
 Which annotations to use?
Variant Classification
 VarSeq classifies variants into
20+ different categories
 The categories are further
grouped as:
- Loss of Function
- Missense
- Other
 Choice of gene transcript
reference
- RefSeq
- Ensembl
- Others
ClinVar
 ClinVar is a public archive of
variants evaluated for potential
causal relationships to diseases
 Submissions from many
sources, including major clinical
laboratories
 Over 100k records
 Updated monthly
Functional Predictions
 Functional predictions use algorithms to determine the expected
consequence of variants (or the resulting amino acid substitutions).
 dbNSFP
- The Database for NonSynonymous Functional Predictions (dbNSFP) is a
free tool developed by Dr. Xiaoming Liu.
- Catalogs pre-computed conservation and functional prediction scores for all possible
missense SNVs in the genome
- Methods include SIFT, PolyPhen-2, MutationTaster, MutationAssessor, FATHMM, more
 dbscSNV
- Companion to dbNSFP that scores variants in splice consensus regions
- Variants in these regions may disrupt normal gene expression and/or function
 dbNSFP and dbscSNV are both accessible in VarSeq
Variant/Gene Ranking
 PhoRank algorithm in VarSeq uses HPO and GO terminology to
score relationships between genes and phenotypes
 Very useful to prioritize a long list of variants for individual review
 Based on PHEVOR method.
QC Considerations
 Variant QC
 Rare variants deserve special
attention
 VCF/BAM Data:
- Depth - DP
- Quality - GQ
- Strand bias
- Etc.
 Public Annotations:
- “Mappability”
Mappability Annotations
 The human reference genome has
assembly gaps and other “difficult”
regions
 NGS technology sequences short
DNA fragments which are the aligned
to the reference genome
- Most sequences are aligned correctly
- Some sequences can’t be aligned uniquely
- Some sequences may be incorrectly aligned
 Luckily, we can predict many of the
trouble spots
Segmental Duplications
 Segmental duplications are a common confounder
 UCSC “Genomic Super Dups” annotation available through VarSeq
 Recent Example (below):
- Apparent UPD feature in family trio was determined to be an artifact of seg. duplication
- Large chromosome segment duplicated elsewhere with >98% similarity
Emerging Standards
 Several organizations working on best
practices guidelines for genome
mappability
- 1000 Genomes Project
- Genome in a Bottle Consortium
- Global Alliance for Genomics and Health (GA4GH)
- National Institute of Standards and Technology
 Downloadable annotations available for
many types of features:
- Mappability by read length
- High G-C content regions
- Low complexity
- Segmental duplications
- Etc.
Example: 1kG Low Complexity Regions
Example: GA4GH 150-bp Mappability
VarSeq Demonstration Data
 Exome sequencing of five individuals from family with familial cardiac
conduction disease (CCD)
 Raw sequence data obtained from SRA
Workflow Discussion Points
 Male-to-male
transmission makes X-
linked model unlikely
 May follow dominant or
recessive transmission
 Inherited forms of CCD
are rare
 Family has East Asian
ancestry
[Demonstration]
Why VarSeq?
VarSeq
Simple
Flexible
Scalable
 Variant annotation, filtering
and ranking
 Exploratory analysis
 Powerful GUI with
immediate feedback
 Rich visualizations with
GenomeBrowse
integration
Questions or
more info:
 Email
info@goldenhelix.com
 Request an evaluation of
the software at
www.goldenhelix.com
Questions?
Use the Questions pane in
your GoToWebinar window

Mais conteúdo relacionado

Mais procurados

Genetic manipulation of stay-green traits for croop imporvement
Genetic manipulation of stay-green traits for croop imporvement Genetic manipulation of stay-green traits for croop imporvement
Genetic manipulation of stay-green traits for croop imporvement Shantanu Das
 
FINAL POSTER
FINAL POSTERFINAL POSTER
FINAL POSTERRyan Foo
 
RNA-seq: A High-resolution View of the Transcriptome
RNA-seq: A High-resolution View of the TranscriptomeRNA-seq: A High-resolution View of the Transcriptome
RNA-seq: A High-resolution View of the TranscriptomeSean Davis
 
RNA-seq Data Analysis Overview
RNA-seq Data Analysis OverviewRNA-seq Data Analysis Overview
RNA-seq Data Analysis OverviewSean Davis
 
Next Generation Sequencing file Formats ( 2017 )
Next Generation Sequencing file Formats ( 2017 )Next Generation Sequencing file Formats ( 2017 )
Next Generation Sequencing file Formats ( 2017 )Pierre Lindenbaum
 
RSEM and DE packages
RSEM and DE packagesRSEM and DE packages
RSEM and DE packagesRavi Gandham
 
Donor-derived cell-free DNA(dd-cfDNA), Moh'd sharshir
Donor-derived cell-free DNA(dd-cfDNA), Moh'd sharshirDonor-derived cell-free DNA(dd-cfDNA), Moh'd sharshir
Donor-derived cell-free DNA(dd-cfDNA), Moh'd sharshirMoh'd sharshir
 
Pharmacogenetics and Warfarin
Pharmacogenetics and WarfarinPharmacogenetics and Warfarin
Pharmacogenetics and WarfarinAndrew Guvetis
 
The Needleman Wunsch algorithm
The Needleman Wunsch algorithmThe Needleman Wunsch algorithm
The Needleman Wunsch algorithmavrilcoghlan
 
Isolation of promoter and other regulatory elements.pptx
Isolation of promoter and other regulatory elements.pptxIsolation of promoter and other regulatory elements.pptx
Isolation of promoter and other regulatory elements.pptxShailendraPandey20
 
Sequence assembly
Sequence assemblySequence assembly
Sequence assemblyRamya P
 
Lecture 9 slides: Machine learning for Protein Structure ...
Lecture 9 slides: Machine learning for Protein Structure ...Lecture 9 slides: Machine learning for Protein Structure ...
Lecture 9 slides: Machine learning for Protein Structure ...butest
 
techniques used in Metabolite profiling of bryophytes ppt
techniques used in Metabolite  profiling of bryophytes ppttechniques used in Metabolite  profiling of bryophytes ppt
techniques used in Metabolite profiling of bryophytes pptUnnatiChopra1
 
Whole exome sequencing data analysis.pptx
Whole exome sequencing data analysis.pptxWhole exome sequencing data analysis.pptx
Whole exome sequencing data analysis.pptxHaibo Liu
 
Paired-end alignments in sequence graphs
Paired-end alignments in sequence graphsPaired-end alignments in sequence graphs
Paired-end alignments in sequence graphsChirag Jain
 
Introduction to RNA-seq
Introduction to RNA-seqIntroduction to RNA-seq
Introduction to RNA-seqPaul Gardner
 

Mais procurados (20)

Genetic manipulation of stay-green traits for croop imporvement
Genetic manipulation of stay-green traits for croop imporvement Genetic manipulation of stay-green traits for croop imporvement
Genetic manipulation of stay-green traits for croop imporvement
 
NGS: Mapping and de novo assembly
NGS: Mapping and de novo assemblyNGS: Mapping and de novo assembly
NGS: Mapping and de novo assembly
 
FINAL POSTER
FINAL POSTERFINAL POSTER
FINAL POSTER
 
RNA-seq: A High-resolution View of the Transcriptome
RNA-seq: A High-resolution View of the TranscriptomeRNA-seq: A High-resolution View of the Transcriptome
RNA-seq: A High-resolution View of the Transcriptome
 
RNA-seq Data Analysis Overview
RNA-seq Data Analysis OverviewRNA-seq Data Analysis Overview
RNA-seq Data Analysis Overview
 
Pharmacogenomics
PharmacogenomicsPharmacogenomics
Pharmacogenomics
 
Next Generation Sequencing file Formats ( 2017 )
Next Generation Sequencing file Formats ( 2017 )Next Generation Sequencing file Formats ( 2017 )
Next Generation Sequencing file Formats ( 2017 )
 
RSEM and DE packages
RSEM and DE packagesRSEM and DE packages
RSEM and DE packages
 
Donor-derived cell-free DNA(dd-cfDNA), Moh'd sharshir
Donor-derived cell-free DNA(dd-cfDNA), Moh'd sharshirDonor-derived cell-free DNA(dd-cfDNA), Moh'd sharshir
Donor-derived cell-free DNA(dd-cfDNA), Moh'd sharshir
 
Pharmacogenetics and Warfarin
Pharmacogenetics and WarfarinPharmacogenetics and Warfarin
Pharmacogenetics and Warfarin
 
The Needleman Wunsch algorithm
The Needleman Wunsch algorithmThe Needleman Wunsch algorithm
The Needleman Wunsch algorithm
 
Pharmacogenomics
Pharmacogenomics Pharmacogenomics
Pharmacogenomics
 
SNP Genotyping Technologies
SNP Genotyping TechnologiesSNP Genotyping Technologies
SNP Genotyping Technologies
 
Isolation of promoter and other regulatory elements.pptx
Isolation of promoter and other regulatory elements.pptxIsolation of promoter and other regulatory elements.pptx
Isolation of promoter and other regulatory elements.pptx
 
Sequence assembly
Sequence assemblySequence assembly
Sequence assembly
 
Lecture 9 slides: Machine learning for Protein Structure ...
Lecture 9 slides: Machine learning for Protein Structure ...Lecture 9 slides: Machine learning for Protein Structure ...
Lecture 9 slides: Machine learning for Protein Structure ...
 
techniques used in Metabolite profiling of bryophytes ppt
techniques used in Metabolite  profiling of bryophytes ppttechniques used in Metabolite  profiling of bryophytes ppt
techniques used in Metabolite profiling of bryophytes ppt
 
Whole exome sequencing data analysis.pptx
Whole exome sequencing data analysis.pptxWhole exome sequencing data analysis.pptx
Whole exome sequencing data analysis.pptx
 
Paired-end alignments in sequence graphs
Paired-end alignments in sequence graphsPaired-end alignments in sequence graphs
Paired-end alignments in sequence graphs
 
Introduction to RNA-seq
Introduction to RNA-seqIntroduction to RNA-seq
Introduction to RNA-seq
 

Destaque

Authoring Clinical Reports in VarSeq
Authoring Clinical Reports in VarSeqAuthoring Clinical Reports in VarSeq
Authoring Clinical Reports in VarSeqGolden Helix Inc
 
Clinical Reporting Made Easy
Clinical Reporting Made EasyClinical Reporting Made Easy
Clinical Reporting Made EasyGolden Helix Inc
 
Lucid what really matters to families
Lucid what really matters to familiesLucid what really matters to families
Lucid what really matters to familieslucidpeople
 
Personalized Medicine through Tumor Sequencing
Personalized Medicine through Tumor SequencingPersonalized Medicine through Tumor Sequencing
Personalized Medicine through Tumor SequencingGolden Helix Inc
 
Advancing Agrigenomic Discoveries with Sequencing and GWAS Research
Advancing Agrigenomic Discoveries with Sequencing and GWAS ResearchAdvancing Agrigenomic Discoveries with Sequencing and GWAS Research
Advancing Agrigenomic Discoveries with Sequencing and GWAS ResearchGolden Helix Inc
 
B2e11e1b9030c180a860a8450d5eceff
B2e11e1b9030c180a860a8450d5eceffB2e11e1b9030c180a860a8450d5eceff
B2e11e1b9030c180a860a8450d5eceffPimpaka Khampin
 
Izabela lewicka kołobrzeg
Izabela lewicka kołobrzegIzabela lewicka kołobrzeg
Izabela lewicka kołobrzegIzabela Lewicka
 
Pharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
Pharmacogenomic Prediction of Antracycline-induced CardiotoxicityPharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
Pharmacogenomic Prediction of Antracycline-induced CardiotoxicityGolden Helix Inc
 
Pentyrch bowling club annual tour 2014 Tour Programme
Pentyrch bowling club annual tour 2014 Tour ProgrammePentyrch bowling club annual tour 2014 Tour Programme
Pentyrch bowling club annual tour 2014 Tour Programmekrakoweric
 
Αρραβώνας- Γάμος στο Σκοπό Ανατολικής Θράκης
Αρραβώνας- Γάμος στο Σκοπό Ανατολικής ΘράκηςΑρραβώνας- Γάμος στο Σκοπό Ανατολικής Θράκης
Αρραβώνας- Γάμος στο Σκοπό Ανατολικής ΘράκηςΚατερίνα Καραμπαΐρη
 
Geography definitions
Geography definitionsGeography definitions
Geography definitionsjeon1009
 
Dars e-hadith-volume005
Dars e-hadith-volume005Dars e-hadith-volume005
Dars e-hadith-volume005Hammadia
 
Leadership for Affordable Housing Evaluation Study
Leadership for Affordable Housing Evaluation StudyLeadership for Affordable Housing Evaluation Study
Leadership for Affordable Housing Evaluation Studymjbinstitute
 
336bad9a270ac2f1456caebe75899ceb
336bad9a270ac2f1456caebe75899ceb336bad9a270ac2f1456caebe75899ceb
336bad9a270ac2f1456caebe75899cebPimpaka Khampin
 
Cancer Workflows in VarSeq
Cancer Workflows in VarSeqCancer Workflows in VarSeq
Cancer Workflows in VarSeqGolden Helix Inc
 

Destaque (20)

Authoring Clinical Reports in VarSeq
Authoring Clinical Reports in VarSeqAuthoring Clinical Reports in VarSeq
Authoring Clinical Reports in VarSeq
 
Clinical Reporting Made Easy
Clinical Reporting Made EasyClinical Reporting Made Easy
Clinical Reporting Made Easy
 
Έθιμα της περιοχής μας
Έθιμα της περιοχής μαςΈθιμα της περιοχής μας
Έθιμα της περιοχής μας
 
Lucid what really matters to families
Lucid what really matters to familiesLucid what really matters to families
Lucid what really matters to families
 
Personalized Medicine through Tumor Sequencing
Personalized Medicine through Tumor SequencingPersonalized Medicine through Tumor Sequencing
Personalized Medicine through Tumor Sequencing
 
Advancing Agrigenomic Discoveries with Sequencing and GWAS Research
Advancing Agrigenomic Discoveries with Sequencing and GWAS ResearchAdvancing Agrigenomic Discoveries with Sequencing and GWAS Research
Advancing Agrigenomic Discoveries with Sequencing and GWAS Research
 
B2e11e1b9030c180a860a8450d5eceff
B2e11e1b9030c180a860a8450d5eceffB2e11e1b9030c180a860a8450d5eceff
B2e11e1b9030c180a860a8450d5eceff
 
τα δικαιώματα του παιδιού
τα δικαιώματα του παιδιούτα δικαιώματα του παιδιού
τα δικαιώματα του παιδιού
 
ใบงานCom 2-8
ใบงานCom 2-8ใบงานCom 2-8
ใบงานCom 2-8
 
Izabela lewicka kołobrzeg
Izabela lewicka kołobrzegIzabela lewicka kołobrzeg
Izabela lewicka kołobrzeg
 
συνομιλία τηλέμαχου αθηνάς
συνομιλία τηλέμαχου  αθηνάςσυνομιλία τηλέμαχου  αθηνάς
συνομιλία τηλέμαχου αθηνάς
 
Pharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
Pharmacogenomic Prediction of Antracycline-induced CardiotoxicityPharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
Pharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
 
Pentyrch bowling club annual tour 2014 Tour Programme
Pentyrch bowling club annual tour 2014 Tour ProgrammePentyrch bowling club annual tour 2014 Tour Programme
Pentyrch bowling club annual tour 2014 Tour Programme
 
Αρραβώνας- Γάμος στο Σκοπό Ανατολικής Θράκης
Αρραβώνας- Γάμος στο Σκοπό Ανατολικής ΘράκηςΑρραβώνας- Γάμος στο Σκοπό Ανατολικής Θράκης
Αρραβώνας- Γάμος στο Σκοπό Ανατολικής Θράκης
 
Geography definitions
Geography definitionsGeography definitions
Geography definitions
 
Dars e-hadith-volume005
Dars e-hadith-volume005Dars e-hadith-volume005
Dars e-hadith-volume005
 
Leadership for Affordable Housing Evaluation Study
Leadership for Affordable Housing Evaluation StudyLeadership for Affordable Housing Evaluation Study
Leadership for Affordable Housing Evaluation Study
 
336bad9a270ac2f1456caebe75899ceb
336bad9a270ac2f1456caebe75899ceb336bad9a270ac2f1456caebe75899ceb
336bad9a270ac2f1456caebe75899ceb
 
Аудит
АудитАудит
Аудит
 
Cancer Workflows in VarSeq
Cancer Workflows in VarSeqCancer Workflows in VarSeq
Cancer Workflows in VarSeq
 

Semelhante a Using VarSeq to Improve Variant Analysis Research Workflows

Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Rare Variant Analysis Workflows: Analyzing NGS Data in Large CohortsRare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Rare Variant Analysis Workflows: Analyzing NGS Data in Large CohortsGolden Helix Inc
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...GenomeInABottle
 
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...Golden Helix Inc
 
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVSExploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVSGolden Helix Inc
 
Knowing Your NGS Upstream: Alignment and Variants
Knowing Your NGS Upstream: Alignment and VariantsKnowing Your NGS Upstream: Alignment and Variants
Knowing Your NGS Upstream: Alignment and VariantsGolden Helix Inc
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GenomeInABottle
 
Tools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisTools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisSANJANA PANDEY
 
GIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGenomeInABottle
 
Kim Pruitt trainingbiocuration2015
Kim Pruitt trainingbiocuration2015Kim Pruitt trainingbiocuration2015
Kim Pruitt trainingbiocuration2015Kim D. Pruitt
 
2012 10-24 - ngs webinar
2012 10-24 - ngs webinar2012 10-24 - ngs webinar
2012 10-24 - ngs webinarElsa von Licy
 
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Prof. Wim Van Criekinge
 
GIAB Integrating multiple technologies to form benchmark SVs 180517
GIAB Integrating multiple technologies to form benchmark SVs 180517GIAB Integrating multiple technologies to form benchmark SVs 180517
GIAB Integrating multiple technologies to form benchmark SVs 180517GenomeInABottle
 
Golden Helix's End-to-End Solution for Clinical Labs
Golden Helix's End-to-End Solution for Clinical LabsGolden Helix's End-to-End Solution for Clinical Labs
Golden Helix's End-to-End Solution for Clinical LabsGolden Helix
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917GenomeInABottle
 
XabTracker & SeqAgent: Integrated LIMS & Sequence Analysis Tools for Antibody...
XabTracker & SeqAgent: Integrated LIMS & Sequence Analysis Tools for Antibody...XabTracker & SeqAgent: Integrated LIMS & Sequence Analysis Tools for Antibody...
XabTracker & SeqAgent: Integrated LIMS & Sequence Analysis Tools for Antibody...Mark Evans
 
2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...
2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...
2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...Gabe Rudy
 
Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030GenomeInABottle
 
Annotation capabilities
Annotation capabilitiesAnnotation capabilities
Annotation capabilitiesGolden Helix
 
Dgaston dec-06-2012
Dgaston dec-06-2012Dgaston dec-06-2012
Dgaston dec-06-2012Dan Gaston
 
GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GenomeInABottle
 

Semelhante a Using VarSeq to Improve Variant Analysis Research Workflows (20)

Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Rare Variant Analysis Workflows: Analyzing NGS Data in Large CohortsRare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
 
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
 
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVSExploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
 
Knowing Your NGS Upstream: Alignment and Variants
Knowing Your NGS Upstream: Alignment and VariantsKnowing Your NGS Upstream: Alignment and Variants
Knowing Your NGS Upstream: Alignment and Variants
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
 
Tools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisTools for Transcriptome Data Analysis
Tools for Transcriptome Data Analysis
 
GIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM Forum
 
Kim Pruitt trainingbiocuration2015
Kim Pruitt trainingbiocuration2015Kim Pruitt trainingbiocuration2015
Kim Pruitt trainingbiocuration2015
 
2012 10-24 - ngs webinar
2012 10-24 - ngs webinar2012 10-24 - ngs webinar
2012 10-24 - ngs webinar
 
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
 
GIAB Integrating multiple technologies to form benchmark SVs 180517
GIAB Integrating multiple technologies to form benchmark SVs 180517GIAB Integrating multiple technologies to form benchmark SVs 180517
GIAB Integrating multiple technologies to form benchmark SVs 180517
 
Golden Helix's End-to-End Solution for Clinical Labs
Golden Helix's End-to-End Solution for Clinical LabsGolden Helix's End-to-End Solution for Clinical Labs
Golden Helix's End-to-End Solution for Clinical Labs
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917
 
XabTracker & SeqAgent: Integrated LIMS & Sequence Analysis Tools for Antibody...
XabTracker & SeqAgent: Integrated LIMS & Sequence Analysis Tools for Antibody...XabTracker & SeqAgent: Integrated LIMS & Sequence Analysis Tools for Antibody...
XabTracker & SeqAgent: Integrated LIMS & Sequence Analysis Tools for Antibody...
 
2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...
2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...
2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...
 
Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030
 
Annotation capabilities
Annotation capabilitiesAnnotation capabilities
Annotation capabilities
 
Dgaston dec-06-2012
Dgaston dec-06-2012Dgaston dec-06-2012
Dgaston dec-06-2012
 
GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005
 

Mais de Golden Helix Inc

Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...Golden Helix Inc
 
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...Golden Helix Inc
 
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...Golden Helix Inc
 
Two Clinical Workflows - From Unfiltered Variants to a Clinical Report
Two Clinical Workflows - From Unfiltered Variants to a Clinical ReportTwo Clinical Workflows - From Unfiltered Variants to a Clinical Report
Two Clinical Workflows - From Unfiltered Variants to a Clinical ReportGolden Helix Inc
 
The Molecular Sciences Made Personal
The Molecular Sciences Made PersonalThe Molecular Sciences Made Personal
The Molecular Sciences Made PersonalGolden Helix Inc
 
Prediction and Meta-Analysis
Prediction and Meta-AnalysisPrediction and Meta-Analysis
Prediction and Meta-AnalysisGolden Helix Inc
 
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeqIntroducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeqGolden Helix Inc
 
MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...
MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...
MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...Golden Helix Inc
 
Getting Started with VSWarehouse - The User Experience
Getting Started with VSWarehouse - The User ExperienceGetting Started with VSWarehouse - The User Experience
Getting Started with VSWarehouse - The User ExperienceGolden Helix Inc
 
Using WES in Distant Relationships to Identify Cardiomyopathy Genes
Using WES in Distant Relationships to Identify Cardiomyopathy GenesUsing WES in Distant Relationships to Identify Cardiomyopathy Genes
Using WES in Distant Relationships to Identify Cardiomyopathy GenesGolden Helix Inc
 
Using Clinical Reports as a part of a Gene Panel Pipeline
Using Clinical Reports as a part of a Gene Panel PipelineUsing Clinical Reports as a part of a Gene Panel Pipeline
Using Clinical Reports as a part of a Gene Panel PipelineGolden Helix Inc
 
Investigating Shared Additive Genetic Variation for Alcohol Dependence
Investigating Shared Additive Genetic Variation for Alcohol DependenceInvestigating Shared Additive Genetic Variation for Alcohol Dependence
Investigating Shared Additive Genetic Variation for Alcohol DependenceGolden Helix Inc
 
Population Structure & Genetic Improvement in Livestock
Population Structure & Genetic Improvement in LivestockPopulation Structure & Genetic Improvement in Livestock
Population Structure & Genetic Improvement in LivestockGolden Helix Inc
 
GWAS in a model organism: Arabidopsis thaliana
GWAS in a model organism: Arabidopsis thalianaGWAS in a model organism: Arabidopsis thaliana
GWAS in a model organism: Arabidopsis thalianaGolden Helix Inc
 
MM - KBAC: Using mixed models to adjust for population structure in a rare-va...
MM - KBAC: Using mixed models to adjust for population structure in a rare-va...MM - KBAC: Using mixed models to adjust for population structure in a rare-va...
MM - KBAC: Using mixed models to adjust for population structure in a rare-va...Golden Helix Inc
 
New Study Identifies High-Risk Variants Associated with Autism Spectrum Disor...
New Study Identifies High-Risk Variants Associated with Autism Spectrum Disor...New Study Identifies High-Risk Variants Associated with Autism Spectrum Disor...
New Study Identifies High-Risk Variants Associated with Autism Spectrum Disor...Golden Helix Inc
 

Mais de Golden Helix Inc (20)

Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
 
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
 
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
 
Two Clinical Workflows - From Unfiltered Variants to a Clinical Report
Two Clinical Workflows - From Unfiltered Variants to a Clinical ReportTwo Clinical Workflows - From Unfiltered Variants to a Clinical Report
Two Clinical Workflows - From Unfiltered Variants to a Clinical Report
 
The Molecular Sciences Made Personal
The Molecular Sciences Made PersonalThe Molecular Sciences Made Personal
The Molecular Sciences Made Personal
 
Prediction and Meta-Analysis
Prediction and Meta-AnalysisPrediction and Meta-Analysis
Prediction and Meta-Analysis
 
A Walk Through GWAS
A Walk Through GWASA Walk Through GWAS
A Walk Through GWAS
 
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeqIntroducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
 
MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...
MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...
MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...
 
Getting Started with VSWarehouse - The User Experience
Getting Started with VSWarehouse - The User ExperienceGetting Started with VSWarehouse - The User Experience
Getting Started with VSWarehouse - The User Experience
 
Custom Family Workflows
Custom Family WorkflowsCustom Family Workflows
Custom Family Workflows
 
Using WES in Distant Relationships to Identify Cardiomyopathy Genes
Using WES in Distant Relationships to Identify Cardiomyopathy GenesUsing WES in Distant Relationships to Identify Cardiomyopathy Genes
Using WES in Distant Relationships to Identify Cardiomyopathy Genes
 
Using Clinical Reports as a part of a Gene Panel Pipeline
Using Clinical Reports as a part of a Gene Panel PipelineUsing Clinical Reports as a part of a Gene Panel Pipeline
Using Clinical Reports as a part of a Gene Panel Pipeline
 
Investigating Shared Additive Genetic Variation for Alcohol Dependence
Investigating Shared Additive Genetic Variation for Alcohol DependenceInvestigating Shared Additive Genetic Variation for Alcohol Dependence
Investigating Shared Additive Genetic Variation for Alcohol Dependence
 
CNV Analysis in VarSeq
CNV Analysis in VarSeqCNV Analysis in VarSeq
CNV Analysis in VarSeq
 
Beagle Imputation in SVS
Beagle Imputation in SVSBeagle Imputation in SVS
Beagle Imputation in SVS
 
Population Structure & Genetic Improvement in Livestock
Population Structure & Genetic Improvement in LivestockPopulation Structure & Genetic Improvement in Livestock
Population Structure & Genetic Improvement in Livestock
 
GWAS in a model organism: Arabidopsis thaliana
GWAS in a model organism: Arabidopsis thalianaGWAS in a model organism: Arabidopsis thaliana
GWAS in a model organism: Arabidopsis thaliana
 
MM - KBAC: Using mixed models to adjust for population structure in a rare-va...
MM - KBAC: Using mixed models to adjust for population structure in a rare-va...MM - KBAC: Using mixed models to adjust for population structure in a rare-va...
MM - KBAC: Using mixed models to adjust for population structure in a rare-va...
 
New Study Identifies High-Risk Variants Associated with Autism Spectrum Disor...
New Study Identifies High-Risk Variants Associated with Autism Spectrum Disor...New Study Identifies High-Risk Variants Associated with Autism Spectrum Disor...
New Study Identifies High-Risk Variants Associated with Autism Spectrum Disor...
 

Último

Loudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxLoudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxpriyankatabhane
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlshansessene
 
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep LearningCombining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learningvschiavoni
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionJadeNovelo1
 
final waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterfinal waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterHanHyoKim
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests GlycosidesNandakishor Bhaurao Deshmukh
 
whole genome sequencing new and its types including shortgun and clone by clone
whole genome sequencing new  and its types including shortgun and clone by clonewhole genome sequencing new  and its types including shortgun and clone by clone
whole genome sequencing new and its types including shortgun and clone by clonechaudhary charan shingh university
 
FBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxFBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxPayal Shrivastava
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxGiDMOh
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024Jene van der Heide
 
Abnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptxAbnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptxzeus70441
 
How we decide powerpoint presentation.pptx
How we decide powerpoint presentation.pptxHow we decide powerpoint presentation.pptx
How we decide powerpoint presentation.pptxJosielynTars
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Christina Parmionova
 
Forensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxForensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxkumarsanjai28051
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsSérgio Sacani
 
Gas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptxGas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptxGiovaniTrinidad
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...D. B. S. College Kanpur
 
well logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxwell logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxzaydmeerab121
 

Último (20)

Loudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxLoudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptx
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girls
 
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep LearningCombining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and Function
 
final waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterfinal waves properties grade 7 - third quarter
final waves properties grade 7 - third quarter
 
Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
 
whole genome sequencing new and its types including shortgun and clone by clone
whole genome sequencing new  and its types including shortgun and clone by clonewhole genome sequencing new  and its types including shortgun and clone by clone
whole genome sequencing new and its types including shortgun and clone by clone
 
FBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxFBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptx
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptx
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
 
Abnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptxAbnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptx
 
How we decide powerpoint presentation.pptx
How we decide powerpoint presentation.pptxHow we decide powerpoint presentation.pptx
How we decide powerpoint presentation.pptx
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
 
Forensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxForensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptx
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive stars
 
Gas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptxGas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptx
 
PLASMODIUM. PPTX
PLASMODIUM. PPTXPLASMODIUM. PPTX
PLASMODIUM. PPTX
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
 
well logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxwell logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptx
 

Using VarSeq to Improve Variant Analysis Research Workflows

  • 1. Using VarSeq to Improve Variant Analysis Research June 10, 2015 G Bryce Christensen Director of Services
  • 2. Use the Questions pane in your GoToWebinar window Questions during the presentation
  • 3. Agenda What makes a damaging variant? VarSeq Interactive Demonstration 2 3 4 QC Considerations Variant analysis workflows1
  • 4. What is VarSeq? VarSeq Simple Flexible Scalable  Variant annotation, filtering and ranking  Repeatable workflows  Rich visualizations with GenomeBrowse integration  Powerful GUI and command-line interfaces
  • 5. Workflow Development Process in VarSeq 1. Begin from one or many VCF files 2. Annotate variants using public data sources curated by Golden Helix and/or annotate with custom data sources. 3. Run additional computation algorithms - Allele counts, genotype zygosity, gene list matching, etc 4. Construct filter chain to identify candidate variants - May use combinations of logical operators in filters - May have multiple independent filter chains and/or endpoints 5. Process results - Gene Ranking with PhoRank - Review variant QC - Vizualization with GenomeBrowse - Commit variants to local database - Etc.
  • 6. Annotations are the key  Good variant analysis begins with accurate annotations.  Golden Helix invests extensive time and effort in validating and maintaining data sources.  Annotation data sources may be used for either quality control or analytic purposes.
  • 7. Defining Deleteriousness  What makes a variant potentially damaging?  Start by defining the search space: - Rare, non-synonymous, homozygous variants? - DeNovo mutations in highly conserved genes? - Splice-site mutations? - Etc.  Review annotations for remaining variants to identify causal candidates  Which annotations to use?
  • 8. Variant Classification  VarSeq classifies variants into 20+ different categories  The categories are further grouped as: - Loss of Function - Missense - Other  Choice of gene transcript reference - RefSeq - Ensembl - Others
  • 9. ClinVar  ClinVar is a public archive of variants evaluated for potential causal relationships to diseases  Submissions from many sources, including major clinical laboratories  Over 100k records  Updated monthly
  • 10. Functional Predictions  Functional predictions use algorithms to determine the expected consequence of variants (or the resulting amino acid substitutions).  dbNSFP - The Database for NonSynonymous Functional Predictions (dbNSFP) is a free tool developed by Dr. Xiaoming Liu. - Catalogs pre-computed conservation and functional prediction scores for all possible missense SNVs in the genome - Methods include SIFT, PolyPhen-2, MutationTaster, MutationAssessor, FATHMM, more  dbscSNV - Companion to dbNSFP that scores variants in splice consensus regions - Variants in these regions may disrupt normal gene expression and/or function  dbNSFP and dbscSNV are both accessible in VarSeq
  • 11. Variant/Gene Ranking  PhoRank algorithm in VarSeq uses HPO and GO terminology to score relationships between genes and phenotypes  Very useful to prioritize a long list of variants for individual review  Based on PHEVOR method.
  • 12. QC Considerations  Variant QC  Rare variants deserve special attention  VCF/BAM Data: - Depth - DP - Quality - GQ - Strand bias - Etc.  Public Annotations: - “Mappability”
  • 13. Mappability Annotations  The human reference genome has assembly gaps and other “difficult” regions  NGS technology sequences short DNA fragments which are the aligned to the reference genome - Most sequences are aligned correctly - Some sequences can’t be aligned uniquely - Some sequences may be incorrectly aligned  Luckily, we can predict many of the trouble spots
  • 14. Segmental Duplications  Segmental duplications are a common confounder  UCSC “Genomic Super Dups” annotation available through VarSeq  Recent Example (below): - Apparent UPD feature in family trio was determined to be an artifact of seg. duplication - Large chromosome segment duplicated elsewhere with >98% similarity
  • 15. Emerging Standards  Several organizations working on best practices guidelines for genome mappability - 1000 Genomes Project - Genome in a Bottle Consortium - Global Alliance for Genomics and Health (GA4GH) - National Institute of Standards and Technology  Downloadable annotations available for many types of features: - Mappability by read length - High G-C content regions - Low complexity - Segmental duplications - Etc.
  • 16. Example: 1kG Low Complexity Regions
  • 17. Example: GA4GH 150-bp Mappability
  • 18. VarSeq Demonstration Data  Exome sequencing of five individuals from family with familial cardiac conduction disease (CCD)  Raw sequence data obtained from SRA
  • 19. Workflow Discussion Points  Male-to-male transmission makes X- linked model unlikely  May follow dominant or recessive transmission  Inherited forms of CCD are rare  Family has East Asian ancestry
  • 21. Why VarSeq? VarSeq Simple Flexible Scalable  Variant annotation, filtering and ranking  Exploratory analysis  Powerful GUI with immediate feedback  Rich visualizations with GenomeBrowse integration
  • 22. Questions or more info:  Email info@goldenhelix.com  Request an evaluation of the software at www.goldenhelix.com
  • 23. Questions? Use the Questions pane in your GoToWebinar window