SlideShare uma empresa Scribd logo
1 de 26
Next Generation Sequencing
for
Cancer Genomics 101
2017/02/10
Ino de Bruijn
Jorge Reis-Filho Lab
Content
• DNA sequencing
– Targets
• DNA, genes, exons, introns
• RNA
• How do we analyze NGS data?
– Genetic changes
– Mutational Signatures
– RNASeq
Sequencing DNA in the modern era
• DNA Sequencing is to convert real world DNA to digital DNA
• In 1980s
– Sanger sequencing
– Compare short regions of DNA
• Possible by hand
• In mid 2000s
– Parallelization of sequencing
reactions
– Generates billions of DNA reads
• DNA read: short stretch of DNA
– Compare whole genomes
• Impossible by hand
CACGTCTAAGGGCGAAGAGCTGACTGCTTTTTT
Targeting parts of the genome
• Human genome has 3 billion bases
• Be cost effective:
– Focus on part of genome related to your
subject
What is a gene?
• Human genome 3 billion bases
– 23 Chromosomes
– Certain stretches of DNA are code for proteins
which perform a wide variety of functions in
your body (~20,000 in total)
Gene to protein
Exons comprise only
~1.5% of genome
Different targets
• Whole Genome Sequencing (WGS)
– E.g. Rearrangements outside genes
• Whole Exome Sequencing (WES)
– E.g. Gene Discovery (Rare/unknown tumors)
• Custom target
– MSK-IMPACT (Integrated Mutation Profiling
of Actionable CancerTargets)
• 410 genes related to cancer
• >15K patents profiled at MSKCC
– https://cbioportal.mskcc.org/study?id=mskimpact
NGS Principles
NGS Principles - Coverage
Sequence same part many times:
Coverage is number of times a base is covered by a read
NGS Principles - Coverage
• Not all reads retrieved are correct
– Many errors when sequencing
• DNA Library prep protocol
• Sequencing error rate
• Sequencing groups of cells
– Certain genetic changes only in small fraction
of cells
• Need to sequence the same part multiple
times to get confidence
– Amount depends on analysis & expectation
How to analyze the NGS data?
• Some might guess this is where the
bioinformatician comes in…
How to analyze the NGS data?
• Some might guess this is where the
bioinformatician comes in…
Too late - the bioinformatician should have
been helping you design the experiment 
How to analyze NGS data?
• Tons of different options
– What is the research question?
• Common analysis: identify genetic changes
in the tumor
Identify the genetic changes
Meyerson et al. Nat Rev Genet 2010
Identify the genetic changes
• Compare against reference human
genome
– Gives both germline and somatic mutations
• How to differentiate?
– Databases with common germline variants misses many
• Somatic mutations
– Take DNA from normal cells and tumor cells
– Filter mutations in normal
Identify somatic mutations
Identify mutations
• Automated pipelines to do this
– Example: Mutation calling tools take into
account
• Number of reads having the mutation versus all
reads (Mutation Allelic Fraction (MAF))
• Coverage at that position
• Read quality score
• If calling somatic mutations
– Mutation in the normal
• Every parameter makes assumptions about
the data – communicate the goal of the
project
Categorize Mutations
• Silent/Nonsilent
– Does the mutation alter phenotype?
• In exonic region
– Synonymous: Amino Acid Code stay the same
– Nonsynonymous: Changes Amino Acid Code
of protein
Categorize Mutations
• Oncogenesis
– Oncogenes (the gas)
• Cell growth
• Activation causes cancer
– Tumor Suppressor Genes (the breaks)
• DNA repair, slow down cell division
• Loss of function causes cancer
– Two Hit Hypothesis (Knudson 1971)
Mutational Signatures
• Find activated mutational processes
• Use the identified SNVs (single nucleotide
variants) to determine
– Use 1 base context on both 5’ and 3’ side
• .C. > .T.
• 6 base transition classes
– C>A, C>G, C>T,T>A,T>C,T>G
• 4 possible bases on both sides
• Total: 6 * 4 * 4 = 96 possible transitions
Mutational Signatures
Alexandrov, L.B. et al. Nature 2013
Biological processes generating
somatic mutations in cancer samples
Dataset:
4,938,362 mutations from 7,042
cancers
Aging signature Defective DNA MMR signature POLE signature
I T
CG>TA transitions at NpCpG
I, indels; T, transcriptional strand bias
CG>TA transitions at NpCpG
CG>AT transversions at CpCpC
C>A transversions at TpCpT; T>G at TpTpT
Copy Number Analysis
(amplification)
(gain)
(neutral)
(loss)
(deletion)
Relative
copy
number:
RNA-Seq Analysis
• Gene expression
– Find low or highly expressed genes
Breast Invasive Carcinoma (TCGA, Nature 2012)
AMP+Upreg AMP Upreg
RNA-Seq Analysis
• Gene expression as prognosis indicator
Verhaak et al (JCI, 2013)
RNA-Seq Analysis
• Fusion gene detection
– E.g.TMPRSS2:ERGa (50% prostate cancers)
debruiji@mskcc.org

Mais conteúdo relacionado

Mais procurados

Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods Mrinal Vashisth
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencingUzma Jabeen
 
Introduction to Next-Generation Sequencing (NGS) Technology
Introduction to Next-Generation Sequencing (NGS) TechnologyIntroduction to Next-Generation Sequencing (NGS) Technology
Introduction to Next-Generation Sequencing (NGS) TechnologyQIAGEN
 
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisSo you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisUniversity of California, Davis
 
Variant calling and how to prioritize somatic mutations and inheritated varia...
Variant calling and how to prioritize somatic mutations and inheritated varia...Variant calling and how to prioritize somatic mutations and inheritated varia...
Variant calling and how to prioritize somatic mutations and inheritated varia...Vall d'Hebron Institute of Research (VHIR)
 
Transcript detection in RNAseq
Transcript detection in RNAseqTranscript detection in RNAseq
Transcript detection in RNAseqDenis C. Bauer
 
High Throughput Sequencing Technologies: On the path to the $0* genome
High Throughput Sequencing Technologies: On the path to the $0* genomeHigh Throughput Sequencing Technologies: On the path to the $0* genome
High Throughput Sequencing Technologies: On the path to the $0* genomeBrian Krueger
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomicsAjit Shinde
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...VHIR Vall d’Hebron Institut de Recerca
 
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...VHIR Vall d’Hebron Institut de Recerca
 
RNA-seq: Mapping and quality control - part 3
RNA-seq: Mapping and quality control - part 3RNA-seq: Mapping and quality control - part 3
RNA-seq: Mapping and quality control - part 3BITS
 
Transcriptomics: A time efficient tool for crop improvement
Transcriptomics: A time efficient tool for crop improvementTranscriptomics: A time efficient tool for crop improvement
Transcriptomics: A time efficient tool for crop improvementSajid Sheikh
 
GBS: Genotyping by sequencing
GBS: Genotyping by sequencingGBS: Genotyping by sequencing
GBS: Genotyping by sequencingsampath perumal
 
Workshop NGS data analysis - 1
Workshop NGS data analysis - 1Workshop NGS data analysis - 1
Workshop NGS data analysis - 1Maté Ongenaert
 

Mais procurados (20)

New generation Sequencing
New generation Sequencing New generation Sequencing
New generation Sequencing
 
Data analysis pipelines for NGS applications
Data analysis pipelines for NGS applicationsData analysis pipelines for NGS applications
Data analysis pipelines for NGS applications
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Introduction to Next-Generation Sequencing (NGS) Technology
Introduction to Next-Generation Sequencing (NGS) TechnologyIntroduction to Next-Generation Sequencing (NGS) Technology
Introduction to Next-Generation Sequencing (NGS) Technology
 
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisSo you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
 
Variant calling and how to prioritize somatic mutations and inheritated varia...
Variant calling and how to prioritize somatic mutations and inheritated varia...Variant calling and how to prioritize somatic mutations and inheritated varia...
Variant calling and how to prioritize somatic mutations and inheritated varia...
 
Transcript detection in RNAseq
Transcript detection in RNAseqTranscript detection in RNAseq
Transcript detection in RNAseq
 
High Throughput Sequencing Technologies: On the path to the $0* genome
High Throughput Sequencing Technologies: On the path to the $0* genomeHigh Throughput Sequencing Technologies: On the path to the $0* genome
High Throughput Sequencing Technologies: On the path to the $0* genome
 
Rna seq pipeline
Rna seq pipelineRna seq pipeline
Rna seq pipeline
 
Exome sequence analysis
Exome sequence analysisExome sequence analysis
Exome sequence analysis
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
 
RNA-seq Analysis
RNA-seq AnalysisRNA-seq Analysis
RNA-seq Analysis
 
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
 
RNA-seq: Mapping and quality control - part 3
RNA-seq: Mapping and quality control - part 3RNA-seq: Mapping and quality control - part 3
RNA-seq: Mapping and quality control - part 3
 
Transcriptomics: A time efficient tool for crop improvement
Transcriptomics: A time efficient tool for crop improvementTranscriptomics: A time efficient tool for crop improvement
Transcriptomics: A time efficient tool for crop improvement
 
GBS: Genotyping by sequencing
GBS: Genotyping by sequencingGBS: Genotyping by sequencing
GBS: Genotyping by sequencing
 
Workshop NGS data analysis - 1
Workshop NGS data analysis - 1Workshop NGS data analysis - 1
Workshop NGS data analysis - 1
 
Snp genotyping
Snp genotypingSnp genotyping
Snp genotyping
 

Semelhante a 20170209 ngs for_cancer_genomics_101

SNPs analysis methods
SNPs analysis methodsSNPs analysis methods
SNPs analysis methodshad89
 
Genomics and proteomics (Bioinformatics)
Genomics and proteomics (Bioinformatics)Genomics and proteomics (Bioinformatics)
Genomics and proteomics (Bioinformatics)Sijo A
 
CancerBioinformatics_DataTypesResources_012215.pptx
CancerBioinformatics_DataTypesResources_012215.pptxCancerBioinformatics_DataTypesResources_012215.pptx
CancerBioinformatics_DataTypesResources_012215.pptxQiZhi2
 
Comparative and functional genomics
Comparative and functional genomicsComparative and functional genomics
Comparative and functional genomicsJalormi Parekh
 
Microarry andd NGS.pdf
Microarry andd NGS.pdfMicroarry andd NGS.pdf
Microarry andd NGS.pdfnedalalazzwy
 
Gene expression introduction
Gene expression introductionGene expression introduction
Gene expression introductionSetia Pramana
 
Introduction to epigenetics and study design
Introduction to epigenetics and study designIntroduction to epigenetics and study design
Introduction to epigenetics and study designamlbinder
 
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...Intel IT Center
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencingshinycthomas
 
1.introduction to genetic engineering and restriction enzymes
1.introduction to genetic engineering and restriction enzymes1.introduction to genetic engineering and restriction enzymes
1.introduction to genetic engineering and restriction enzymesGetachew Birhanu
 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics Senthil Natesan
 

Semelhante a 20170209 ngs for_cancer_genomics_101 (20)

SNPs analysis methods
SNPs analysis methodsSNPs analysis methods
SNPs analysis methods
 
Genomics and proteomics (Bioinformatics)
Genomics and proteomics (Bioinformatics)Genomics and proteomics (Bioinformatics)
Genomics and proteomics (Bioinformatics)
 
Microbial physiology in genomic era
Microbial physiology in genomic eraMicrobial physiology in genomic era
Microbial physiology in genomic era
 
CancerBioinformatics_DataTypesResources_012215.pptx
CancerBioinformatics_DataTypesResources_012215.pptxCancerBioinformatics_DataTypesResources_012215.pptx
CancerBioinformatics_DataTypesResources_012215.pptx
 
Comparative and functional genomics
Comparative and functional genomicsComparative and functional genomics
Comparative and functional genomics
 
Genomics types
Genomics typesGenomics types
Genomics types
 
Microarry andd NGS.pdf
Microarry andd NGS.pdfMicroarry andd NGS.pdf
Microarry andd NGS.pdf
 
Gene expression introduction
Gene expression introductionGene expression introduction
Gene expression introduction
 
Introduction to epigenetics and study design
Introduction to epigenetics and study designIntroduction to epigenetics and study design
Introduction to epigenetics and study design
 
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
microarrary
microarrarymicroarrary
microarrary
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
1.introduction to genetic engineering and restriction enzymes
1.introduction to genetic engineering and restriction enzymes1.introduction to genetic engineering and restriction enzymes
1.introduction to genetic engineering and restriction enzymes
 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics
 
genesequencing-200105073623 (1).pdf
genesequencing-200105073623 (1).pdfgenesequencing-200105073623 (1).pdf
genesequencing-200105073623 (1).pdf
 
Gene Sequencing
Gene SequencingGene Sequencing
Gene Sequencing
 
Gene cloning
Gene cloningGene cloning
Gene cloning
 
Ngs presentation
Ngs presentationNgs presentation
Ngs presentation
 
Dna chip
Dna chipDna chip
Dna chip
 

Último

Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLkantirani197
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Servicemonikaservice1
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptxAlMamun560346
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑Damini Dixit
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and ClassificationsAreesha Ahmad
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 

Último (20)

Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 

20170209 ngs for_cancer_genomics_101

  • 1. Next Generation Sequencing for Cancer Genomics 101 2017/02/10 Ino de Bruijn Jorge Reis-Filho Lab
  • 2. Content • DNA sequencing – Targets • DNA, genes, exons, introns • RNA • How do we analyze NGS data? – Genetic changes – Mutational Signatures – RNASeq
  • 3. Sequencing DNA in the modern era • DNA Sequencing is to convert real world DNA to digital DNA • In 1980s – Sanger sequencing – Compare short regions of DNA • Possible by hand • In mid 2000s – Parallelization of sequencing reactions – Generates billions of DNA reads • DNA read: short stretch of DNA – Compare whole genomes • Impossible by hand CACGTCTAAGGGCGAAGAGCTGACTGCTTTTTT
  • 4. Targeting parts of the genome • Human genome has 3 billion bases • Be cost effective: – Focus on part of genome related to your subject
  • 5. What is a gene? • Human genome 3 billion bases – 23 Chromosomes – Certain stretches of DNA are code for proteins which perform a wide variety of functions in your body (~20,000 in total)
  • 6. Gene to protein Exons comprise only ~1.5% of genome
  • 7. Different targets • Whole Genome Sequencing (WGS) – E.g. Rearrangements outside genes • Whole Exome Sequencing (WES) – E.g. Gene Discovery (Rare/unknown tumors) • Custom target – MSK-IMPACT (Integrated Mutation Profiling of Actionable CancerTargets) • 410 genes related to cancer • >15K patents profiled at MSKCC – https://cbioportal.mskcc.org/study?id=mskimpact
  • 9. NGS Principles - Coverage Sequence same part many times: Coverage is number of times a base is covered by a read
  • 10. NGS Principles - Coverage • Not all reads retrieved are correct – Many errors when sequencing • DNA Library prep protocol • Sequencing error rate • Sequencing groups of cells – Certain genetic changes only in small fraction of cells • Need to sequence the same part multiple times to get confidence – Amount depends on analysis & expectation
  • 11. How to analyze the NGS data? • Some might guess this is where the bioinformatician comes in…
  • 12. How to analyze the NGS data? • Some might guess this is where the bioinformatician comes in… Too late - the bioinformatician should have been helping you design the experiment 
  • 13. How to analyze NGS data? • Tons of different options – What is the research question? • Common analysis: identify genetic changes in the tumor
  • 14. Identify the genetic changes Meyerson et al. Nat Rev Genet 2010
  • 15. Identify the genetic changes • Compare against reference human genome – Gives both germline and somatic mutations • How to differentiate? – Databases with common germline variants misses many • Somatic mutations – Take DNA from normal cells and tumor cells – Filter mutations in normal
  • 17. Identify mutations • Automated pipelines to do this – Example: Mutation calling tools take into account • Number of reads having the mutation versus all reads (Mutation Allelic Fraction (MAF)) • Coverage at that position • Read quality score • If calling somatic mutations – Mutation in the normal • Every parameter makes assumptions about the data – communicate the goal of the project
  • 18. Categorize Mutations • Silent/Nonsilent – Does the mutation alter phenotype? • In exonic region – Synonymous: Amino Acid Code stay the same – Nonsynonymous: Changes Amino Acid Code of protein
  • 19. Categorize Mutations • Oncogenesis – Oncogenes (the gas) • Cell growth • Activation causes cancer – Tumor Suppressor Genes (the breaks) • DNA repair, slow down cell division • Loss of function causes cancer – Two Hit Hypothesis (Knudson 1971)
  • 20. Mutational Signatures • Find activated mutational processes • Use the identified SNVs (single nucleotide variants) to determine – Use 1 base context on both 5’ and 3’ side • .C. > .T. • 6 base transition classes – C>A, C>G, C>T,T>A,T>C,T>G • 4 possible bases on both sides • Total: 6 * 4 * 4 = 96 possible transitions
  • 21. Mutational Signatures Alexandrov, L.B. et al. Nature 2013 Biological processes generating somatic mutations in cancer samples Dataset: 4,938,362 mutations from 7,042 cancers Aging signature Defective DNA MMR signature POLE signature I T CG>TA transitions at NpCpG I, indels; T, transcriptional strand bias CG>TA transitions at NpCpG CG>AT transversions at CpCpC C>A transversions at TpCpT; T>G at TpTpT
  • 23. RNA-Seq Analysis • Gene expression – Find low or highly expressed genes Breast Invasive Carcinoma (TCGA, Nature 2012) AMP+Upreg AMP Upreg
  • 24. RNA-Seq Analysis • Gene expression as prognosis indicator Verhaak et al (JCI, 2013)
  • 25. RNA-Seq Analysis • Fusion gene detection – E.g.TMPRSS2:ERGa (50% prostate cancers)