SlideShare uma empresa Scribd logo
1 de 3
Bioc4010 Sample Questions:

1. A) What is the base call accuracy of a base in an Illumina sequenced short
read with a Q value of 20?
B) Is this better or worse than a Q value of 10?

Answer: A) Probability 1 in 100 or 99% call accuracy
B)Better. Q10 corresponds to a probability of 1 in 10 or 90% call accuracy


Formula: Q = -10 log10 P


2. What two primary advantages does exome sequencing provide over whole
genome sequencing?

Answer: Cost and data reduction. Exome capture limits the sequencing to known
protein-coding genes and some miRNAs.


3. Split and sort the string ‘CAPTAINKIRK’ into its appropriate suffix array

Answer:

Ainkirk
Aptainkirk
Captainkirk
Inkirk
Irk
K
Kirk
Nkirk
Ptainkirk
Rk
Tainkirk
4. Given a base-quality score threshold of Q30, the following short read
alignment, and reference sequence, what is the genotype (two alleles, eg
G/C)at the indicated position? Base qualities for the position are listed on the
side for each of the reads.




AGCTCCCAGGGTCCAG                                   Q29
          GTCCAGTCTCGGTT                           Q40
      CAGGGTCCAGTC                                 Q47
           TCCAGTCTCGGTTCCATC                      Q35
    CCCAGGGCCCAG                                   Q50
        GGGTCCAGTCTC                               Q31
   TCCCAGGGCC                                      Q10
       AGGGTCCAGT                                  Q45
 GCTCCCAGGGCCCAGTCT                                Q46
CTCCCAGGGCCC                                     Q33
CCAGGGTCCAGTCQ38
 GCTCCCAGGGCCCAGTCTCGG                              Q41
      CAGGGTCCAGTCTCG                               Q15
AGCTCCCAGGGTCCAGTCTCGGTTCCATCTA
           *

Answer: Discard the reads where the base quality score is below Q30. Sum up the
reference and alternate bases at the position. (T =6 , C = 4). Therefore the genotype
called is T/C (heterozygous).


5. Sort the following types of genetic variants into the categories: Potentially
Disease Causing, Unlikely to be Disease Causing

1. Splice Site
2. Non-Synonymous
3. Synonymous
4. FrameshiftIndel
5. Stop Loss
6. Stop Gain
7. Intronic (Non-Splice Site)
8. Intergenic


Answer:

Disease: 1, 2, 4, 5, 6
Non-Disease: 3, 7, 8
6) What is the primary motivation for using “next gen” sequencing methods
and modern genomics approaches to diagnosing human genetic diseases?

Answer: Cost

7) What does the base quality of a sequencing read tell you?

Answer: The base quality is equivalent to the probability of an incorrect base call.
(Also acceptable answer is the base call accuracy)

8) What problem does binary search address?

Answer: Efficiently searching the index of a genome

Mais conteúdo relacionado

Semelhante a Bioc4010 sample questions

Protein-Protein Interaction Prediction Based on Template-Based and de Novo Do...
Protein-Protein Interaction Prediction Based on Template-Based and de Novo Do...Protein-Protein Interaction Prediction Based on Template-Based and de Novo Do...
Protein-Protein Interaction Prediction Based on Template-Based and de Novo Do...Masahito Ohue
 
Graph and assembly strategies for the MHC and ribosomal DNA regions
Graph and assembly strategies for the MHC and ribosomal DNA regionsGraph and assembly strategies for the MHC and ribosomal DNA regions
Graph and assembly strategies for the MHC and ribosomal DNA regionsGenome Reference Consortium
 
19_21Translation
19_21Translation19_21Translation
19_21TranslationKaren Lewis
 
rnaseq_from_babelomics
rnaseq_from_babelomicsrnaseq_from_babelomics
rnaseq_from_babelomicsFrancisco Garc
 
Successful detection of 40 COSMIC hotspot mutations at allelic frequency belo...
Successful detection of 40 COSMIC hotspot mutations at allelic frequency belo...Successful detection of 40 COSMIC hotspot mutations at allelic frequency belo...
Successful detection of 40 COSMIC hotspot mutations at allelic frequency belo...Thermo Fisher Scientific
 
CDAC 2018 Pellegrini clustering ppi networks
CDAC 2018 Pellegrini clustering ppi networksCDAC 2018 Pellegrini clustering ppi networks
CDAC 2018 Pellegrini clustering ppi networksMarco Antoniotti
 
Introducing data analysis: reads to results
Introducing data analysis: reads to resultsIntroducing data analysis: reads to results
Introducing data analysis: reads to resultsAGRF_Ltd
 
Pyrosequencing slide presentation rev3.
Pyrosequencing slide presentation rev3.Pyrosequencing slide presentation rev3.
Pyrosequencing slide presentation rev3.Robert Bruce
 
High throughput qPCR: tips for analysis across multiple plates
High throughput qPCR: tips for analysis across multiple platesHigh throughput qPCR: tips for analysis across multiple plates
High throughput qPCR: tips for analysis across multiple platesIntegrated DNA Technologies
 
[2020-09-01] IIBMP2020 Generating annotation texts of HLA sequences with anti...
[2020-09-01] IIBMP2020 Generating annotation texts of HLA sequences with anti...[2020-09-01] IIBMP2020 Generating annotation texts of HLA sequences with anti...
[2020-09-01] IIBMP2020 Generating annotation texts of HLA sequences with anti...Eli Kaminuma
 
2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...
2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...
2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...FOODCROPS
 
Wang labsummer2010
Wang labsummer2010Wang labsummer2010
Wang labsummer2010russodl
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive modelsChemAxon
 
SNP genotyping using Affymetrix' Axiom Genotyping Solution
SNP genotyping using Affymetrix' Axiom Genotyping SolutionSNP genotyping using Affymetrix' Axiom Genotyping Solution
SNP genotyping using Affymetrix' Axiom Genotyping SolutionAffymetrix
 
PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3
PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3
PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3QIAGEN
 
Aai 2007-pcr array-poster
Aai 2007-pcr array-posterAai 2007-pcr array-poster
Aai 2007-pcr array-posterElsa von Licy
 
Ascb 2007-pcr array-poster
Ascb 2007-pcr array-posterAscb 2007-pcr array-poster
Ascb 2007-pcr array-posterElsa von Licy
 

Semelhante a Bioc4010 sample questions (20)

Protein-Protein Interaction Prediction Based on Template-Based and de Novo Do...
Protein-Protein Interaction Prediction Based on Template-Based and de Novo Do...Protein-Protein Interaction Prediction Based on Template-Based and de Novo Do...
Protein-Protein Interaction Prediction Based on Template-Based and de Novo Do...
 
Graph and assembly strategies for the MHC and ribosomal DNA regions
Graph and assembly strategies for the MHC and ribosomal DNA regionsGraph and assembly strategies for the MHC and ribosomal DNA regions
Graph and assembly strategies for the MHC and ribosomal DNA regions
 
19_21Translation
19_21Translation19_21Translation
19_21Translation
 
rnaseq_from_babelomics
rnaseq_from_babelomicsrnaseq_from_babelomics
rnaseq_from_babelomics
 
Successful detection of 40 COSMIC hotspot mutations at allelic frequency belo...
Successful detection of 40 COSMIC hotspot mutations at allelic frequency belo...Successful detection of 40 COSMIC hotspot mutations at allelic frequency belo...
Successful detection of 40 COSMIC hotspot mutations at allelic frequency belo...
 
CDAC 2018 Pellegrini clustering ppi networks
CDAC 2018 Pellegrini clustering ppi networksCDAC 2018 Pellegrini clustering ppi networks
CDAC 2018 Pellegrini clustering ppi networks
 
Introducing data analysis: reads to results
Introducing data analysis: reads to resultsIntroducing data analysis: reads to results
Introducing data analysis: reads to results
 
In silico analysis for unknown data
In silico analysis for unknown dataIn silico analysis for unknown data
In silico analysis for unknown data
 
Validaternai
ValidaternaiValidaternai
Validaternai
 
Pyrosequencing slide presentation rev3.
Pyrosequencing slide presentation rev3.Pyrosequencing slide presentation rev3.
Pyrosequencing slide presentation rev3.
 
High throughput qPCR: tips for analysis across multiple plates
High throughput qPCR: tips for analysis across multiple platesHigh throughput qPCR: tips for analysis across multiple plates
High throughput qPCR: tips for analysis across multiple plates
 
[2020-09-01] IIBMP2020 Generating annotation texts of HLA sequences with anti...
[2020-09-01] IIBMP2020 Generating annotation texts of HLA sequences with anti...[2020-09-01] IIBMP2020 Generating annotation texts of HLA sequences with anti...
[2020-09-01] IIBMP2020 Generating annotation texts of HLA sequences with anti...
 
2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...
2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...
2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...
 
Wang labsummer2010
Wang labsummer2010Wang labsummer2010
Wang labsummer2010
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive models
 
SNP genotyping using Affymetrix' Axiom Genotyping Solution
SNP genotyping using Affymetrix' Axiom Genotyping SolutionSNP genotyping using Affymetrix' Axiom Genotyping Solution
SNP genotyping using Affymetrix' Axiom Genotyping Solution
 
PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3
PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3
PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3
 
Abrf poster2007
Abrf poster2007Abrf poster2007
Abrf poster2007
 
Aai 2007-pcr array-poster
Aai 2007-pcr array-posterAai 2007-pcr array-poster
Aai 2007-pcr array-poster
 
Ascb 2007-pcr array-poster
Ascb 2007-pcr array-posterAscb 2007-pcr array-poster
Ascb 2007-pcr array-poster
 

Mais de Dan Gaston

Population and evolutionary genetics 1
Population and evolutionary genetics 1Population and evolutionary genetics 1
Population and evolutionary genetics 1Dan Gaston
 
2016 ngs health_lecture
2016 ngs health_lecture2016 ngs health_lecture
2016 ngs health_lectureDan Gaston
 
Human genetics evolutionary genetics
Human genetics   evolutionary geneticsHuman genetics   evolutionary genetics
Human genetics evolutionary geneticsDan Gaston
 
Genomics, Bioinformatics, and Pathology
Genomics, Bioinformatics, and PathologyGenomics, Bioinformatics, and Pathology
Genomics, Bioinformatics, and PathologyDan Gaston
 
2015 Bioc4010 lecture1and2
2015 Bioc4010 lecture1and22015 Bioc4010 lecture1and2
2015 Bioc4010 lecture1and2Dan Gaston
 
2016 Dal Human Genetics - Genomics in Medicine Lecture
2016 Dal Human Genetics - Genomics in Medicine Lecture2016 Dal Human Genetics - Genomics in Medicine Lecture
2016 Dal Human Genetics - Genomics in Medicine LectureDan Gaston
 
Bioc4700 2014 Guest Lecture
Bioc4700   2014 Guest LectureBioc4700   2014 Guest Lecture
Bioc4700 2014 Guest LectureDan Gaston
 
Protein Evolution: Structure, Function, and Human Health
Protein Evolution: Structure, Function, and Human HealthProtein Evolution: Structure, Function, and Human Health
Protein Evolution: Structure, Function, and Human HealthDan Gaston
 
Bioc4010 lectures 1 and 2
Bioc4010 lectures 1 and 2Bioc4010 lectures 1 and 2
Bioc4010 lectures 1 and 2Dan Gaston
 
Dgaston dec-06-2012
Dgaston dec-06-2012Dgaston dec-06-2012
Dgaston dec-06-2012Dan Gaston
 
Bioinformatics in Gene Research
Bioinformatics in Gene ResearchBioinformatics in Gene Research
Bioinformatics in Gene ResearchDan Gaston
 

Mais de Dan Gaston (11)

Population and evolutionary genetics 1
Population and evolutionary genetics 1Population and evolutionary genetics 1
Population and evolutionary genetics 1
 
2016 ngs health_lecture
2016 ngs health_lecture2016 ngs health_lecture
2016 ngs health_lecture
 
Human genetics evolutionary genetics
Human genetics   evolutionary geneticsHuman genetics   evolutionary genetics
Human genetics evolutionary genetics
 
Genomics, Bioinformatics, and Pathology
Genomics, Bioinformatics, and PathologyGenomics, Bioinformatics, and Pathology
Genomics, Bioinformatics, and Pathology
 
2015 Bioc4010 lecture1and2
2015 Bioc4010 lecture1and22015 Bioc4010 lecture1and2
2015 Bioc4010 lecture1and2
 
2016 Dal Human Genetics - Genomics in Medicine Lecture
2016 Dal Human Genetics - Genomics in Medicine Lecture2016 Dal Human Genetics - Genomics in Medicine Lecture
2016 Dal Human Genetics - Genomics in Medicine Lecture
 
Bioc4700 2014 Guest Lecture
Bioc4700   2014 Guest LectureBioc4700   2014 Guest Lecture
Bioc4700 2014 Guest Lecture
 
Protein Evolution: Structure, Function, and Human Health
Protein Evolution: Structure, Function, and Human HealthProtein Evolution: Structure, Function, and Human Health
Protein Evolution: Structure, Function, and Human Health
 
Bioc4010 lectures 1 and 2
Bioc4010 lectures 1 and 2Bioc4010 lectures 1 and 2
Bioc4010 lectures 1 and 2
 
Dgaston dec-06-2012
Dgaston dec-06-2012Dgaston dec-06-2012
Dgaston dec-06-2012
 
Bioinformatics in Gene Research
Bioinformatics in Gene ResearchBioinformatics in Gene Research
Bioinformatics in Gene Research
 

Último

Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 

Último (20)

Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 

Bioc4010 sample questions

  • 1. Bioc4010 Sample Questions: 1. A) What is the base call accuracy of a base in an Illumina sequenced short read with a Q value of 20? B) Is this better or worse than a Q value of 10? Answer: A) Probability 1 in 100 or 99% call accuracy B)Better. Q10 corresponds to a probability of 1 in 10 or 90% call accuracy Formula: Q = -10 log10 P 2. What two primary advantages does exome sequencing provide over whole genome sequencing? Answer: Cost and data reduction. Exome capture limits the sequencing to known protein-coding genes and some miRNAs. 3. Split and sort the string ‘CAPTAINKIRK’ into its appropriate suffix array Answer: Ainkirk Aptainkirk Captainkirk Inkirk Irk K Kirk Nkirk Ptainkirk Rk Tainkirk
  • 2. 4. Given a base-quality score threshold of Q30, the following short read alignment, and reference sequence, what is the genotype (two alleles, eg G/C)at the indicated position? Base qualities for the position are listed on the side for each of the reads. AGCTCCCAGGGTCCAG Q29 GTCCAGTCTCGGTT Q40 CAGGGTCCAGTC Q47 TCCAGTCTCGGTTCCATC Q35 CCCAGGGCCCAG Q50 GGGTCCAGTCTC Q31 TCCCAGGGCC Q10 AGGGTCCAGT Q45 GCTCCCAGGGCCCAGTCT Q46 CTCCCAGGGCCC Q33 CCAGGGTCCAGTCQ38 GCTCCCAGGGCCCAGTCTCGG Q41 CAGGGTCCAGTCTCG Q15 AGCTCCCAGGGTCCAGTCTCGGTTCCATCTA * Answer: Discard the reads where the base quality score is below Q30. Sum up the reference and alternate bases at the position. (T =6 , C = 4). Therefore the genotype called is T/C (heterozygous). 5. Sort the following types of genetic variants into the categories: Potentially Disease Causing, Unlikely to be Disease Causing 1. Splice Site 2. Non-Synonymous 3. Synonymous 4. FrameshiftIndel 5. Stop Loss 6. Stop Gain 7. Intronic (Non-Splice Site) 8. Intergenic Answer: Disease: 1, 2, 4, 5, 6 Non-Disease: 3, 7, 8
  • 3. 6) What is the primary motivation for using “next gen” sequencing methods and modern genomics approaches to diagnosing human genetic diseases? Answer: Cost 7) What does the base quality of a sequencing read tell you? Answer: The base quality is equivalent to the probability of an incorrect base call. (Also acceptable answer is the base call accuracy) 8) What problem does binary search address? Answer: Efficiently searching the index of a genome