SlideShare uma empresa Scribd logo
1 de 25
1
Introduction
• Comparative genomics is a large-scale, holistic approach that
compares two or more genomes to discover the similarities and
differences between the genomes and to study the biology of the
individual genomes
• The subject of comparative genomics impinges on
– Evolutionary biology and phylogenetic reconstructions of the tree of life,
– Drug discovery programs,
– Function predictions of hypothetical proteins
– Identification of genes, regulatory motifs and other non-coding DNA
motifs
– Genome flux and dynamics
2
Computational tool for genome-scale
sequence alignment
• The first step in comparative genomics analysis is often the
alignment of two genome sequences
• It is a technically challenging problem
Algorithms/tools URL
BLASTN and
MEGABLAST
http://www.ncbi.nlm.nih.gov/BLAST/
GLASS http://crossspecies.lcs.mit.edu/
MUMmer http://www.tigr.org/software/mummer/
PatternHunter http://www.bioinformaticssolutions.com/
products/ph.php
PipMaker http://bio.cse.psu.edu/pipmaker/
VISTA http://www-gsd.lbl.gov/vista
WABA http://www.cse.ucsc.edu/kent/xenoAli/
3
L. Wei et. al. 2002
Related terms..
• Homology
• Homologous
• Orthologous
• Paralogous
• Xenologous
• Analogoues
• Horizontal gene transfer
4
• Homology is the relationship of any two characters (such as two proteins that
have similar sequences) that have descended, usually through divergence, from
a common ancestral character
• Homologues are thus components or characters (such as genes/proteins with
similar sequences) that can be attributed to a common ancestor of the two
organisms during evolution. Homologues can either be orthologues,
paralogues, or xenologues
• Orthologues are homologues that have evolved from a common ancestral gene
by speciation. They usually have similar functions
• Paralogues are homologues that are related or produced by duplication within a
genome. They often have evolved to perform different functions
• Xenologues are homologues that are related by an interspecies (horizontal
transfer) of the genetic material for one of the homologues
• Horizontal (Lateral) Gene Transfer is the movement of genetic material
between species (or genus) other than by vertical descent.
5
6
7
Methods for comparative genomics
• Comparative analysis of genome structure
• Comparative analysis of coding regions
• Comparative analysis of non-coding regions
8
Comparative analysis of genome structure
• Analysis of the global structure of genomes, such as nucleotide
composition, syntenic relationships, and gene ordering offer insight
into the similarities and differences between genomes.
• This provide information on the organization and evolution of the
genomes, and highlight the unique features of individual genomes
• The structure of different genomes can be compared at three levels:
– Overall nucleotide statistics,
– Genome structure at DNA level, and
– Genome structure at gene level.
9
Comparison of overall nucleotide statistics
• Overall nucleotide statistics, such as
– Genome size,
– Overall (G+C) content,
– Regions of different (G+C) content,
– Genome signature such as codon usage biases,
– Amino acid usage biases, and the ratio of observed di-
nucleotide frequency and
– The expected frequency given random nucleotide
distribution
• These all present a global view of the similarities and
differences of the genomes.
10
Comparison of genome structure at DNA level
• Chromosomal breakage and exchange of chromosomal
fragments are common mode of gene evolution. They can be
studied by comparing genome structures at DNA level.
– Identification of conserved synteny and genome
rearrangement events
– Analysis of breakpoints
– Analysis of content and distribution of DNA repeats
11
Comparison of genome structure at gene level
• Chromosomal breakage and exchange of chromosomal
fragments cause disruption of gene order
• Therefore gene order correlates with evolutionary distance
between genomes
12
Comparative analysis of coding regions
13
Number of algorithms that have been use in comparative genomics to aid function
prediction of genes.
Identification of gene-coding regions
comparison of gene content
comparison of protein content
Comparative genome based function prediction
Identification of gene-coding regions
• The analysis and comparison of the coding regions starts with the gene
identification algorithm that is used to infer what portions of the genomic
sequence actively code for genes.
• There are four basic approaches for gene identification
L. Wei et. al. 2002
14
Comparison of gene content
• After the predicted gene set is generated, it is very interesting
and important to compare the content of genes across genomes
• The first statistics to compare is the estimated total number of
genes in a genome, elucidate the similarities and differences
between the genomes include percentage of the genome that
code for genes, distribution of coding regions across the
genome (a.k.a. gene density), average gene length, codon
usage
• This is often done using a pairwise sequence comparison tool
such as BLASTN or TBLASTX
15
Comparison of protein content
• A second level of analysis that can be performed is to compare
the set of gene products (protein) between the genomes, which
has been termed ‘‘comparative proteomics”
• It is important to compare the protein contents in critical
pathways and important functional categories across genomes
• Two widely used resources for pathways and functional
categories are the KEGG pathway database and the Gene
Ontology (GO) hierarchy
L. Wei et. al. 2002
16
cntd…
• Interesting statistics to compare include
– Level of sequence identity between orthologous pairs
across genome
– Paralogous pairs within genome,
– Number of replicated copies in corresponding paralog
families
– Functions of the paralogs
– Locations of members of paralog families across the
genome
17
Comparative genomics-based function
prediction
• functional assignment of genes in a non similarity-based
manner
• This rely on the basic premise that genes; that are functionally
related, are genes that are closely associated across genomes in
some form
• This include three methods:
– Co-conservation across genomes
– Conservation of gene clusters and genomic context across species
– Physical fusion of functionally linked genes across species (Domain
fusion analysis)
18
Comparative analysis of noncoding regions
• Noncoding regions of the genome gained a lot of attention in
recent years because of its predicted role in regulation of
transcription, DNA replication, and other biological functions
• This approach is based on the presumption that selective
pressure causes regulatory elements to evolve at a slower rate
than that of non regulatory sequences in the non coding
regions
19
Insights into Genome Fluxes and the Processes of Evolution
• From an evolutionary biology perspective, whole genome
comparisons provide molecular insights into the processes of
evolution that include the molecular events responsible for the
variations and fluxes that occur through a genome. These
include processes like, inversions, translocations, deletions,
duplications and insertions.
20
The Impact of Comparative Genomics in Phylogenetic Analysis
Schematic depiction of Microsporidia's phylogenetic position based on Small Subunit RNA
(SSU rRNA) as an early branching eukaryote that evolved prior to the acquisiton of
mitochondria, and it's subsequent placement based on a composite gene phylogeny where it
was placed closer to fungi. The latter placement has been confirmed by the complete
sequenceof the micro-sporidia, Encephalitozoon cuniculi, where despite the absence of
mitochondria, the presence of several mitochondrial genes could be observed.
21
Comparative Genomics in Drug Discovery
• Comparative genomic studies throw important light on the
pathogenesis of organisms, throwing up opportunities for
therapeutic intervention as well as help in understanding and
identifying disease genes
• One of the most important fallouts of comparative analyses at a
genome-wide scale is in the ability to identify and develop novel
drug targets
• If one is looking for antibacterial, antifungal, or antiprotozoal
proteins to be used as targets, comparative genome analysis can
reveal virulence genes, uncharacterized essential genes, species-
specific genes, organism-specific genes, while ensuring that the
chosen genes have no homologues in humans
22
Comparative genomics in drug discovery programs. A flow chart
diagram explaining how comparative genomics can facilitate drug
discovery programs for the discovery of new antimicrobials
23
Looking beyond…
• As comparative genomics moves from between kingdoms to
between genus to between species analysis, the next step is to
carry out comparisions between individuals or strains that are
members of a particular species
• This would allow us to investigate variations at the individual
level and to enable one to determine the propensity of an
individual to respond to a drug or to come down with a disease
or infection
24
25

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
 
SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
 
Protein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOLProtein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOL
 
Express sequence tags
Express sequence tagsExpress sequence tags
Express sequence tags
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
NCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology InformationNCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology Information
 
BLAST
BLASTBLAST
BLAST
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure prediction
 
Presentation on marker genes
Presentation on marker genesPresentation on marker genes
Presentation on marker genes
 
Finding ORF
Finding ORFFinding ORF
Finding ORF
 
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCINGDNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
Protein protein interaction
Protein protein interactionProtein protein interaction
Protein protein interaction
 

Destaque

Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
Amol Kunde
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
Nikhil Aggarwal
 
Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomics
Aisha Kalsoom
 

Destaque (20)

Comparative genomics presentation
Comparative genomics presentationComparative genomics presentation
Comparative genomics presentation
 
What is comparative genomics
What is comparative genomicsWhat is comparative genomics
What is comparative genomics
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
 
genomic comparison
genomic comparison genomic comparison
genomic comparison
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Genomics
GenomicsGenomics
Genomics
 
Genome evolution
Genome evolutionGenome evolution
Genome evolution
 
Comparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 formatComparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 format
 
Comparative Genomics with GMOD and BioPerl
Comparative Genomics with GMOD and BioPerlComparative Genomics with GMOD and BioPerl
Comparative Genomics with GMOD and BioPerl
 
Genome evolution - tales of scales DNA to crops,months to billions of years, ...
Genome evolution - tales of scales DNA to crops,months to billions of years, ...Genome evolution - tales of scales DNA to crops,months to billions of years, ...
Genome evolution - tales of scales DNA to crops,months to billions of years, ...
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Gene order
Gene orderGene order
Gene order
 
Genome origin
Genome originGenome origin
Genome origin
 
Association mapping
Association mapping Association mapping
Association mapping
 
SNp mining in crops
SNp mining in cropsSNp mining in crops
SNp mining in crops
 
Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomics
 
Genotyping by Sequencing
Genotyping by SequencingGenotyping by Sequencing
Genotyping by Sequencing
 

Semelhante a Comparative genomics

Bioinformatics, comparative genemics and proteomics
Bioinformatics, comparative genemics and proteomicsBioinformatics, comparative genemics and proteomics
Bioinformatics, comparative genemics and proteomics
juancarlosrise
 

Semelhante a Comparative genomics (20)

Comparative and functional genomics
Comparative and functional genomicsComparative and functional genomics
Comparative and functional genomics
 
Comparative genomics 2
Comparative genomics 2Comparative genomics 2
Comparative genomics 2
 
Bioinformatics, comparative genemics and proteomics
Bioinformatics, comparative genemics and proteomicsBioinformatics, comparative genemics and proteomics
Bioinformatics, comparative genemics and proteomics
 
Data mining ppt
Data mining pptData mining ppt
Data mining ppt
 
Role of genomics and proteomics
Role of genomics and proteomicsRole of genomics and proteomics
Role of genomics and proteomics
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Data Mining
Data Mining Data Mining
Data Mining
 
Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.
 
Comparative genomics.pdf
Comparative genomics.pdfComparative genomics.pdf
Comparative genomics.pdf
 
Genomics(functional genomics)
Genomics(functional genomics)Genomics(functional genomics)
Genomics(functional genomics)
 
Molecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionMolecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contruction
 
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
 
Pharmacogenomics
PharmacogenomicsPharmacogenomics
Pharmacogenomics
 
Genomics
GenomicsGenomics
Genomics
 
Genomics
GenomicsGenomics
Genomics
 
Genomics types
Genomics typesGenomics types
Genomics types
 
Pathway analysis for genomics data
Pathway analysis for genomics dataPathway analysis for genomics data
Pathway analysis for genomics data
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
genomeannotation-160822182432.pdf
genomeannotation-160822182432.pdfgenomeannotation-160822182432.pdf
genomeannotation-160822182432.pdf
 
Genomics and proteomics by shreeman
Genomics and proteomics by shreemanGenomics and proteomics by shreeman
Genomics and proteomics by shreeman
 

Último

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 

Último (20)

ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 

Comparative genomics

  • 1. 1
  • 2. Introduction • Comparative genomics is a large-scale, holistic approach that compares two or more genomes to discover the similarities and differences between the genomes and to study the biology of the individual genomes • The subject of comparative genomics impinges on – Evolutionary biology and phylogenetic reconstructions of the tree of life, – Drug discovery programs, – Function predictions of hypothetical proteins – Identification of genes, regulatory motifs and other non-coding DNA motifs – Genome flux and dynamics 2
  • 3. Computational tool for genome-scale sequence alignment • The first step in comparative genomics analysis is often the alignment of two genome sequences • It is a technically challenging problem Algorithms/tools URL BLASTN and MEGABLAST http://www.ncbi.nlm.nih.gov/BLAST/ GLASS http://crossspecies.lcs.mit.edu/ MUMmer http://www.tigr.org/software/mummer/ PatternHunter http://www.bioinformaticssolutions.com/ products/ph.php PipMaker http://bio.cse.psu.edu/pipmaker/ VISTA http://www-gsd.lbl.gov/vista WABA http://www.cse.ucsc.edu/kent/xenoAli/ 3 L. Wei et. al. 2002
  • 4. Related terms.. • Homology • Homologous • Orthologous • Paralogous • Xenologous • Analogoues • Horizontal gene transfer 4
  • 5. • Homology is the relationship of any two characters (such as two proteins that have similar sequences) that have descended, usually through divergence, from a common ancestral character • Homologues are thus components or characters (such as genes/proteins with similar sequences) that can be attributed to a common ancestor of the two organisms during evolution. Homologues can either be orthologues, paralogues, or xenologues • Orthologues are homologues that have evolved from a common ancestral gene by speciation. They usually have similar functions • Paralogues are homologues that are related or produced by duplication within a genome. They often have evolved to perform different functions • Xenologues are homologues that are related by an interspecies (horizontal transfer) of the genetic material for one of the homologues • Horizontal (Lateral) Gene Transfer is the movement of genetic material between species (or genus) other than by vertical descent. 5
  • 6. 6
  • 7. 7
  • 8. Methods for comparative genomics • Comparative analysis of genome structure • Comparative analysis of coding regions • Comparative analysis of non-coding regions 8
  • 9. Comparative analysis of genome structure • Analysis of the global structure of genomes, such as nucleotide composition, syntenic relationships, and gene ordering offer insight into the similarities and differences between genomes. • This provide information on the organization and evolution of the genomes, and highlight the unique features of individual genomes • The structure of different genomes can be compared at three levels: – Overall nucleotide statistics, – Genome structure at DNA level, and – Genome structure at gene level. 9
  • 10. Comparison of overall nucleotide statistics • Overall nucleotide statistics, such as – Genome size, – Overall (G+C) content, – Regions of different (G+C) content, – Genome signature such as codon usage biases, – Amino acid usage biases, and the ratio of observed di- nucleotide frequency and – The expected frequency given random nucleotide distribution • These all present a global view of the similarities and differences of the genomes. 10
  • 11. Comparison of genome structure at DNA level • Chromosomal breakage and exchange of chromosomal fragments are common mode of gene evolution. They can be studied by comparing genome structures at DNA level. – Identification of conserved synteny and genome rearrangement events – Analysis of breakpoints – Analysis of content and distribution of DNA repeats 11
  • 12. Comparison of genome structure at gene level • Chromosomal breakage and exchange of chromosomal fragments cause disruption of gene order • Therefore gene order correlates with evolutionary distance between genomes 12
  • 13. Comparative analysis of coding regions 13 Number of algorithms that have been use in comparative genomics to aid function prediction of genes. Identification of gene-coding regions comparison of gene content comparison of protein content Comparative genome based function prediction
  • 14. Identification of gene-coding regions • The analysis and comparison of the coding regions starts with the gene identification algorithm that is used to infer what portions of the genomic sequence actively code for genes. • There are four basic approaches for gene identification L. Wei et. al. 2002 14
  • 15. Comparison of gene content • After the predicted gene set is generated, it is very interesting and important to compare the content of genes across genomes • The first statistics to compare is the estimated total number of genes in a genome, elucidate the similarities and differences between the genomes include percentage of the genome that code for genes, distribution of coding regions across the genome (a.k.a. gene density), average gene length, codon usage • This is often done using a pairwise sequence comparison tool such as BLASTN or TBLASTX 15
  • 16. Comparison of protein content • A second level of analysis that can be performed is to compare the set of gene products (protein) between the genomes, which has been termed ‘‘comparative proteomics” • It is important to compare the protein contents in critical pathways and important functional categories across genomes • Two widely used resources for pathways and functional categories are the KEGG pathway database and the Gene Ontology (GO) hierarchy L. Wei et. al. 2002 16
  • 17. cntd… • Interesting statistics to compare include – Level of sequence identity between orthologous pairs across genome – Paralogous pairs within genome, – Number of replicated copies in corresponding paralog families – Functions of the paralogs – Locations of members of paralog families across the genome 17
  • 18. Comparative genomics-based function prediction • functional assignment of genes in a non similarity-based manner • This rely on the basic premise that genes; that are functionally related, are genes that are closely associated across genomes in some form • This include three methods: – Co-conservation across genomes – Conservation of gene clusters and genomic context across species – Physical fusion of functionally linked genes across species (Domain fusion analysis) 18
  • 19. Comparative analysis of noncoding regions • Noncoding regions of the genome gained a lot of attention in recent years because of its predicted role in regulation of transcription, DNA replication, and other biological functions • This approach is based on the presumption that selective pressure causes regulatory elements to evolve at a slower rate than that of non regulatory sequences in the non coding regions 19
  • 20. Insights into Genome Fluxes and the Processes of Evolution • From an evolutionary biology perspective, whole genome comparisons provide molecular insights into the processes of evolution that include the molecular events responsible for the variations and fluxes that occur through a genome. These include processes like, inversions, translocations, deletions, duplications and insertions. 20
  • 21. The Impact of Comparative Genomics in Phylogenetic Analysis Schematic depiction of Microsporidia's phylogenetic position based on Small Subunit RNA (SSU rRNA) as an early branching eukaryote that evolved prior to the acquisiton of mitochondria, and it's subsequent placement based on a composite gene phylogeny where it was placed closer to fungi. The latter placement has been confirmed by the complete sequenceof the micro-sporidia, Encephalitozoon cuniculi, where despite the absence of mitochondria, the presence of several mitochondrial genes could be observed. 21
  • 22. Comparative Genomics in Drug Discovery • Comparative genomic studies throw important light on the pathogenesis of organisms, throwing up opportunities for therapeutic intervention as well as help in understanding and identifying disease genes • One of the most important fallouts of comparative analyses at a genome-wide scale is in the ability to identify and develop novel drug targets • If one is looking for antibacterial, antifungal, or antiprotozoal proteins to be used as targets, comparative genome analysis can reveal virulence genes, uncharacterized essential genes, species- specific genes, organism-specific genes, while ensuring that the chosen genes have no homologues in humans 22
  • 23. Comparative genomics in drug discovery programs. A flow chart diagram explaining how comparative genomics can facilitate drug discovery programs for the discovery of new antimicrobials 23
  • 24. Looking beyond… • As comparative genomics moves from between kingdoms to between genus to between species analysis, the next step is to carry out comparisions between individuals or strains that are members of a particular species • This would allow us to investigate variations at the individual level and to enable one to determine the propensity of an individual to respond to a drug or to come down with a disease or infection 24
  • 25. 25