SlideShare uma empresa Scribd logo
1 de 35
Bioinformatics – A Brief overview
What is bioinformatics? ,[object Object],[object Object]
Publically available genomes (April 1998) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],COMPLETE/PENDING PUBLICATION Rickettsia prowazekii  Pseudomonas aeruginosa Pyrococcus abyssii Bacillus sp. C-125 Ureaplasma urealyticum Pyrobaculum aerophilum ALMOST/PUBLIC Pyrococcus furiosus Mycobacterium tuberculosis H37Rv Mycobacterium tuberculosis CSU93 Neisseria gonorrhea Neisseria meningiditis Streptococcus pyogenes
Promises of genomics and bioinformatics  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The need for bioinformaticists.   The number of entries in data bases of gene sequences is increasing exponentially. Bioinformaticians are needed to understand and use this information . 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 GenBank growth
What Can be done using bioinformatics? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
NCBI (National centre for Biotechnology information ) www.ncbi.nlm.nih.gov ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What can be discovered about a gene by a database search? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Databases ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Biological databanks and databases ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Bioinformatics
PubMed
 
Sequence analysis: overview Nucleotide sequence file Search databases for similar sequences Sequence comparison Multiple sequence analysis ,[object Object],[object Object],[object Object],Translate into protein Search for known motifs RNA structure prediction non-coding coding Protein sequence analysis Search for protein coding regions Manual sequence entry Sequence database browsing Sequencing project management  Protein sequence file Search databases for similar sequences Sequence comparison Search for known motifs Predict secondary structure Predict tertiary structure Create a multiple sequence alignment Edit the alignment Format the alignment for publication Molecular phylogeny Protein family analysis Nucleotide sequence analysis Sequence entry
Sequence comparison ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Click on:
Database Search
 
Multiple Sequence Alignment: Approaches ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
CLUSTALW MSA
Phylogeny inference:  Analysis of sequences allows evolutionary relationships to be determined E.coli C.botulinum C.cadavers C.butyricum B.subtilis B.cereus Phylogenetic tree constructed using the Phylip package
gene prediction software ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
PCR Primer Design: ,[object Object],OPTIMAL primer length  --> 20 MINIMUM primer length  --> 18 MAXIMUM primer length  --> 22  OPTIMAL primer melting temperature  --> 60.000 MINIMUM acceptable melting temp  --> 57.000 MAXIMUM acceptable melting temp  --> 63.000 MINIMUM acceptable primer GC%  --> 20.000 MAXIMUM acceptable primer GC%  --> 80.000 Salt concentration (mM)  --> 50.000  DNA concentration (nM)  --> 50.000 MAX no. unknown bases (Ns) allowed  --> 0  MAX acceptable self-complementarity --> 12  MAXIMUM 3' end self-complementarity --> 8  GC clamp how many 3' bases  --> 0
Restriction mapping:  Genes can be analysed to detect gene sequences that can be cleaved with restriction enzymes AceIII  1 CAGCTCnnnnnnn’nnn... AluI  2 AG’CT AlwI  1 GGATCnnnn’n_ ApoI  2 r’AATT_y BanII  1 G_rGCy’C BfaI  2 C’TA_G BfiI  1 ACTGGG BsaXI  1 ACnnnnnCTCC BsgI  1 GTGCAGnnnnnnnnnnn... BsiHKAI  1 G_wGCw’C Bsp1286I  1 G_dGCh’C BsrI  2 ACTG_Gn’ BsrFI  1 r’CCGG_y CjeI  2 CCAnnnnnnGTnnnnnn... CviJI  4 rG’Cy CviRI  1 TG’CA DdeI  2 C’TnA_G DpnI  2 GA’TC EcoRI  1 G’AATT_C HinfI  2 G’AnT_C MaeIII  1 ’GTnAC_ MnlI  1 CCTCnnnnnn_n’ MseI  2 T’TA_A MspI  1 C’CG_G NdeI  1 CA’TA_TG Sau3AI  2 ’GATC_ SstI  1 G_AGCT’C TfiI  2 G’AwT_C Tsp45I  1 ’GTsAC_ Tsp509I  3 ’AATT_ TspRI  1 CAGTGnn’ 50 100 150 200 250
RNA structure prediction:  Structural features of RNA can be predicted G G A C A G G A G G A U A C C G C G G U C C U G C C G G U C C U C A C U U G G A C U U A G U A U C A U C A G U C U G C G C A A U A G G U A A C G C G U
Protein Structure  : the 3-D structure of proteins is used to understand protein function and design new drugs
Gene Sequencing:  Automated chemcial sequencing methods allow rapid generation of large data banks of gene sequences
Structural Bioinformatics
Structural Bioinformatics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
 
Bioinformatics key areas organisation of knowledge (sequences, structures, functional data) ,[object Object]
Molecular modeling ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Molecular visualization ,[object Object],[object Object],[object Object],[object Object],[object Object]
SECONDARY STRUCTURE PREDICTION Jpred,Gor,Sopma
Tertiary Structure prediction CPHmodel
Active Site Prediction

Mais conteúdo relacionado

Mais procurados

Article presentation on Small RNA
Article presentation on Small RNAArticle presentation on Small RNA
Article presentation on Small RNA
Varsha Gayatonde
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
hemantbreeder
 

Mais procurados (20)

Proteomics
ProteomicsProteomics
Proteomics
 
Gene mapping methods
Gene mapping methodsGene mapping methods
Gene mapping methods
 
DNA-DNA Hybridisation
DNA-DNA HybridisationDNA-DNA Hybridisation
DNA-DNA Hybridisation
 
Gene silencing and rnai
Gene silencing and rnaiGene silencing and rnai
Gene silencing and rnai
 
2 whole genome sequencing and analysis
2 whole genome sequencing and analysis2 whole genome sequencing and analysis
2 whole genome sequencing and analysis
 
mi RNA and siRNA
mi RNA and siRNAmi RNA and siRNA
mi RNA and siRNA
 
15 molecular markers techniques
15 molecular markers techniques15 molecular markers techniques
15 molecular markers techniques
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Transcriptome analysis
Transcriptome analysisTranscriptome analysis
Transcriptome analysis
 
High-Throughput Sequencing
High-Throughput SequencingHigh-Throughput Sequencing
High-Throughput Sequencing
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
Ion torrent and SOLiD Sequencing Techniques
Ion torrent and SOLiD Sequencing Techniques Ion torrent and SOLiD Sequencing Techniques
Ion torrent and SOLiD Sequencing Techniques
 
Genome editing
Genome editingGenome editing
Genome editing
 
Gene network and pathways
Gene network and pathwaysGene network and pathways
Gene network and pathways
 
Express sequence tags
Express sequence tagsExpress sequence tags
Express sequence tags
 
Article presentation on Small RNA
Article presentation on Small RNAArticle presentation on Small RNA
Article presentation on Small RNA
 
Analysis of gene expression
Analysis of gene expressionAnalysis of gene expression
Analysis of gene expression
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Gene targeting and sequence tags
Gene targeting and sequence tagsGene targeting and sequence tags
Gene targeting and sequence tags
 

Destaque

Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformatics
Abhishek Vatsa
 

Destaque (12)

Bioinformatics A Biased Overview
Bioinformatics A Biased OverviewBioinformatics A Biased Overview
Bioinformatics A Biased Overview
 
Molecular Markers: Major Applications in Insects
Molecular Markers: Major Applications in InsectsMolecular Markers: Major Applications in Insects
Molecular Markers: Major Applications in Insects
 
Formal languages to map Genotype to Phenotype in Natural Genomes
Formal languages to map Genotype to Phenotype in Natural GenomesFormal languages to map Genotype to Phenotype in Natural Genomes
Formal languages to map Genotype to Phenotype in Natural Genomes
 
DNA Markers Techniques for Plant Varietal Identification
DNA Markers Techniques for Plant Varietal Identification DNA Markers Techniques for Plant Varietal Identification
DNA Markers Techniques for Plant Varietal Identification
 
Ap Chapter 21
Ap Chapter 21Ap Chapter 21
Ap Chapter 21
 
Flow Cytometry Training : Introduction day 1 session 1
Flow Cytometry Training : Introduction day 1 session 1Flow Cytometry Training : Introduction day 1 session 1
Flow Cytometry Training : Introduction day 1 session 1
 
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big Data
 
Mapping Genotype to Phenotype using Attribute Grammar, Laura Adam
Mapping Genotype to Phenotype using Attribute Grammar, Laura AdamMapping Genotype to Phenotype using Attribute Grammar, Laura Adam
Mapping Genotype to Phenotype using Attribute Grammar, Laura Adam
 
How to be a bioinformatician
How to be a bioinformaticianHow to be a bioinformatician
How to be a bioinformatician
 
Gene concept
Gene conceptGene concept
Gene concept
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformatics
 

Semelhante a Project report-on-bio-informatics

Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
Atai Rabby
 
bioinformatics simple
bioinformatics simple bioinformatics simple
bioinformatics simple
nadeem akhter
 
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptxBTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
ChijiokeNsofor
 
Whole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysisWhole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysis
drelamuruganvet
 

Semelhante a Project report-on-bio-informatics (20)

Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
 
bioinformatic.pptx
bioinformatic.pptxbioinformatic.pptx
bioinformatic.pptx
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...
 
bioinformatics simple
bioinformatics simple bioinformatics simple
bioinformatics simple
 
Bioinformatics seminar
Bioinformatics seminarBioinformatics seminar
Bioinformatics seminar
 
Pcmd bioinformatics-lecture i
Pcmd bioinformatics-lecture iPcmd bioinformatics-lecture i
Pcmd bioinformatics-lecture i
 
Introduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfIntroduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdf
 
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptxBTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
 
NCBI
NCBINCBI
NCBI
 
Bioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirBioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sir
 
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICSPROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
 
M Sc Project
M Sc ProjectM Sc Project
M Sc Project
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahu
 
Bioinformatics MiRON
Bioinformatics MiRONBioinformatics MiRON
Bioinformatics MiRON
 
Whole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysisWhole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysis
 
overview on Next generation sequencing in breast csncer
overview on Next generation sequencing in breast csnceroverview on Next generation sequencing in breast csncer
overview on Next generation sequencing in breast csncer
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
 

Último

Último (20)

Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
 
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service AvailableTrichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
 
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
 
Call Girls Rishikesh Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Rishikesh Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 8250077686 Top Class Call Girl Service Available
 
Top Rated Bangalore Call Girls Majestic ⟟ 9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Majestic ⟟  9332606886 ⟟ Call Me For Genuine S...Top Rated Bangalore Call Girls Majestic ⟟  9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Majestic ⟟ 9332606886 ⟟ Call Me For Genuine S...
 
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
 
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 9332606886 ⟟ Call Me For G...
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟  9332606886 ⟟ Call Me For G...Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟  9332606886 ⟟ Call Me For G...
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 9332606886 ⟟ Call Me For G...
 
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
 
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
 
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
 
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
 
Call Girls Visakhapatnam Just Call 8250077686 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 8250077686 Top Class Call Girl Service Ava...Call Girls Visakhapatnam Just Call 8250077686 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 8250077686 Top Class Call Girl Service Ava...
 
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service AvailableCall Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
 
Call Girls Vadodara Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Vadodara Just Call 8617370543 Top Class Call Girl Service AvailableCall Girls Vadodara Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Vadodara Just Call 8617370543 Top Class Call Girl Service Available
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
 
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
 
Top Rated Bangalore Call Girls Mg Road ⟟ 9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Mg Road ⟟   9332606886 ⟟ Call Me For Genuine S...Top Rated Bangalore Call Girls Mg Road ⟟   9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Mg Road ⟟ 9332606886 ⟟ Call Me For Genuine S...
 

Project report-on-bio-informatics

  • 1. Bioinformatics – A Brief overview
  • 2.
  • 3.
  • 4.
  • 5. The need for bioinformaticists. The number of entries in data bases of gene sequences is increasing exponentially. Bioinformaticians are needed to understand and use this information . 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 GenBank growth
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 12.  
  • 13.
  • 14.
  • 17.  
  • 18.
  • 20. Phylogeny inference: Analysis of sequences allows evolutionary relationships to be determined E.coli C.botulinum C.cadavers C.butyricum B.subtilis B.cereus Phylogenetic tree constructed using the Phylip package
  • 21.
  • 22.
  • 23. Restriction mapping: Genes can be analysed to detect gene sequences that can be cleaved with restriction enzymes AceIII 1 CAGCTCnnnnnnn’nnn... AluI 2 AG’CT AlwI 1 GGATCnnnn’n_ ApoI 2 r’AATT_y BanII 1 G_rGCy’C BfaI 2 C’TA_G BfiI 1 ACTGGG BsaXI 1 ACnnnnnCTCC BsgI 1 GTGCAGnnnnnnnnnnn... BsiHKAI 1 G_wGCw’C Bsp1286I 1 G_dGCh’C BsrI 2 ACTG_Gn’ BsrFI 1 r’CCGG_y CjeI 2 CCAnnnnnnGTnnnnnn... CviJI 4 rG’Cy CviRI 1 TG’CA DdeI 2 C’TnA_G DpnI 2 GA’TC EcoRI 1 G’AATT_C HinfI 2 G’AnT_C MaeIII 1 ’GTnAC_ MnlI 1 CCTCnnnnnn_n’ MseI 2 T’TA_A MspI 1 C’CG_G NdeI 1 CA’TA_TG Sau3AI 2 ’GATC_ SstI 1 G_AGCT’C TfiI 2 G’AwT_C Tsp45I 1 ’GTsAC_ Tsp509I 3 ’AATT_ TspRI 1 CAGTGnn’ 50 100 150 200 250
  • 24. RNA structure prediction: Structural features of RNA can be predicted G G A C A G G A G G A U A C C G C G G U C C U G C C G G U C C U C A C U U G G A C U U A G U A U C A U C A G U C U G C G C A A U A G G U A A C G C G U
  • 25. Protein Structure : the 3-D structure of proteins is used to understand protein function and design new drugs
  • 26. Gene Sequencing: Automated chemcial sequencing methods allow rapid generation of large data banks of gene sequences
  • 28.
  • 29.  
  • 30.
  • 31.
  • 32.

Notas do Editor

  1. As a result, the last few years have seen an explosion in the field of bioinformatics, a new field of study which combines methods from computer science and information technology to analyze biological information. In its purest definition, bioinformatics is the application of information technology to biology.
  2. Rather than sequencing isolated genes, more and more research groups and companies are now focussing on sequencing whole genomes from organisms of medical, commercial or scientific importance. The first complete bacterium to be completely sequenced was Haemophilus influenzae in 1995. In 1996, the first complete eukaryotic genome, that of baker’s yeast ( Saccharomyces cerevisiae ) was published. New complete genomes are now being published every month, and human genome projects, both publicly and privately funded, are well on the way to completion.
  3. The new genome technologies coupled with bioinformatics promise a revolution in almost all fields of life sciences and in society. For example, just in the medical sciences: In the pharmaceutical industry, these methods have been embraced as a shortcut to the discovery of better drugs. For example, knowledge of a protein’s structure can shorten considerably the time taken to develop specific inhibitors of this protein for therapeutic use. The study of how genome variation affects drug effectiveness (pharmacogenomics) is still in its infancy, but promises to deliver more effective and specific therapeutic drugs which are tailored to the individual’s genetic make-up. A knowledge of the genome also facilitates the targeting of genetic diseases by drug or gene therapy. Genome analysis also provides the framework for the study of gene and protein expression using DNA microarray technology or 2-dimensional gene electrophoresis, with broad-ranging applications. And these techniques can be applied not only in the medical sciences, but also in agriculture, biotechnology etc…
  4. The last 10 years have seen recombinant DNA techniques pervade the whole of biology and biology-related fields. The use of plasmids, restriction enzymes, DNA sequencing methods and, more recently, PCR, have allowed the cloning and characterization of many genes and of their protein products. The growth in DNA sequence data available to researchers is phenomenal. For example, GenBank, a major database where molecular biologists store the DNA sequences they obtain and make them available, doubles in size approximately every 14 months. At the beginning of 1999, Genbank contained over 3 million sequence records, and grew at a rate in excess of a million nucleotides deposited per day! Genbank is shown here as an example, but other sequence databases would grow at similar rates. Source: genbank release notes, National Center for Biotechnology Information (http://ncbi.nlm.nih.gov/)
  5. As the application of information technology to biology, bioinformatics pervades the whole of biology, including genetics, biochemistry, ecology and medicine. However, much of the publicity and emphasis which bioinformatics has received in the last few years has been on DNA and protein sequence analysis. Given the large amount of sequence data available and the rate at which it is growing, this is where the need for computer analysis has been felt the most. DNA and protein sequences are particularly amenable to computer analysis, since they can be represented by strings of letters, which computers are very apt to deal with. A DNA sequence is a string of 4 letters (A, C, G and T), and a protein sequence can also be represented by a string of 20 letters, each of which represents an amino acid
  6. The next part of the lecture uses flowcharts to outline a range of procedures commonly used in computer-assisted biomolecular sequence analysis. This rather complicated flowchart summarizes this whole section of the lecture. The flowchart will be divided into four sections: Sequence entry: getting the sequence into the computer Nucleotide sequence analysis Protein sequence analysis Multiple sequence analysis (working with multiple sequence alignments) Each step of the flowchart will be examined in turn
  7. 1 caagtcttct ttctccaagg aggatatgaa gcgttttcgg cttcctgccc tgagctgtgc 61 agcaaacagt ccacccccat ggggctcagc ctcccgctga gtactagtgt gcctgacagt 121 gcagaatccg gatgcagctc ctgtagcacc cctctctacg accagggggg cccagtggag 181 atcctgtcct tcctgtacct gggcagtgct taccatgctt cccggaaaga tatgctcgac 241 gccttgggta tcactgcttt gatcaacgtc tcggccaatt gtcctaacaa ctttgagggt 301 cactaccagt acaagagcat ccctgtggag gacaaccaca aggcagacat cagctcctgg 361 ttcaacgagg cgattgactt tatagactcc atcaaggatg ctggaggaag ggtgtttgtg 421 cactgccagg ccggcatctc caggtcagcc accatctgcc ttgcttacct catgaggact 481 aaccgagtga agctggacga ggcctttgag tttgtgaagc a
  8. Multiple sequence alignments can therefore be used as input to create phylogenetic trees representing possible evolutionary relationships. The principle is that the more closely related two species, the more similar their homologous sequences will be (in general - there are many exceptions) For example, according to the above tree, B. subtilis and B. cereus are more closely related to each other than to C. botulinum, C. cadavers, C. butyricum or E. coli. This tree was created from an alignment of the 16s ribosomal RNA sequences from the various bacteria. Further reading: molecular phylogeny is a very large field in itself, with a lot of associated literature. A good introduction to the field can be found in: Swofford, Olsen, Waddell and Hillis (1996) “Phylogenetic inference” in Molecular Systematics (2nd ed), DM Hillis, C Moritz and BK Mable eds.Sinauer Associates, Inc. Sunderland MA, USA
  9. PCR planning programs let the user specify criteria such as primer length, melting temperature, GC content etc...
  10. This type of display is produced by the program mapplot , part of the GCG package. It lists the restriction enzymes which cut a particular sequence (together with their recognition sequence) and creates a graphical representation of the sequence with the cutting sites marked along a line representing the sequence. This type of image is useful for finding suitable restriction enzymes for subcloning a particular sequence fragment, or for producing a distinctive restriction pattern for in vitro diagnostic procedures. Enzyme name Recognition sequence cutting sites
  11. This transfer RNA cloverleaf structure was predicted for a tRNA sequence using Michael Zuker’s program mfold , which has been incorporated in the GCG package. Further reading: M. Zuker, D.H. Mathews & D.H. Turner (1999) “Algorithms and thermodynamics for RNA secondary structure prediction: a practical guide” In RNA Biochemistry and Biotechnology , J. Barciszewski & B.F.C. Clark, eds., NATO ASI Series, Kluwer Academic Publishers Also available online at http://www.ibc.wustl.edu/~zuker/seqanal/
  12. There are several approaches to building a 3 dimensional model for a protein: Homology modeling uses sequence similarity to map a sequence onto the known structure of a similar sequence (for example, using BLAST to search the PDB database) Profiling involves converting known structures into 3D profiles where the residue preference for each position is classified according to secondary structure (helix, strand, coil) and hydrophobicity/accessibility (exposed, partially exposed, buried). The query sequence can then be mapped onto a library of 3D profiles and the best matching profiles are selected. Threading also involves mapping a sequence onto a library of structures, but only structural information is used. Instead, pseudo-potential energy functions are used to evaluate residue-residue interactions. The query sequence is “threaded” through the various potential structures in the library and the folds yielding the lowest interaction energy when the sequence is mapped onto them are selected. For example, a fold which bring two residues of opposite charge close together will be considered a better fit than a fold which brings together two residues of the same charge or two large residues which would cause a steric clash. (Slide and notes courtesy of Dr Shoba Ranganathan, Australian Genomic Information Centre)
  13. Large scale sequencing projects make use of automated sequencing machines connected to a computer. Because the sequencing machines are typically limited to 300-600 nucleotides, it is often necessary to break down large sequences into fragments, sequence these fragments, then reconstruct the original complete sequence by searching for regions in common between the gel readings, using specialized software. This picture shows some windows from gap4 , a sequencing project management program which is part of the Staden package. This type of software helps in the management of sequencing projects not only by assembling gel readings but also by searching and removing vector sequences, repeat sequences and poor quality sequence regions which can cause problems when assembling the fragments Further reading: Staden, R., Beal, K.F. and Bonfield, J.K. (1998) The Staden Package, Computer Methods in Molecular Biology Eds Stephen Misener and Steve Krawetz. The Humana Press Inc., Totowa, NJ 07512 Also available at: http://www.mrc-lmb.cam.ac.uk/pubseq/methods_in_mol_biol/index.html