SlideShare uma empresa Scribd logo
1 de 36
Introduction to bioinformatics 
Sylvia B. Nagl
What is bioinformatics? 
• an emerging interdisciplinary research area 
• deals with the computational management 
and analysis of biological information: genes, 
genomes, proteins, cells, ecological systems, 
medical information, robots, artificial 
intelligence...
The Core of Bioinformatics to date 
•Relationships between 
TDQAAFDTNIVTLTRFVM 
EQGRKARGTGEMTQLLNS 
LCTAVKAISTAVRKAGIA 
HLYGIAGSTNVTGDQVKK 
LDVLSNDLVINVLKSSFA 
TCVLVTEEDKNAIIVEPE 
KRGKYVVCFDPLDGSSNI 
DCLVSIGTIFGIYRKNST 
DEPSEKDALQPGRNLVAA 
GYALYGSATMLV 
sequence 3D structure protein functions 
•Properties and evolution of genes, genomes, 
proteins, metabolic pathways in cells 
•Use of this knowledge for prediction, modelling, and 
design
“The holy grail of bioinformatics” 
GCTCCTCACTGTCTGTGTTTATTC 
TTTTAGCTTCTTCAGATCTTTTAG 
TCTGAGGAAGCCTGGCATGTGCA 
AATGAAGTTAACCTAA... 
> 500, 000 genes 
sequenced to date 
Expected number of 
unique protein 
structures: 
~ 700-1, 000
Basic concepts 
• conceptual foundations of bioinformatics: 
evolution 
protein folding 
protein function 
• bioinformatics builds mathematical models 
of these processes - 
to infer relationships between components 
of complex biological systems
Information processing in cells 
coding regions 
regulatory 
sites 
nucleic acids 
transcripts 
proteins 
One-to-many mappings! 
Context-dependence!
Global approaches: Toward a new Systems Biology 
Global cell state 
Genome 
Genome activation 
patterns: transcriptomics 
Protein population: 
proteomics 
Organisation: 
tissue imaging EM X-ray, NMR 
cells 
molecular complexes 
•How does the spatial and 
temporal organisation of 
living matter give rise to 
biological processes?
Global approaches: Toward a new Systems Biology 
Perturbation Living cell 
Dynamic response 
“Virtual cell” 
Biological knowledge 
(computerised) 
Sequence information 
Structural information 
•Basic principles 
•Practical 
applications 
Bioinformatics 
Mathematical 
modelling 
Simulation
We do not know yet whether the information in the genome is sufficient 
to reconstruct an entire biological system. Information on building 
blocks not enough, information on their interactions is essential. 
External environment 
Internal environment 
Metabolic net 
Genetic networks 
DNA hRNA mRNAs proteins
Bioinformatics in context 
Genomics 
Molecular Biophysics 
biology 
Molecular 
evolution 
Ethical, legal, 
and social 
implications 
Bioinformatics 
Mathematics/ 
computer 
science
Current challenges to users 
• Potential hurdles: 
Methods are in flux and not fully developed-scattered 
and heterogeneous resources 
• Remedies: Web resources 
navigation guides 
integration of tools and databanks 
http://www.biochem.ucl.ac.uk/~nagl/bioinformatics.html
Example 1 
Sequence homology search of the 
genome of Plasmodium falciparum 
Target identification for antimalerial 
drugs
The search for new antimalarial 
drugs 
• Malaria is one of the leading causes of morbidity 
and mortality in the tropics. 
• 300 to 500 million estimated clinical cases and 1.5 
million to 2.7 million deaths per year. 
• Nearly all fatal cases are caused by Plasmodium 
falciparum. 
• The parasite's resistance to conventional 
antimalarial drugs such as chloroquine is growing 
at an alarming rate.
•P. falciparum has a plastidlike organelle, called the 
apicoplast, acquired by endosymbiosis of an alga. 
Jomaa et al. (1999) 
•Self-replicating, maternally inherited (35kb, circular DNA). 
•Comparative genome analysis: Search for orthologs. 
Apicoplast contains enzymes found in plant and bacterial, 
but not animal metabolic pathways. 
•Potential target for antimalerial drugs: 
DOXP reductoisomerase
Jomaa et al. (1999) Science 285: 1573-1576:
Biological databases
(Boguski, 1999) 
The challenge 
In 1995, the number of genes in the database started to exceed 
the number of papers on molecular biology and genetics in the 
literature!
Data types 
primary data 
secondary data 
tertiary data 
sequence 
DNA 
amino acid 
AATGCGTATAGGC 
DMPVERILEALAVE 
primary database 
secondary 
“motifs”: regular protein structure 
expressions, blocks, 
profiles, fingerprints e. g., alpha-helices, beta-strands 
secondary db 
tertiary protein 
structure 
domains, folding units 
tertiary db 
atomic co-ordinates
Primary biological databases 
• Nucleic acid 
EMBL 
GenBank 
DDBJ (DNA Data 
Bank of Japan) 
• Protein 
PIR 
MIPS 
SWISS-PROT 
TrEMBL 
NRL-3D
International nucleotide data banks 
EMBL 
Europe 
EMBL 
EBI 
GenBank 
USA 
NLM 
NCBI 
International 
Advisory Meeting 
Collaborative Meeting 
DDBJ 
Japan 
NIG 
CIB 
TrEMBL NRDB
GenBank file format
GenBank file format
Swiss-Prot
SWISS-PROT file format
SWISS-PROT file format
SWISS-PROT file format
SWISS-PROT file format
Other primary protein databases 
• TrEMBL (translated EMBL) in SWISS-PROT format 
rapid access to sequence data from genome projects 
computer-annotated supplement to SWISS-PROT 
translations of all coding sequences (CDS) in EMBL 
• SP-TrEMBL 
• REM-TrEMBL: immunoglobulins, T-cell receptors, short 
fragments, synthetic and patented sequences
Other primary protein databases 
The Protein Information Resource (PIR) 
• integrated system of protein sequence databases 
and derived related databases, e. g., alignment 
databases 
• rapid searching, comparison, and pattern matching of 
protein sequences 
• retrieval of descriptive, bibliographic, feature, and 
concurrent cross-reference information 
• aims to be comprehensive and consistently 
annotated
PIR: related databases 
NRL-3D Sequence-Structure Database 
• produced by PIR from sequence and annotation 
information extracted from three-dimensional 
structures in the Protein Databank (PDB) 
• allows keyword and similarity searches
PIR: related databases 
PATCHX integrated with PIR 
• a non-redundant database of protein sequences 
produced by MIPS, the European branch of PIR-International 
The PIR Protein Sequence Database and PATCHX 
together provide the most complete collection of 
protein sequence data currently available in the 
public domain.
Composite protein sequence dbs 
NRDB OWL MIPSX(PIR+PATCHX) SP+TrEMBL 
PIR PIR PIR TrEMBL 
SP SP SP SP 
PDB GenBank MIPSOwn 
GenPept NRL-3D NRL-3D 
MIPSH 
PIRMOD 
MIPSTrn 
EMTrans 
GBTrans 
Kabat 
PseqIP
OWL composite database 
OWL only released every 6-8 
weeks 
By accession number 
• By database code 
• By text 
• By sequence 
• By title 
• By author 
• By query language 
• By regular expression 
Direct OWL access: 
OWL Blast server
Two other useful sites 
INFOBIOGEN-The Public Catalog of Databases 
http://www.infobiogen.fr/services/dbcat/ 
KEGG-Kyoto Encyclopedia of Genes and Genomes 
http://www.genome.ad.jp/kegg/ 
Kyoto Encyclopedia of Genes and Genomes (KEGG) is an effort to 
computerize current knowledge of molecular and cellular biology in 
terms of the information pathways that consist of interacting molecules 
or genes and to provide links from the gene catalogs produced by 
genome sequencing projects.
Sequence Retrieval System (SRS) 
Database browser that allows 
users to 
•retrieve 
•link 
•access 
entries from all interconnected 
resources. 
Users can formulate queries 
across a range of different 
database types.
Guide to Protein Databases: 
http://www.biochem.ucl.ac.uk/~robert/bioinf 
/lecture1/index.html 
http://www.biochem.ucl.ac.uk/~robert/bioinf/lecture2/index 
.html 
With thanks to Dr Roman Laskowski.

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Biological Databases
Biological DatabasesBiological Databases
Biological Databases
 
Databases
DatabasesDatabases
Databases
 
Gen bank (genetic sequence databank)
Gen bank (genetic sequence databank)Gen bank (genetic sequence databank)
Gen bank (genetic sequence databank)
 
Composite and Specialized databases
Composite and Specialized databasesComposite and Specialized databases
Composite and Specialized databases
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
protein databases
 protein databases protein databases
protein databases
 
Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02
 
100505 koenig biological_databases
100505 koenig biological_databases100505 koenig biological_databases
100505 koenig biological_databases
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Biological Database
Biological DatabaseBiological Database
Biological Database
 
Data base in detail
Data base in detailData base in detail
Data base in detail
 
Biological data base
Biological data baseBiological data base
Biological data base
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
PIR & MINT
PIR & MINTPIR & MINT
PIR & MINT
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
biological detabase
biological detabasebiological detabase
biological detabase
 
The uni prot knowledgebase
The uni prot knowledgebaseThe uni prot knowledgebase
The uni prot knowledgebase
 

Destaque (20)

Data retrieval
Data retrievalData retrieval
Data retrieval
 
Application of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciencesApplication of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciences
 
Pham yang embl-ebi
Pham yang embl-ebiPham yang embl-ebi
Pham yang embl-ebi
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
 
An Introduction to "Bioinformatics & Internet"
An Introduction to "Bioinformatics & Internet"An Introduction to "Bioinformatics & Internet"
An Introduction to "Bioinformatics & Internet"
 
BLAST
BLASTBLAST
BLAST
 
Msa
MsaMsa
Msa
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Structural Bioinformatics - Homology modeling & its Scope
Structural Bioinformatics - Homology modeling & its ScopeStructural Bioinformatics - Homology modeling & its Scope
Structural Bioinformatics - Homology modeling & its Scope
 
Alignments
AlignmentsAlignments
Alignments
 
Parwati sihag
Parwati sihagParwati sihag
Parwati sihag
 
Sequence alignment belgaum
Sequence alignment belgaumSequence alignment belgaum
Sequence alignment belgaum
 
BLAST and sequence alignment
BLAST and sequence alignmentBLAST and sequence alignment
BLAST and sequence alignment
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Laboratory Information Management System (LIMS)
Laboratory Information Management System (LIMS)Laboratory Information Management System (LIMS)
Laboratory Information Management System (LIMS)
 
LIMS
LIMSLIMS
LIMS
 
Homology modeling
Homology modelingHomology modeling
Homology modeling
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignment
 
16S Ribosomal DNA Sequence Analysis
16S Ribosomal DNA Sequence Analysis16S Ribosomal DNA Sequence Analysis
16S Ribosomal DNA Sequence Analysis
 

Semelhante a bioinfomatics

Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdfnedalalazzwy
 
Introduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptxIntroduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptxRAJESHKUMAR428748
 
Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBioinformaticsCentre
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformaticsphilmaweb
 
Introduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfIntroduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfkigaruantony
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformaticsAtai Rabby
 
Biological databases
Biological databasesBiological databases
Biological databasesAshfaq Ahmad
 
Introduction to bioinformatics.pptx
Introduction to bioinformatics.pptxIntroduction to bioinformatics.pptx
Introduction to bioinformatics.pptxMortezaGhandadi1
 
Bioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of NatureBioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of NatureRobert Cormia
 
INFORMATICS 2.pptx
INFORMATICS 2.pptxINFORMATICS 2.pptx
INFORMATICS 2.pptxOramadevi1
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsAyeshaYousaf20
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekData Driven Innovation
 
DisGeNET: A discovery platform for the dynamical exploration of human disease...
DisGeNET: A discovery platform for the dynamical exploration of human disease...DisGeNET: A discovery platform for the dynamical exploration of human disease...
DisGeNET: A discovery platform for the dynamical exploration of human disease...Núria Queralt Rosinach
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSsandeshGM
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptxSilpa87
 
Bioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptxBioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptxxRowlet
 

Semelhante a bioinfomatics (20)

Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdf
 
Introduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptxIntroduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptx
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdf
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Introduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfIntroduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdf
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Introduction to bioinformatics.pptx
Introduction to bioinformatics.pptxIntroduction to bioinformatics.pptx
Introduction to bioinformatics.pptx
 
Bioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of NatureBioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of Nature
 
INFORMATICS 2.pptx
INFORMATICS 2.pptxINFORMATICS 2.pptx
INFORMATICS 2.pptx
 
INFORMATICS 2.pptx
INFORMATICS 2.pptxINFORMATICS 2.pptx
INFORMATICS 2.pptx
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomics
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
 
Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03
 
Bioinformatics seminar
Bioinformatics seminarBioinformatics seminar
Bioinformatics seminar
 
DisGeNET: A discovery platform for the dynamical exploration of human disease...
DisGeNET: A discovery platform for the dynamical exploration of human disease...DisGeNET: A discovery platform for the dynamical exploration of human disease...
DisGeNET: A discovery platform for the dynamical exploration of human disease...
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptx
 
Bioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptxBioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptx
 

Último

Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Shubhangi Sonawane
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.MateoGardella
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterMateoGardella
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfSanaAli374401
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 

Último (20)

Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 

bioinfomatics

  • 2. What is bioinformatics? • an emerging interdisciplinary research area • deals with the computational management and analysis of biological information: genes, genomes, proteins, cells, ecological systems, medical information, robots, artificial intelligence...
  • 3. The Core of Bioinformatics to date •Relationships between TDQAAFDTNIVTLTRFVM EQGRKARGTGEMTQLLNS LCTAVKAISTAVRKAGIA HLYGIAGSTNVTGDQVKK LDVLSNDLVINVLKSSFA TCVLVTEEDKNAIIVEPE KRGKYVVCFDPLDGSSNI DCLVSIGTIFGIYRKNST DEPSEKDALQPGRNLVAA GYALYGSATMLV sequence 3D structure protein functions •Properties and evolution of genes, genomes, proteins, metabolic pathways in cells •Use of this knowledge for prediction, modelling, and design
  • 4. “The holy grail of bioinformatics” GCTCCTCACTGTCTGTGTTTATTC TTTTAGCTTCTTCAGATCTTTTAG TCTGAGGAAGCCTGGCATGTGCA AATGAAGTTAACCTAA... > 500, 000 genes sequenced to date Expected number of unique protein structures: ~ 700-1, 000
  • 5. Basic concepts • conceptual foundations of bioinformatics: evolution protein folding protein function • bioinformatics builds mathematical models of these processes - to infer relationships between components of complex biological systems
  • 6. Information processing in cells coding regions regulatory sites nucleic acids transcripts proteins One-to-many mappings! Context-dependence!
  • 7. Global approaches: Toward a new Systems Biology Global cell state Genome Genome activation patterns: transcriptomics Protein population: proteomics Organisation: tissue imaging EM X-ray, NMR cells molecular complexes •How does the spatial and temporal organisation of living matter give rise to biological processes?
  • 8. Global approaches: Toward a new Systems Biology Perturbation Living cell Dynamic response “Virtual cell” Biological knowledge (computerised) Sequence information Structural information •Basic principles •Practical applications Bioinformatics Mathematical modelling Simulation
  • 9. We do not know yet whether the information in the genome is sufficient to reconstruct an entire biological system. Information on building blocks not enough, information on their interactions is essential. External environment Internal environment Metabolic net Genetic networks DNA hRNA mRNAs proteins
  • 10. Bioinformatics in context Genomics Molecular Biophysics biology Molecular evolution Ethical, legal, and social implications Bioinformatics Mathematics/ computer science
  • 11. Current challenges to users • Potential hurdles: Methods are in flux and not fully developed-scattered and heterogeneous resources • Remedies: Web resources navigation guides integration of tools and databanks http://www.biochem.ucl.ac.uk/~nagl/bioinformatics.html
  • 12. Example 1 Sequence homology search of the genome of Plasmodium falciparum Target identification for antimalerial drugs
  • 13. The search for new antimalarial drugs • Malaria is one of the leading causes of morbidity and mortality in the tropics. • 300 to 500 million estimated clinical cases and 1.5 million to 2.7 million deaths per year. • Nearly all fatal cases are caused by Plasmodium falciparum. • The parasite's resistance to conventional antimalarial drugs such as chloroquine is growing at an alarming rate.
  • 14. •P. falciparum has a plastidlike organelle, called the apicoplast, acquired by endosymbiosis of an alga. Jomaa et al. (1999) •Self-replicating, maternally inherited (35kb, circular DNA). •Comparative genome analysis: Search for orthologs. Apicoplast contains enzymes found in plant and bacterial, but not animal metabolic pathways. •Potential target for antimalerial drugs: DOXP reductoisomerase
  • 15. Jomaa et al. (1999) Science 285: 1573-1576:
  • 17. (Boguski, 1999) The challenge In 1995, the number of genes in the database started to exceed the number of papers on molecular biology and genetics in the literature!
  • 18. Data types primary data secondary data tertiary data sequence DNA amino acid AATGCGTATAGGC DMPVERILEALAVE primary database secondary “motifs”: regular protein structure expressions, blocks, profiles, fingerprints e. g., alpha-helices, beta-strands secondary db tertiary protein structure domains, folding units tertiary db atomic co-ordinates
  • 19. Primary biological databases • Nucleic acid EMBL GenBank DDBJ (DNA Data Bank of Japan) • Protein PIR MIPS SWISS-PROT TrEMBL NRL-3D
  • 20. International nucleotide data banks EMBL Europe EMBL EBI GenBank USA NLM NCBI International Advisory Meeting Collaborative Meeting DDBJ Japan NIG CIB TrEMBL NRDB
  • 28. Other primary protein databases • TrEMBL (translated EMBL) in SWISS-PROT format rapid access to sequence data from genome projects computer-annotated supplement to SWISS-PROT translations of all coding sequences (CDS) in EMBL • SP-TrEMBL • REM-TrEMBL: immunoglobulins, T-cell receptors, short fragments, synthetic and patented sequences
  • 29. Other primary protein databases The Protein Information Resource (PIR) • integrated system of protein sequence databases and derived related databases, e. g., alignment databases • rapid searching, comparison, and pattern matching of protein sequences • retrieval of descriptive, bibliographic, feature, and concurrent cross-reference information • aims to be comprehensive and consistently annotated
  • 30. PIR: related databases NRL-3D Sequence-Structure Database • produced by PIR from sequence and annotation information extracted from three-dimensional structures in the Protein Databank (PDB) • allows keyword and similarity searches
  • 31. PIR: related databases PATCHX integrated with PIR • a non-redundant database of protein sequences produced by MIPS, the European branch of PIR-International The PIR Protein Sequence Database and PATCHX together provide the most complete collection of protein sequence data currently available in the public domain.
  • 32. Composite protein sequence dbs NRDB OWL MIPSX(PIR+PATCHX) SP+TrEMBL PIR PIR PIR TrEMBL SP SP SP SP PDB GenBank MIPSOwn GenPept NRL-3D NRL-3D MIPSH PIRMOD MIPSTrn EMTrans GBTrans Kabat PseqIP
  • 33. OWL composite database OWL only released every 6-8 weeks By accession number • By database code • By text • By sequence • By title • By author • By query language • By regular expression Direct OWL access: OWL Blast server
  • 34. Two other useful sites INFOBIOGEN-The Public Catalog of Databases http://www.infobiogen.fr/services/dbcat/ KEGG-Kyoto Encyclopedia of Genes and Genomes http://www.genome.ad.jp/kegg/ Kyoto Encyclopedia of Genes and Genomes (KEGG) is an effort to computerize current knowledge of molecular and cellular biology in terms of the information pathways that consist of interacting molecules or genes and to provide links from the gene catalogs produced by genome sequencing projects.
  • 35. Sequence Retrieval System (SRS) Database browser that allows users to •retrieve •link •access entries from all interconnected resources. Users can formulate queries across a range of different database types.
  • 36. Guide to Protein Databases: http://www.biochem.ucl.ac.uk/~robert/bioinf /lecture1/index.html http://www.biochem.ucl.ac.uk/~robert/bioinf/lecture2/index .html With thanks to Dr Roman Laskowski.