SlideShare uma empresa Scribd logo
1 de 27
Free Powerpoint Templates
Page 1
Protein Database
By
KAUSHAL KUMAR SAHU
Assistant Professor (Ad Hoc)
Department of Biotechnology
Govt. Digvijay Autonomous P. G. College
Raj-Nandgaon ( C. G. )
Free Powerpoint Templates
Page 2
Introduction
• Bioinformatics is the application of Information technology to store, organize
and analyze the vast amount of biological data which is available in the form
of sequences and structures of proteins and nucleic acids. The biological
information of nucleic acids is available as sequences while the data of
proteins is available as sequences and structures.
• A biological database is a collection of data that is organized so that its
contents can easily be accessed, managed, and updated. The activity of
preparing a database can be divided in to:
• Collection of data in a form which can be easily accessed
• Making it available to a multi-user system (always available for the user)
Free Powerpoint Templates
Page 3
The network for production, construction
and accession of a database
EXPERIMENTS N
• | |- E U
• ORGANIZATION |----------|- T-->S COPY
• OF DATA HOST/SERVER | W-->E-->ONLINE -----> PERSONAL
• | | O-->R ACCESS DATABASE
• |------------> DATABASES R S
• K
• |
•
• EDS
• (Electronic Data Storage)
•
Free Powerpoint Templates
Page 4
Protein databases
• Protein databases are more specialized than primary sequence
databases. They contain information derived from the primary
sequence databases. Some contain protein translations of the
nucleic acid sequences. Some contain sets of patterns and motifs
derived from sequence homologs.
Free Powerpoint Templates
Page 5
History
• The first database was created within a short period after the Insulin protein
sequence was made available in 1956. Insulin is the first protein to be
sequenced. The sequence of Insulin consisted of just 51 residues which
characterize the sequence.
• In 1959, V.M. Ingram first made attempt to compare sickle cell
haemoglobin and normal haemoglobin and demonstrated their homology.
this results in more protein sequencing and accumulation of vast information
.hence it is realized to have database so that using computation software
the protein can be quickly compared.
• In 1965, Margaret Dayhoff established the first database of protein
sequences, a database that was published annually as a series of volumes
entitled “Atlas of Protein Sequence and Structure”
• In 1972, Protein Data Bank was developed as the first protein structure
database
Free Powerpoint Templates
Page 6
Classification of biological database
Free Powerpoint Templates
Page 7
Primary database:-
Protein data bank (PDB)
• Three-dimensional structures are stored in the Protein Databank (PDB).
This is the single world-wide archive of structural data derived by X-ray
crystallography, nuclear magnetic resonance spectroscopy, and other
techniques, as well as structural models
• The database is maintained by the Research Collaboratory for Structural
Bioinformatics (RCSB), at Rutgers University.
• Data in the PDB are very high quality and are extensively curated.
Free Powerpoint Templates
Page 8
Homepage
Free Powerpoint Templates
Page 9
Free Powerpoint Templates
Page 10
Free Powerpoint Templates
Page 11
Sequence database:
SWISS-PROT protein sequence database
• SWISS-PROT was created in at the department of medical biochemistry
(university of geneva) in 1986.
• In 1987, European Molecular biology laboratory and Swiss institute of
Bioinformatics (SIB) work in collaboration ,as equal partners , to develop
and maintain this highly annotated repository of protein sequences.
• It provides high quality annotation with minimum redundancy.
Free Powerpoint Templates
Page 12
Translated EMBL (TrEMBL)
• It was created in 1996 with the objective to fill the gap between flow of
genomic data and annotated protein sequences.
• TrEMBL contains computer annotated records generated by translating
coding sequences (CDS) available in EMBL nucleotide sequence database.
• It has two main sections-
• SP- TrEMBL
• REM- TrEMBL-
Free Powerpoint Templates
Page 13
Protein information resource (PIR)
• PIR was established in 1984 by the National Biomedical Research
Foundation (NBRF) as a resource to assist researchers in the identification
and interpretation of protein sequence information.
• The database is split into four sections PIR1 to PIR4
– PIR1 contains fully classified and annotated entries.
– PIR2 includes preliminary entries.
– PIR3 contains unverified entries
– PIR4 entries all into:-
• Conceptual translations sequence
• Protein sequences
• Conceptual translations of artifactual sequence.
• Sequence that are not genetically encoded and not produced in ribosome.
Free Powerpoint Templates
Page 14
Homepage
Free Powerpoint Templates
Page 15
Secondary databases:
Structural classification of proteins (SCOP)
• It was created in 1995 by Murzin et al. it is maintained at Cambridge with
the aim to gather information about structural similarities of proteins to
increase our understanding of protein evolution and development.
• SCOP provides comprehensive information on structural and evolutionary
relationships of protein with known structure including structures available in
protein data bank.
• The manually constructed SCOP classifies proteins in a hierarchy which
includes class, folds, superfamily, family, protein and species.
Free Powerpoint Templates
Page 16
Class Architecture Topology Homology
(CATH)
• The CATH database established in 1993 is a protein structure classification
based on four levels namely class, Architecture ,Topology and Homology.
• CATH contains hierarchical domain classification of protein structures
present in protein data bank and is maintained at University College
London.
• The classification has been done by combination of automated and manual
methods.
Free Powerpoint Templates
Page 17
Sequence database-
1.PROSITE:
• It is a method of determining what is the function of uncharacterized
proteins translated from genomic or cDNA sequences.
• It consists of a database of biologically significant sites, patterns and
profiles that help to reliably identify to which known family of protein (if any)
a new sequence belongs.
• It include protein pattern motifs indicative protein’s function , are widely
used for function prediction studies, cellular localization annotation, and
sequence classification.
Free Powerpoint Templates
Page 18
Homepage
Free Powerpoint Templates
Page 19
• 3. BLOCKS
• Blocks are multiply aligned ungapped segments corresponding to the most
highly conserved regions of proteins.
• Block database Itself contain more than 4000 entries.
• 4. Pfam
• The methodology used by Pfam to create protein family or domain
signatures is Hidden Markov Models (HMMs).
• They are thus particularly useful when analysing multidomain proteins.
• The biggest drawback of Pfam is its lack of biological information
(annotation) of the protein families
Free Powerpoint Templates
Page 20
Important database search tool:
SEARCH TOOL FUNCTION PROVIDED
BLAST (BASIC LOCALALIGNMENT TOOL) Used to analyze sequence information and detect
homologous sequences.
ENTREZ Used to access literature , sequence and
structural database.
DNAPLOT Sequence alignment tool
LOCUS LINK Accessing information on homologous gene
STRUCTURE It support molecular molding database
(MMDB)and software tool for structure analysis.
TAXONOMY BROWSER Taxonomic classification of various species as
well as genetic information.
FASTA This program provide algorithm to speed up
sequence comparison.
Free Powerpoint Templates
Page 21
Example: study protein sequence of hepatitis B virus
surface antigen FASTA product by NCBI
Free Powerpoint Templates
Page 22
Free Powerpoint Templates
Page 23
Free Powerpoint Templates
Page 24
Free Powerpoint Templates
Page 25
Application of protein database
• Protein sequence
• Determination of macromolecular structure
• Molecular evolution
• Drug development
Free Powerpoint Templates
Page 26
Conclusion
• The aim of most protein structure databases is to organize and annotate
the protein structures, providing the biological community access to the
experimental data in a useful way. whereas sequence databases focus on
sequence information, and contain no structural information for the majority
of entries.
• Thus there is no doubt that Bioinformatics tools for efficient research will
have significant impact in biological sciences and betterment of human
lives.
Free Powerpoint Templates
Page 27
References
• Principles of gene manipulation and genomics- S.B.
Primrose and R.M.Twyman (seventh edition)
• www.bioinfo.com
• www.ncbi.nil.nih.gov.
• http://www.mrc-
lmb.cam.ac.uk/genomes/madanm/pdfs/biodbseq.pdf
•

Mais conteúdo relacionado

Mais procurados (20)

Protein database
Protein databaseProtein database
Protein database
 
Prosite
PrositeProsite
Prosite
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data mining
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Kegg databse
Kegg databseKegg databse
Kegg databse
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
MEGA (Molecular Evolutionary Genetics Analysis)
MEGA (Molecular Evolutionary Genetics Analysis)MEGA (Molecular Evolutionary Genetics Analysis)
MEGA (Molecular Evolutionary Genetics Analysis)
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
Introduction to NCBI
Introduction to NCBIIntroduction to NCBI
Introduction to NCBI
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Ddbj
DdbjDdbj
Ddbj
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Structural databases
Structural databases Structural databases
Structural databases
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Genomic databases
Genomic databasesGenomic databases
Genomic databases
 

Semelhante a Protein database

Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological databaseKAUSHAL SAHU
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...SBituila
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...BibiQuinah
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEPrashantSharma807
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptxscience lover
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxVandana Yadav03
 
Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdfnedalalazzwy
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary databaseKAUSHAL SAHU
 
Introduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptxIntroduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptxRAJESHKUMAR428748
 
Proteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyProteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyChrist College, Rajkot
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...Elufer Akram
 

Semelhante a Protein database (20)

Biological databases
Biological databasesBiological databases
Biological databases
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological database
 
Important protein databases and proteomics softwares
Important protein databases and proteomics softwaresImportant protein databases and proteomics softwares
Important protein databases and proteomics softwares
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
Protein Database
Protein DatabaseProtein Database
Protein Database
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptx
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptx
 
Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdf
 
Biological databases
Biological databases Biological databases
Biological databases
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
Introduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptxIntroduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptx
 
Proteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyProteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASy
 
Biological database
Biological databaseBiological database
Biological database
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Biological data base
Biological data baseBiological data base
Biological data base
 

Mais de KAUSHAL SAHU

tumor suppressor gene, prb, p53 gene
tumor suppressor gene, prb, p53 genetumor suppressor gene, prb, p53 gene
tumor suppressor gene, prb, p53 geneKAUSHAL SAHU
 
tumor suppressor gene by
tumor suppressor gene bytumor suppressor gene by
tumor suppressor gene byKAUSHAL SAHU
 
tumor suppresor genes
tumor suppresor genestumor suppresor genes
tumor suppresor genesKAUSHAL SAHU
 
tumor suppressor gene, prb, p53
tumor suppressor gene, prb, p53tumor suppressor gene, prb, p53
tumor suppressor gene, prb, p53KAUSHAL SAHU
 
transcription factor by kk sahu
transcription factor by kk sahutranscription factor by kk sahu
transcription factor by kk sahuKAUSHAL SAHU
 
DNA repair by kk sahu
DNA repair by kk sahuDNA repair by kk sahu
DNA repair by kk sahuKAUSHAL SAHU
 
membrane protein, synthesis by
membrane protein, synthesis bymembrane protein, synthesis by
membrane protein, synthesis byKAUSHAL SAHU
 
prokaryotic translation mechinry
prokaryotic translation mechinryprokaryotic translation mechinry
prokaryotic translation mechinryKAUSHAL SAHU
 
translation mechinary
translation mechinarytranslation mechinary
translation mechinaryKAUSHAL SAHU
 
translation cycle, protein synnthesis
translation cycle, protein synnthesistranslation cycle, protein synnthesis
translation cycle, protein synnthesisKAUSHAL SAHU
 
co and post translation modification, by
co and post translation modification, byco and post translation modification, by
co and post translation modification, byKAUSHAL SAHU
 
co and post translation modification
co and post translation modificationco and post translation modification
co and post translation modificationKAUSHAL SAHU
 
Prokaryotic transcription by kk
Prokaryotic transcription by kk Prokaryotic transcription by kk
Prokaryotic transcription by kk KAUSHAL SAHU
 
Enzyme Kinetics and thermodynamic analysis
Enzyme Kinetics and thermodynamic analysisEnzyme Kinetics and thermodynamic analysis
Enzyme Kinetics and thermodynamic analysisKAUSHAL SAHU
 
Chromatin, Organization macromolecule complex
Chromatin, Organization macromolecule complexChromatin, Organization macromolecule complex
Chromatin, Organization macromolecule complexKAUSHAL SAHU
 
Receptor mediated endocytosis by kk
Receptor mediated endocytosis by kkReceptor mediated endocytosis by kk
Receptor mediated endocytosis by kkKAUSHAL SAHU
 
Recepter mediated endocytosis by kk ashu
Recepter mediated endocytosis by kk ashuRecepter mediated endocytosis by kk ashu
Recepter mediated endocytosis by kk ashuKAUSHAL SAHU
 
Protein sorting and targeting
Protein sorting and targetingProtein sorting and targeting
Protein sorting and targetingKAUSHAL SAHU
 
Prokaryotic translation machinery by kk
Prokaryotic translation machinery by kk Prokaryotic translation machinery by kk
Prokaryotic translation machinery by kk KAUSHAL SAHU
 
eukaryotic translation machinery by kk sahu
eukaryotic translation machinery by kk sahueukaryotic translation machinery by kk sahu
eukaryotic translation machinery by kk sahuKAUSHAL SAHU
 

Mais de KAUSHAL SAHU (20)

tumor suppressor gene, prb, p53 gene
tumor suppressor gene, prb, p53 genetumor suppressor gene, prb, p53 gene
tumor suppressor gene, prb, p53 gene
 
tumor suppressor gene by
tumor suppressor gene bytumor suppressor gene by
tumor suppressor gene by
 
tumor suppresor genes
tumor suppresor genestumor suppresor genes
tumor suppresor genes
 
tumor suppressor gene, prb, p53
tumor suppressor gene, prb, p53tumor suppressor gene, prb, p53
tumor suppressor gene, prb, p53
 
transcription factor by kk sahu
transcription factor by kk sahutranscription factor by kk sahu
transcription factor by kk sahu
 
DNA repair by kk sahu
DNA repair by kk sahuDNA repair by kk sahu
DNA repair by kk sahu
 
membrane protein, synthesis by
membrane protein, synthesis bymembrane protein, synthesis by
membrane protein, synthesis by
 
prokaryotic translation mechinry
prokaryotic translation mechinryprokaryotic translation mechinry
prokaryotic translation mechinry
 
translation mechinary
translation mechinarytranslation mechinary
translation mechinary
 
translation cycle, protein synnthesis
translation cycle, protein synnthesistranslation cycle, protein synnthesis
translation cycle, protein synnthesis
 
co and post translation modification, by
co and post translation modification, byco and post translation modification, by
co and post translation modification, by
 
co and post translation modification
co and post translation modificationco and post translation modification
co and post translation modification
 
Prokaryotic transcription by kk
Prokaryotic transcription by kk Prokaryotic transcription by kk
Prokaryotic transcription by kk
 
Enzyme Kinetics and thermodynamic analysis
Enzyme Kinetics and thermodynamic analysisEnzyme Kinetics and thermodynamic analysis
Enzyme Kinetics and thermodynamic analysis
 
Chromatin, Organization macromolecule complex
Chromatin, Organization macromolecule complexChromatin, Organization macromolecule complex
Chromatin, Organization macromolecule complex
 
Receptor mediated endocytosis by kk
Receptor mediated endocytosis by kkReceptor mediated endocytosis by kk
Receptor mediated endocytosis by kk
 
Recepter mediated endocytosis by kk ashu
Recepter mediated endocytosis by kk ashuRecepter mediated endocytosis by kk ashu
Recepter mediated endocytosis by kk ashu
 
Protein sorting and targeting
Protein sorting and targetingProtein sorting and targeting
Protein sorting and targeting
 
Prokaryotic translation machinery by kk
Prokaryotic translation machinery by kk Prokaryotic translation machinery by kk
Prokaryotic translation machinery by kk
 
eukaryotic translation machinery by kk sahu
eukaryotic translation machinery by kk sahueukaryotic translation machinery by kk sahu
eukaryotic translation machinery by kk sahu
 

Último

GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLkantirani197
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑Damini Dixit
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxRizalinePalanog2
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Joonhun Lee
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptxAlMamun560346
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 

Último (20)

GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 

Protein database

  • 1. Free Powerpoint Templates Page 1 Protein Database By KAUSHAL KUMAR SAHU Assistant Professor (Ad Hoc) Department of Biotechnology Govt. Digvijay Autonomous P. G. College Raj-Nandgaon ( C. G. )
  • 2. Free Powerpoint Templates Page 2 Introduction • Bioinformatics is the application of Information technology to store, organize and analyze the vast amount of biological data which is available in the form of sequences and structures of proteins and nucleic acids. The biological information of nucleic acids is available as sequences while the data of proteins is available as sequences and structures. • A biological database is a collection of data that is organized so that its contents can easily be accessed, managed, and updated. The activity of preparing a database can be divided in to: • Collection of data in a form which can be easily accessed • Making it available to a multi-user system (always available for the user)
  • 3. Free Powerpoint Templates Page 3 The network for production, construction and accession of a database EXPERIMENTS N • | |- E U • ORGANIZATION |----------|- T-->S COPY • OF DATA HOST/SERVER | W-->E-->ONLINE -----> PERSONAL • | | O-->R ACCESS DATABASE • |------------> DATABASES R S • K • | • • EDS • (Electronic Data Storage) •
  • 4. Free Powerpoint Templates Page 4 Protein databases • Protein databases are more specialized than primary sequence databases. They contain information derived from the primary sequence databases. Some contain protein translations of the nucleic acid sequences. Some contain sets of patterns and motifs derived from sequence homologs.
  • 5. Free Powerpoint Templates Page 5 History • The first database was created within a short period after the Insulin protein sequence was made available in 1956. Insulin is the first protein to be sequenced. The sequence of Insulin consisted of just 51 residues which characterize the sequence. • In 1959, V.M. Ingram first made attempt to compare sickle cell haemoglobin and normal haemoglobin and demonstrated their homology. this results in more protein sequencing and accumulation of vast information .hence it is realized to have database so that using computation software the protein can be quickly compared. • In 1965, Margaret Dayhoff established the first database of protein sequences, a database that was published annually as a series of volumes entitled “Atlas of Protein Sequence and Structure” • In 1972, Protein Data Bank was developed as the first protein structure database
  • 6. Free Powerpoint Templates Page 6 Classification of biological database
  • 7. Free Powerpoint Templates Page 7 Primary database:- Protein data bank (PDB) • Three-dimensional structures are stored in the Protein Databank (PDB). This is the single world-wide archive of structural data derived by X-ray crystallography, nuclear magnetic resonance spectroscopy, and other techniques, as well as structural models • The database is maintained by the Research Collaboratory for Structural Bioinformatics (RCSB), at Rutgers University. • Data in the PDB are very high quality and are extensively curated.
  • 11. Free Powerpoint Templates Page 11 Sequence database: SWISS-PROT protein sequence database • SWISS-PROT was created in at the department of medical biochemistry (university of geneva) in 1986. • In 1987, European Molecular biology laboratory and Swiss institute of Bioinformatics (SIB) work in collaboration ,as equal partners , to develop and maintain this highly annotated repository of protein sequences. • It provides high quality annotation with minimum redundancy.
  • 12. Free Powerpoint Templates Page 12 Translated EMBL (TrEMBL) • It was created in 1996 with the objective to fill the gap between flow of genomic data and annotated protein sequences. • TrEMBL contains computer annotated records generated by translating coding sequences (CDS) available in EMBL nucleotide sequence database. • It has two main sections- • SP- TrEMBL • REM- TrEMBL-
  • 13. Free Powerpoint Templates Page 13 Protein information resource (PIR) • PIR was established in 1984 by the National Biomedical Research Foundation (NBRF) as a resource to assist researchers in the identification and interpretation of protein sequence information. • The database is split into four sections PIR1 to PIR4 – PIR1 contains fully classified and annotated entries. – PIR2 includes preliminary entries. – PIR3 contains unverified entries – PIR4 entries all into:- • Conceptual translations sequence • Protein sequences • Conceptual translations of artifactual sequence. • Sequence that are not genetically encoded and not produced in ribosome.
  • 15. Free Powerpoint Templates Page 15 Secondary databases: Structural classification of proteins (SCOP) • It was created in 1995 by Murzin et al. it is maintained at Cambridge with the aim to gather information about structural similarities of proteins to increase our understanding of protein evolution and development. • SCOP provides comprehensive information on structural and evolutionary relationships of protein with known structure including structures available in protein data bank. • The manually constructed SCOP classifies proteins in a hierarchy which includes class, folds, superfamily, family, protein and species.
  • 16. Free Powerpoint Templates Page 16 Class Architecture Topology Homology (CATH) • The CATH database established in 1993 is a protein structure classification based on four levels namely class, Architecture ,Topology and Homology. • CATH contains hierarchical domain classification of protein structures present in protein data bank and is maintained at University College London. • The classification has been done by combination of automated and manual methods.
  • 17. Free Powerpoint Templates Page 17 Sequence database- 1.PROSITE: • It is a method of determining what is the function of uncharacterized proteins translated from genomic or cDNA sequences. • It consists of a database of biologically significant sites, patterns and profiles that help to reliably identify to which known family of protein (if any) a new sequence belongs. • It include protein pattern motifs indicative protein’s function , are widely used for function prediction studies, cellular localization annotation, and sequence classification.
  • 19. Free Powerpoint Templates Page 19 • 3. BLOCKS • Blocks are multiply aligned ungapped segments corresponding to the most highly conserved regions of proteins. • Block database Itself contain more than 4000 entries. • 4. Pfam • The methodology used by Pfam to create protein family or domain signatures is Hidden Markov Models (HMMs). • They are thus particularly useful when analysing multidomain proteins. • The biggest drawback of Pfam is its lack of biological information (annotation) of the protein families
  • 20. Free Powerpoint Templates Page 20 Important database search tool: SEARCH TOOL FUNCTION PROVIDED BLAST (BASIC LOCALALIGNMENT TOOL) Used to analyze sequence information and detect homologous sequences. ENTREZ Used to access literature , sequence and structural database. DNAPLOT Sequence alignment tool LOCUS LINK Accessing information on homologous gene STRUCTURE It support molecular molding database (MMDB)and software tool for structure analysis. TAXONOMY BROWSER Taxonomic classification of various species as well as genetic information. FASTA This program provide algorithm to speed up sequence comparison.
  • 21. Free Powerpoint Templates Page 21 Example: study protein sequence of hepatitis B virus surface antigen FASTA product by NCBI
  • 25. Free Powerpoint Templates Page 25 Application of protein database • Protein sequence • Determination of macromolecular structure • Molecular evolution • Drug development
  • 26. Free Powerpoint Templates Page 26 Conclusion • The aim of most protein structure databases is to organize and annotate the protein structures, providing the biological community access to the experimental data in a useful way. whereas sequence databases focus on sequence information, and contain no structural information for the majority of entries. • Thus there is no doubt that Bioinformatics tools for efficient research will have significant impact in biological sciences and betterment of human lives.
  • 27. Free Powerpoint Templates Page 27 References • Principles of gene manipulation and genomics- S.B. Primrose and R.M.Twyman (seventh edition) • www.bioinfo.com • www.ncbi.nil.nih.gov. • http://www.mrc- lmb.cam.ac.uk/genomes/madanm/pdfs/biodbseq.pdf •