SlideShare uma empresa Scribd logo
1 de 28
Baixar para ler offline
STRUCTURAL
DATABASES
PDB , CSD , CATH
INTRODUCTION:
• Structural databases are the essential tools for all
crystallographic works.
• They are used in the process of producing, solving
,refining and publishing the structure of a new material.
THE COMMON INFORMATION FOUND IN THE
STRUCTURAL DATABASE INCLUDE:
• Bibliographic information- author name, journal reference.
• The chemical compound name, formula and oxidation states
of the element present.
• Number of formula units per unit cell(contents)
• Dimension and symmetry of the unit cell.
• symmetry of the structure.
• Atomic coordinates, occupancies and thermal parameters.
• Any special features of the experiment to collect the
diffraction data.
• The structures in the database have been solved using X-ray,
neutron and electron diffraction techniques on sample,
computational modelling or by using NMR.
PDB:(PROTEIN DATABASES)
• Protein database contains the information about 3D structures of
the proteins.
• The structural information of the protein can be determined by
X-ray crystallography or Nuclear magnetic resonance(NMR)
spectroscopy methods.
• The PDB is overseen by an organisation called World Wide
Protein Data Bank,wwPDB.
• It is available at
• www.wwpdb.org
• www.pdbe.org
• www.pdbj.org
• Each entry in the PDB is provided with a unique identification
number called PDB ID.It is a 4 letter identification number which
consists of both alpha numeric characters.
PDB FILE FORMAT:
The PDB file format is the standard file format for protein
structure file. It describes how molecules are held together in
3-D Structure of a protein.
• The file contain hundreds or thousands of lines called
records. Each record provides a different set of information
like
• HEADER: This reocord contains file name, date of submission
and the PDB ID of the molecule.
• TITLE: This record contains the title of the PDB entry.
• COMPND: This record includes the protein name.
• SOURCE: This record contains the name of the organism in
which the particular protein is obtained.
• KEYWDS: This record contains the keywords that describes
about the protein.
PDB FILE FORMAT:
• EXPDTA: This record contains the method used for the
protein structure experiment.
• AUTHOR: This record contains the name of the
contributors who put the data into the database.
• REVDATA: This record contains the revision date of the
data related to protein.(Date of modification)
• JRNL: This record contains the journal details of the
literature about the protein
• REMARK: This record contains the remarks about the
protein structure.
• DBREF: This record contains the reference to the protein
in the sequence databases.
PDB FILE FORMAT:
• SEQRES: This record contains information about the
amino acid sequence of protein.
• HET: This record contains details about the non protein
substances in the protein.
• HETNAM: This record contain the compound name of
the non protein substances.
• HETSYN: This record contains the identical compound
name for the non protein substances.
• FORMUL: This record contain the chemical formula of
the non protein substances.
• HELIX: This record holds the recognition of helical
substructures.
PDB FILE FORMAT:
• LINK: This record holds the recognition of inter-residue bonds.
• ATOM: This record contains the atomic coordinates for the
structure.
• HEATM: This record contains the atomic coordinate record for
non protein substances.
• CONECT: This record contains the details about the bonds
involved in non protein atoms.
• MASTER: This record contains the details about the number of
REMARK records, HET records, HELIX records, CONECT records
and SEQRES records, etc.
• END: This record represent the end of the file.
•
THE PDB FORMAT
• 123456789+123456789+123456789+123456789+123456789+123456789+123456789+123456789+
• HEADER RETINOIC-ACID TRANSPORT 28-SEP-94 1CBS 1CBS 2
• COMPND CELLULAR RETINOIC-ACID-BINDING PROTEIN TYPE II COMPLEXED 1CBS 3
• COMPND 2 WITH ALL-TRANS-RETINOIC ACID (THE PRESUMED PHYSIOLOGICAL 1CBS 4
• COMPND 3 LIGAND) 1CBS 5
• SOURCE HUMAN (HOMO SAPIENS) 1CBS 6
• SOURCE 2 EXPRESSION SYSTEM: (ESCHERICHIA COLI) BL21 (DE3) 1CBS 7
• SOURCE 3 PLASMID: PET-3A 1CBS 8
• SOURCE 4 GENE: HUMAN CRABP-II 1CBS 9
• AUTHOR G.J.KLEYWEGT,T.BERGFORS,T.A.JONES 1CBS 10
• REVDAT 1 26-JAN-95 1CBS 0 1CBS 11
• -------------------------------------------------------------------------------------------------------------------------------------------
CATH:
• The CATH means Class, Architecture,Topology and
homologouus super family database for proteins
• It was created by Janet Thornton and colleagues at the
university college London.
• It is available at
http://www.biochem.ucl.ac.uk/bsm/cath
• http://www.cathdb.info
• It is a protein classification tool
IT CONSISTS OF FOUR LEVELS
• Class: It includes structural conformations of proteins
and their contents(alpha, beta, alpha/beta, etc.)
• Architecture: It describes the gross orientation of
secondary structures. It also gives information about
folding of polypeptide chains.
• Topology: It deals with the structures formed due to
different topological arrangement of secondary
structures. It explains the super families of the proteins.
• Homologous super family: It compares the sequence
and structure of various proteins. It helps to trace the
evolutionary relationship among the proteins.
CATH
• The CATH aims to provide official releases of protein
structures every 12 months
• It is a free publicly available online resource.
• The latest version of CATH contains 1,14,215
domains,2178 homologous superfamilies,1110 fold
groups.
THE CATH SERVER
• The CATH have recently set up a server which allows
the user to submit the co-ordinates of the newly
determined structure for automatic classification in
CATH.
• DOMAIN BOUNDARIES AND SEQUENCE COMPARISON
• CATH contains a detective program which is good for
identifying multidomain proteins.
• The results from the detective are returned to the user in
less than a minutes.
• Identified domains are scanned against non identical
representatives from CATH using a global sequence
alignment method
CATH SERVER
• If a sequence match 95% then the domain is identical
to one in CATH.
• If a sequence match less than 30% then the structures
are compared with all the sequence families (s-level).
• ASSESING STRUCTURAL SIMILARITY:
• TOPSCAN compares the secondary strucutres in each
fold family to identify the possible fold families to which
the new structures belong.
• Subsequently the fast version of structure comparison
SSAP scans represetatives from all the families
• Structural pairs having a ssap score more than 80 are
possible homologues while the score with 70-80 don’t
have no sequence or functional similiarity.
• Finally the SSAP structural alignment is displayed using a
graphical display package.
CSD
• The cambridge structural Database is both a repository
and a validated resource for 3-D structural data of
molecules containing carbon and hydrogen.
• It is used to know about the structures of organic,
metal-organic and organometallic molecules
• The specific entries in the CSD are complementary to
PDB and Inorganic crystal structure database.
• The data in the CSD is typically obtained by X-ray
crystallography and less frequently by neutron
diffraction
CSD
• The data in the CSD is submitted by crystallographers and
chemists from all over the world.
• The CSD is maintained by an incorporated company called
Cambridge Crystallographic Data centre, CCDC
• The CCDC are publicly available for download at the point of
publication.
• The CSD is updated with about 50,000 new structures each
year and are freely available to support teaching and other
activities
• The CSD is available at
• www.ccdc.cam.ac.uk
• webcsd.ccdc.cam.ac.uk
Structural
Database
Applications
Prediction
Analysis
Mining
Compariso
n
Classificatio
n
Structure
Refinement
Databases
Annotation
Structural databases

Mais conteúdo relacionado

Mais procurados (20)

Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
BLAST
BLASTBLAST
BLAST
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
 
Est database
Est databaseEst database
Est database
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
 
Ddbj
DdbjDdbj
Ddbj
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Fasta
FastaFasta
Fasta
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
Introduction to NCBI
Introduction to NCBIIntroduction to NCBI
Introduction to NCBI
 
Protein database
Protein databaseProtein database
Protein database
 
Protein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOLProtein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOL
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Clustal
ClustalClustal
Clustal
 

Semelhante a Structural databases

Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptxSilpa87
 
Bioinformatic_Databases_2.ppt Bioinformatics
Bioinformatic_Databases_2.ppt BioinformaticsBioinformatic_Databases_2.ppt Bioinformatics
Bioinformatic_Databases_2.ppt BioinformaticsMohamedHasan816582
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2Razzaqe
 
Bioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.pptBioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.pptNaglaaFathy42
 
Bioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzcBioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzcAdiM27
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2Razzaqe
 
Bioinformatics lecture xxiii
Bioinformatics lecture xxiiiBioinformatics lecture xxiii
Bioinformatics lecture xxiiiMuhammad Younis
 
Lecture 9 molecular descriptors
Lecture 9  molecular descriptorsLecture 9  molecular descriptors
Lecture 9 molecular descriptorsRAJAN ROLTA
 
Nucleic acid database
Nucleic acid database Nucleic acid database
Nucleic acid database bhargvi sharma
 
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...geraintduck
 

Semelhante a Structural databases (20)

Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptx
 
Bioinformatic_Databases_2.ppt Bioinformatics
Bioinformatic_Databases_2.ppt BioinformaticsBioinformatic_Databases_2.ppt Bioinformatics
Bioinformatic_Databases_2.ppt Bioinformatics
 
PDF文档.pdf
PDF文档.pdfPDF文档.pdf
PDF文档.pdf
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
 
Bioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.pptBioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.ppt
 
Bioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzcBioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzc
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
 
Biological databases
Biological databases Biological databases
Biological databases
 
Analisis 16S dan 18S rRNA.ppt
Analisis 16S dan 18S rRNA.pptAnalisis 16S dan 18S rRNA.ppt
Analisis 16S dan 18S rRNA.ppt
 
Bioinformatics lecture xxiii
Bioinformatics lecture xxiiiBioinformatics lecture xxiii
Bioinformatics lecture xxiii
 
Lecture 9 molecular descriptors
Lecture 9  molecular descriptorsLecture 9  molecular descriptors
Lecture 9 molecular descriptors
 
Introduction to pdb
Introduction to pdbIntroduction to pdb
Introduction to pdb
 
Nucleic acid database
Nucleic acid database Nucleic acid database
Nucleic acid database
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
Enfin, DAS and BioMart
Enfin, DAS and BioMartEnfin, DAS and BioMart
Enfin, DAS and BioMart
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Intro to databases
Intro to databasesIntro to databases
Intro to databases
 
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 

Mais de Priyadharshana

Advantages of herbal medicine.
Advantages of herbal medicine.Advantages of herbal medicine.
Advantages of herbal medicine.Priyadharshana
 
National research laboratories in medicinal plant research (autosaved).edited
National research laboratories in medicinal plant research (autosaved).editedNational research laboratories in medicinal plant research (autosaved).edited
National research laboratories in medicinal plant research (autosaved).editedPriyadharshana
 
History of herbal medicine.
History of herbal medicine.History of herbal medicine.
History of herbal medicine.Priyadharshana
 
Cultivation of Asparagus racemosus.
Cultivation of Asparagus racemosus.Cultivation of Asparagus racemosus.
Cultivation of Asparagus racemosus.Priyadharshana
 
Climate change causes, effects and prevention
Climate change  causes, effects and preventionClimate change  causes, effects and prevention
Climate change causes, effects and preventionPriyadharshana
 
Radio immuno assay (priya)
Radio immuno assay (priya)Radio immuno assay (priya)
Radio immuno assay (priya)Priyadharshana
 

Mais de Priyadharshana (12)

Advantages of herbal medicine.
Advantages of herbal medicine.Advantages of herbal medicine.
Advantages of herbal medicine.
 
National research laboratories in medicinal plant research (autosaved).edited
National research laboratories in medicinal plant research (autosaved).editedNational research laboratories in medicinal plant research (autosaved).edited
National research laboratories in medicinal plant research (autosaved).edited
 
History of herbal medicine.
History of herbal medicine.History of herbal medicine.
History of herbal medicine.
 
Cultivation of Asparagus racemosus.
Cultivation of Asparagus racemosus.Cultivation of Asparagus racemosus.
Cultivation of Asparagus racemosus.
 
Pickling
PicklingPickling
Pickling
 
Canning
CanningCanning
Canning
 
Chromatography
ChromatographyChromatography
Chromatography
 
Climate change causes, effects and prevention
Climate change  causes, effects and preventionClimate change  causes, effects and prevention
Climate change causes, effects and prevention
 
Radio immuno assay (priya)
Radio immuno assay (priya)Radio immuno assay (priya)
Radio immuno assay (priya)
 
Vaccines
Vaccines Vaccines
Vaccines
 
Anorexia
Anorexia Anorexia
Anorexia
 
Landslides
LandslidesLandslides
Landslides
 

Último

Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)Areesha Ahmad
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Servicemonikaservice1
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicinesherlingomez2
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformationAreesha Ahmad
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxBhagirath Gogikar
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)AkefAfaneh2
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 

Último (20)

Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicine
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 

Structural databases

  • 2. INTRODUCTION: • Structural databases are the essential tools for all crystallographic works. • They are used in the process of producing, solving ,refining and publishing the structure of a new material.
  • 3. THE COMMON INFORMATION FOUND IN THE STRUCTURAL DATABASE INCLUDE: • Bibliographic information- author name, journal reference. • The chemical compound name, formula and oxidation states of the element present. • Number of formula units per unit cell(contents) • Dimension and symmetry of the unit cell. • symmetry of the structure. • Atomic coordinates, occupancies and thermal parameters. • Any special features of the experiment to collect the diffraction data. • The structures in the database have been solved using X-ray, neutron and electron diffraction techniques on sample, computational modelling or by using NMR.
  • 4. PDB:(PROTEIN DATABASES) • Protein database contains the information about 3D structures of the proteins. • The structural information of the protein can be determined by X-ray crystallography or Nuclear magnetic resonance(NMR) spectroscopy methods. • The PDB is overseen by an organisation called World Wide Protein Data Bank,wwPDB. • It is available at • www.wwpdb.org • www.pdbe.org • www.pdbj.org • Each entry in the PDB is provided with a unique identification number called PDB ID.It is a 4 letter identification number which consists of both alpha numeric characters.
  • 5.
  • 6.
  • 7. PDB FILE FORMAT: The PDB file format is the standard file format for protein structure file. It describes how molecules are held together in 3-D Structure of a protein. • The file contain hundreds or thousands of lines called records. Each record provides a different set of information like • HEADER: This reocord contains file name, date of submission and the PDB ID of the molecule. • TITLE: This record contains the title of the PDB entry. • COMPND: This record includes the protein name. • SOURCE: This record contains the name of the organism in which the particular protein is obtained. • KEYWDS: This record contains the keywords that describes about the protein.
  • 8. PDB FILE FORMAT: • EXPDTA: This record contains the method used for the protein structure experiment. • AUTHOR: This record contains the name of the contributors who put the data into the database. • REVDATA: This record contains the revision date of the data related to protein.(Date of modification) • JRNL: This record contains the journal details of the literature about the protein • REMARK: This record contains the remarks about the protein structure. • DBREF: This record contains the reference to the protein in the sequence databases.
  • 9. PDB FILE FORMAT: • SEQRES: This record contains information about the amino acid sequence of protein. • HET: This record contains details about the non protein substances in the protein. • HETNAM: This record contain the compound name of the non protein substances. • HETSYN: This record contains the identical compound name for the non protein substances. • FORMUL: This record contain the chemical formula of the non protein substances. • HELIX: This record holds the recognition of helical substructures.
  • 10. PDB FILE FORMAT: • LINK: This record holds the recognition of inter-residue bonds. • ATOM: This record contains the atomic coordinates for the structure. • HEATM: This record contains the atomic coordinate record for non protein substances. • CONECT: This record contains the details about the bonds involved in non protein atoms. • MASTER: This record contains the details about the number of REMARK records, HET records, HELIX records, CONECT records and SEQRES records, etc. • END: This record represent the end of the file. •
  • 11.
  • 12. THE PDB FORMAT • 123456789+123456789+123456789+123456789+123456789+123456789+123456789+123456789+ • HEADER RETINOIC-ACID TRANSPORT 28-SEP-94 1CBS 1CBS 2 • COMPND CELLULAR RETINOIC-ACID-BINDING PROTEIN TYPE II COMPLEXED 1CBS 3 • COMPND 2 WITH ALL-TRANS-RETINOIC ACID (THE PRESUMED PHYSIOLOGICAL 1CBS 4 • COMPND 3 LIGAND) 1CBS 5 • SOURCE HUMAN (HOMO SAPIENS) 1CBS 6 • SOURCE 2 EXPRESSION SYSTEM: (ESCHERICHIA COLI) BL21 (DE3) 1CBS 7 • SOURCE 3 PLASMID: PET-3A 1CBS 8 • SOURCE 4 GENE: HUMAN CRABP-II 1CBS 9 • AUTHOR G.J.KLEYWEGT,T.BERGFORS,T.A.JONES 1CBS 10 • REVDAT 1 26-JAN-95 1CBS 0 1CBS 11 • -------------------------------------------------------------------------------------------------------------------------------------------
  • 13. CATH: • The CATH means Class, Architecture,Topology and homologouus super family database for proteins • It was created by Janet Thornton and colleagues at the university college London. • It is available at http://www.biochem.ucl.ac.uk/bsm/cath • http://www.cathdb.info • It is a protein classification tool
  • 14. IT CONSISTS OF FOUR LEVELS • Class: It includes structural conformations of proteins and their contents(alpha, beta, alpha/beta, etc.) • Architecture: It describes the gross orientation of secondary structures. It also gives information about folding of polypeptide chains. • Topology: It deals with the structures formed due to different topological arrangement of secondary structures. It explains the super families of the proteins. • Homologous super family: It compares the sequence and structure of various proteins. It helps to trace the evolutionary relationship among the proteins.
  • 15.
  • 16. CATH • The CATH aims to provide official releases of protein structures every 12 months • It is a free publicly available online resource. • The latest version of CATH contains 1,14,215 domains,2178 homologous superfamilies,1110 fold groups.
  • 17.
  • 18. THE CATH SERVER • The CATH have recently set up a server which allows the user to submit the co-ordinates of the newly determined structure for automatic classification in CATH. • DOMAIN BOUNDARIES AND SEQUENCE COMPARISON • CATH contains a detective program which is good for identifying multidomain proteins. • The results from the detective are returned to the user in less than a minutes. • Identified domains are scanned against non identical representatives from CATH using a global sequence alignment method
  • 19. CATH SERVER • If a sequence match 95% then the domain is identical to one in CATH. • If a sequence match less than 30% then the structures are compared with all the sequence families (s-level). • ASSESING STRUCTURAL SIMILARITY: • TOPSCAN compares the secondary strucutres in each fold family to identify the possible fold families to which the new structures belong. • Subsequently the fast version of structure comparison SSAP scans represetatives from all the families • Structural pairs having a ssap score more than 80 are possible homologues while the score with 70-80 don’t have no sequence or functional similiarity. • Finally the SSAP structural alignment is displayed using a graphical display package.
  • 20.
  • 21.
  • 22. CSD • The cambridge structural Database is both a repository and a validated resource for 3-D structural data of molecules containing carbon and hydrogen. • It is used to know about the structures of organic, metal-organic and organometallic molecules • The specific entries in the CSD are complementary to PDB and Inorganic crystal structure database. • The data in the CSD is typically obtained by X-ray crystallography and less frequently by neutron diffraction
  • 23. CSD • The data in the CSD is submitted by crystallographers and chemists from all over the world. • The CSD is maintained by an incorporated company called Cambridge Crystallographic Data centre, CCDC • The CCDC are publicly available for download at the point of publication. • The CSD is updated with about 50,000 new structures each year and are freely available to support teaching and other activities • The CSD is available at • www.ccdc.cam.ac.uk • webcsd.ccdc.cam.ac.uk
  • 24.
  • 25.
  • 26.