SlideShare uma empresa Scribd logo
1 de 30
Data Base in Bioinformatics
What is a Database/Resource?
• Collection of data in the related format
– structured
– searchable (index)-> table of contents
– updated periodically (release) -> new
edition
– cross-referenced (hyperlinks) -> links with
other db
• Includes also associated tools (software)
necessary for db access, db updating, db
information insertion, db information
deletion….
• Type and Content of Data
– Sequence or Structure
– Nucleic acid or protein
– Important Biological information such as about
enzyme and their metabolic pathways, mutations,
diseases, drugs, images etc.
• Based on source of data
– Primary database
– Secondary database
– Knowledge bases
– Integrated Database
Categories of databases for Life Sciences
• Sequences (DNA, protein) -> Primary db
• Genomics
• Protein domain/family -> Secondary db
• Mutation/polymorphism
• Proteomics (2D gel, MS)
• 3D structure -> Structure db
• Metabolism
• Bibliography
• Others
Primary biological databases
• Nucleic acid
EMBL
GenBank
DDBJ (DNA Data Bank of
Japan)
• Protein
PIR
MIPS
SWISS-PROT
TrEMBL
NRL-3D
Nucleotide Databases
•EMBL:Nucleotide sequence database
•Ensembl: Automatics annotation of eukaryotic genomes
•Genome Server: Overview of completed genomes at EBI
•Genome-MOT: Genome monitoring table
•EMBL-Align: Multiple sequence alignment database
•Parasites: Parasite Genome databases
•Mutations: Sequence variation database project
•IMGT: Immunogenetics database, comprising-
IMGT/LIGM- database of immunoglobulins and
T-cell receptors, IMGT/HLA database of the human
MHC complex and IMGT/MHC covering MHC
complex of non-human species.
Reference site : www.ebi.ac.uk/Databases/nucleotide.html
EMBL/GenBank/DDBJ
• These 3 db contain mainly the same
information (few differences in the format and
syntax)
• Serve as archives containing all sequences
(single genes, ESTs, complete genomes, etc.)
derived from:
– Genome projects and sequencing centers
– Individual scientists
– Patent offices (i.e. USPTO, EPO)
• Non-confidential data are exchanged daily
• Currently: 2.5 x107 sequences, over 3.2 x1010
bp;
• Sequences from > 50,000 different species;
Databases related to Genomics
• Contain information on genes, gene location
(mapping), gene nomenclature and links to
sequence databases;
• Exist for most organisms important for life
science research;
• Examples: MIM, GDB (human), MGD
(mouse), FlyBase (Drosophila), SGD (yeast),
MaizeDB (maize), SubtiList (B.subtilis), etc.
• Format: generally relational (Oracle, SyBase
or AceDb).
Ensembl
• Contains all the human genome DNA
sequences currently available in the public
domain.
• Automated annotation: by using different
software tools, features are identified in the
DNA sequences:
– Genes (known or predicted)
– Single nucleotide polymorphisms (SNPs)
– Repeats
– Homologies
• Created and maintained by the EBI and the
Sanger Center (UK)
• www.ensembl.org
Protein Databases
•SWISS-PROT: Annotated Sequence Database
•TrEMBL: Database of EMBL nucleotide translated sequences
•InterPro:Integrated resource for protein families, domains
and functional sites.
•CluSTr:Offers an automatic classification of SWISS-PROT
and TrEMBL.
•IPI: A non-redundant human proteome set constructed from
SWISS-PROT, TrEMBL, Ensembl and RefSeq.
•GOA: Provides assignments of gene products to the Gene
Ontology (GO) resource.
•Proteome Analysis: Statistical and comparative analysis of
the predicted proteomes of fully sequenced organisms
•Protein Profiles: Tables of SWISS-PROT and TrEMBL entries
and alignments for the protein families of the Protein Profile.
•IntEnz: The Integrated relational Enzyme database (IntEnz) will
contain enzyme data approved by the Nomenclature Committee.
Reference site : www.ebi.ac.uk/Databases/protein.html
Swiss-Prot
• Annotated protein sequence database established in
1986 and maintained collaboratively since 1987, by
the Department of Medical Biochemistry of the
University of Geneva and EBI
• Complete, Curated, Non-redundant and cross-
referenced with 34 other databases
• Highly cross-referenced
• Available from a variety of servers and through
sequence analysis software tools
• More than 8,000 different species
• First 20 species represent about 42% of all sequences
in the database
• More than 1,29,000 entries with 4.7 X 1010 amino
acids
• More than 6,22,000 entries in TrEMBL
TrEMBL (Translation of EMBL)
• Computer-annotated supplement to SWISS-
PROT, as it is impossible to cope with the flow
of data…
• Well-structure SWISS-PROT-like resource
• Derived from automated EMBL CDS
translation maintained at the EBI, UK.
• TrEMBL is automatically generated and
annotated using software tools (incompatible
with the SWISS-PROT in terms of quality)
• TrEMBL contains all what is not yet in
SWISS-PROT
Structure Databases
•MSD:The Macromolecular Structure Database –
A relational database representation of clean Protein Data
Bank (PDB)
•3DSeq: 3D sequence alignment server- Annotation of the
alignments between sequence database and the PDB
•FSSP: Based on exhaustive all-against-all 3D structure
comparison of protein structures currently in the
Protein Data Bank (PDB)
•DALI: Fold Classification based on Structure-Structure
Assignments
•3Dee: Database of protein domain definitions wherein
the domains have been clustered on sequence and
structural similarity
•NDB: Nucleic Acid Structure Database
Protein DataBank (PDB)
• Important in solving real problems in
molecular biology
• Protein Databank
– PDB Established in 1972 at Brookhaven National
Laboratory (BNL)
– Sole international repository of macromolecular
structure data
– Moved to Research Collaboratory
for Structural Bioinformatics
http://www.rcsb.org/
Medline
• MEDLINE covers the fields of medicine, nursing, dentistry,
veterinary medicine, the health care system, and the
preclinical sciences
• more than 4,000 biomedical journals published in the United
States and 70 other countries
• Contains over 10 million citations since 1966 until now
• Contains links to biological db and to some journals
• New records are added to PreMEDLINE daily!
– Many papers not dealing with human are not in Medline !
– Before 1970, keeps only the first 10 authors !
– Not all journals have citations since 1966 !
Medline/Pubmed
• PubMed is developed by the National Center for
Biotechnology Information (NCBI)
• PubMed provides access to bibliographic information such as
MEDLINE, PreMEDLINE, HealthSTAR, and to integrated
molecular biology databases (composite db)
• PMID: 10923642 (PubMed ID), UI: 20378145 (Medline ID)
•PubMed: The biomedical literature (PubMed)
•Nucleotide sequence database (Genbank)
•Protein sequence database
•Structure: Three-dimensional macromolecular structures
•Genome: Complete genome assemblies
•PopSet: Population study data sets
•OMIM: Online Mendelian Inheritance in Man
•Taxonomy: Organisms in GenBank
• Books: Online books
• ProbeSet: Gene expression and microarray datasets
• 3D Domains: Domains from Entrez Structure
• UniSTS: Markers and mapping data
• SNP: Single nucleotide polymorphisms
• CDD: Conserved domains
Entrez at NCBI
It is a retrieval system for searching several linked databases such as
Entrez: Search fields
•Keyword allows to search a set of indexed terms
•Accession allows to search accession numbers
•Author Name
•Affiliations of authors
•Journal Title
•E.C. Numbers
•Feature Key searches for particular DNA feature
•SeqId is string identifier
•Title Words
•Text Words
•Organism
•Pubmed ID
•Publication and modification date
•Protein Name
If you would like to donate us?
Scan below and donate us 0.013$ (US dollar) (5Rs Indian rupee)
Contact: If you want PPT/PDF files, please contact below.
Email: gnccmysore@gmail.com
Telegram:+919738137533(only for Chat)
Data Base in Bioinformatics.ppt

Mais conteúdo relacionado

Mais procurados (20)

Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
Introduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbjIntroduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbj
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission Tools
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
EMBL-EBI
EMBL-EBIEMBL-EBI
EMBL-EBI
 
EMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology LaboratoryEMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology Laboratory
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformatics
 
Protein database
Protein databaseProtein database
Protein database
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Prosite
PrositeProsite
Prosite
 
TOOLS AND DATA BASES OF NCBI
TOOLS AND DATA BASES OF NCBITOOLS AND DATA BASES OF NCBI
TOOLS AND DATA BASES OF NCBI
 
UniProt
UniProtUniProt
UniProt
 
TrEMBL
TrEMBLTrEMBL
TrEMBL
 
Ddbj
DdbjDdbj
Ddbj
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databases
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 

Semelhante a Data Base in Bioinformatics.ppt

Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBioinformaticsCentre
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEPrashantSharma807
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptxscience lover
 
Biological databases
Biological databasesBiological databases
Biological databasesQamar iqbal
 
Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdfnedalalazzwy
 
Bioinformatic_Databases_2.ppt Bioinformatics
Bioinformatic_Databases_2.ppt BioinformaticsBioinformatic_Databases_2.ppt Bioinformatics
Bioinformatic_Databases_2.ppt BioinformaticsMohamedHasan816582
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2Razzaqe
 
Bioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.pptBioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.pptNaglaaFathy42
 
Bioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzcBioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzcAdiM27
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2Razzaqe
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.pptSanthiyaAK
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...Elufer Akram
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptxSilpa87
 

Semelhante a Data Base in Bioinformatics.ppt (20)

Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdf
 
Structural database and their classification by abdul qahar
Structural database and their classification by abdul qaharStructural database and their classification by abdul qahar
Structural database and their classification by abdul qahar
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptx
 
Biological databases
Biological databases Biological databases
Biological databases
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdf
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Bioinformatic_Databases_2.ppt Bioinformatics
Bioinformatic_Databases_2.ppt BioinformaticsBioinformatic_Databases_2.ppt Bioinformatics
Bioinformatic_Databases_2.ppt Bioinformatics
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
 
Bioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.pptBioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.ppt
 
Bioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzcBioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzc
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
 
Biological data base
Biological data baseBiological data base
Biological data base
 
PDF文档.pdf
PDF文档.pdfPDF文档.pdf
PDF文档.pdf
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.ppt
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptx
 

Mais de Bangaluru

Good Pipetting Technique.pptx
Good Pipetting Technique.pptxGood Pipetting Technique.pptx
Good Pipetting Technique.pptxBangaluru
 
KF autotitrator.pptx
KF autotitrator.pptxKF autotitrator.pptx
KF autotitrator.pptxBangaluru
 
Pheromones.pptx
Pheromones.pptxPheromones.pptx
Pheromones.pptxBangaluru
 
Nitric oxide ppt.pptx
Nitric oxide  ppt.pptxNitric oxide  ppt.pptx
Nitric oxide ppt.pptxBangaluru
 
Mitochondrial biogenesis.pptx
Mitochondrial biogenesis.pptxMitochondrial biogenesis.pptx
Mitochondrial biogenesis.pptxBangaluru
 
Melatonin.pptx
Melatonin.pptxMelatonin.pptx
Melatonin.pptxBangaluru
 
Ion Channels.pptx
Ion Channels.pptxIon Channels.pptx
Ion Channels.pptxBangaluru
 
Glucose Transporters.pptx
Glucose Transporters.pptxGlucose Transporters.pptx
Glucose Transporters.pptxBangaluru
 
Chemical Bonds 2.ppt
Chemical Bonds 2.pptChemical Bonds 2.ppt
Chemical Bonds 2.pptBangaluru
 
Chemical Bonds 1.ppt
Chemical Bonds 1.pptChemical Bonds 1.ppt
Chemical Bonds 1.pptBangaluru
 
Eicosanoids.pptx
Eicosanoids.pptxEicosanoids.pptx
Eicosanoids.pptxBangaluru
 
Chemical safety.pptx
Chemical safety.pptxChemical safety.pptx
Chemical safety.pptxBangaluru
 
Biogas production.pptx
Biogas production.pptxBiogas production.pptx
Biogas production.pptxBangaluru
 
Mass Spectrometry.ppt
Mass Spectrometry.pptMass Spectrometry.ppt
Mass Spectrometry.pptBangaluru
 
Risk assessment for computer system validation
Risk assessment for computer system validationRisk assessment for computer system validation
Risk assessment for computer system validationBangaluru
 
Recovery and purification of intracellular and extra cellular products
Recovery and purification of intracellular and extra cellular productsRecovery and purification of intracellular and extra cellular products
Recovery and purification of intracellular and extra cellular productsBangaluru
 
Good documentation practices
Good documentation practicesGood documentation practices
Good documentation practicesBangaluru
 
Zymography_Pas staining
Zymography_Pas stainingZymography_Pas staining
Zymography_Pas stainingBangaluru
 

Mais de Bangaluru (20)

Good Pipetting Technique.pptx
Good Pipetting Technique.pptxGood Pipetting Technique.pptx
Good Pipetting Technique.pptx
 
KF autotitrator.pptx
KF autotitrator.pptxKF autotitrator.pptx
KF autotitrator.pptx
 
Pheromones.pptx
Pheromones.pptxPheromones.pptx
Pheromones.pptx
 
Nitric oxide ppt.pptx
Nitric oxide  ppt.pptxNitric oxide  ppt.pptx
Nitric oxide ppt.pptx
 
Mitochondrial biogenesis.pptx
Mitochondrial biogenesis.pptxMitochondrial biogenesis.pptx
Mitochondrial biogenesis.pptx
 
Melatonin.pptx
Melatonin.pptxMelatonin.pptx
Melatonin.pptx
 
Ion Channels.pptx
Ion Channels.pptxIon Channels.pptx
Ion Channels.pptx
 
Glucose Transporters.pptx
Glucose Transporters.pptxGlucose Transporters.pptx
Glucose Transporters.pptx
 
Chemical Bonds 2.ppt
Chemical Bonds 2.pptChemical Bonds 2.ppt
Chemical Bonds 2.ppt
 
Chemical Bonds 1.ppt
Chemical Bonds 1.pptChemical Bonds 1.ppt
Chemical Bonds 1.ppt
 
Eicosanoids.pptx
Eicosanoids.pptxEicosanoids.pptx
Eicosanoids.pptx
 
Chemical safety.pptx
Chemical safety.pptxChemical safety.pptx
Chemical safety.pptx
 
Biogas production.pptx
Biogas production.pptxBiogas production.pptx
Biogas production.pptx
 
Mass Spectrometry.ppt
Mass Spectrometry.pptMass Spectrometry.ppt
Mass Spectrometry.ppt
 
Risk assessment for computer system validation
Risk assessment for computer system validationRisk assessment for computer system validation
Risk assessment for computer system validation
 
Recovery and purification of intracellular and extra cellular products
Recovery and purification of intracellular and extra cellular productsRecovery and purification of intracellular and extra cellular products
Recovery and purification of intracellular and extra cellular products
 
Iron
IronIron
Iron
 
Good documentation practices
Good documentation practicesGood documentation practices
Good documentation practices
 
Zymography_Pas staining
Zymography_Pas stainingZymography_Pas staining
Zymography_Pas staining
 
Hematology
HematologyHematology
Hematology
 

Último

TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Servicemonikaservice1
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxRizalinePalanog2
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Servicenishacall1
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptxAlMamun560346
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and ClassificationsAreesha Ahmad
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformationAreesha Ahmad
 

Último (20)

TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 

Data Base in Bioinformatics.ppt

  • 1. Data Base in Bioinformatics
  • 2.
  • 3. What is a Database/Resource? • Collection of data in the related format – structured – searchable (index)-> table of contents – updated periodically (release) -> new edition – cross-referenced (hyperlinks) -> links with other db • Includes also associated tools (software) necessary for db access, db updating, db information insertion, db information deletion….
  • 4. • Type and Content of Data – Sequence or Structure – Nucleic acid or protein – Important Biological information such as about enzyme and their metabolic pathways, mutations, diseases, drugs, images etc. • Based on source of data – Primary database – Secondary database – Knowledge bases – Integrated Database
  • 5. Categories of databases for Life Sciences • Sequences (DNA, protein) -> Primary db • Genomics • Protein domain/family -> Secondary db • Mutation/polymorphism • Proteomics (2D gel, MS) • 3D structure -> Structure db • Metabolism • Bibliography • Others
  • 6. Primary biological databases • Nucleic acid EMBL GenBank DDBJ (DNA Data Bank of Japan) • Protein PIR MIPS SWISS-PROT TrEMBL NRL-3D
  • 7. Nucleotide Databases •EMBL:Nucleotide sequence database •Ensembl: Automatics annotation of eukaryotic genomes •Genome Server: Overview of completed genomes at EBI •Genome-MOT: Genome monitoring table •EMBL-Align: Multiple sequence alignment database •Parasites: Parasite Genome databases •Mutations: Sequence variation database project •IMGT: Immunogenetics database, comprising- IMGT/LIGM- database of immunoglobulins and T-cell receptors, IMGT/HLA database of the human MHC complex and IMGT/MHC covering MHC complex of non-human species. Reference site : www.ebi.ac.uk/Databases/nucleotide.html
  • 8. EMBL/GenBank/DDBJ • These 3 db contain mainly the same information (few differences in the format and syntax) • Serve as archives containing all sequences (single genes, ESTs, complete genomes, etc.) derived from: – Genome projects and sequencing centers – Individual scientists – Patent offices (i.e. USPTO, EPO) • Non-confidential data are exchanged daily • Currently: 2.5 x107 sequences, over 3.2 x1010 bp; • Sequences from > 50,000 different species;
  • 9. Databases related to Genomics • Contain information on genes, gene location (mapping), gene nomenclature and links to sequence databases; • Exist for most organisms important for life science research; • Examples: MIM, GDB (human), MGD (mouse), FlyBase (Drosophila), SGD (yeast), MaizeDB (maize), SubtiList (B.subtilis), etc. • Format: generally relational (Oracle, SyBase or AceDb).
  • 10. Ensembl • Contains all the human genome DNA sequences currently available in the public domain. • Automated annotation: by using different software tools, features are identified in the DNA sequences: – Genes (known or predicted) – Single nucleotide polymorphisms (SNPs) – Repeats – Homologies • Created and maintained by the EBI and the Sanger Center (UK) • www.ensembl.org
  • 11. Protein Databases •SWISS-PROT: Annotated Sequence Database •TrEMBL: Database of EMBL nucleotide translated sequences •InterPro:Integrated resource for protein families, domains and functional sites. •CluSTr:Offers an automatic classification of SWISS-PROT and TrEMBL. •IPI: A non-redundant human proteome set constructed from SWISS-PROT, TrEMBL, Ensembl and RefSeq. •GOA: Provides assignments of gene products to the Gene Ontology (GO) resource. •Proteome Analysis: Statistical and comparative analysis of the predicted proteomes of fully sequenced organisms •Protein Profiles: Tables of SWISS-PROT and TrEMBL entries and alignments for the protein families of the Protein Profile. •IntEnz: The Integrated relational Enzyme database (IntEnz) will contain enzyme data approved by the Nomenclature Committee. Reference site : www.ebi.ac.uk/Databases/protein.html
  • 12. Swiss-Prot • Annotated protein sequence database established in 1986 and maintained collaboratively since 1987, by the Department of Medical Biochemistry of the University of Geneva and EBI • Complete, Curated, Non-redundant and cross- referenced with 34 other databases • Highly cross-referenced • Available from a variety of servers and through sequence analysis software tools • More than 8,000 different species • First 20 species represent about 42% of all sequences in the database • More than 1,29,000 entries with 4.7 X 1010 amino acids • More than 6,22,000 entries in TrEMBL
  • 13. TrEMBL (Translation of EMBL) • Computer-annotated supplement to SWISS- PROT, as it is impossible to cope with the flow of data… • Well-structure SWISS-PROT-like resource • Derived from automated EMBL CDS translation maintained at the EBI, UK. • TrEMBL is automatically generated and annotated using software tools (incompatible with the SWISS-PROT in terms of quality) • TrEMBL contains all what is not yet in SWISS-PROT
  • 14. Structure Databases •MSD:The Macromolecular Structure Database – A relational database representation of clean Protein Data Bank (PDB) •3DSeq: 3D sequence alignment server- Annotation of the alignments between sequence database and the PDB •FSSP: Based on exhaustive all-against-all 3D structure comparison of protein structures currently in the Protein Data Bank (PDB) •DALI: Fold Classification based on Structure-Structure Assignments •3Dee: Database of protein domain definitions wherein the domains have been clustered on sequence and structural similarity •NDB: Nucleic Acid Structure Database
  • 15. Protein DataBank (PDB) • Important in solving real problems in molecular biology • Protein Databank – PDB Established in 1972 at Brookhaven National Laboratory (BNL) – Sole international repository of macromolecular structure data – Moved to Research Collaboratory for Structural Bioinformatics http://www.rcsb.org/
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22. Medline • MEDLINE covers the fields of medicine, nursing, dentistry, veterinary medicine, the health care system, and the preclinical sciences • more than 4,000 biomedical journals published in the United States and 70 other countries • Contains over 10 million citations since 1966 until now • Contains links to biological db and to some journals • New records are added to PreMEDLINE daily! – Many papers not dealing with human are not in Medline ! – Before 1970, keeps only the first 10 authors ! – Not all journals have citations since 1966 !
  • 23. Medline/Pubmed • PubMed is developed by the National Center for Biotechnology Information (NCBI) • PubMed provides access to bibliographic information such as MEDLINE, PreMEDLINE, HealthSTAR, and to integrated molecular biology databases (composite db) • PMID: 10923642 (PubMed ID), UI: 20378145 (Medline ID)
  • 24. •PubMed: The biomedical literature (PubMed) •Nucleotide sequence database (Genbank) •Protein sequence database •Structure: Three-dimensional macromolecular structures •Genome: Complete genome assemblies •PopSet: Population study data sets •OMIM: Online Mendelian Inheritance in Man •Taxonomy: Organisms in GenBank • Books: Online books • ProbeSet: Gene expression and microarray datasets • 3D Domains: Domains from Entrez Structure • UniSTS: Markers and mapping data • SNP: Single nucleotide polymorphisms • CDD: Conserved domains Entrez at NCBI It is a retrieval system for searching several linked databases such as
  • 25.
  • 26. Entrez: Search fields •Keyword allows to search a set of indexed terms •Accession allows to search accession numbers •Author Name •Affiliations of authors •Journal Title •E.C. Numbers •Feature Key searches for particular DNA feature •SeqId is string identifier •Title Words •Text Words •Organism •Pubmed ID •Publication and modification date •Protein Name
  • 27.
  • 28.
  • 29. If you would like to donate us? Scan below and donate us 0.013$ (US dollar) (5Rs Indian rupee) Contact: If you want PPT/PDF files, please contact below. Email: gnccmysore@gmail.com Telegram:+919738137533(only for Chat)