SlideShare uma empresa Scribd logo
1 de 35
BIOLOGICAL
SEQUENCE
DATABASES
1
NCBI
2
What is NCBI?
National center for biotechnology
information
Established in 1998
Part of national library of medicine at
national institute of health
Major aim : public database
Development of software tools for
sequence analysis and disseminate
biomedical information
2 explain Roles of NCBI
3
1) Maintenance of biological databases
whether primary or secondary. It
includes GENEBANK
2) NCBI provides the data retrieval
systems such as ENTREZ
3) Provides computational sources for
the analysis of the GENEBANK data
and other biological data
Kinds of databases
Primary databases
Secondary
4
databases
Original submission by
the experimentalists who
have originally searched
Content Is controlled by
the submitters
Examples include
GENEBANK, SNP and
GEO
Built up from primary
data which is retrieved by
primary database
Content controlled by
third party NCBI
Examples include
RefSeq, RefSNP
, NCBI
Structure, Protein. Etc.
NCBI homepage
5
6
NCBI
TOOLS
BLAST
Standard blast Mega blast
PSI-blast PHI-blast
RPS blast
BLAST 2 SEQ
DATABASE
RETREIVAL
TOOL
SPECIALIZED
TOOL
ORF finder E-pcr
Sequence
submission
tool bankit
Spidey
DATABASES
Nucleotide
database
Literature
database
Protein
database
Expression
database
Structure
database
Retrieval tool ENTREZ
7
Integrated database search and
retrieval system
Provides extensive links between and
within database records
Cross references of different
databases
3 Sequence submission to
NCBI
8
Databases are constantly updated
with the newer submissions of the
sequences via sequence submission
tools such as:
Bankit
Sequein
Bank it
9
Web-based sequence submission tool
Connect to NCBI Home Page
Connect to GENEBANK side bar at
left
Tool of choice for simple submissions
Can also be used for updating
previously added information
Sequein
10
Stand alone sequence submission
and updating tool
Handling multiple sequence
submission
Provides increased capacity for long
sequence submissions
Multiple annotation
Phylogenetic analysis population
BLAST
11
Basic local alignment search tool
program
used to generate alignments
between a nucleotide or protein
sequence, referred to as a “query”
and nucleotide or protein sequences
within a database, referred to as
“subject” sequences.
Sequence similarity searches against a
variety of different sequence databases
Unigene, gene, MMDB, GEO
Kinds of BLAST
Blastn Blastp Blastx Tblastn Tblastx
12
SPECIALIZED TOOLS
13
There are a lot of sequence analysis
tools which will be explained later
1) ORF Finder
2) e-PCR
3) SPIDEY
ORF FINDER
14
Open reading frame finder
Graphical analysis tool
Finds all open reading frames in the
user’s sequence or the sequence
already submitted in the databases
Uses standard and alternative genetic
codes for the analysis of reading
frames
Packaged with sequein
e-PCR
15
Electronic polymerase chain reaction
Searches for the STS (Sequence-
Tagged Sites)
Whole template DNA is searched for
STS
New database searches a query
sequence against a sequence
database
Spidey
16
This is another m RNA to genome
alignment tool
Searches databases via BLAST
As an input it gets a single genomic
sequence and m RNA FASTA
sequences
Pseudo genes and paralogues are
eliminated in this search and rue gene
is selected.
Databases of NCBI
Nucleotide
17
Literature
Protein
Gene
expression
Structure
Chemical
Nucleotide database-
GENEBANK
18
NCBI’s primary sequence database
Comprehensive public database of
nucleotide sequences
Bibliographic support
Built from authors entry into genebak
regarding EST
Genebank an EMBL make an INSD
Collaborative approach to share data
daily
HOMOLOGENE
Automated detection of homologues
Completely sequenced eukaryotic
genes
Analyses the proteins of the input
organism
Blastp
Taxonomic trees are being made
Statistical analysis of each match is
done and orthologs and paralogs are
identified 19
Db SNP
20
Database of single nucleotide
polymorphisms
Short deletion and insertions
polymorphisms
SNP~ 3D structures via Cn3D and
MMDB
Functional variants could be matched
with the OMIM
Literature database- PMC
21
Pubmed central
Digital archive of peer review journals
of life sciences
Enormous full text journals are there
Immediate access to full text journals
or within 12 months of publishing
Protein database
22
ENTREZ PROTEIN ~ Protein
sequence database of NCBI
Databases are cross searched
PDB, Swiss-Prot
Taxonomic relations
CDD conserved domain database
Gene expression database
23
Distribution and regulation of the
Transcriptional products
Normal and abnormal cell types
Lot of techniques have been
developed for survey of genome wide
transcript expression
SAGE map
24
Serial analysis of gene expression
map
Gene expression data analysis
Tag-to-gene function map
SAGE tags to gene clusters or a
single gene
A reciprocal gene to tag SAGE Map is
also available
Updated weekly
Structural database- MMDB
25
Molecular modeling database MMDB
3D macromolecular structures
XRD and NMR are being used for the
experimental structure determination
Evolutionary history of function
Relationship between
macromolecules.
26
27
28
DATABASES
29
Chemical database- Pubchem
30
Database for the chemical molecules
Freely accessed through web-user
interface
Chemical structure
Diagnostic and therapeutic agents
Molecular mass below 2000u
Bridge between macromolecular
genomics and small organic
molecules of cellular metabolism
31
Display settings
32
Aspirin
33
34
Thanks
35

Mais conteúdo relacionado

Semelhante a 02. Biological sequence databases.pptx

Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
Atai Rabby
 
Biological databases.pptx
Biological databases.pptxBiological databases.pptx
Biological databases.pptx
PagudalaSangeetha
 

Semelhante a 02. Biological sequence databases.pptx (20)

Bioinformatics for beginners (exam point of view)
Bioinformatics for beginners (exam point of view)Bioinformatics for beginners (exam point of view)
Bioinformatics for beginners (exam point of view)
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomics
 
Biological database
Biological databaseBiological database
Biological database
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
 
Introduction to bioinformatics.pptx
Introduction to bioinformatics.pptxIntroduction to bioinformatics.pptx
Introduction to bioinformatics.pptx
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data mining
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahu
 
Biological databases
Biological databasesBiological databases
Biological databases
 
database retrival.pdf
database retrival.pdfdatabase retrival.pdf
database retrival.pdf
 
The uni prot knowledgebase
The uni prot knowledgebaseThe uni prot knowledgebase
The uni prot knowledgebase
 
Rishi
RishiRishi
Rishi
 
Major biological nucleotide databases
Major biological nucleotide databasesMajor biological nucleotide databases
Major biological nucleotide databases
 
Biological databases.pptx
Biological databases.pptxBiological databases.pptx
Biological databases.pptx
 
Intro bioinfo
Intro bioinfoIntro bioinfo
Intro bioinfo
 
Intro bioinfo
Intro bioinfoIntro bioinfo
Intro bioinfo
 
NCBI
NCBINCBI
NCBI
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 

Mais de HussainTaqi1 (15)

Biological titration of microorganisms..docx
Biological titration of microorganisms..docxBiological titration of microorganisms..docx
Biological titration of microorganisms..docx
 
microbiologicalinstrumentations-201011084351.pptx
microbiologicalinstrumentations-201011084351.pptxmicrobiologicalinstrumentations-201011084351.pptx
microbiologicalinstrumentations-201011084351.pptx
 
Quality control of Dairy industry.pdf
Quality control of Dairy industry.pdfQuality control of Dairy industry.pdf
Quality control of Dairy industry.pdf
 
Untitled.pptx
Untitled.pptxUntitled.pptx
Untitled.pptx
 
AMR ,s Practical.pptx
AMR ,s Practical.pptxAMR ,s Practical.pptx
AMR ,s Practical.pptx
 
DNA Extraction.pptx
DNA Extraction.pptxDNA Extraction.pptx
DNA Extraction.pptx
 
biofuels-131122134746-phpapp01 (1).pptx
biofuels-131122134746-phpapp01 (1).pptxbiofuels-131122134746-phpapp01 (1).pptx
biofuels-131122134746-phpapp01 (1).pptx
 
geneticengineering.ppt
geneticengineering.pptgeneticengineering.ppt
geneticengineering.ppt
 
April ppt-WPS Office.pptx
April ppt-WPS Office.pptxApril ppt-WPS Office.pptx
April ppt-WPS Office.pptx
 
Single Cell Protein.pptx
Single Cell Protein.pptxSingle Cell Protein.pptx
Single Cell Protein.pptx
 
Fermentation.pptx
Fermentation.pptxFermentation.pptx
Fermentation.pptx
 
genomic library.pptx
genomic library.pptxgenomic library.pptx
genomic library.pptx
 
01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx
 
Hehehe-WPS Office.pptx
Hehehe-WPS Office.pptxHehehe-WPS Office.pptx
Hehehe-WPS Office.pptx
 
Transgenic Plants.pptx
Transgenic Plants.pptxTransgenic Plants.pptx
Transgenic Plants.pptx
 

Último

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
Sérgio Sacani
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Sérgio Sacani
 

Último (20)

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 

02. Biological sequence databases.pptx

  • 2. NCBI 2 What is NCBI? National center for biotechnology information Established in 1998 Part of national library of medicine at national institute of health Major aim : public database Development of software tools for sequence analysis and disseminate biomedical information
  • 3. 2 explain Roles of NCBI 3 1) Maintenance of biological databases whether primary or secondary. It includes GENEBANK 2) NCBI provides the data retrieval systems such as ENTREZ 3) Provides computational sources for the analysis of the GENEBANK data and other biological data
  • 4. Kinds of databases Primary databases Secondary 4 databases Original submission by the experimentalists who have originally searched Content Is controlled by the submitters Examples include GENEBANK, SNP and GEO Built up from primary data which is retrieved by primary database Content controlled by third party NCBI Examples include RefSeq, RefSNP , NCBI Structure, Protein. Etc.
  • 6. 6 NCBI TOOLS BLAST Standard blast Mega blast PSI-blast PHI-blast RPS blast BLAST 2 SEQ DATABASE RETREIVAL TOOL SPECIALIZED TOOL ORF finder E-pcr Sequence submission tool bankit Spidey DATABASES Nucleotide database Literature database Protein database Expression database Structure database
  • 7. Retrieval tool ENTREZ 7 Integrated database search and retrieval system Provides extensive links between and within database records Cross references of different databases
  • 8. 3 Sequence submission to NCBI 8 Databases are constantly updated with the newer submissions of the sequences via sequence submission tools such as: Bankit Sequein
  • 9. Bank it 9 Web-based sequence submission tool Connect to NCBI Home Page Connect to GENEBANK side bar at left Tool of choice for simple submissions Can also be used for updating previously added information
  • 10. Sequein 10 Stand alone sequence submission and updating tool Handling multiple sequence submission Provides increased capacity for long sequence submissions Multiple annotation Phylogenetic analysis population
  • 11. BLAST 11 Basic local alignment search tool program used to generate alignments between a nucleotide or protein sequence, referred to as a “query” and nucleotide or protein sequences within a database, referred to as “subject” sequences. Sequence similarity searches against a variety of different sequence databases Unigene, gene, MMDB, GEO
  • 12. Kinds of BLAST Blastn Blastp Blastx Tblastn Tblastx 12
  • 13. SPECIALIZED TOOLS 13 There are a lot of sequence analysis tools which will be explained later 1) ORF Finder 2) e-PCR 3) SPIDEY
  • 14. ORF FINDER 14 Open reading frame finder Graphical analysis tool Finds all open reading frames in the user’s sequence or the sequence already submitted in the databases Uses standard and alternative genetic codes for the analysis of reading frames Packaged with sequein
  • 15. e-PCR 15 Electronic polymerase chain reaction Searches for the STS (Sequence- Tagged Sites) Whole template DNA is searched for STS New database searches a query sequence against a sequence database
  • 16. Spidey 16 This is another m RNA to genome alignment tool Searches databases via BLAST As an input it gets a single genomic sequence and m RNA FASTA sequences Pseudo genes and paralogues are eliminated in this search and rue gene is selected.
  • 18. Nucleotide database- GENEBANK 18 NCBI’s primary sequence database Comprehensive public database of nucleotide sequences Bibliographic support Built from authors entry into genebak regarding EST Genebank an EMBL make an INSD Collaborative approach to share data daily
  • 19. HOMOLOGENE Automated detection of homologues Completely sequenced eukaryotic genes Analyses the proteins of the input organism Blastp Taxonomic trees are being made Statistical analysis of each match is done and orthologs and paralogs are identified 19
  • 20. Db SNP 20 Database of single nucleotide polymorphisms Short deletion and insertions polymorphisms SNP~ 3D structures via Cn3D and MMDB Functional variants could be matched with the OMIM
  • 21. Literature database- PMC 21 Pubmed central Digital archive of peer review journals of life sciences Enormous full text journals are there Immediate access to full text journals or within 12 months of publishing
  • 22. Protein database 22 ENTREZ PROTEIN ~ Protein sequence database of NCBI Databases are cross searched PDB, Swiss-Prot Taxonomic relations CDD conserved domain database
  • 23. Gene expression database 23 Distribution and regulation of the Transcriptional products Normal and abnormal cell types Lot of techniques have been developed for survey of genome wide transcript expression
  • 24. SAGE map 24 Serial analysis of gene expression map Gene expression data analysis Tag-to-gene function map SAGE tags to gene clusters or a single gene A reciprocal gene to tag SAGE Map is also available Updated weekly
  • 25. Structural database- MMDB 25 Molecular modeling database MMDB 3D macromolecular structures XRD and NMR are being used for the experimental structure determination Evolutionary history of function Relationship between macromolecules.
  • 26. 26
  • 27. 27
  • 28. 28
  • 30. Chemical database- Pubchem 30 Database for the chemical molecules Freely accessed through web-user interface Chemical structure Diagnostic and therapeutic agents Molecular mass below 2000u Bridge between macromolecular genomics and small organic molecules of cellular metabolism
  • 31. 31
  • 34. 34