SlideShare uma empresa Scribd logo
1 de 15
Motif and Pattern Databases And
some practical approaches
Sucheta Tripathy
10/2/2016
Motifs
• Defined as a nucleotide or amino acid
sequence pattern that is widespread and is
associated with a biological function.
– A sequence motif = A structural Motif.
– A sequence motif residing in the coding region
may encode a structural motif.
– Non-coding nucleotide motifs may have regulatory
role. May have recognition sites for DNA binding
proteins.
Motifs, profiles and patterns
• Conserved region of a DNA or protein – Motif
• Qualitative expression of a motif – Pattern
– Regular Expression
– C[TA]TTG{X}
• Quantitative expression of a motif – Profile
– Position Specific Scoring Matrices (PSSMs)
– Weight matrices
Motifs/Patterns
N{P}[ST]{P}
[FILV]Qxxx[RK]Gxxx[RK]xx[FILVWY]
[] -> or (Probability information is lost)
{} -> Not
() -> repeated
^ -> Beginning
Profiles
• Quantitative representation.
• More useful for training dataset.
TCTAGAAGATGGCAGTGGCGAAGA
TCTAGAAAATGACAGTGGCGAAGA
TCTAGAAAATGGCAGTAGCGAAGA
TCTACTAAATGA TAGTAGCGAAGA
A 0,0,0,100 ,0, 75,100, 75 ATG
T 100,0,100,0,0, 25, 0, 0 ATG
G 0, 0, 0, 0, 75 ,0, 0, 25 ATG
C 0,100,0,0, 25 ,0, 0, 0 ATG
De novo prediction of Motifs
• MEME; EXTREME; AlignAce, Amadeus,
CisModule, FIRE, Gibbs Motif Sampler,
PhyloGibbs, SeSiMCMC, ChIPMunk and
Weeder. SCOPE, MotifVoter, and Mprofiler
MEME (Multiple Expectation Maximization for
Motif Elicitation)
Figure 3. Resources
MacIsaac KD, Fraenkel E (2006) Practical Strategies for Discovering Regulatory DNA Sequence Motifs. PLoS Comput Biol 2(4): e36.
doi:10.1371/journal.pcbi.0020036
http://journals.plos.org/ploscompbiol/article?id=info:doi/10.1371/journal.pcbi.0020036
MRLSFVPLLQLSRLVVSTQHSTKMSTVYRTCKMNEIALSLLAPTQPLDADQGVMSPMASSDQ
TTSIGDFRFLRTHHDKEERGLLVTSLTKGLAETSFPYR YTSMCATICSITHSRADAAPAKQAH
Prosite
ATGCGTCTCTCCTTCGTTCCACTACTGCAGCTCTCTCGTCTGGTCGTTAGCACACAACATAGTACGAAAATGA
GCACAGTATACCGTACCTGCAAAATGAATGAAATAGCTCTCTCGTTGCTGGCGCCAACGCAGCCATTGGACG
CTGACCAGGGTGTTATGTCACCGATGGCCTCATCAGACCAGACAACCTCAATTGG TGACTTTCGGTTCCTGA
GAACCCACCACGATAAAGAAGAGCGGGGCTTGCTGGTTACCAGCCTCACAAAAGGTTTGGCTGAAACATCAT
TTCCGTATCGATACACTTCGATGTGCGCAACTATTTGTTCAATTACGCATTCTCGGGCAGATGCTGCGCCTGC
GAAGCAGGCGCACTA
Scan this sequence and get me the motif
OR Build a PSSM
ATGCGTCTCTC
ATGCCTCTGTC
ATGCGTCTCTC
ATGCGTCTCTC
ATGCGTCTATC

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Dynamic programming
Dynamic programming Dynamic programming
Dynamic programming
 
Gene prediction method
Gene prediction method Gene prediction method
Gene prediction method
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
 
Pyrosequencing
PyrosequencingPyrosequencing
Pyrosequencing
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformatics
 
Microarray Data Analysis
Microarray Data AnalysisMicroarray Data Analysis
Microarray Data Analysis
 
Ddbj
DdbjDdbj
Ddbj
 
Pairwise sequence alignment
Pairwise sequence alignmentPairwise sequence alignment
Pairwise sequence alignment
 
Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins
 
Fasta
FastaFasta
Fasta
 
Peptide Mass Fingerprinting
Peptide Mass FingerprintingPeptide Mass Fingerprinting
Peptide Mass Fingerprinting
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Prosite
PrositeProsite
Prosite
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Introduction to NCBI
Introduction to NCBIIntroduction to NCBI
Introduction to NCBI
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Sequence file formats
Sequence file formatsSequence file formats
Sequence file formats
 

Destaque

Motif presentation
Motif presentationMotif presentation
Motif presentationAmir Razmjou
 
XPRIME: A Novel Motif Searching Method
XPRIME: A Novel Motif Searching MethodXPRIME: A Novel Motif Searching Method
XPRIME: A Novel Motif Searching Methodrlpoulsen
 
Protein threading using context specific alignment potential ismb-2013
Protein threading using context specific alignment potential ismb-2013Protein threading using context specific alignment potential ismb-2013
Protein threading using context specific alignment potential ismb-2013Sheng Wang
 
Drablos Composite Motifs Bosc2009
Drablos Composite Motifs Bosc2009Drablos Composite Motifs Bosc2009
Drablos Composite Motifs Bosc2009bosc
 
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...Luca Cozzuto
 
MEMEs in the Classroom
MEMEs in the ClassroomMEMEs in the Classroom
MEMEs in the ClassroomMichael A.
 
Dow fllormate dow pavimentos
Dow fllormate dow pavimentosDow fllormate dow pavimentos
Dow fllormate dow pavimentosCarla Alves
 
Night motif
Night motifNight motif
Night motifhmfowler
 
Optimum insulation thickness for building envelope a review
Optimum insulation thickness for building envelope  a reviewOptimum insulation thickness for building envelope  a review
Optimum insulation thickness for building envelope a revieweSAT Journals
 
Analysis of ChIP-Seq Data
Analysis of ChIP-Seq DataAnalysis of ChIP-Seq Data
Analysis of ChIP-Seq DataPhil Ewels
 
Protein Evolution: Structure, Function, and Human Health
Protein Evolution: Structure, Function, and Human HealthProtein Evolution: Structure, Function, and Human Health
Protein Evolution: Structure, Function, and Human HealthDan Gaston
 
Content – knowing when and how to use it
Content – knowing when and how to use itContent – knowing when and how to use it
Content – knowing when and how to use itBANNER
 
Theme,Symbols and Motifs
Theme,Symbols and MotifsTheme,Symbols and Motifs
Theme,Symbols and MotifsGuerillateacher
 
B A N N E R S
B A N N E R SB A N N E R S
B A N N E R Svaveloz
 
Patter lattice as a model of human's language processing
Patter lattice as a model of human's language processingPatter lattice as a model of human's language processing
Patter lattice as a model of human's language processingKow Kuroda
 
Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)
Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)
Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)Dr Vijayata choudhary
 

Destaque (20)

Motif presentation
Motif presentationMotif presentation
Motif presentation
 
XPRIME: A Novel Motif Searching Method
XPRIME: A Novel Motif Searching MethodXPRIME: A Novel Motif Searching Method
XPRIME: A Novel Motif Searching Method
 
Protein threading using context specific alignment potential ismb-2013
Protein threading using context specific alignment potential ismb-2013Protein threading using context specific alignment potential ismb-2013
Protein threading using context specific alignment potential ismb-2013
 
Drablos Composite Motifs Bosc2009
Drablos Composite Motifs Bosc2009Drablos Composite Motifs Bosc2009
Drablos Composite Motifs Bosc2009
 
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
 
6 motif and pattern
6   motif and pattern6   motif and pattern
6 motif and pattern
 
MEMEs in the Classroom
MEMEs in the ClassroomMEMEs in the Classroom
MEMEs in the Classroom
 
Macs course
Macs courseMacs course
Macs course
 
Apresentacao NG6
Apresentacao NG6Apresentacao NG6
Apresentacao NG6
 
Dow fllormate dow pavimentos
Dow fllormate dow pavimentosDow fllormate dow pavimentos
Dow fllormate dow pavimentos
 
Night motif
Night motifNight motif
Night motif
 
Optimum insulation thickness for building envelope a review
Optimum insulation thickness for building envelope  a reviewOptimum insulation thickness for building envelope  a review
Optimum insulation thickness for building envelope a review
 
DNA Motif Finding 2010
DNA Motif Finding 2010DNA Motif Finding 2010
DNA Motif Finding 2010
 
Analysis of ChIP-Seq Data
Analysis of ChIP-Seq DataAnalysis of ChIP-Seq Data
Analysis of ChIP-Seq Data
 
Protein Evolution: Structure, Function, and Human Health
Protein Evolution: Structure, Function, and Human HealthProtein Evolution: Structure, Function, and Human Health
Protein Evolution: Structure, Function, and Human Health
 
Content – knowing when and how to use it
Content – knowing when and how to use itContent – knowing when and how to use it
Content – knowing when and how to use it
 
Theme,Symbols and Motifs
Theme,Symbols and MotifsTheme,Symbols and Motifs
Theme,Symbols and Motifs
 
B A N N E R S
B A N N E R SB A N N E R S
B A N N E R S
 
Patter lattice as a model of human's language processing
Patter lattice as a model of human's language processingPatter lattice as a model of human's language processing
Patter lattice as a model of human's language processing
 
Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)
Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)
Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)
 

Mais de Sucheta Tripathy (20)

Gal
GalGal
Gal
 
Ramorum2016 final
Ramorum2016 finalRamorum2016 final
Ramorum2016 final
 
Primer designgeneprediction
Primer designgenepredictionPrimer designgeneprediction
Primer designgeneprediction
 
Databases ii
Databases iiDatabases ii
Databases ii
 
Snps and microarray
Snps and microarraySnps and microarray
Snps and microarray
 
Stat2013
Stat2013Stat2013
Stat2013
 
26 nov2013seminar
26 nov2013seminar26 nov2013seminar
26 nov2013seminar
 
Stat2013
Stat2013Stat2013
Stat2013
 
Presentation2013
Presentation2013Presentation2013
Presentation2013
 
Lecture7,8
Lecture7,8Lecture7,8
Lecture7,8
 
Lecture5,6
Lecture5,6Lecture5,6
Lecture5,6
 
Primer designgeneprediction
Primer designgenepredictionPrimer designgeneprediction
Primer designgeneprediction
 
Lecture 3,4
Lecture 3,4Lecture 3,4
Lecture 3,4
 
Lecture 1,2
Lecture 1,2Lecture 1,2
Lecture 1,2
 
Sequence Alignment,Blast, Fasta, MSA
Sequence Alignment,Blast, Fasta, MSASequence Alignment,Blast, Fasta, MSA
Sequence Alignment,Blast, Fasta, MSA
 
Databases Part II
Databases Part IIDatabases Part II
Databases Part II
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Genome sequencingprojects
Genome sequencingprojectsGenome sequencingprojects
Genome sequencingprojects
 
Human encodeproject
Human encodeprojectHuman encodeproject
Human encodeproject
 
Tyler presentation
Tyler presentationTyler presentation
Tyler presentation
 

Último

Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 

Último (20)

Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 

Motif andpatterndatabase