SlideShare uma empresa Scribd logo
1 de 37
Baixar para ler offline
Protein-Protein Interactions Prediction
Sergey Knyazev
November 21, 2012
Outline
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction databases
4) Protein-Protein interaction prediction
Introduction
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction databases
4) Protein-Protein interaction prediction
There is Huge Ammount of
Interactions in a Cell
Example: Possible molecular interactions
in a spreading cell.
There is Many Ways Biomolecules
Interacts in a Cell.
Protein-Protein Interaction
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction databases
4) Protein-Protein interaction prediction
Protein-Protein Interaction
●
Physical contacts with molecular docking between
proteins that occur in a cell or in a living organism.
●
Not just a ‘‘functional contact’’: The existence of
many other types of functional links between
biomolecular entities (genes, proteins, metabolites,
etc.) in living organisms should not be confused
with protein physical interactions.
●
‘‘Specific contact’’, not just all proteins that bump
into each other by chance.
●
Should be excluded interactions that a protein
experiences when it is being made, folded, quality
checked, or degraded.
Protein-Protein Interaction (PPI)
detection
PPIs Detection Methods
Protein-Protein Interaction
Databases
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction
databases
4) Protein-Protein interaction prediction
Protein-Protein Interaction
Databases
●
BIND - Biomolecular Interaction Network Database;
●
BioGRID - Biological General Repository for
Interaction Datasets;
●
DIP - Database of Interacting Proteins;
●
IntAct - IntAct Molecular Interaction Database;
●
HPRD - Human Protein Reference Database
●
MINT - Molecular INTeraction database;
●
PIPs - Human PPI Prediction database;
●
STRING - Known and Predicted Protein-Protein
Interactions.
PPI Network Derived from
Databases
PIPs human PPIs database
●
Contains predictions of 37 000 high probability
interactions of which 34 000 are not reported in the
interaction databases HPRD, BIND, DIP or OPHID.
●
Interactions predicted by a naive Bayesian model.
The method combines information from gene co-
expression, orthology, co-occurrence of domains,
post-translational modifications, co-localization of
the proteins within the cell and analysis of the local
topology of the predicted PPI network.
●
Based on a prediction algorithm described bellow...
Protein-Protein interaction
prediction
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction databases
4) Protein-Protein interaction
prediction
Protein-Protein Interaction
Prediction
●
The prediction of human protein-protein
interactions was investigated in a Bayesian
framework by considering combinations of
individual protein features known to be indicative
of interaction.
●
The seven individual features are used.
●
The features are grouped into five distinct
modules: Expression (E), Ortology(O),
Combined(C), Disorder(D), Transitive(T).
Expression Module
●
Data Source:
–
GDS596 from the Gene Expression Omnibus
●
Description:
–
Gene co-expression profiles from 79 physiologically normal
tissues obtained from various sources
●
Scoring function:
–
Pearson correlation of coexpression over all conditions
●
Bins:
–
20 of equal size covering the correlation value
range (-1 to +1)
Orthology Module
●
Data Source:
–
InParanoid, BIND, DIP and GRID databases
●
Description:
–
Interactions of homologous protein pairs from yeast, fly, worm and human
●
Scoring function:
–
Organism-based using InParanoid score
●
13 Bins:
–
High, medium and low confidence bins were defined for human protein pairs that have
interacting orthologs in either yeast, fly or worm (for a total of 9 bins)
–
two bin for human pairs that have interacting paralogs in human (a medium and a low
confidence)
–
one bin for human pairs that have interacting homologs in more than one organism
–
one bin for human pairs that have only noninteracting orthologs
Combined Module
●
This module incorporates three distinct features in a nonnaïve
Bayesian framework: subcellular localization, domain co-
occurrence and post-translational modification co-occurrence.
●
Localization:
–
Data source:
●
PSLT predictions
–
Description:
●
PSLT is a human subcellular localization predictor that considers nine different
compartments (ER, Golgi, cytosol, nucleus, peroxisome, plasma membrane,
lysosome, mitochondria and extracellular)
–
Scoring function:
●
Qualitative score: proximity of compartments
–
4 bins:
●
same, neighboring, different compartments, or not localized
Combined module
●
Domain co-occurrence
–
Data source:
●
InterPro and Pfam
–
Description:
●
Protein domains and motifs
–
Scoring function:
●
The chi-square test was used as a measure of the likelihood of co-
occurrence of specific InterPro domains and motifs in protein pairs
●
Chi-square scores were calculated for all pairs of domains/motifs
that occurred in the training data
–
Bins:
●
5 covering range of Chi-square scores
Combined module
●
PTM co-occurrence
–
Data source:
●
HPRD and UniProt
–
Description:
●
Post-translational modifications
–
Scoring function:
–
Bins:
●
4 covering range of PTM scores
Disorder Module
●
Data source:
–
VLS2 predictions
●
Description:
–
Prediction of protein intrinsic disorder
●
Scoring function:
–
Sum of the percent disorder for each protein in a pair
●
Bins:
–
6 covering range of scoring function (0 to 200%)
Transitive Module
●
Description:
–
Module that considers local
topology of underlying network
predicted using combinations of
above features
●
Scoring function:
●
Bins:
–
5 covering range of scoring
function
Independence of the Modules
●
The final likelihood ratio output by the predictor is only
representative of the true likelihood of interaction of a protein pair if
the modules considered are independent. If the modules were not
independent, some likelihood ratios would likely be overestimated.
●
Previous studies have demonstrated that some of the features
considered here are indeed independent.
●
Independence of all modules used in our predictor was verified by
calculating Pearson correlation coefficients for all pairs of modules.
Architecture of the Predictor and
Likelihoods of the Modules
Posterior Odds Ratio Estimation
●
f1, … , fn — features
●
I — interaction
●
~I — non-interaction
Accuracy of the Predictors
●
In order to analyze the predictions, five-fold cross validation
experiments were performed and the area under partial ROC
(receiver operator characteristic) curves (partial AUCs) measured.
●
T is the total number of positives in the test set
●
Ti is the number of positives that score higher than the ith highest
scoring negative
Prediction Accuracy of Different Combinations of Modules
PPI Prediction by Single Module
PPI Prediction by Combination of
Modules
Receiver Operator Characteristic
(ROC)
Comparison with Other Interaction
Datasets
●
Estimated datasets:
–
Rhodes probabilistic dataset
–
LR400 (derived from our predictors)
–
Lehner orthology-derived dataset
●
The false positive rates:
●
Reference datasets:
–
Literature-mined Ramani dataset
–
Human Protein Reference Database
(HPRD)
Comparison with Other Interaction
Datasets
Independent Validation
Conclusion
●
Predicted over 37000 human protein
interactions
●
Explored a subspace of the human
interactome that has not been
investigated by previous large
interaction datasets.
References
●
Protein–Protein Interactions Essentials: Key Concepts to
Building and Analyzing Interactome Networks 2010
Javier De Las Rivas, Celia Fontanillo
●
PIPs: human protein–protein interaction prediction
database 2008
Mark D. McDowall, Michelle S. Scott and Geoffrey J. Barton
●
Probabilistic prediction and ranking of human protein-
protein interactions 2007
Michelle S Scott and Geoffrey J Barton
Thank you!

Mais conteúdo relacionado

Mais procurados

co immunoprecipitation
co immunoprecipitationco immunoprecipitation
co immunoprecipitationssuser60e34a
 
Cytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networksCytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networksBITS
 
Proteomics and protein-protein interaction
Proteomics  and protein-protein interactionProteomics  and protein-protein interaction
Proteomics and protein-protein interactionSenthilkumarV25
 
Protein-protein interaction (PPI)
Protein-protein interaction (PPI)Protein-protein interaction (PPI)
Protein-protein interaction (PPI)N Poorin
 
Protein protein interaction basic
Protein protein interaction basicProtein protein interaction basic
Protein protein interaction basicAyesha Aftab
 
Protein interaction Creative Biomart
Protein interaction Creative BiomartProtein interaction Creative Biomart
Protein interaction Creative BiomartCreative BioMart
 
Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)Sai Ram
 
Protein protein interaction
Protein protein interactionProtein protein interaction
Protein protein interactionAashish Patel
 
The yeast two hybrid system and ChIP
The yeast two hybrid system and ChIPThe yeast two hybrid system and ChIP
The yeast two hybrid system and ChIPAbhishek M
 
yeast two hybrid system
yeast two hybrid systemyeast two hybrid system
yeast two hybrid systemSheetal Mehla
 
Yeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction StudiesYeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction Studiesajithnandanam
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactionsTasuduq Yaqoob
 
Biotech 2012 spring-6_protein_interactions_0
Biotech 2012 spring-6_protein_interactions_0Biotech 2012 spring-6_protein_interactions_0
Biotech 2012 spring-6_protein_interactions_0BioinformaticsInstitute
 
Protein-protein interaction
Protein-protein interactionProtein-protein interaction
Protein-protein interactionsigma-tau
 
Gene regulatory networks
Gene regulatory networksGene regulatory networks
Gene regulatory networksMadiheh
 
Yeast two hybrid
Yeast two hybridYeast two hybrid
Yeast two hybridhina ojha
 

Mais procurados (20)

co immunoprecipitation
co immunoprecipitationco immunoprecipitation
co immunoprecipitation
 
Cytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networksCytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networks
 
Ppi
PpiPpi
Ppi
 
Proteomics and protein-protein interaction
Proteomics  and protein-protein interactionProteomics  and protein-protein interaction
Proteomics and protein-protein interaction
 
Protein-protein interaction (PPI)
Protein-protein interaction (PPI)Protein-protein interaction (PPI)
Protein-protein interaction (PPI)
 
Protein protein interaction basic
Protein protein interaction basicProtein protein interaction basic
Protein protein interaction basic
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
 
Protein interaction Creative Biomart
Protein interaction Creative BiomartProtein interaction Creative Biomart
Protein interaction Creative Biomart
 
Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)
 
Protein protein interaction
Protein protein interactionProtein protein interaction
Protein protein interaction
 
The yeast two hybrid system and ChIP
The yeast two hybrid system and ChIPThe yeast two hybrid system and ChIP
The yeast two hybrid system and ChIP
 
yeast two hybrid system
yeast two hybrid systemyeast two hybrid system
yeast two hybrid system
 
Yeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction StudiesYeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction Studies
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
 
Biotech 2012 spring-6_protein_interactions_0
Biotech 2012 spring-6_protein_interactions_0Biotech 2012 spring-6_protein_interactions_0
Biotech 2012 spring-6_protein_interactions_0
 
Protein-protein interaction
Protein-protein interactionProtein-protein interaction
Protein-protein interaction
 
Proteomics
ProteomicsProteomics
Proteomics
 
Gene regulatory networks
Gene regulatory networksGene regulatory networks
Gene regulatory networks
 
Yeast two hybrid
Yeast two hybridYeast two hybrid
Yeast two hybrid
 
Yeast two hybrid
Yeast two hybrid Yeast two hybrid
Yeast two hybrid
 

Destaque

Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomicsAisha Kalsoom
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomicsPawan Kumar
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomicsajay301
 

Destaque (7)

Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomics
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Genomics
GenomicsGenomics
Genomics
 
Types of genomics ppt
Types of genomics pptTypes of genomics ppt
Types of genomics ppt
 
Genomics
GenomicsGenomics
Genomics
 

Semelhante a Slides 0

Protein protein interaction, functional proteomics
Protein protein interaction, functional proteomicsProtein protein interaction, functional proteomics
Protein protein interaction, functional proteomicsKAUSHAL SAHU
 
A Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products ResearchA Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products ResearchHuda Nazeer
 
Yeast Two Hybrid System
Yeast Two Hybrid SystemYeast Two Hybrid System
Yeast Two Hybrid SystemSuby Mon Benny
 
Role of genomics and proteomics
Role of genomics and proteomicsRole of genomics and proteomics
Role of genomics and proteomicsPavana K A
 
University of Texas at Austin
University of Texas at AustinUniversity of Texas at Austin
University of Texas at Austinbutest
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicshemantbreeder
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSHEETHUMOLKS
 
Proteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyProteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyChrist College, Rajkot
 
Systems Biology Approaches to Cancer
Systems Biology Approaches to CancerSystems Biology Approaches to Cancer
Systems Biology Approaches to CancerRaunak Shrestha
 
Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...Andrei KUCHARAVY
 
Proteomics - Analysis and integration of large-scale data sets
Proteomics - Analysis and integration of large-scale data setsProteomics - Analysis and integration of large-scale data sets
Proteomics - Analysis and integration of large-scale data setsLars Juhl Jensen
 
STRING - Modeling of pathways through cross-species integration of large-scal...
STRING - Modeling of pathways through cross-species integration of large-scal...STRING - Modeling of pathways through cross-species integration of large-scal...
STRING - Modeling of pathways through cross-species integration of large-scal...Lars Juhl Jensen
 
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...KarthigaRavichandran3
 
Proteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programmeProteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programmeSumanthBT1
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and toolsKAUSHAL SAHU
 

Semelhante a Slides 0 (20)

Protein protein interaction, functional proteomics
Protein protein interaction, functional proteomicsProtein protein interaction, functional proteomics
Protein protein interaction, functional proteomics
 
Systems biology
Systems biologySystems biology
Systems biology
 
A Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products ResearchA Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products Research
 
Applied Bioinformatics Assignment 5docx
Applied Bioinformatics Assignment  5docxApplied Bioinformatics Assignment  5docx
Applied Bioinformatics Assignment 5docx
 
Yeast Two Hybrid System
Yeast Two Hybrid SystemYeast Two Hybrid System
Yeast Two Hybrid System
 
Role of genomics and proteomics
Role of genomics and proteomicsRole of genomics and proteomics
Role of genomics and proteomics
 
University of Texas at Austin
University of Texas at AustinUniversity of Texas at Austin
University of Texas at Austin
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
 
presentation
presentationpresentation
presentation
 
Proteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyProteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASy
 
Systems Biology Approaches to Cancer
Systems Biology Approaches to CancerSystems Biology Approaches to Cancer
Systems Biology Approaches to Cancer
 
Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...
 
Proteomics
ProteomicsProteomics
Proteomics
 
Proteomics - Analysis and integration of large-scale data sets
Proteomics - Analysis and integration of large-scale data setsProteomics - Analysis and integration of large-scale data sets
Proteomics - Analysis and integration of large-scale data sets
 
STRING - Modeling of pathways through cross-species integration of large-scal...
STRING - Modeling of pathways through cross-species integration of large-scal...STRING - Modeling of pathways through cross-species integration of large-scal...
STRING - Modeling of pathways through cross-species integration of large-scal...
 
proteomics
 proteomics proteomics
proteomics
 
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
 
Proteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programmeProteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programme
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and tools
 

Mais de BioinformaticsInstitute

Comparative Genomics and de Bruijn graphs
Comparative Genomics and de Bruijn graphsComparative Genomics and de Bruijn graphs
Comparative Genomics and de Bruijn graphsBioinformaticsInstitute
 
Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
 Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес... Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...BioinformaticsInstitute
 
Вперед в прошлое. Методы генетической диагностики древней днк
Вперед в прошлое. Методы генетической диагностики древней днкВперед в прошлое. Методы генетической диагностики древней днк
Вперед в прошлое. Методы генетической диагностики древней днкBioinformaticsInstitute
 
"Зачем биологам суперкомпьютеры", Александр Предеус
"Зачем биологам суперкомпьютеры", Александр Предеус"Зачем биологам суперкомпьютеры", Александр Предеус
"Зачем биологам суперкомпьютеры", Александр ПредеусBioinformaticsInstitute
 
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...BioinformaticsInstitute
 
Рак 101 (Мария Шутова, ИоГЕН РАН)
Рак 101 (Мария Шутова, ИоГЕН РАН)Рак 101 (Мария Шутова, ИоГЕН РАН)
Рак 101 (Мария Шутова, ИоГЕН РАН)BioinformaticsInstitute
 
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...BioinformaticsInstitute
 
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)BioinformaticsInstitute
 

Mais de BioinformaticsInstitute (20)

Graph genome
Graph genome Graph genome
Graph genome
 
Nanopores sequencing
Nanopores sequencingNanopores sequencing
Nanopores sequencing
 
A superglue for string comparison
A superglue for string comparisonA superglue for string comparison
A superglue for string comparison
 
Comparative Genomics and de Bruijn graphs
Comparative Genomics and de Bruijn graphsComparative Genomics and de Bruijn graphs
Comparative Genomics and de Bruijn graphs
 
Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
 Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес... Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
 
Вперед в прошлое. Методы генетической диагностики древней днк
Вперед в прошлое. Методы генетической диагностики древней днкВперед в прошлое. Методы генетической диагностики древней днк
Вперед в прошлое. Методы генетической диагностики древней днк
 
Knime & bioinformatics
Knime & bioinformaticsKnime & bioinformatics
Knime & bioinformatics
 
"Зачем биологам суперкомпьютеры", Александр Предеус
"Зачем биологам суперкомпьютеры", Александр Предеус"Зачем биологам суперкомпьютеры", Александр Предеус
"Зачем биологам суперкомпьютеры", Александр Предеус
 
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
 
Рак 101 (Мария Шутова, ИоГЕН РАН)
Рак 101 (Мария Шутова, ИоГЕН РАН)Рак 101 (Мария Шутова, ИоГЕН РАН)
Рак 101 (Мария Шутова, ИоГЕН РАН)
 
Плюрипотентность 101
Плюрипотентность 101Плюрипотентность 101
Плюрипотентность 101
 
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
 
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
 
Biodb 2011-everything
Biodb 2011-everythingBiodb 2011-everything
Biodb 2011-everything
 
Biodb 2011-05
Biodb 2011-05Biodb 2011-05
Biodb 2011-05
 
Biodb 2011-04
Biodb 2011-04Biodb 2011-04
Biodb 2011-04
 
Biodb 2011-03
Biodb 2011-03Biodb 2011-03
Biodb 2011-03
 
Biodb 2011-01
Biodb 2011-01Biodb 2011-01
Biodb 2011-01
 
Biodb 2011-02
Biodb 2011-02Biodb 2011-02
Biodb 2011-02
 
Ngs 3 1
Ngs 3 1Ngs 3 1
Ngs 3 1
 

Último

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 

Último (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

Slides 0

  • 2. Outline 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 3. Introduction 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 4. There is Huge Ammount of Interactions in a Cell Example: Possible molecular interactions in a spreading cell.
  • 5. There is Many Ways Biomolecules Interacts in a Cell.
  • 6. Protein-Protein Interaction 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 7. Protein-Protein Interaction ● Physical contacts with molecular docking between proteins that occur in a cell or in a living organism. ● Not just a ‘‘functional contact’’: The existence of many other types of functional links between biomolecular entities (genes, proteins, metabolites, etc.) in living organisms should not be confused with protein physical interactions. ● ‘‘Specific contact’’, not just all proteins that bump into each other by chance. ● Should be excluded interactions that a protein experiences when it is being made, folded, quality checked, or degraded.
  • 10. Protein-Protein Interaction Databases 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 11. Protein-Protein Interaction Databases ● BIND - Biomolecular Interaction Network Database; ● BioGRID - Biological General Repository for Interaction Datasets; ● DIP - Database of Interacting Proteins; ● IntAct - IntAct Molecular Interaction Database; ● HPRD - Human Protein Reference Database ● MINT - Molecular INTeraction database; ● PIPs - Human PPI Prediction database; ● STRING - Known and Predicted Protein-Protein Interactions.
  • 12.
  • 13. PPI Network Derived from Databases
  • 14. PIPs human PPIs database ● Contains predictions of 37 000 high probability interactions of which 34 000 are not reported in the interaction databases HPRD, BIND, DIP or OPHID. ● Interactions predicted by a naive Bayesian model. The method combines information from gene co- expression, orthology, co-occurrence of domains, post-translational modifications, co-localization of the proteins within the cell and analysis of the local topology of the predicted PPI network. ● Based on a prediction algorithm described bellow...
  • 15. Protein-Protein interaction prediction 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 16. Protein-Protein Interaction Prediction ● The prediction of human protein-protein interactions was investigated in a Bayesian framework by considering combinations of individual protein features known to be indicative of interaction. ● The seven individual features are used. ● The features are grouped into five distinct modules: Expression (E), Ortology(O), Combined(C), Disorder(D), Transitive(T).
  • 17. Expression Module ● Data Source: – GDS596 from the Gene Expression Omnibus ● Description: – Gene co-expression profiles from 79 physiologically normal tissues obtained from various sources ● Scoring function: – Pearson correlation of coexpression over all conditions ● Bins: – 20 of equal size covering the correlation value range (-1 to +1)
  • 18. Orthology Module ● Data Source: – InParanoid, BIND, DIP and GRID databases ● Description: – Interactions of homologous protein pairs from yeast, fly, worm and human ● Scoring function: – Organism-based using InParanoid score ● 13 Bins: – High, medium and low confidence bins were defined for human protein pairs that have interacting orthologs in either yeast, fly or worm (for a total of 9 bins) – two bin for human pairs that have interacting paralogs in human (a medium and a low confidence) – one bin for human pairs that have interacting homologs in more than one organism – one bin for human pairs that have only noninteracting orthologs
  • 19. Combined Module ● This module incorporates three distinct features in a nonnaïve Bayesian framework: subcellular localization, domain co- occurrence and post-translational modification co-occurrence. ● Localization: – Data source: ● PSLT predictions – Description: ● PSLT is a human subcellular localization predictor that considers nine different compartments (ER, Golgi, cytosol, nucleus, peroxisome, plasma membrane, lysosome, mitochondria and extracellular) – Scoring function: ● Qualitative score: proximity of compartments – 4 bins: ● same, neighboring, different compartments, or not localized
  • 20. Combined module ● Domain co-occurrence – Data source: ● InterPro and Pfam – Description: ● Protein domains and motifs – Scoring function: ● The chi-square test was used as a measure of the likelihood of co- occurrence of specific InterPro domains and motifs in protein pairs ● Chi-square scores were calculated for all pairs of domains/motifs that occurred in the training data – Bins: ● 5 covering range of Chi-square scores
  • 21. Combined module ● PTM co-occurrence – Data source: ● HPRD and UniProt – Description: ● Post-translational modifications – Scoring function: – Bins: ● 4 covering range of PTM scores
  • 22. Disorder Module ● Data source: – VLS2 predictions ● Description: – Prediction of protein intrinsic disorder ● Scoring function: – Sum of the percent disorder for each protein in a pair ● Bins: – 6 covering range of scoring function (0 to 200%)
  • 23. Transitive Module ● Description: – Module that considers local topology of underlying network predicted using combinations of above features ● Scoring function: ● Bins: – 5 covering range of scoring function
  • 24. Independence of the Modules ● The final likelihood ratio output by the predictor is only representative of the true likelihood of interaction of a protein pair if the modules considered are independent. If the modules were not independent, some likelihood ratios would likely be overestimated. ● Previous studies have demonstrated that some of the features considered here are indeed independent. ● Independence of all modules used in our predictor was verified by calculating Pearson correlation coefficients for all pairs of modules.
  • 25. Architecture of the Predictor and Likelihoods of the Modules
  • 26. Posterior Odds Ratio Estimation ● f1, … , fn — features ● I — interaction ● ~I — non-interaction
  • 27. Accuracy of the Predictors ● In order to analyze the predictions, five-fold cross validation experiments were performed and the area under partial ROC (receiver operator characteristic) curves (partial AUCs) measured. ● T is the total number of positives in the test set ● Ti is the number of positives that score higher than the ith highest scoring negative
  • 28. Prediction Accuracy of Different Combinations of Modules
  • 29. PPI Prediction by Single Module
  • 30. PPI Prediction by Combination of Modules
  • 32. Comparison with Other Interaction Datasets ● Estimated datasets: – Rhodes probabilistic dataset – LR400 (derived from our predictors) – Lehner orthology-derived dataset ● The false positive rates: ● Reference datasets: – Literature-mined Ramani dataset – Human Protein Reference Database (HPRD)
  • 33. Comparison with Other Interaction Datasets
  • 35. Conclusion ● Predicted over 37000 human protein interactions ● Explored a subspace of the human interactome that has not been investigated by previous large interaction datasets.
  • 36. References ● Protein–Protein Interactions Essentials: Key Concepts to Building and Analyzing Interactome Networks 2010 Javier De Las Rivas, Celia Fontanillo ● PIPs: human protein–protein interaction prediction database 2008 Mark D. McDowall, Michelle S. Scott and Geoffrey J. Barton ● Probabilistic prediction and ranking of human protein- protein interactions 2007 Michelle S Scott and Geoffrey J Barton