SlideShare uma empresa Scribd logo
1 de 210
Interaction networks
Prediction, data integration and text mining




              Lars Juhl Jensen
the cell cycle
essential process
grow and divide
one cell
two cells
four phases
G1 phase
growth
S phase
DNA replication
G2 phase
growth
M phase
cell division
regulation
gene expression
phosphorylation
targeted degradation
protein interactions
exercise 1
http://string-db.org
Szklarczyk, Franceschini et al., Nucleic Acids Research, 2011
association networks
guilt by association
STRING
>1100 genomes
genomic context
gene fusion
Korbel et al., Nature Biotechnology, 2004
conserved neighborhood
Korbel et al., Nature Biotechnology, 2004
phylogenetic profiles
Korbel et al., Nature Biotechnology, 2004
protein interactions
Jensen & Bork, Science, 2008
genetic interactions
Beyer et al., Nature Reviews Genetics, 2007
gene coexpression
curated knowledge
Letunic & Bork, Trends in Biochemical Sciences, 2008
>10 km
text mining
Pafilis, O’Donoghue, Jensen et al., Nature Biotechnology, 2009
co-mentioning
NLP
Natural Language Processing
different sources
Ensembl
RefSeq
BIND
Biomolecular Interaction Network Database
BioGRID
General Repository for Interaction Datasets
DIP
Database of Interacting Proteins
IntAct
MINT
Molecular Interactions Database
HPRD
Human Protein Reference Database
PDB
Protein Data Bank
GEO
Gene Expression Omnibus
MIPS
Munich Information center
 for Protein Sequences
Gene Ontology
BioCyc
KEGG
Kyoto Encyclopedia of Genes and Genomes
PID
NCI-Nature Pathway Interaction Database
Reactome
different formats
different names
CDC2
CDK1
P06493
not comparable
variable quality
confidence scores
calibrate to gold standard
von Mering et al., Nucleic Acids Research, 2005
transfer by orthology
von Mering et al., Nucleic Acids Research, 2005
combine scores
exercise 2
changing parameters
high confidence only
experiments only
evidence viewers
cell cycle analysis
gene expression
cell cultures
synchronization
microarrays
time courses
Gauthier et al., Nucleic Acids Research, 2007
cycling genes
time of peak expression
protein interactions
temporal network
de Lichtenberg, Jensen et al., Science, 2005
just-in-time assembly
de Lichtenberg, Jensen et al., Cell Cycle, 2007
evolutionary flexibility
orthologs and paralogs
protein complexes
exercise 3
http://string-db.org
network expansion
what is known
external data
save network
open in Cytoscape
layout
clustering
project data onto network
de Lichtenberg, Jensen et al., Science, 2005
very flexible
lose the STRING interface
payload mechanism
show external data
nodes
edges
hosted on your server
exercise 4
http://cyclebase-string.jensenlab.org
network expansion
CDK–cyclin complexes
chemical networks
STITCH
STRING + chemicals
PubChem compounds
>74,000 small molecules
experimental data
BindingDB
ChEMBL
PDSP Ki
Psycoactive Drug Screening Program
PDB
Protein Data Bank
drug targets
CTD
Comparative Toxicogenomics Database
DrugBank
GLIDA
GPCR-Ligand Database
Matador
TTD
Therapeutic Target Database
metabolic pathways
BioCyc
KEGG
Kyoto Encyclopedia of Genes and Genomes
Reactome
text mining
co-mentioning
NLP
Natural Language Processing
same issues as for proteins
only worse
exercise 5
http://stitch-db.org
chemical network for TYMS
Kuhn et al., Nucleic Acids Research, 2012
network expansion
interpretation
disease networks
human proteins
>8,000 disease terms
text mining
co-mentioning
exercise 6
http://diseases.jensenlab.org
TYMS disease associations
inspect the evidence
colorectal cancer network
conclusions
know your question
know what is possible
know the tools
shameless self-promotion
CONFIRMED
                                     SPEAKERS:
                                     Ivan Dikic
                                     Steve Jackson
                                     Jiri Lukas
                                     Andre Nussenzweig
                                     Philippe Bastiens
                                     Tony Pawson
                                     Forest White
                                     Eric Verdin
                                     Tim Hunt
                                     Brenda Schulman
                                     Michael Yaffe
                                     Matthias Mann
                                     Gerand Hart
                                     Søren Brunak
                                     Henrik Semb
                                     Juleen Zierath
REGISTRATION FEE, ACCOMMODATION
AND LOCAL COSTS FOR ALL ATTENDEES    CHAIRS:
ARE COVERED BY THE NOVO NORDISK      Jesper Velgaard Olsen
FOUNDATION.                          Chuna Choudhary
                                     Niels Mailand
APPLICATION DEADLINE SEPTEMBER 14,   Lars Juhl Jensen
larsjuhljensen
thank you!

Mais conteúdo relacionado

Destaque

Link Prediction in (Partially) Aligned Heterogeneous Social Networks
Link Prediction in (Partially) Aligned Heterogeneous Social NetworksLink Prediction in (Partially) Aligned Heterogeneous Social Networks
Link Prediction in (Partially) Aligned Heterogeneous Social Networks
Sina Sajadmanesh
 
SNLI_presentation_2
SNLI_presentation_2SNLI_presentation_2
SNLI_presentation_2
Viral Gupta
 
Systems biology: Large-scale biomedical data mining
Systems biology: Large-scale biomedical data miningSystems biology: Large-scale biomedical data mining
Systems biology: Large-scale biomedical data mining
Lars Juhl Jensen
 
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Leon Derczynski
 

Destaque (20)

PyData2015
PyData2015PyData2015
PyData2015
 
Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?
 
Network medicine - Integrating drugs, targets, diseases and side-effects
Network medicine - Integrating drugs, targets, diseases and side-effectsNetwork medicine - Integrating drugs, targets, diseases and side-effects
Network medicine - Integrating drugs, targets, diseases and side-effects
 
Learning Spectral Graph Transformations for Link Prediction
Learning Spectral Graph Transformations for Link PredictionLearning Spectral Graph Transformations for Link Prediction
Learning Spectral Graph Transformations for Link Prediction
 
Link Prediction in (Partially) Aligned Heterogeneous Social Networks
Link Prediction in (Partially) Aligned Heterogeneous Social NetworksLink Prediction in (Partially) Aligned Heterogeneous Social Networks
Link Prediction in (Partially) Aligned Heterogeneous Social Networks
 
AI&BigData Lab. Mostapha Benhenda. "Word vector representation and applications"
AI&BigData Lab. Mostapha Benhenda. "Word vector representation and applications"AI&BigData Lab. Mostapha Benhenda. "Word vector representation and applications"
AI&BigData Lab. Mostapha Benhenda. "Word vector representation and applications"
 
BelBi2016 presentation: Hybrid methodology for information extraction from ta...
BelBi2016 presentation: Hybrid methodology for information extraction from ta...BelBi2016 presentation: Hybrid methodology for information extraction from ta...
BelBi2016 presentation: Hybrid methodology for information extraction from ta...
 
Artificial Intelligence in E-learning (AI-Ed): Current and future applications
Artificial Intelligence in E-learning (AI-Ed): Current and future applicationsArtificial Intelligence in E-learning (AI-Ed): Current and future applications
Artificial Intelligence in E-learning (AI-Ed): Current and future applications
 
Automatic Key Term Extraction from Spoken Course Lectures
Automatic Key Term Extraction from Spoken Course LecturesAutomatic Key Term Extraction from Spoken Course Lectures
Automatic Key Term Extraction from Spoken Course Lectures
 
Deep Learning via Semi-Supervised Embedding (第 7 回 Deep Learning 勉強会資料; 大澤)
Deep Learning via Semi-Supervised Embedding (第 7 回 Deep Learning 勉強会資料; 大澤)Deep Learning via Semi-Supervised Embedding (第 7 回 Deep Learning 勉強会資料; 大澤)
Deep Learning via Semi-Supervised Embedding (第 7 回 Deep Learning 勉強会資料; 大澤)
 
Recursive Neural Network : ujava.org 12th deep learning workshop
Recursive Neural Network : ujava.org 12th deep learning workshopRecursive Neural Network : ujava.org 12th deep learning workshop
Recursive Neural Network : ujava.org 12th deep learning workshop
 
SNLI_presentation_2
SNLI_presentation_2SNLI_presentation_2
SNLI_presentation_2
 
Deep learning: what? how? why? How to win a Kaggle competition
Deep learning: what? how? why? How to win a Kaggle competitionDeep learning: what? how? why? How to win a Kaggle competition
Deep learning: what? how? why? How to win a Kaggle competition
 
Deep Learning Class #3 - Take Two LSTMs
Deep Learning Class #3 - Take Two LSTMsDeep Learning Class #3 - Take Two LSTMs
Deep Learning Class #3 - Take Two LSTMs
 
Deep learning for text analytics
Deep learning for text analyticsDeep learning for text analytics
Deep learning for text analytics
 
Systems biology: Large-scale biomedical data mining
Systems biology: Large-scale biomedical data miningSystems biology: Large-scale biomedical data mining
Systems biology: Large-scale biomedical data mining
 
Bibliological data science and drug discovery
Bibliological data science and drug discoveryBibliological data science and drug discovery
Bibliological data science and drug discovery
 
Tutorial on Coreference Resolution
Tutorial on Coreference Resolution Tutorial on Coreference Resolution
Tutorial on Coreference Resolution
 
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
 
Biomedical Relation Extraction for Knowledge Graph Completion
Biomedical Relation Extraction for Knowledge Graph CompletionBiomedical Relation Extraction for Knowledge Graph Completion
Biomedical Relation Extraction for Knowledge Graph Completion
 

Semelhante a Interaction networks - Prediction, data integration and text mining

Advanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomicsAdvanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomics
Lars Juhl Jensen
 
Advanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomicsAdvanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomics
Lars Juhl Jensen
 
Advanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomicsAdvanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomics
Lars Juhl Jensen
 
Network biology: Large-scale data integration and text mining
Network biology: Large-scale data integration and text miningNetwork biology: Large-scale data integration and text mining
Network biology: Large-scale data integration and text mining
Lars Juhl Jensen
 
Network biology - Large-scale data integration and text mining
Network biology - Large-scale data integration and text miningNetwork biology - Large-scale data integration and text mining
Network biology - Large-scale data integration and text mining
Lars Juhl Jensen
 
Network biology: Large-scale biomedical data and text mining
Network biology: Large-scale biomedical data and text miningNetwork biology: Large-scale biomedical data and text mining
Network biology: Large-scale biomedical data and text mining
Lars Juhl Jensen
 
Mining text and data on chemicals
Mining text and data on chemicalsMining text and data on chemicals
Mining text and data on chemicals
Lars Juhl Jensen
 
Protein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data miningProtein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data mining
Lars Juhl Jensen
 
Protein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data miningProtein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data mining
Lars Juhl Jensen
 
Network biology - Large-scale biomedical data and text mining
Network biology - Large-scale biomedical data and text miningNetwork biology - Large-scale biomedical data and text mining
Network biology - Large-scale biomedical data and text mining
Lars Juhl Jensen
 
Networks of proteins and diseases
Networks of proteins and diseasesNetworks of proteins and diseases
Networks of proteins and diseases
Lars Juhl Jensen
 
Protein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data miningProtein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data mining
Lars Juhl Jensen
 
Large-scale integration of data and text
Large-scale integration of data and textLarge-scale integration of data and text
Large-scale integration of data and text
Lars Juhl Jensen
 
Systems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systemsSystems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systems
Lars Juhl Jensen
 
Advanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomicsAdvanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomics
Lars Juhl Jensen
 

Semelhante a Interaction networks - Prediction, data integration and text mining (20)

Advanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomicsAdvanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomics
 
Advanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomicsAdvanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomics
 
Advanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomicsAdvanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomics
 
Network biology: Large-scale data integration and text mining
Network biology: Large-scale data integration and text miningNetwork biology: Large-scale data integration and text mining
Network biology: Large-scale data integration and text mining
 
Network biology - Large-scale data integration and text mining
Network biology - Large-scale data integration and text miningNetwork biology - Large-scale data integration and text mining
Network biology - Large-scale data integration and text mining
 
Network biology: Large-scale biomedical data and text mining
Network biology: Large-scale biomedical data and text miningNetwork biology: Large-scale biomedical data and text mining
Network biology: Large-scale biomedical data and text mining
 
Mining text and data on chemicals
Mining text and data on chemicalsMining text and data on chemicals
Mining text and data on chemicals
 
Protein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data miningProtein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data mining
 
Protein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data miningProtein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data mining
 
Network biology - Large-scale biomedical data and text mining
Network biology - Large-scale biomedical data and text miningNetwork biology - Large-scale biomedical data and text mining
Network biology - Large-scale biomedical data and text mining
 
Networks of proteins and diseases
Networks of proteins and diseasesNetworks of proteins and diseases
Networks of proteins and diseases
 
Protein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data miningProtein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data mining
 
Large-scale data and text mining
Large-scale data and text miningLarge-scale data and text mining
Large-scale data and text mining
 
Protein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data miningProtein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data mining
 
Large-scale integration of data and text
Large-scale integration of data and textLarge-scale integration of data and text
Large-scale integration of data and text
 
Unraveling signal transduction networks through data integration
Unraveling signal transduction networks through data integrationUnraveling signal transduction networks through data integration
Unraveling signal transduction networks through data integration
 
Protein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data miningProtein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data mining
 
Systems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systemsSystems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systems
 
Visualization of large-scale protein and disease networks
Visualization of large-scaleprotein and disease networksVisualization of large-scaleprotein and disease networks
Visualization of large-scale protein and disease networks
 
Advanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomicsAdvanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomics
 

Mais de Lars Juhl Jensen

Mais de Lars Juhl Jensen (20)

One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Illustrating the power of dictionary-based named entit...One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Illustrating the power of dictionary-based named entit...
 
One tagger, many uses: Simple text-mining strategies for biomedicine
One tagger, many uses: Simple text-mining strategies for biomedicineOne tagger, many uses: Simple text-mining strategies for biomedicine
One tagger, many uses: Simple text-mining strategies for biomedicine
 
Extract 2.0: Text-mining-assisted interactive annotation
Extract 2.0: Text-mining-assisted interactive annotationExtract 2.0: Text-mining-assisted interactive annotation
Extract 2.0: Text-mining-assisted interactive annotation
 
Network visualization: A crash course on using Cytoscape
Network visualization: A crash course on using CytoscapeNetwork visualization: A crash course on using Cytoscape
Network visualization: A crash course on using Cytoscape
 
STRING & STITCH : Network integration of heterogeneous data
STRING & STITCH: Network integration of heterogeneous dataSTRING & STITCH: Network integration of heterogeneous data
STRING & STITCH : Network integration of heterogeneous data
 
Biomedical text mining: Automatic processing of unstructured text
Biomedical text mining: Automatic processing of unstructured textBiomedical text mining: Automatic processing of unstructured text
Biomedical text mining: Automatic processing of unstructured text
 
Medical network analysis: Linking diseases and genes through data and text mi...
Medical network analysis: Linking diseases and genes through data and text mi...Medical network analysis: Linking diseases and genes through data and text mi...
Medical network analysis: Linking diseases and genes through data and text mi...
 
Network Biology: A crash course on STRING and Cytoscape
Network Biology: A crash course on STRING and CytoscapeNetwork Biology: A crash course on STRING and Cytoscape
Network Biology: A crash course on STRING and Cytoscape
 
Cellular networks
Cellular networksCellular networks
Cellular networks
 
Cellular Network Biology: Large-scale integration of data and text
Cellular Network Biology: Large-scale integration of data and textCellular Network Biology: Large-scale integration of data and text
Cellular Network Biology: Large-scale integration of data and text
 
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
 
STRING & related databases: Large-scale integration of heterogeneous data
STRING & related databases: Large-scale integration of heterogeneous dataSTRING & related databases: Large-scale integration of heterogeneous data
STRING & related databases: Large-scale integration of heterogeneous data
 
Tagger: Rapid dictionary-based named entity recognition
Tagger: Rapid dictionary-based named entity recognitionTagger: Rapid dictionary-based named entity recognition
Tagger: Rapid dictionary-based named entity recognition
 
Network Biology: Large-scale integration of data and text
Network Biology: Large-scale integration of data and textNetwork Biology: Large-scale integration of data and text
Network Biology: Large-scale integration of data and text
 
Medical text mining: Linking diseases, drugs, and adverse reactions
Medical text mining: Linking diseases, drugs, and adverse reactionsMedical text mining: Linking diseases, drugs, and adverse reactions
Medical text mining: Linking diseases, drugs, and adverse reactions
 
Network biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and textNetwork biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and text
 
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactionsMedical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactions
 
Cellular Network Biology
Cellular Network BiologyCellular Network Biology
Cellular Network Biology
 
Network biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and textNetwork biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and text
 
Biomarker bioinformatics: Network-based candidate prioritization
Biomarker bioinformatics: Network-based candidate prioritizationBiomarker bioinformatics: Network-based candidate prioritization
Biomarker bioinformatics: Network-based candidate prioritization
 

Interaction networks - Prediction, data integration and text mining