SlideShare a Scribd company logo
1 of 128
Network biology
A basis for large-scale biomedical data mining
Lars Juhl Jensen
sequence analysis
Jensen, Gupta et al., Journal of Molecular Biology, 2002
data mining
de Lichtenberg, Jensen et al., Science, 2005
data mining
text mining
Pafilis, O’Donoghue, Jensen et al., Nature Biotechnology, 2009
signaling networks
phosphoproteomics
in vivo phosphosites
kinases are unknown
sequence motifs
Miller, Jensen et al., Science Signaling, 2008
NetPhorest
data organization
Miller, Jensen et al., Science Signaling, 2008
automated pipeline
Miller, Jensen et al., Science Signaling, 2008
compilation of datasets
training and evaluation
motif atlas
179 kinases
89 SH2 domains
8 PTB domains
BRCT domains
WW domains
14-3-3 proteins
phosphatases
sequence specificity
in vitro
network context
Linding, Jensen, Ostheimer et al., Cell, 2007
STRING
Jensen, Kuhn et al., Nucleic Acids Research, 2009
630 genomes
2.5 million proteins
genomic context
gene fusion
Korbel et al., Nature Biotechnology, 2004
phylogenetic profiles
Korbel et al., Nature Biotechnology, 2004
primary experimental data
physical interactions
Jensen & Bork, Science, 2008
gene coexpression
curated knowledge
Letunic & Bork, Trends in Biochemical Sciences, 2008
literature mining
not comparable
confidence scores
von Mering et al., Nucleic Acids Research, 2005
cross-species integration
Linding, Jensen, Ostheimer et al., Cell, 2007
putting it all together
NetworKIN
Linding, Jensen, Ostheimer et al., Cell, 2007
>2x better accuracy
use case
DNA damage response
Linding, Jensen, Ostheimer et al., Cell, 2007
experimental validation
ATM phosphorylates Rad50
Linding, Jensen, Ostheimer et al., Cell, 2007
drug repositioning
new uses for old drugs
drug–drug network
shared target(s)
chemical similarity
Tanimoto coefficients
Campillos & Kuhn et al., Science, 2008
Campillos & Kuhn et al., Science, 2008
similar drugs share targets
only trivial predictions
phenotypic similarity
chemical perturbations
phenotypic readouts
drug treatment
side effects
no database
package inserts
Campillos & Kuhn et al., Science, 2008
text mining
side-effect ontology
backtracking
Campillos & Kuhn et al., Science, 2008
side-effect correlations
Campillos & Kuhn et al., Science, 2008
GSC weighting
side-effect frequencies
Campillos & Kuhn et al., Science, 2008
raw similarity score
Campillos & Kuhn et al., Science, 2008
p-values
Campillos & Kuhn et al., Science, 2008
side-effect similarity
chemical similarity
Campillos & Kuhn et al., Science, 2008
confidence scores
drug–drug network
Campillos & Kuhn et al., Science, 2008
categorization
Campillos & Kuhn et al., Science, 2008
experimental validation
20 drug–drug pairs
in vitro binding assays
Ki<10 µM for 11 of 20
cell assays
9 of 9 showed activity
work in progress
link side-effects to targets
direct target prediction
STITCH
Kuhn et al., Nucleic Acids Research, 2010
thank you!
Acknowledgments
NetPhorest.info
– Rune Linding
– Martin Lee Miller
– Francesca Diella
– Claus Jørgensen
– Michele Tinti
– Lei Li
– Marilyn Hsiung
– Sirlester A. Parker
– Jennifer Bordeaux
– Thomas Sicheritz-Pontén
– Marina Olhovsky
– Adrian Pasculescu
– Jes Alexander
– Stefan Knapp
– Nikolaj Blom
– Peer Bork
– Shawn Li
– Gianni Cesareni
– Tony Pawson
– Benjamin E. Turk
– Michael B. Yaffe
– Søren Brunak
STRING-DB.org
– Christian von Mering
– Damian Szklarczyk
– Michael Kuhn
– Manuel Stark
– Samuel Chaffron
– Chris Creevey
– Jean Muller
– Tobias Doerks
– Philippe Julien
– Alexander Roth
– Milan Simonovic
– Jan Korbel
– Berend Snel
– Martijn Huynen
– Peer Bork
Side effect
– Monica Campillos
– Michael Kuhn
– Christian von Mering
– Anne-Claude Gavin
– Peer Bork
NetworKIN.info
– Rune Linding
– Gerard Ostheimer
– Heiko Horn
– Martin Lee Miller
– Francesca Diella
– Karen Colwill
– Jing Jin
– Pavel Metalnikov
– Vivian Nguyen
– Adrian Pasculescu
– Jin Gyoon Park
– Leona D. Samson
– Rob Russell
– Peer Bork
– Michael Yaffe
– Tony Pawson
larsjuhljensen

More Related Content

What's hot

Evolutionary plasticity of cell-cycle regulation
Evolutionary plasticity of cell-cycle regulationEvolutionary plasticity of cell-cycle regulation
Evolutionary plasticity of cell-cycle regulation
Lars Juhl Jensen
 
Network integration of data and text
Network integration of data and textNetwork integration of data and text
Network integration of data and text
Lars Juhl Jensen
 
Large-scale integration of data and text
Large-scale integration of data and textLarge-scale integration of data and text
Large-scale integration of data and text
Lars Juhl Jensen
 
Systems biology: Large-scale biomedical data mining
Systems biology: Large-scale biomedical data miningSystems biology: Large-scale biomedical data mining
Systems biology: Large-scale biomedical data mining
Lars Juhl Jensen
 

What's hot (19)

Data Integration and Systems Biology
Data Integration and Systems BiologyData Integration and Systems Biology
Data Integration and Systems Biology
 
Bms 2010
Bms 2010Bms 2010
Bms 2010
 
Unraveling signal transduction networks through data integration
Unraveling signal transduction networks through data integrationUnraveling signal transduction networks through data integration
Unraveling signal transduction networks through data integration
 
Evolutionary plasticity of cell-cycle regulation
Evolutionary plasticity of cell-cycle regulationEvolutionary plasticity of cell-cycle regulation
Evolutionary plasticity of cell-cycle regulation
 
Cellular Network Biology
Cellular Network BiologyCellular Network Biology
Cellular Network Biology
 
Network integration of data and text
Network integration of data and textNetwork integration of data and text
Network integration of data and text
 
Just-in-time assembly: Transcriptional and post-translational cell-cycle regu...
Just-in-time assembly: Transcriptional and post-translational cell-cycle regu...Just-in-time assembly: Transcriptional and post-translational cell-cycle regu...
Just-in-time assembly: Transcriptional and post-translational cell-cycle regu...
 
Unraveling signaling networks by large-scale data integration
Unraveling signaling networks by large-scale data integrationUnraveling signaling networks by large-scale data integration
Unraveling signaling networks by large-scale data integration
 
STRING&nbsp;- Modeling of biological systems through cross-species data integ...
STRING&nbsp;- Modeling of biological systems through cross-species data integ...STRING&nbsp;- Modeling of biological systems through cross-species data integ...
STRING&nbsp;- Modeling of biological systems through cross-species data integ...
 
From phosphoproteomics to signaling networks
From phosphoproteomics to signaling networksFrom phosphoproteomics to signaling networks
From phosphoproteomics to signaling networks
 
Cellular network biology: Proteome-wide analysis of heterogeneous data
Cellular network biology: Proteome-wide analysis of heterogeneous dataCellular network biology: Proteome-wide analysis of heterogeneous data
Cellular network biology: Proteome-wide analysis of heterogeneous data
 
Large-scale integration of data and text
Large-scale integration of data and textLarge-scale integration of data and text
Large-scale integration of data and text
 
One tagger, many uses - Illustrating the power of ontologies in named entity ...
One tagger, many uses - Illustrating the power of ontologies in named entity ...One tagger, many uses - Illustrating the power of ontologies in named entity ...
One tagger, many uses - Illustrating the power of ontologies in named entity ...
 
Using networks to derive function
Using networks to derive functionUsing networks to derive function
Using networks to derive function
 
Gene association networks - Large-scale integration of data and text
Gene association networks - Large-scale integration of data and textGene association networks - Large-scale integration of data and text
Gene association networks - Large-scale integration of data and text
 
Data and Text Mining
Data and Text MiningData and Text Mining
Data and Text Mining
 
Protein–protein interaction networks
Protein–protein interaction networksProtein–protein interaction networks
Protein–protein interaction networks
 
Large-scale data and text mining
Large-scale data and text miningLarge-scale data and text mining
Large-scale data and text mining
 
Systems biology: Large-scale biomedical data mining
Systems biology: Large-scale biomedical data miningSystems biology: Large-scale biomedical data mining
Systems biology: Large-scale biomedical data mining
 

Viewers also liked (6)

Network biology: A basis for large-scale biomedical data mining
Network biology: A basis for large-scale biomedical data miningNetwork biology: A basis for large-scale biomedical data mining
Network biology: A basis for large-scale biomedical data mining
 
A biologists view on Second Life
A biologists view on Second LifeA biologists view on Second Life
A biologists view on Second Life
 
Gene association networks - Large-scale integration of data and text
Gene association networks - Large-scale integration of data and textGene association networks - Large-scale integration of data and text
Gene association networks - Large-scale integration of data and text
 
Protein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data miningProtein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data mining
 
Protein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data miningProtein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data mining
 
Predicting novel targets for existing drugs using side effect information
Predicting novel targets for existing drugs using side effect informationPredicting novel targets for existing drugs using side effect information
Predicting novel targets for existing drugs using side effect information
 

Similar to Network biology: A basis for large-scale biomedical data mining

Advanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomicsAdvanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomics
Lars Juhl Jensen
 
Networks of proteins and diseases
Networks of proteins and diseasesNetworks of proteins and diseases
Networks of proteins and diseases
Lars Juhl Jensen
 
Exploring proteins, chemicals and their interactions with STRING and STITCH
Exploring proteins, chemicals and their interactions with STRING and STITCHExploring proteins, chemicals and their interactions with STRING and STITCH
Exploring proteins, chemicals and their interactions with STRING and STITCH
biocs
 

Similar to Network biology: A basis for large-scale biomedical data mining (18)

Computational Biology - Signaling networks and drug repositioning
Computational Biology - Signaling networks and drug repositioningComputational Biology - Signaling networks and drug repositioning
Computational Biology - Signaling networks and drug repositioning
 
Integration of heterogeneous data
Integration of heterogeneous dataIntegration of heterogeneous data
Integration of heterogeneous data
 
Text and data mining
Text and data miningText and data mining
Text and data mining
 
Data integration: The STITCH database of protein–small molecule interactions
Data integration: The STITCH database of protein–small molecule interactionsData integration: The STITCH database of protein–small molecule interactions
Data integration: The STITCH database of protein–small molecule interactions
 
Unraveling cellular phosphorylation networks using computational biology
Unraveling cellular phosphorylation networks using computational biologyUnraveling cellular phosphorylation networks using computational biology
Unraveling cellular phosphorylation networks using computational biology
 
Network biology
Network biologyNetwork biology
Network biology
 
Advanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomicsAdvanced bioinformatics methods for proteomics
Advanced bioinformatics methods for proteomics
 
Prediction of protein-small molecule networks through large-scale data integr...
Prediction of protein-small molecule networks through large-scale data integr...Prediction of protein-small molecule networks through large-scale data integr...
Prediction of protein-small molecule networks through large-scale data integr...
 
Unraveling signaling networks by large-scale data integration
Unraveling signaling networks by large-scale data integrationUnraveling signaling networks by large-scale data integration
Unraveling signaling networks by large-scale data integration
 
Network medicine - Integrating drugs, targets, diseases and side-effects
Network medicine - Integrating drugs, targets, diseases and side-effectsNetwork medicine - Integrating drugs, targets, diseases and side-effects
Network medicine - Integrating drugs, targets, diseases and side-effects
 
Large-scale integration of data and text
Large-scale integration of data and textLarge-scale integration of data and text
Large-scale integration of data and text
 
Large-scale integration of data and text
Large-scale integration of data and textLarge-scale integration of data and text
Large-scale integration of data and text
 
Combining sequence motifs and protein interactions to unravel complex phospho...
Combining sequence motifs and protein interactions to unravel complex phospho...Combining sequence motifs and protein interactions to unravel complex phospho...
Combining sequence motifs and protein interactions to unravel complex phospho...
 
Networks of proteins and diseases
Networks of proteins and diseasesNetworks of proteins and diseases
Networks of proteins and diseases
 
STRING & STITCH : Network integration of heterogeneous data
STRING & STITCH: Network integration of heterogeneous dataSTRING & STITCH: Network integration of heterogeneous data
STRING & STITCH : Network integration of heterogeneous data
 
Exploring proteins, chemicals and their interactions with STRING and STITCH
Exploring proteins, chemicals and their interactions with STRING and STITCHExploring proteins, chemicals and their interactions with STRING and STITCH
Exploring proteins, chemicals and their interactions with STRING and STITCH
 
Network biology
Network biologyNetwork biology
Network biology
 
Large-scale integration of data and text
Large-scale integration of data and textLarge-scale integration of data and text
Large-scale integration of data and text
 

More from Lars Juhl Jensen

More from Lars Juhl Jensen (20)

One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Illustrating the power of dictionary-based named entit...One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Illustrating the power of dictionary-based named entit...
 
One tagger, many uses: Simple text-mining strategies for biomedicine
One tagger, many uses: Simple text-mining strategies for biomedicineOne tagger, many uses: Simple text-mining strategies for biomedicine
One tagger, many uses: Simple text-mining strategies for biomedicine
 
Extract 2.0: Text-mining-assisted interactive annotation
Extract 2.0: Text-mining-assisted interactive annotationExtract 2.0: Text-mining-assisted interactive annotation
Extract 2.0: Text-mining-assisted interactive annotation
 
Network visualization: A crash course on using Cytoscape
Network visualization: A crash course on using CytoscapeNetwork visualization: A crash course on using Cytoscape
Network visualization: A crash course on using Cytoscape
 
Biomedical text mining: Automatic processing of unstructured text
Biomedical text mining: Automatic processing of unstructured textBiomedical text mining: Automatic processing of unstructured text
Biomedical text mining: Automatic processing of unstructured text
 
Medical network analysis: Linking diseases and genes through data and text mi...
Medical network analysis: Linking diseases and genes through data and text mi...Medical network analysis: Linking diseases and genes through data and text mi...
Medical network analysis: Linking diseases and genes through data and text mi...
 
Network Biology: A crash course on STRING and Cytoscape
Network Biology: A crash course on STRING and CytoscapeNetwork Biology: A crash course on STRING and Cytoscape
Network Biology: A crash course on STRING and Cytoscape
 
Cellular networks
Cellular networksCellular networks
Cellular networks
 
Cellular Network Biology: Large-scale integration of data and text
Cellular Network Biology: Large-scale integration of data and textCellular Network Biology: Large-scale integration of data and text
Cellular Network Biology: Large-scale integration of data and text
 
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
 
STRING & related databases: Large-scale integration of heterogeneous data
STRING & related databases: Large-scale integration of heterogeneous dataSTRING & related databases: Large-scale integration of heterogeneous data
STRING & related databases: Large-scale integration of heterogeneous data
 
Tagger: Rapid dictionary-based named entity recognition
Tagger: Rapid dictionary-based named entity recognitionTagger: Rapid dictionary-based named entity recognition
Tagger: Rapid dictionary-based named entity recognition
 
Medical text mining: Linking diseases, drugs, and adverse reactions
Medical text mining: Linking diseases, drugs, and adverse reactionsMedical text mining: Linking diseases, drugs, and adverse reactions
Medical text mining: Linking diseases, drugs, and adverse reactions
 
Network biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and textNetwork biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and text
 
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactionsMedical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactions
 
Network biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and textNetwork biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and text
 
Biomarker bioinformatics: Network-based candidate prioritization
Biomarker bioinformatics: Network-based candidate prioritizationBiomarker bioinformatics: Network-based candidate prioritization
Biomarker bioinformatics: Network-based candidate prioritization
 
The Art of Counting: Scoring and ranking co-occurrences in literature
The Art of Counting: Scoring and ranking co-occurrences in literatureThe Art of Counting: Scoring and ranking co-occurrences in literature
The Art of Counting: Scoring and ranking co-occurrences in literature
 
Text-mining-based retrieval of protein networks
Text-mining-based retrieval of protein networksText-mining-based retrieval of protein networks
Text-mining-based retrieval of protein networks
 
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactionsMedical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactions
 

Recently uploaded

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 

Network biology: A basis for large-scale biomedical data mining

Editor's Notes

  1. Integration Automation Collaboration
  2. Atlas of human kinases Atlases for phospho-binding proteins Atlases for model organisms Ubiquitination would be welcome