Pizza club - March 2017 - Gaia

•Transferir como PPTX, PDF•

0 gostou•169 visualizações

This document presents ChainRank, a method to identify context-specific subnetworks from genome-wide interaction networks. ChainRank models information flow using chains of interactions between start and end nodes. It prioritizes chains based on node scores like expression variability and connectivity. Applying ChainRank to a COPD interaction network, the top chains found showed a 50% improvement in precision over random and enriched for known COPD pathways. Combining multiple node scores yielded even better results, demonstrating ChainRank's ability to identify meaningful subnetworks.

Ciências

Background & Aim
• There is more and more (genome-wide) data available that is still not optimally
used
• Genome-wide networks are too big and complex to be interpreted in a
meaningful way
• Knowledge-based networks are in general non specific: e.g. canonical pathways,
PPI networks…
Develop a flexible method to identify context-specific subnetworks

Approach
• Model the flow of information using chains of interactions
• Chains = simple paths: sequence of interactions (e.g. protein modifications) that
connect one start and one ending point.
• Multiple chains can exist between a couple of start and end protein: what is the
best meaningful subnetwork?
• Prioritization of the chains based on many possible scores: gene expression,
functional module identification, …
• Here they present a general tool for combining multiple biological information as
chain scores: ChainRank

Methods
1. Search for all chains among user-defined start and end nodes in the network
2. Annotate the nodes with scores in order to calculate chains score and p-value

Subnetwork
Restrict the network by heuristic breadth-first search from the fixed initial proteins
to the final one with 2 criteria:
1. Maximal length allowed = length of the shortest path between initial and final
node
2. Prefer the integration of highly connected proteins (canonical signaling
interactors)

Scoring scheme
• Chain score =
𝑛𝑜𝑑𝑒 𝑠𝑐𝑜𝑟𝑒𝑠
𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑛𝑜𝑑𝑒𝑠
• Node scores used
1. Localisation: mean expression variability across studied tissue vs. mean
expression variability across all others -> gene expression
2. Relevance: occurrence of each protein among the significant ones across
studies -> gene expression, protein modifications, metabolism…
3. Connectivity: degree centrality -> topology
• Combination of scores
1. Weighted product of normalized scores
2. Filtering: pre-filter chains by score S1 and rank them by score S2
3. Intersection: keep only chains that pass filter on all scores

Results
• Application to chronic obstructive pulmonary disease (COPD)
• Network used: experimental interactions from different public databases + COPD
knowledge base (10k nodes, 62k interactions)
• Significance: comparison to chains in random networks
• Evaluation: enrichment of the top ranked chains in gold standard pathways
proteins
• Improvement metric:
𝑝𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 𝑜𝑓 𝑟𝑎𝑛𝑘𝑖𝑛𝑔
𝑝𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 𝑜𝑓 𝑟𝑎𝑛𝑑𝑜𝑚 𝑟𝑎𝑛𝑘𝑖𝑛𝑔

Localisation: expression variability across studied tissue vs. across all others
Relevance: occurrence of each protein among the significant ones across COPD-related studies
Connectivity: degree centrality
Combination by weighted product: no improvement
Filtering: connectivity<0.05, ranked by localization
Intersection: connectivity and localization
Filtering: top quartile localization, ranked by relevance
Intersection: localization and relevance
IGF-Akt proximity subnetwork MAPK proximity subnetwork

Results for the best 50 chains
Other methods:
recall 50-85%
Precision 18-42%
Here (max):
recall 67%,
precision 30%

Conclusions and claims
• 50% improvement in finding gold standard proteins (compared to random), and
combining scores even better (x2.5)
• 11% improvement of the AUC (compared to random)
• Generic tool applicable to different network types (GRN, metabolic networks)
• Importance of selected scores based on scientific question
• Applications
• Causal, mechanistic connection?
• Common mechanisms driving different diseases
• Reduce the computational models
• Synthetic lethality

Mais conteúdo relacionado

Mais procurados

Motif presentationAmir Razmjou

P2P DOMAIN CLASSIFICATION USING DECISION TREE ijp2p

Improved Text Mining for Bulk Data Using Deep Learning Approach IJCSIS Research Publications

A TOPOLOGY POTENTIAL-BASED METHOD FOR IDENTIFYING ESSENTIAL PROTEINS FROM PPI...I3E Technologies

On optimizing overlay topologies for searchIMPULSE_TECHNOLOGY

Impact of location popularity on throughput and delay in mobile ad hoc networksieeeprojectschennai

Data availability and feasibility of validation – A genomics case studyVerena139

NetBioSIG2013-Talk Gang SuAlexander Pico

Network motifs in integrated cellular networks of transcription–regulation an...Samuel Sattath

A generalized flow based method for analysis of implicit relationships on wik...JPINFOTECH JAYAPRAKASH

A system to filter unwanted messagesNinad Samel

Sentence versus Paragraph Processing: Linear and relational knowledge structu...Roy Clariana

A study and survey on various progressive duplicate detection mechanismseSAT Journals

News Reliability Evaluation using Latent Semantic AnalysisTELKOMNIKA JOURNAL

Correlation Coefficient Based Average Textual Similarity Model for Informatio...IOSR Journals

NetBioSIG2013-KEYNOTE Benno SchwikowskiAlexander Pico

Recent trends in bioinformaticsZeeshan Hanjra

Curveball Algorithm for Random Sampling of Protein NetworksAkua Biaa Adu

SimulatorEduardo

Simulator922010

Mais procurados (20)

Motif presentation

P2P DOMAIN CLASSIFICATION USING DECISION TREE

Improved Text Mining for Bulk Data Using Deep Learning Approach

A TOPOLOGY POTENTIAL-BASED METHOD FOR IDENTIFYING ESSENTIAL PROTEINS FROM PPI...

On optimizing overlay topologies for search

Impact of location popularity on throughput and delay in mobile ad hoc networks

Data availability and feasibility of validation – A genomics case study

NetBioSIG2013-Talk Gang Su

Network motifs in integrated cellular networks of transcription–regulation an...

A generalized flow based method for analysis of implicit relationships on wik...

A system to filter unwanted messages

Sentence versus Paragraph Processing: Linear and relational knowledge structu...

A study and survey on various progressive duplicate detection mechanisms

News Reliability Evaluation using Latent Semantic Analysis

Correlation Coefficient Based Average Textual Similarity Model for Informatio...

NetBioSIG2013-KEYNOTE Benno Schwikowski

Recent trends in bioinformatics

Curveball Algorithm for Random Sampling of Protein Networks

Simulator

Destaque

Pizza club - March 2017 - UrsulaRSG Luxembourg

B'RAIN company presentationRSG Luxembourg

A modeling workflow in systems biology: An overviewAnna Zhukova

Model management for systems biology projectsUniversity Medicine Greifswald

SilicoLife presentationRSG Luxembourg

Community Modeling WorkshopRSG Luxembourg

Pizza club - February 2017 - FedericoRSG Luxembourg

Pizza club - February 2017 - GemmaRSG Luxembourg

Destaque (8)

Pizza club - March 2017 - Ursula

B'RAIN company presentation

A modeling workflow in systems biology: An overview

Model management for systems biology projects

SilicoLife presentation

Community Modeling Workshop

Pizza club - February 2017 - Federico

Pizza club - February 2017 - Gemma

Semelhante a Pizza club - March 2017 - Gaia

Thesis PresentationDimitrios Apostolos Chalepakis Ntellis

System Biology and Pathway Network.pptxssuserecbdb6

Java tutorial: Programmatic Access to Molecular InteractionsRafael C. Jimenez

Identification of novel potential anti cancer agents using network pharmacolo...Cresset

KnetMiner Overview Oct 2017Keywan Hassani-Pak

presentationPeter Langfelder

Systems Biology Approaches to CancerRaunak Shrestha

Applied Bioinformatics Assignment 5docxUniversity of Allahabad

Pathway and network analysisManar Al-Eslam Mattar

Pathway analysis for genomics dataSakshiJha40

KnetMiner - EBI Workshop 2017Keywan Hassani-Pak

Machine reading for cancer biologyLaura Berry

NS-CUK Seminar: H.B.Kim, Review on "metapath2vec: Scalable representation le...ssuser4b1f48

A New Method for Reducing Energy Consumption in Wireless Sensor Networks usin...Editor IJCATR

molecular docking screnning. pptxPraveen kumar S

Computational Biology Methods for Drug Discovery_Phase 1-5_November 2015Mathew Varghese

EnrichNet: Graph-based statistic and web-application for gene/protein set enr...Enrico Glaab

Overall Vision for NRNB: 2015-2020Alexander Pico

network mining and representation learningsun peiyuan

dreamPriyata Kalra

Semelhante a Pizza club - March 2017 - Gaia (20)

Thesis Presentation

System Biology and Pathway Network.pptx

Java tutorial: Programmatic Access to Molecular Interactions

Identification of novel potential anti cancer agents using network pharmacolo...

KnetMiner Overview Oct 2017

presentation

Systems Biology Approaches to Cancer

Applied Bioinformatics Assignment 5docx

Pathway and network analysis

Pathway analysis for genomics data

KnetMiner - EBI Workshop 2017

Machine reading for cancer biology

NS-CUK Seminar: H.B.Kim, Review on "metapath2vec: Scalable representation le...

A New Method for Reducing Energy Consumption in Wireless Sensor Networks usin...

molecular docking screnning. pptx

Computational Biology Methods for Drug Discovery_Phase 1-5_November 2015

EnrichNet: Graph-based statistic and web-application for gene/protein set enr...

Overall Vision for NRNB: 2015-2020

network mining and representation learning

dream

Mais de RSG Luxembourg

Pizza club - January 2017 - KamilRSG Luxembourg

Pizza club - January 2017 - AlbertoRSG Luxembourg

Pizza club - October 2016 - EugenRSG Luxembourg

Pizza club - October 2016 - LisaRSG Luxembourg

Pizza club Zoé 28.09.16RSG Luxembourg

September Journal Club -AishwaryaRSG Luxembourg

Nicola Bonzanni - Following Your PassionRSG Luxembourg

Magali Michaut - ROCK YOUR SCIENCE!RSG Luxembourg

June 2016 - ZuogongRSG Luxembourg

2016.06 Pizza club - MarcoRSG Luxembourg

Pizza club - May 2016 - ShamanRSG Luxembourg

Pizza Club - May 2016 - AnneRSG Luxembourg

20042016_pizzaclub_part1RSG Luxembourg

20042016_pizzaclub_part2RSG Luxembourg

Resource sharing sessionRSG Luxembourg

160316_pizzaclub_part2RSG Luxembourg

160316_pizzaclub_part1RSG Luxembourg

Mais de RSG Luxembourg (17)

Pizza club - January 2017 - Kamil

Pizza club - January 2017 - Alberto

Pizza club - October 2016 - Eugen

Pizza club - October 2016 - Lisa

Pizza club Zoé 28.09.16

September Journal Club -Aishwarya

Nicola Bonzanni - Following Your Passion

Magali Michaut - ROCK YOUR SCIENCE!

June 2016 - Zuogong

2016.06 Pizza club - Marco

Pizza club - May 2016 - Shaman

Pizza Club - May 2016 - Anne

20042016_pizzaclub_part1

20042016_pizzaclub_part2

Resource sharing session

160316_pizzaclub_part2

160316_pizzaclub_part1

Último

Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubaikojalkojal131

FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV

Ai in communication electronicss[1].pptxsubscribeus100

《Queensland毕业文凭-昆士兰大学毕业证成绩单》rnrncn29

User Guide: Pulsar™ Weather Station (Columbia Weather Systems)Columbia Weather Systems

Let’s Say Someone Did Drop the Bomb. Then What?LUMINATIVE MEDIA/PROJECT COUNSEL MEDIA GROUP

Thermodynamics ,types of system,formulae ,gibbs free energy .pptxuniversity

Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde

PROJECTILE MOTION-Horizontal and VerticalMAESTRELLAMesa2

bonjourmadame.tumblr.com bhaskar's girlshansessene

Citronella presentation SlideShare mani upadhyayupadhyaymani499

Organic farming with special reference to vermicultureTakeleZike1

GenAI talk for Young at Wageningen University & Research (WUR) March 2024Jene van der Heide

Observational constraints on mergers creating magnetism in massive starsSérgio Sacani

REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...Universidade Federal de Sergipe - UFS

GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests GlycosidesNandakishor Bhaurao Deshmukh

Manassas R - Parkside Middle School 🌎🏫qfactory1

Servosystem Theory / Cybernetic Theory by PetrovicAditi Jain

well logging & petrophysical analysis.pptxzaydmeerab121

User Guide: Capricorn FLX™ Weather StationColumbia Weather Systems

Pizza club - March 2017 - Gaia

1. 22 March 2017

2. Background & Aim • There is more and more (genome-wide) data available that is still not optimally used • Genome-wide networks are too big and complex to be interpreted in a meaningful way • Knowledge-based networks are in general non specific: e.g. canonical pathways, PPI networks… Develop a flexible method to identify context-specific subnetworks

3. Approach • Model the flow of information using chains of interactions • Chains = simple paths: sequence of interactions (e.g. protein modifications) that connect one start and one ending point. • Multiple chains can exist between a couple of start and end protein: what is the best meaningful subnetwork? • Prioritization of the chains based on many possible scores: gene expression, functional module identification, … • Here they present a general tool for combining multiple biological information as chain scores: ChainRank

4. Methods 1. Search for all chains among user-defined start and end nodes in the network 2. Annotate the nodes with scores in order to calculate chains score and p-value

5. Subnetwork Restrict the network by heuristic breadth-first search from the fixed initial proteins to the final one with 2 criteria: 1. Maximal length allowed = length of the shortest path between initial and final node 2. Prefer the integration of highly connected proteins (canonical signaling interactors)

6. Scoring scheme • Chain score = 𝑛𝑜𝑑𝑒 𝑠𝑐𝑜𝑟𝑒𝑠 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑛𝑜𝑑𝑒𝑠 • Node scores used 1. Localisation: mean expression variability across studied tissue vs. mean expression variability across all others -> gene expression 2. Relevance: occurrence of each protein among the significant ones across studies -> gene expression, protein modifications, metabolism… 3. Connectivity: degree centrality -> topology • Combination of scores 1. Weighted product of normalized scores 2. Filtering: pre-filter chains by score S1 and rank them by score S2 3. Intersection: keep only chains that pass filter on all scores

7. Results • Application to chronic obstructive pulmonary disease (COPD) • Network used: experimental interactions from different public databases + COPD knowledge base (10k nodes, 62k interactions) • Significance: comparison to chains in random networks • Evaluation: enrichment of the top ranked chains in gold standard pathways proteins • Improvement metric: 𝑝𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 𝑜𝑓 𝑟𝑎𝑛𝑘𝑖𝑛𝑔 𝑝𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 𝑜𝑓 𝑟𝑎𝑛𝑑𝑜𝑚 𝑟𝑎𝑛𝑘𝑖𝑛𝑔

8. Localisation: expression variability across studied tissue vs. across all others Relevance: occurrence of each protein among the significant ones across COPD-related studies Connectivity: degree centrality Combination by weighted product: no improvement Filtering: connectivity<0.05, ranked by localization Intersection: connectivity and localization Filtering: top quartile localization, ranked by relevance Intersection: localization and relevance IGF-Akt proximity subnetwork MAPK proximity subnetwork

9. Results for the best 50 chains Other methods: recall 50-85% Precision 18-42% Here (max): recall 67%, precision 30%

10. Conclusions and claims • 50% improvement in finding gold standard proteins (compared to random), and combining scores even better (x2.5) • 11% improvement of the AUC (compared to random) • Generic tool applicable to different network types (GRN, metabolic networks) • Importance of selected scores based on scientific question • Applications • Causal, mechanistic connection? • Common mechanisms driving different diseases • Reduce the computational models • Synthetic lethality

Notas do Editor

Note that from pathway to Ppi notation the structure of the pathway is heavily changing: they cannot aim at recovering the canonical pathway, so they go for improvement
Connectivity score = |dc – max(dc)| + 1
COPD case doesn’t have p-values because Relevance is far from normal distribution They don’t justify the use of different scores in different scenarios, why connectivity in the first one?
Random score is different from diagonal because it takes into account the topology Red lines: # gold standard proteins; B2 is the same but with the input network (chains scored but not selected)

Pizza club - March 2017 - Gaia

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Destaque

Destaque (8)

Semelhante a Pizza club - March 2017 - Gaia

Semelhante a Pizza club - March 2017 - Gaia (20)

Mais de RSG Luxembourg

Mais de RSG Luxembourg (17)

Último

Último (20)

Pizza club - March 2017 - Gaia

Notas do Editor