SlideShare uma empresa Scribd logo
1 de 11
Integrated omics analysis 
pipeline for model organism 
with Cytoscape 
Kozo Nishida 
RIKEN, Quantitavie Biology Center(QBiC) 
Kozo Nishida @ RECOMB2014, Nov 11, 2014 1
Goal 
Reproducible and modifiable omics analysis pipeline 
in a single environment 
(www.genome.jp) 
(Rohn, 2012) 
Omics 
experiment 
Omics data 
analysis 
Pathway data 
integration 
Network 
Analysis 
Kozo Nishida @ RECOMB2014, Nov 11, 2014 2
• Each Process is separated and is NOT easy to reproduce the whole analysis pipeline. 
• Especially in need of modifying the process and aggregating the result. 
• Cytoscape is good software for pathway data integration and network analysis but… 
• NOT the best for whole analysis pipeline, Java app is NOT easy to modify. 
• R is common for omics data preprocessing and analysis 
• Python is good for data aggregation 
• both can be used for data integration and network analysis. 
(www.genome.jp) (Rohn, 2012) 
Why? 
Omics experiment Omics data analysis Pathway data integration Network Analysis 
3
How? 
Kozo Nishida @ RECOMB2014, Nov 11, 2014 4
Seamless, reproducible, and modifiable IPython notebook environment 
• Cytoscape is controlled by IPython notebook 
• Low-revel access to Cytoscape with cyREST app 
• Omics analysis with Bioconductor R packages 
• Pathway data integration with Python and graph-database 
• KEGG-based pathway data integration with KEGGscape app 
5
cyREST and KEGGscape app 
• cyREST provides us with scripting language interface 
• cyREST is useful and suitable for KEGG-based pathway data integration 
• KEGGscape supports KEGG pathway xml(KGML) import on Cytoscape 
• Difference from CytoKEGG and CyKEGGparser 
• CytoKEGG and CyKEGGparser have several additional features, but too 
specialized in their purpose and some un-supported pathways. 
• KEGGscape simply supports importing and reconstructing KEGG pathway as it 
is, as many as KEGG provides. (Currently supports all KEGG pathways.) 
Kozo Nishida @ RECOMB2014, Nov 11, 2014 6
Demo for E. coli 1 
OR 
Mapping differentially expressed genes 
(Between WT and lrp-) to KEGG 
Kozo Nishida @ RECOMB2014, Nov 11, 2014 7
Demo for E. coli 2 
OR 
Mapping E. coli drugtargets to KEGG 
Kozo Nishida @ RECOMB2014, Nov 11, 2014 8
Other example for Arabidopsis thaliana 
OR 
Mapping time-series metabolome profile 
to KEGG (http://goo.gl/jk01HP) 
Kozo Nishida @ RECOMB2014, Nov 11, 2014 9
Conclusions, Future work 
• Constructed reproducible (and flexible) omics analysis 
pipeline with cyREST app. 
• You can replace KEGG to WikiPathways, Reactome or 
other pathway databases 
• Packaging Python and R utility functions 
• py2cytoscape (github.com/idekerlab/py2cytoscape) 
• More example IPython notebooks!! 
•Welcome your contribution, please see 
github.com/idekerlab/cy-rest-python 
Kozo Nishida @ RECOMB2014, Nov 11, 2014 10
Acknowledgments 
• The Cytoscape consortium 
• Keiichiro Ono (UCSD) 
• cyREST, KEGGscape 
• Atsushi Fukushima (RIKEN CSRS) 
• AtMetExpress Arabidopsis thaliana metabolome database 
• Jun Sese (AIST CBRC) 
• Mentoring in “Tool Prototype for Integrated Database Analysis” project 
This project is supported by National Bioscience Database Center(NBDC), Japan 
Kozo Nishida @ RECOMB2014, Nov 11, 2014 11

Mais conteúdo relacionado

Mais procurados

PTU: Using Provenance for Repeatability
PTU: Using Provenance for RepeatabilityPTU: Using Provenance for Repeatability
PTU: Using Provenance for Repeatability
Tanu Malik
 

Mais procurados (20)

ICAR 2015 Workshop - Blake Meyers
ICAR 2015 Workshop - Blake MeyersICAR 2015 Workshop - Blake Meyers
ICAR 2015 Workshop - Blake Meyers
 
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
 
LDV: Light-weight Database Virtualization
LDV: Light-weight Database VirtualizationLDV: Light-weight Database Virtualization
LDV: Light-weight Database Virtualization
 
Gryphon Framework - Preliminary Results Feb-2014
Gryphon Framework - Preliminary Results Feb-2014Gryphon Framework - Preliminary Results Feb-2014
Gryphon Framework - Preliminary Results Feb-2014
 
Big Data Initiatives for Agroecosystems
Big Data Initiatives for AgroecosystemsBig Data Initiatives for Agroecosystems
Big Data Initiatives for Agroecosystems
 
Websci17 final
Websci17 finalWebsci17 final
Websci17 final
 
Getting Started Of Elasticsearch
Getting Started Of ElasticsearchGetting Started Of Elasticsearch
Getting Started Of Elasticsearch
 
Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013
 
Madrid SPARQL handson
Madrid SPARQL handsonMadrid SPARQL handson
Madrid SPARQL handson
 
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
Evaluating Named Entity Recognition and Disambiguation in News and TweetsEvaluating Named Entity Recognition and Disambiguation in News and Tweets
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
 
GEN: A Database Interface Generator for HPC Programs
GEN: A Database Interface Generator for HPC ProgramsGEN: A Database Interface Generator for HPC Programs
GEN: A Database Interface Generator for HPC Programs
 
eScience Resources for the Chemistry Community from the Royal Society of Chem...
eScience Resources for the Chemistry Community from the Royal Society of Chem...eScience Resources for the Chemistry Community from the Royal Society of Chem...
eScience Resources for the Chemistry Community from the Royal Society of Chem...
 
The Materials Project - Combining Science and Informatics to Accelerate Mater...
The Materials Project - Combining Science and Informatics to Accelerate Mater...The Materials Project - Combining Science and Informatics to Accelerate Mater...
The Materials Project - Combining Science and Informatics to Accelerate Mater...
 
ICAR 2015 Plenary - Chris Town
ICAR 2015 Plenary - Chris TownICAR 2015 Plenary - Chris Town
ICAR 2015 Plenary - Chris Town
 
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
 
The Galaxy bioinformatics workflow environment
The Galaxy bioinformatics workflow environmentThe Galaxy bioinformatics workflow environment
The Galaxy bioinformatics workflow environment
 
Computational workflows for omics analyses at the IARC
Computational workflows for omics analyses at the IARCComputational workflows for omics analyses at the IARC
Computational workflows for omics analyses at the IARC
 
PTU: Using Provenance for Repeatability
PTU: Using Provenance for RepeatabilityPTU: Using Provenance for Repeatability
PTU: Using Provenance for Repeatability
 
Jcdl2013 mklein
Jcdl2013 mkleinJcdl2013 mklein
Jcdl2013 mklein
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 

Destaque (7)

Pathway解析のためのSPARQL wapper packageの作成
Pathway解析のためのSPARQL wapper packageの作成Pathway解析のためのSPARQL wapper packageの作成
Pathway解析のためのSPARQL wapper packageの作成
 
wikidataへの化合物idの追加
wikidataへの化合物idの追加wikidataへの化合物idの追加
wikidataへの化合物idの追加
 
Cytoscape retreat 2010_demo
Cytoscape retreat 2010_demoCytoscape retreat 2010_demo
Cytoscape retreat 2010_demo
 
Pathway解析のためのSPARQL wrapper packageの作成
Pathway解析のためのSPARQL wrapper packageの作成Pathway解析のためのSPARQL wrapper packageの作成
Pathway解析のためのSPARQL wrapper packageの作成
 
integration_Aug2015
integration_Aug2015integration_Aug2015
integration_Aug2015
 
正則化つき線形モデル(「入門機械学習第6章」より)
正則化つき線形モデル(「入門機械学習第6章」より)正則化つき線形モデル(「入門機械学習第6章」より)
正則化つき線形モデル(「入門機械学習第6章」より)
 
Integrative bioinformatics analysis of Parkinson's disease related omics data
Integrative bioinformatics analysis of Parkinson's disease related omics dataIntegrative bioinformatics analysis of Parkinson's disease related omics data
Integrative bioinformatics analysis of Parkinson's disease related omics data
 

Semelhante a Integrated omics analysis pipeline for model organism with Cytoscape, Kozo Nishida

Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014
Monica Munoz-Torres
 
The BioAssay Research Database
The BioAssay Research DatabaseThe BioAssay Research Database
The BioAssay Research Database
Rajarshi Guha
 
JulieKlein_Bosc2012
JulieKlein_Bosc2012JulieKlein_Bosc2012
JulieKlein_Bosc2012
KUPKB_Team
 
OpenDiscovery
OpenDiscoveryOpenDiscovery
OpenDiscovery
gwprice
 

Semelhante a Integrated omics analysis pipeline for model organism with Cytoscape, Kozo Nishida (20)

ICAR 2015 Workshop - Nick Provart
ICAR 2015 Workshop - Nick ProvartICAR 2015 Workshop - Nick Provart
ICAR 2015 Workshop - Nick Provart
 
AGS 2014 i5k_workspace
AGS 2014 i5k_workspaceAGS 2014 i5k_workspace
AGS 2014 i5k_workspace
 
Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014
 
The BioAssay Research Database
The BioAssay Research DatabaseThe BioAssay Research Database
The BioAssay Research Database
 
J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...
J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...
J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...
 
JulieKlein_Bosc2012
JulieKlein_Bosc2012JulieKlein_Bosc2012
JulieKlein_Bosc2012
 
iMicrobe_ASLO_2015
iMicrobe_ASLO_2015iMicrobe_ASLO_2015
iMicrobe_ASLO_2015
 
20160308 dtl ngs_focus_group_meeting_slideshare
20160308 dtl ngs_focus_group_meeting_slideshare20160308 dtl ngs_focus_group_meeting_slideshare
20160308 dtl ngs_focus_group_meeting_slideshare
 
Thesis biobix
Thesis biobixThesis biobix
Thesis biobix
 
Introduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-SeqIntroduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-Seq
 
Prokka - rapid bacterial genome annotation - ABPHM 2013
Prokka - rapid bacterial genome annotation - ABPHM 2013Prokka - rapid bacterial genome annotation - ABPHM 2013
Prokka - rapid bacterial genome annotation - ABPHM 2013
 
Genome_annotation@BioDec: Python all over the place
Genome_annotation@BioDec: Python all over the placeGenome_annotation@BioDec: Python all over the place
Genome_annotation@BioDec: Python all over the place
 
NCBO Technology
NCBO TechnologyNCBO Technology
NCBO Technology
 
NCBO BioPortal SPARQL Endpoint - The Quad Economy of a Semantic Web Ontology ...
NCBO BioPortal SPARQL Endpoint - The Quad Economy of a Semantic Web Ontology ...NCBO BioPortal SPARQL Endpoint - The Quad Economy of a Semantic Web Ontology ...
NCBO BioPortal SPARQL Endpoint - The Quad Economy of a Semantic Web Ontology ...
 
Systematic integration of millions of peptidoform evidences into Ensembl and ...
Systematic integration of millions of peptidoform evidences into Ensembl and ...Systematic integration of millions of peptidoform evidences into Ensembl and ...
Systematic integration of millions of peptidoform evidences into Ensembl and ...
 
Case Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human GenomeCase Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human Genome
 
OpenDiscovery
OpenDiscoveryOpenDiscovery
OpenDiscovery
 
groovy & grails - lecture 1
groovy & grails - lecture 1groovy & grails - lecture 1
groovy & grails - lecture 1
 
Module development
Module development Module development
Module development
 
Kino : Making Semantic Annotations Easier
Kino : Making Semantic Annotations EasierKino : Making Semantic Annotations Easier
Kino : Making Semantic Annotations Easier
 

Último

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
Bhagirath Gogikar
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 

Último (20)

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Unit5-Cloud.pptx for lpu course cse121 o
Unit5-Cloud.pptx for lpu course cse121 oUnit5-Cloud.pptx for lpu course cse121 o
Unit5-Cloud.pptx for lpu course cse121 o
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicine
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 

Integrated omics analysis pipeline for model organism with Cytoscape, Kozo Nishida

  • 1. Integrated omics analysis pipeline for model organism with Cytoscape Kozo Nishida RIKEN, Quantitavie Biology Center(QBiC) Kozo Nishida @ RECOMB2014, Nov 11, 2014 1
  • 2. Goal Reproducible and modifiable omics analysis pipeline in a single environment (www.genome.jp) (Rohn, 2012) Omics experiment Omics data analysis Pathway data integration Network Analysis Kozo Nishida @ RECOMB2014, Nov 11, 2014 2
  • 3. • Each Process is separated and is NOT easy to reproduce the whole analysis pipeline. • Especially in need of modifying the process and aggregating the result. • Cytoscape is good software for pathway data integration and network analysis but… • NOT the best for whole analysis pipeline, Java app is NOT easy to modify. • R is common for omics data preprocessing and analysis • Python is good for data aggregation • both can be used for data integration and network analysis. (www.genome.jp) (Rohn, 2012) Why? Omics experiment Omics data analysis Pathway data integration Network Analysis 3
  • 4. How? Kozo Nishida @ RECOMB2014, Nov 11, 2014 4
  • 5. Seamless, reproducible, and modifiable IPython notebook environment • Cytoscape is controlled by IPython notebook • Low-revel access to Cytoscape with cyREST app • Omics analysis with Bioconductor R packages • Pathway data integration with Python and graph-database • KEGG-based pathway data integration with KEGGscape app 5
  • 6. cyREST and KEGGscape app • cyREST provides us with scripting language interface • cyREST is useful and suitable for KEGG-based pathway data integration • KEGGscape supports KEGG pathway xml(KGML) import on Cytoscape • Difference from CytoKEGG and CyKEGGparser • CytoKEGG and CyKEGGparser have several additional features, but too specialized in their purpose and some un-supported pathways. • KEGGscape simply supports importing and reconstructing KEGG pathway as it is, as many as KEGG provides. (Currently supports all KEGG pathways.) Kozo Nishida @ RECOMB2014, Nov 11, 2014 6
  • 7. Demo for E. coli 1 OR Mapping differentially expressed genes (Between WT and lrp-) to KEGG Kozo Nishida @ RECOMB2014, Nov 11, 2014 7
  • 8. Demo for E. coli 2 OR Mapping E. coli drugtargets to KEGG Kozo Nishida @ RECOMB2014, Nov 11, 2014 8
  • 9. Other example for Arabidopsis thaliana OR Mapping time-series metabolome profile to KEGG (http://goo.gl/jk01HP) Kozo Nishida @ RECOMB2014, Nov 11, 2014 9
  • 10. Conclusions, Future work • Constructed reproducible (and flexible) omics analysis pipeline with cyREST app. • You can replace KEGG to WikiPathways, Reactome or other pathway databases • Packaging Python and R utility functions • py2cytoscape (github.com/idekerlab/py2cytoscape) • More example IPython notebooks!! •Welcome your contribution, please see github.com/idekerlab/cy-rest-python Kozo Nishida @ RECOMB2014, Nov 11, 2014 10
  • 11. Acknowledgments • The Cytoscape consortium • Keiichiro Ono (UCSD) • cyREST, KEGGscape • Atsushi Fukushima (RIKEN CSRS) • AtMetExpress Arabidopsis thaliana metabolome database • Jun Sese (AIST CBRC) • Mentoring in “Tool Prototype for Integrated Database Analysis” project This project is supported by National Bioscience Database Center(NBDC), Japan Kozo Nishida @ RECOMB2014, Nov 11, 2014 11

Notas do Editor

  1. I’m Kozo Nishida. From RIKEN, Japan. I would like introduce new omics analysis environment project for Cytoscape.
  2. My project goal is to realize reproducible and modifiable omics analysis pipleline in a single environment. These processes are the component for the pipeline.
  3. The reason why I do this project is Each process is separated and is NOT easy to reproduce the whole analysis pipeline. Especially this is hard in need of modifying the process and aggregating the result connecting them. Of course Cytoscape is good for the latter part of the pipeline, but Is NOT the best for whole analysis pipeline. Because this pipeline needs flexibility, but Java app requires compiling and is NOT easy to modify. And for the former part of pipeline, R language is common for omics data preprocessing and analysis. And Python language is good for data aggregation and can be used for a general purpose. These languages are easy to modify the pipeline through a trial and error process.
  4. So I implemented a pipeline like this image.
  5. Usually Cytoscape users mainly control Cytoscape with GUI. But in my case, Cytoscape is programmatically controlled by IPython notebook with cyREST app. I leave omics analysis to bioconductor packages, and pathway data integration to Python and graph-database. And the main pathway integration target is KEGG.
  6. You need to install cyREST and KEGGscape app to reproduce our pipeline. Python requires cyREST interface to control Cytoscape. cyREST is useful and suitable for KEGG-based pathway data integration. And default Cytoscape does not support KEGG pathway. So you need to install KEGGscape app. There are CytoKEGG and CyKEGGparser apps for KEGG pathway support in Cytoscape3. But these are specialized in their workflow, and you may feel difficult to control these app from Python. So I recommend KEGGscape, currently KEGGscape simply supports importing and reconstructing KEGG pathway as it is. And currently KEGGscape supports all KEGG pathways.
  7. Next I show you two demoes for E.coli. 1 is a pipeline for mapping differentially expressed genes(between WT and lrp mutant strain) to KEGG First I import KEGG pathway from Ipython notebook. At this stage pipeline is not finished, so no data integrated yet. Next I run whole pipeline, the differentially expressed gene table are merged, Yellow highlighted nodes are enzyme nodes including the differentially expressed genes.
  8. 2 is a pipeline for mapping E.coli drugtargest in drugbank to KEGG. First I also import a KEGG pathway from Ipython notebook. At this stage pipeline is not finished yet, no data integrated. Next I run whole pipeline, the E coli drugtargets in Drugbank are mapped to KEGG pathway. This column is the drugs, and next column is target protein of KEGG gene product. These pipelines are independent and you can combine them. But here for simplicity I separated them two movies.
  9. And RIKEN has rich resource for plant metabolome, so I’m also trying to construct more complicated pipeline for Arabidopsis thaliana. I cannot show you all metabolome data yet, but you can see a sample metabolome mapping Ipython notebook from here.
  10. I showed you some reproducible omics analysis pipelines with cyREST app. For example I used KEGG but you can replace target pathway from KEGG to Wikipathways, Reactome or other databases. Of course you can integrate these all data with Python. I’ve just started this project, so the Python and R packaging is not finished yet. I hope to contribute py2cytoscape project. And Increasing the number of notebook example is important for this project. If you have interest about notebook contribution, please see this URL.
  11. I thank the following people, thank you.