Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Reproducible Workflow with Cytoscape and Jupyter Notebook
1. Reproducible Workflows with
Jupyter Notebook and Cytoscape
Keiichiro Ono
Cytoscape Core Developer Team
UC, San Diego Trey Ideker Lab / National Resource for Network Biology
5/19/2016 Advanced Cytoscape Workshop
6. Review
- Network analysis / visualization is a powerful
method to get biological insights from your
screening result
- Cytoscape is the de-facto standard tool to
perform this type of analysis
7. Review
-Core features of Cytoscape
-Navigation (Pan/Zoom/Select)
-Network / Table Data Import
-Automatic Layout
-Visual Style
12. Creating Visualizations in Cytoscape
Name Type
BRCA1 gene
MAP2K1 gene
C05981 compound
• Mapping from Type to Node Shape
• Mapping from Type to Node Color
C05981
BRCA1
MAP2K1
Creating mappings from data points
to Visual Properties
16. Problems
- Reproducibility of biological research, especially for in vivo/vitro
experiments, is a hard problem
- But this is true even for in silico analysis!
- OS version
- Revision of scripts
- Data analysis software versions
- Version of data files
- Command line parameters written on a paper napkin
- “Black magic” only a grad student knows
- This is something we need to fix, using latest technologies and best
practices
35. Language-Agnostic
- From next version (4.x), Python Notebook will be an
implementation of Jupyter
- You can switch to other language kernels
- In this lecture, we will use Python, but you can use
language of your choice to control Cytoscape
36.
37. Question
• Cytoscape is a desktop application
• Point & click GUI operation
• Easy to use, but how can we
make our workflow
reproducible?
39. What is cyREST?
- Platform-independent, RESTful API module for Cytoscape
- Means you can access basic Cytoscape data objects
programmatically
- Now it’s a Cytoscape Core feature!
REST
40. Interactive Data Analysis
Environments
In-House Databases External Computing Resources
- Graph Layout
- Statistical Analysis
- Data Pre-processing
RStudio
- NumPy
- SciPy
- Pandas
- NetworkX
IPython Notebook
File / Code Hosting ServicesPublic Data Repository
PSICQUIC Services
EBI RDF Platform
Other Bioinformatics Web Applications / Services
- igraph
- rCurl
Command Line Tools
> sed
> awk
> grep
> curl
Web Browsers
Data Repository & Collaboration Service
Data Bus (Internet)
Your Workstation
Cytoscape App Store
Cytoscape Desktop
Apps
Core
REST
52. REST
Lab notebook to record
your workflow
Make Cytoscape
controllable via scripts
Manage
multiple versions of your
notebooks and other
scripts
Missing: Environment to execute your workflow
56. Bare Metal Machine
OS (Linux)
Docker
Frameworks
Application
Frameworks
Application
Frameworks
Application
Frameworks
Application
Frameworks
Application
57.
58. What is Docker?
- Container to run applications in an isolated
environment
- Application = Layer of images
- Sharable Environments
- Environments as code
59. Docker Hub
- Sharing environments as code!
- Dockerfile - Definition of your container
- “GitHub of Images”