SlideShare uma empresa Scribd logo
1 de 20
Baixar para ler offline
Cloud BioLinux: Pre-Configured and On-Demand
High Performance Computing for the Genomics Community



                  Ntino Krampis, PhD
           Next-Gen Sequence Data Management '10
                      Providence, RI
Expensive sequencing, computing and large organizations

●
    multi-million, broad-impact sequencing projects

●   large sequencing center, with a dedicated bioinformatics department

●   large-scale computations on SGE cluster, algorithm acceleration hardware
Bench-top, commodity sequencing and small labs


●
    small-factor sequencer available: GS Junior by 454

●   sequencing as a standard technique in basic biology and genetics research

●   remember microarrays and lengthy assays for protein interactions ?

●   RNAseq and ChiPseq, and each biologist will be tackling a metagenome
Will small labs become the long tail of sequencing ?



                amount of
                sequencing             Credit: WikiMedia Commons




                                   number of labs


●   downstream bioinformatic analysis required for biological discovery

●   basic analysis example: large-scale BLAST to public DBs (try 0.5GB at NCBI)

●   do not have the hardware, expertise, or time to install and run software locally
Cloud Biolinux
              pre-configured and on-demand bioinformatics on the cloud


●   a public virtual machine (VM) on EC2 with 100+ bioinformatics tools

●   how it came to be, what offers for sequence analysis

●   where and how do I run it, especially if I am not a computer expert

●   modifying and sharing VM configurations and data with your peers

●   openness and community around Cloud Biolinux
Cloud Biolinux
                                        The Biolinux part



                                 ●   an Ubuntu Linux desktop for bioinformatics
tinyurl.com/BioLinux-NEBC        ●   NEBC packaged software and maintains repository

           +                     ●   Ubuntu AMI on EC2, pull packages from repository

                                 ●   additional software of interest to JCVI



           =


tinyurl.com/CloudBioLinux-JCVI
Cloud Biolinux
                          what comes in the box

●   glimmer, hmmer, phylip, rasmol, genespring, clustalw, EMBOSS

●   mpiBLAST clusters using EC2 virtual machine instances

●   Celera whole genome shotgun assembler

●   NX remote desktop, easy to use for benchtop scientists
Cloud Biolinux
                                 The Cloud part

●   find our VM on Amazon EC2:

          Biolinux 5.0 packages (32-bit): ami-6953b200
          Biolinux 6.0 packages (64-bit): ami-6011e409 , EBS based

●   17GB / 6 core instances 0.5$ / hour, see aws.amazon.com/ec2/pricing

●   a small bacterial genome assembly costs a little over 2$

●   up to 68 RAM / 26 core, EBS up to 1000 GB in size (0.10$ / GB / month)

●   make a copy of our public Biolinux ami - add your data - make private
Cloud Biolinux
http://tinyurl.com/cloud-biolinux-tutorial (credit to the NEBC team)




                                                          simply signup at

                                                          aws.amazon.com
                                                               then
                                                      aws.amazon.com/console
                                                               and
Cloud Biolinux
http://tinyurl.com/cloud-biolinux-tutorial (credit to the NEBC team)




                                                      ●find Cloud Biolinux
                                                      AMI using ID

                                                      ● enter desired password
                                                      for remote desktop login

                                                      ●   all other default
●get remote desktop client:
nomachine.com/download.php

●simply enter VM's IP address
and your password
What if I want to share my
alignments with a collaborator?

save your data as a new AMI

EBS cost 0.10$ / GB / month

at 15GB, it costs 1.5$ / month
share your data: public or with another AWS user

users with access can boot the AMI with all the
software + data
Cloud Biolinux
                                  The Cloud part

●   run Cloud Biolinux on your private cloud ?

●   Eucalyptus open source cloud platform

●   identical API with EC2, without the usage charges

●   easy to set up on your lab's cluster, comes with Ubuntu server (UEC)

●   download VMs from Sourceforge ( tinyurl.com/CloudBiolinux-SF )



                          open.eucalyptus.com
Cloud Biolinux

●   porting VMs across cloud platforms is not trivial

●   Cloud Biolinux VMs from EC2 to Eucalyptus, Xen kernel and boot sector

●   framework to share VM configurations ( tinyurl.com/bootstrap-cloudbiolinux )

●   based on python-fabric automated deployment tool

●   simply edit the software list files and share with collaborators

●   they start with fresh VM, python-fabric replicates VM setup on their cloud



                                tinyurl.com/python-fabric
Cloud Biolinux
       Collaboration and open source

high-level configuration describing software groups

   for each group individual software packages

simply edit the files to change the VM configuration

        tinyurl.com/CloudBioLinux-github




              ...............
Cloud Biolinux
                                 The community

●   from JCVI and NEBC to an open-source, community-based project

●   community initiated during tele-conference meeting at SC '10, Portland, OR

●   first meeting past July in Boston, tinyurl.com/openbio-codefest-2010

●   work done: 64-bit AMIs, NX remote desktop, set-up the fabric framework

●   next year's at ISMB/BOSC in Vienna, Austria http://metalab.at/

●   cloudbiolinux.com and most important, tinyurl.com/cloudbiolinux-lists
Cloud Biolinux
                                  The future

●   expand community, receive feedback, add more software to the VM

●   genome assemblers, high-memory EC2 instances up to 68GB RAM

●   Hadoop / MapReduce (for those running the VM in private clouds)

●   analysis pipelines that are used by large sequencing centers

●   actively seeking funding to put major effort in development

●   tinyurl.com/cloudbiolinux-lists or community@cloudbiolinux.com
Acknowledgments & Credits
Brad Chapman      - development of the fabric scripts and community organizer
Tim Booth, Bela Tiwari – BioLinux 6.0 development and EC2 documentation
Deepak Singh and AWS - education grant supporting codefest workshop
Justin Johnson    –   community and sponsorship of cloudbiolinux.com
J. Craig Venter Inst. - time allowed to work on an open-source project
D. Gomez, E. Navarro, J. Shao, I. Singh – JCVI technology innovation

Members of the Cloud Biolinux community:
Enis Afgan
Michael Heuer
Richard Holland
Mark Jensen                                        Thank you !
Dave Messina
Steffen Möller
Roman Valls

Mais conteúdo relacionado

Mais procurados

Palestra Técnica do CISL - Xen Cloud Platform - Implementando Nuvens Privadas...
Palestra Técnica do CISL - Xen Cloud Platform - Implementando Nuvens Privadas...Palestra Técnica do CISL - Xen Cloud Platform - Implementando Nuvens Privadas...
Palestra Técnica do CISL - Xen Cloud Platform - Implementando Nuvens Privadas...Lorscheider Santiago
 
Introduction to the xen cloud platform - xcp
Introduction to the xen cloud platform - xcpIntroduction to the xen cloud platform - xcp
Introduction to the xen cloud platform - xcpLorscheider Santiago
 
3 Dia Livre - Implementando Nuvens Privadas com XCP 1.1
3 Dia Livre - Implementando Nuvens Privadas com XCP 1.13 Dia Livre - Implementando Nuvens Privadas com XCP 1.1
3 Dia Livre - Implementando Nuvens Privadas com XCP 1.1Lorscheider Santiago
 
Devoxx France 2015 - The Docker Orchestration Ecosystem on Azure
Devoxx France 2015 - The Docker Orchestration Ecosystem on AzureDevoxx France 2015 - The Docker Orchestration Ecosystem on Azure
Devoxx France 2015 - The Docker Orchestration Ecosystem on AzurePatrick Chanezon
 
Why we need container in Software
Why we need container in SoftwareWhy we need container in Software
Why we need container in SoftwareThach Nguyen
 
Introduction to Docker and deployment and Azure
Introduction to Docker and deployment and AzureIntroduction to Docker and deployment and Azure
Introduction to Docker and deployment and AzureJérôme Petazzoni
 
Unikernels: Rise of the Library Hypervisor
Unikernels: Rise of the Library HypervisorUnikernels: Rise of the Library Hypervisor
Unikernels: Rise of the Library HypervisorAnil Madhavapeddy
 
Ceph, Xen, and CloudStack: Semper Melior-XPUS13 McGarry
Ceph, Xen, and CloudStack: Semper Melior-XPUS13 McGarryCeph, Xen, and CloudStack: Semper Melior-XPUS13 McGarry
Ceph, Xen, and CloudStack: Semper Melior-XPUS13 McGarryThe Linux Foundation
 
Microservices. Microservices everywhere! (At OSCON 2015)
Microservices. Microservices everywhere! (At OSCON 2015)Microservices. Microservices everywhere! (At OSCON 2015)
Microservices. Microservices everywhere! (At OSCON 2015)Jérôme Petazzoni
 
Containers technologies
Containers technologiesContainers technologies
Containers technologiesJoris Bonnefoy
 
7 Ways to Optimize Hudson in Production
7 Ways to Optimize Hudson in Production7 Ways to Optimize Hudson in Production
7 Ways to Optimize Hudson in ProductionCloudBees
 
Build Your Private Cloud with Ezilla and Haduzilla
Build Your Private Cloud with Ezilla and HaduzillaBuild Your Private Cloud with Ezilla and Haduzilla
Build Your Private Cloud with Ezilla and HaduzillaJazz Yao-Tsung Wang
 
Open Source Cloud Computing: Practical Solutions For Your Online Presence (ODP)
Open Source Cloud Computing: Practical Solutions For Your Online Presence (ODP)Open Source Cloud Computing: Practical Solutions For Your Online Presence (ODP)
Open Source Cloud Computing: Practical Solutions For Your Online Presence (ODP)Todd Deshane
 
Lxc – next gen virtualization for cloud intro (cloudexpo)
Lxc – next gen virtualization for cloud   intro (cloudexpo)Lxc – next gen virtualization for cloud   intro (cloudexpo)
Lxc – next gen virtualization for cloud intro (cloudexpo)Boden Russell
 
Performance characteristics of traditional v ms vs docker containers (dockerc...
Performance characteristics of traditional v ms vs docker containers (dockerc...Performance characteristics of traditional v ms vs docker containers (dockerc...
Performance characteristics of traditional v ms vs docker containers (dockerc...Boden Russell
 
Re-Think of Virtualization and Containerization
Re-Think of Virtualization and ContainerizationRe-Think of Virtualization and Containerization
Re-Think of Virtualization and ContainerizationXu Wang
 
Understand how docker works
Understand how docker worksUnderstand how docker works
Understand how docker worksJustin Li
 

Mais procurados (20)

Palestra Técnica do CISL - Xen Cloud Platform - Implementando Nuvens Privadas...
Palestra Técnica do CISL - Xen Cloud Platform - Implementando Nuvens Privadas...Palestra Técnica do CISL - Xen Cloud Platform - Implementando Nuvens Privadas...
Palestra Técnica do CISL - Xen Cloud Platform - Implementando Nuvens Privadas...
 
Introduction to the xen cloud platform - xcp
Introduction to the xen cloud platform - xcpIntroduction to the xen cloud platform - xcp
Introduction to the xen cloud platform - xcp
 
3 Dia Livre - Implementando Nuvens Privadas com XCP 1.1
3 Dia Livre - Implementando Nuvens Privadas com XCP 1.13 Dia Livre - Implementando Nuvens Privadas com XCP 1.1
3 Dia Livre - Implementando Nuvens Privadas com XCP 1.1
 
Devoxx France 2015 - The Docker Orchestration Ecosystem on Azure
Devoxx France 2015 - The Docker Orchestration Ecosystem on AzureDevoxx France 2015 - The Docker Orchestration Ecosystem on Azure
Devoxx France 2015 - The Docker Orchestration Ecosystem on Azure
 
Understanding LXC & Docker
Understanding LXC & DockerUnderstanding LXC & Docker
Understanding LXC & Docker
 
Why we need container in Software
Why we need container in SoftwareWhy we need container in Software
Why we need container in Software
 
Introduction to Docker and deployment and Azure
Introduction to Docker and deployment and AzureIntroduction to Docker and deployment and Azure
Introduction to Docker and deployment and Azure
 
Unikernels: Rise of the Library Hypervisor
Unikernels: Rise of the Library HypervisorUnikernels: Rise of the Library Hypervisor
Unikernels: Rise of the Library Hypervisor
 
Ceph, Xen, and CloudStack: Semper Melior-XPUS13 McGarry
Ceph, Xen, and CloudStack: Semper Melior-XPUS13 McGarryCeph, Xen, and CloudStack: Semper Melior-XPUS13 McGarry
Ceph, Xen, and CloudStack: Semper Melior-XPUS13 McGarry
 
Microservices. Microservices everywhere! (At OSCON 2015)
Microservices. Microservices everywhere! (At OSCON 2015)Microservices. Microservices everywhere! (At OSCON 2015)
Microservices. Microservices everywhere! (At OSCON 2015)
 
Containers technologies
Containers technologiesContainers technologies
Containers technologies
 
Docker training
Docker trainingDocker training
Docker training
 
Building FOSS clouds
Building FOSS cloudsBuilding FOSS clouds
Building FOSS clouds
 
7 Ways to Optimize Hudson in Production
7 Ways to Optimize Hudson in Production7 Ways to Optimize Hudson in Production
7 Ways to Optimize Hudson in Production
 
Build Your Private Cloud with Ezilla and Haduzilla
Build Your Private Cloud with Ezilla and HaduzillaBuild Your Private Cloud with Ezilla and Haduzilla
Build Your Private Cloud with Ezilla and Haduzilla
 
Open Source Cloud Computing: Practical Solutions For Your Online Presence (ODP)
Open Source Cloud Computing: Practical Solutions For Your Online Presence (ODP)Open Source Cloud Computing: Practical Solutions For Your Online Presence (ODP)
Open Source Cloud Computing: Practical Solutions For Your Online Presence (ODP)
 
Lxc – next gen virtualization for cloud intro (cloudexpo)
Lxc – next gen virtualization for cloud   intro (cloudexpo)Lxc – next gen virtualization for cloud   intro (cloudexpo)
Lxc – next gen virtualization for cloud intro (cloudexpo)
 
Performance characteristics of traditional v ms vs docker containers (dockerc...
Performance characteristics of traditional v ms vs docker containers (dockerc...Performance characteristics of traditional v ms vs docker containers (dockerc...
Performance characteristics of traditional v ms vs docker containers (dockerc...
 
Re-Think of Virtualization and Containerization
Re-Think of Virtualization and ContainerizationRe-Think of Virtualization and Containerization
Re-Think of Virtualization and Containerization
 
Understand how docker works
Understand how docker worksUnderstand how docker works
Understand how docker works
 

Destaque

IVI Workshop Kazak Investors In English
IVI Workshop Kazak Investors In EnglishIVI Workshop Kazak Investors In English
IVI Workshop Kazak Investors In EnglishThomas Nastas
 
Fairgrounds Proposal
Fairgrounds ProposalFairgrounds Proposal
Fairgrounds Proposalguest12a2146
 
Lapcodex Aviones 1
Lapcodex Aviones 1Lapcodex Aviones 1
Lapcodex Aviones 1guest671e5e0
 
Eu Kyoto Prototcol Class Presentation Sumiko
Eu Kyoto Prototcol Class Presentation SumikoEu Kyoto Prototcol Class Presentation Sumiko
Eu Kyoto Prototcol Class Presentation SumikoKeith Dickson
 
IVI (Tom Nastas) Presentation At 3rd Moscow Venture Fair
IVI (Tom Nastas)  Presentation At 3rd  Moscow Venture FairIVI (Tom Nastas)  Presentation At 3rd  Moscow Venture Fair
IVI (Tom Nastas) Presentation At 3rd Moscow Venture FairThomas Nastas
 
#IgNiteTH Keynote by iannnnn
#IgNiteTH Keynote by iannnnn#IgNiteTH Keynote by iannnnn
#IgNiteTH Keynote by iannnnniannnnn
 
Certificación inversión o gasto
Certificación inversión o gastoCertificación inversión o gasto
Certificación inversión o gastoIvo Castillo
 
Bob Dylan Y Serrat En Hebreo
Bob Dylan Y Serrat En HebreoBob Dylan Y Serrat En Hebreo
Bob Dylan Y Serrat En Hebreosilvia shapiro
 
The Global Competition For Capital
The Global Competition For CapitalThe Global Competition For Capital
The Global Competition For CapitalThomas Nastas
 
Where is Your Social Brand? Cultivating a Strong Brand Across Web 1.0, Web 2....
Where is Your Social Brand? Cultivating a Strong Brand Across Web 1.0, Web 2....Where is Your Social Brand? Cultivating a Strong Brand Across Web 1.0, Web 2....
Where is Your Social Brand? Cultivating a Strong Brand Across Web 1.0, Web 2....Noesium Consulting
 
20090511 Manchester Biochemistry
20090511 Manchester Biochemistry20090511 Manchester Biochemistry
20090511 Manchester BiochemistryMichel Dumontier
 
พลังไพร่ใน Social Network #SMCON (by iannnnn)
พลังไพร่ใน Social Network #SMCON (by iannnnn)พลังไพร่ใน Social Network #SMCON (by iannnnn)
พลังไพร่ใน Social Network #SMCON (by iannnnn)iannnnn
 
Multicultural health standards around the world
Multicultural health standards around the worldMulticultural health standards around the world
Multicultural health standards around the worlddiversityRx
 

Destaque (20)

IVI Workshop Kazak Investors In English
IVI Workshop Kazak Investors In EnglishIVI Workshop Kazak Investors In English
IVI Workshop Kazak Investors In English
 
Fairgrounds Proposal
Fairgrounds ProposalFairgrounds Proposal
Fairgrounds Proposal
 
Propostadedecretplurilingisme2011
Propostadedecretplurilingisme2011Propostadedecretplurilingisme2011
Propostadedecretplurilingisme2011
 
Lourenza
LourenzaLourenza
Lourenza
 
Lapcodex Aviones 1
Lapcodex Aviones 1Lapcodex Aviones 1
Lapcodex Aviones 1
 
Eu Kyoto Prototcol Class Presentation Sumiko
Eu Kyoto Prototcol Class Presentation SumikoEu Kyoto Prototcol Class Presentation Sumiko
Eu Kyoto Prototcol Class Presentation Sumiko
 
IVI (Tom Nastas) Presentation At 3rd Moscow Venture Fair
IVI (Tom Nastas)  Presentation At 3rd  Moscow Venture FairIVI (Tom Nastas)  Presentation At 3rd  Moscow Venture Fair
IVI (Tom Nastas) Presentation At 3rd Moscow Venture Fair
 
Portfolio
PortfolioPortfolio
Portfolio
 
#IgNiteTH Keynote by iannnnn
#IgNiteTH Keynote by iannnnn#IgNiteTH Keynote by iannnnn
#IgNiteTH Keynote by iannnnn
 
Certificación inversión o gasto
Certificación inversión o gastoCertificación inversión o gasto
Certificación inversión o gasto
 
Bob Dylan Y Serrat En Hebreo
Bob Dylan Y Serrat En HebreoBob Dylan Y Serrat En Hebreo
Bob Dylan Y Serrat En Hebreo
 
Flamingo project v4
Flamingo project v4Flamingo project v4
Flamingo project v4
 
The Global Competition For Capital
The Global Competition For CapitalThe Global Competition For Capital
The Global Competition For Capital
 
Where is Your Social Brand? Cultivating a Strong Brand Across Web 1.0, Web 2....
Where is Your Social Brand? Cultivating a Strong Brand Across Web 1.0, Web 2....Where is Your Social Brand? Cultivating a Strong Brand Across Web 1.0, Web 2....
Where is Your Social Brand? Cultivating a Strong Brand Across Web 1.0, Web 2....
 
20090511 Manchester Biochemistry
20090511 Manchester Biochemistry20090511 Manchester Biochemistry
20090511 Manchester Biochemistry
 
พลังไพร่ใน Social Network #SMCON (by iannnnn)
พลังไพร่ใน Social Network #SMCON (by iannnnn)พลังไพร่ใน Social Network #SMCON (by iannnnn)
พลังไพร่ใน Social Network #SMCON (by iannnnn)
 
Multicultural health standards around the world
Multicultural health standards around the worldMulticultural health standards around the world
Multicultural health standards around the world
 
Generation Myth
Generation MythGeneration Myth
Generation Myth
 
The colorfulfriends
The colorfulfriendsThe colorfulfriends
The colorfulfriends
 
Wwf Intro
Wwf IntroWwf Intro
Wwf Intro
 

Semelhante a Chi next gen-ntino-krampis

Cloud BioLinux S.Africa
Cloud BioLinux S.AfricaCloud BioLinux S.Africa
Cloud BioLinux S.AfricaNtino Krampis
 
Oscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby projectOscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby projectPatrick Chanezon
 
Gordon's secret session kubernetes on windows
Gordon's secret session   kubernetes on windowsGordon's secret session   kubernetes on windows
Gordon's secret session kubernetes on windowsDocker, Inc.
 
CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018Krishna-Kumar
 
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"IBM France Lab
 
Understanding Kubernetes
Understanding KubernetesUnderstanding Kubernetes
Understanding KubernetesTu Pham
 
Unikernels: the rise of the library hypervisor in MirageOS
Unikernels: the rise of the library hypervisor in MirageOSUnikernels: the rise of the library hypervisor in MirageOS
Unikernels: the rise of the library hypervisor in MirageOSDocker, Inc.
 
Building a PaaS Platform like Bluemix on OpenStack
Building a PaaS Platform like Bluemix on OpenStackBuilding a PaaS Platform like Bluemix on OpenStack
Building a PaaS Platform like Bluemix on OpenStackAnimesh Singh
 
Cloud foundry Docker Openstack - Leading Open Source Triumvirate
Cloud foundry Docker Openstack - Leading Open Source TriumvirateCloud foundry Docker Openstack - Leading Open Source Triumvirate
Cloud foundry Docker Openstack - Leading Open Source TriumvirateAnimesh Singh
 
La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)
La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)
La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)Alexandre Roman
 
Microservices , Docker , CI/CD , Kubernetes Seminar - Sri Lanka
Microservices , Docker , CI/CD , Kubernetes Seminar - Sri Lanka Microservices , Docker , CI/CD , Kubernetes Seminar - Sri Lanka
Microservices , Docker , CI/CD , Kubernetes Seminar - Sri Lanka Mario Ishara Fernando
 
Cloud Foundry Summit 2015: Managing Hybrid Deployments Using Cloud Foundry on...
Cloud Foundry Summit 2015: Managing Hybrid Deployments Using Cloud Foundry on...Cloud Foundry Summit 2015: Managing Hybrid Deployments Using Cloud Foundry on...
Cloud Foundry Summit 2015: Managing Hybrid Deployments Using Cloud Foundry on...VMware Tanzu
 
Raspberry pi x kubernetes x tensorflow
Raspberry pi x kubernetes x tensorflowRaspberry pi x kubernetes x tensorflow
Raspberry pi x kubernetes x tensorflow霈萱 蔡
 
Moby Open Source Summit North America 2017
Moby Open Source Summit North America 2017Moby Open Source Summit North America 2017
Moby Open Source Summit North America 2017Patrick Chanezon
 
A Tale of Two Pizzas: Developer Tools at AWS - DevDay Los Angeles 2017
A Tale of Two Pizzas: Developer Tools at AWS - DevDay Los Angeles 2017A Tale of Two Pizzas: Developer Tools at AWS - DevDay Los Angeles 2017
A Tale of Two Pizzas: Developer Tools at AWS - DevDay Los Angeles 2017Amazon Web Services
 
Effective administration of IBM Integration Bus - Sanjay Nagchowdhury
Effective administration of IBM Integration Bus - Sanjay NagchowdhuryEffective administration of IBM Integration Bus - Sanjay Nagchowdhury
Effective administration of IBM Integration Bus - Sanjay NagchowdhuryKaren Broughton-Mabbitt
 

Semelhante a Chi next gen-ntino-krampis (20)

Bosc2011 ntino-krampis-full
Bosc2011 ntino-krampis-fullBosc2011 ntino-krampis-full
Bosc2011 ntino-krampis-full
 
Cloud ntino-krampis
Cloud ntino-krampisCloud ntino-krampis
Cloud ntino-krampis
 
Cloud BioLinux S.Africa
Cloud BioLinux S.AfricaCloud BioLinux S.Africa
Cloud BioLinux S.Africa
 
Oscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby projectOscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby project
 
Gordon's secret session kubernetes on windows
Gordon's secret session   kubernetes on windowsGordon's secret session   kubernetes on windows
Gordon's secret session kubernetes on windows
 
CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018
 
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
 
Understanding Kubernetes
Understanding KubernetesUnderstanding Kubernetes
Understanding Kubernetes
 
Unikernels: the rise of the library hypervisor in MirageOS
Unikernels: the rise of the library hypervisor in MirageOSUnikernels: the rise of the library hypervisor in MirageOS
Unikernels: the rise of the library hypervisor in MirageOS
 
Building a PaaS Platform like Bluemix on OpenStack
Building a PaaS Platform like Bluemix on OpenStackBuilding a PaaS Platform like Bluemix on OpenStack
Building a PaaS Platform like Bluemix on OpenStack
 
Open nebula froscon
Open nebula frosconOpen nebula froscon
Open nebula froscon
 
Cloud foundry Docker Openstack - Leading Open Source Triumvirate
Cloud foundry Docker Openstack - Leading Open Source TriumvirateCloud foundry Docker Openstack - Leading Open Source Triumvirate
Cloud foundry Docker Openstack - Leading Open Source Triumvirate
 
La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)
La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)
La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)
 
Microservices , Docker , CI/CD , Kubernetes Seminar - Sri Lanka
Microservices , Docker , CI/CD , Kubernetes Seminar - Sri Lanka Microservices , Docker , CI/CD , Kubernetes Seminar - Sri Lanka
Microservices , Docker , CI/CD , Kubernetes Seminar - Sri Lanka
 
Cloud Foundry Summit 2015: Managing Hybrid Deployments Using Cloud Foundry on...
Cloud Foundry Summit 2015: Managing Hybrid Deployments Using Cloud Foundry on...Cloud Foundry Summit 2015: Managing Hybrid Deployments Using Cloud Foundry on...
Cloud Foundry Summit 2015: Managing Hybrid Deployments Using Cloud Foundry on...
 
Raspberry pi x kubernetes x tensorflow
Raspberry pi x kubernetes x tensorflowRaspberry pi x kubernetes x tensorflow
Raspberry pi x kubernetes x tensorflow
 
Moby Open Source Summit North America 2017
Moby Open Source Summit North America 2017Moby Open Source Summit North America 2017
Moby Open Source Summit North America 2017
 
Domestic cloud
Domestic cloudDomestic cloud
Domestic cloud
 
A Tale of Two Pizzas: Developer Tools at AWS - DevDay Los Angeles 2017
A Tale of Two Pizzas: Developer Tools at AWS - DevDay Los Angeles 2017A Tale of Two Pizzas: Developer Tools at AWS - DevDay Los Angeles 2017
A Tale of Two Pizzas: Developer Tools at AWS - DevDay Los Angeles 2017
 
Effective administration of IBM Integration Bus - Sanjay Nagchowdhury
Effective administration of IBM Integration Bus - Sanjay NagchowdhuryEffective administration of IBM Integration Bus - Sanjay Nagchowdhury
Effective administration of IBM Integration Bus - Sanjay Nagchowdhury
 

Chi next gen-ntino-krampis

  • 1. Cloud BioLinux: Pre-Configured and On-Demand High Performance Computing for the Genomics Community Ntino Krampis, PhD Next-Gen Sequence Data Management '10 Providence, RI
  • 2. Expensive sequencing, computing and large organizations ● multi-million, broad-impact sequencing projects ● large sequencing center, with a dedicated bioinformatics department ● large-scale computations on SGE cluster, algorithm acceleration hardware
  • 3. Bench-top, commodity sequencing and small labs ● small-factor sequencer available: GS Junior by 454 ● sequencing as a standard technique in basic biology and genetics research ● remember microarrays and lengthy assays for protein interactions ? ● RNAseq and ChiPseq, and each biologist will be tackling a metagenome
  • 4. Will small labs become the long tail of sequencing ? amount of sequencing Credit: WikiMedia Commons number of labs ● downstream bioinformatic analysis required for biological discovery ● basic analysis example: large-scale BLAST to public DBs (try 0.5GB at NCBI) ● do not have the hardware, expertise, or time to install and run software locally
  • 5. Cloud Biolinux pre-configured and on-demand bioinformatics on the cloud ● a public virtual machine (VM) on EC2 with 100+ bioinformatics tools ● how it came to be, what offers for sequence analysis ● where and how do I run it, especially if I am not a computer expert ● modifying and sharing VM configurations and data with your peers ● openness and community around Cloud Biolinux
  • 6. Cloud Biolinux The Biolinux part ● an Ubuntu Linux desktop for bioinformatics tinyurl.com/BioLinux-NEBC ● NEBC packaged software and maintains repository + ● Ubuntu AMI on EC2, pull packages from repository ● additional software of interest to JCVI = tinyurl.com/CloudBioLinux-JCVI
  • 7. Cloud Biolinux what comes in the box ● glimmer, hmmer, phylip, rasmol, genespring, clustalw, EMBOSS ● mpiBLAST clusters using EC2 virtual machine instances ● Celera whole genome shotgun assembler ● NX remote desktop, easy to use for benchtop scientists
  • 8. Cloud Biolinux The Cloud part ● find our VM on Amazon EC2: Biolinux 5.0 packages (32-bit): ami-6953b200 Biolinux 6.0 packages (64-bit): ami-6011e409 , EBS based ● 17GB / 6 core instances 0.5$ / hour, see aws.amazon.com/ec2/pricing ● a small bacterial genome assembly costs a little over 2$ ● up to 68 RAM / 26 core, EBS up to 1000 GB in size (0.10$ / GB / month) ● make a copy of our public Biolinux ami - add your data - make private
  • 9. Cloud Biolinux http://tinyurl.com/cloud-biolinux-tutorial (credit to the NEBC team) simply signup at aws.amazon.com then aws.amazon.com/console and
  • 10. Cloud Biolinux http://tinyurl.com/cloud-biolinux-tutorial (credit to the NEBC team) ●find Cloud Biolinux AMI using ID ● enter desired password for remote desktop login ● all other default
  • 11. ●get remote desktop client: nomachine.com/download.php ●simply enter VM's IP address and your password
  • 12.
  • 13. What if I want to share my alignments with a collaborator? save your data as a new AMI EBS cost 0.10$ / GB / month at 15GB, it costs 1.5$ / month
  • 14. share your data: public or with another AWS user users with access can boot the AMI with all the software + data
  • 15. Cloud Biolinux The Cloud part ● run Cloud Biolinux on your private cloud ? ● Eucalyptus open source cloud platform ● identical API with EC2, without the usage charges ● easy to set up on your lab's cluster, comes with Ubuntu server (UEC) ● download VMs from Sourceforge ( tinyurl.com/CloudBiolinux-SF ) open.eucalyptus.com
  • 16. Cloud Biolinux ● porting VMs across cloud platforms is not trivial ● Cloud Biolinux VMs from EC2 to Eucalyptus, Xen kernel and boot sector ● framework to share VM configurations ( tinyurl.com/bootstrap-cloudbiolinux ) ● based on python-fabric automated deployment tool ● simply edit the software list files and share with collaborators ● they start with fresh VM, python-fabric replicates VM setup on their cloud tinyurl.com/python-fabric
  • 17. Cloud Biolinux Collaboration and open source high-level configuration describing software groups for each group individual software packages simply edit the files to change the VM configuration tinyurl.com/CloudBioLinux-github ...............
  • 18. Cloud Biolinux The community ● from JCVI and NEBC to an open-source, community-based project ● community initiated during tele-conference meeting at SC '10, Portland, OR ● first meeting past July in Boston, tinyurl.com/openbio-codefest-2010 ● work done: 64-bit AMIs, NX remote desktop, set-up the fabric framework ● next year's at ISMB/BOSC in Vienna, Austria http://metalab.at/ ● cloudbiolinux.com and most important, tinyurl.com/cloudbiolinux-lists
  • 19. Cloud Biolinux The future ● expand community, receive feedback, add more software to the VM ● genome assemblers, high-memory EC2 instances up to 68GB RAM ● Hadoop / MapReduce (for those running the VM in private clouds) ● analysis pipelines that are used by large sequencing centers ● actively seeking funding to put major effort in development ● tinyurl.com/cloudbiolinux-lists or community@cloudbiolinux.com
  • 20. Acknowledgments & Credits Brad Chapman - development of the fabric scripts and community organizer Tim Booth, Bela Tiwari – BioLinux 6.0 development and EC2 documentation Deepak Singh and AWS - education grant supporting codefest workshop Justin Johnson – community and sponsorship of cloudbiolinux.com J. Craig Venter Inst. - time allowed to work on an open-source project D. Gomez, E. Navarro, J. Shao, I. Singh – JCVI technology innovation Members of the Cloud Biolinux community: Enis Afgan Michael Heuer Richard Holland Mark Jensen Thank you ! Dave Messina Steffen Möller Roman Valls