Misha Kapushesky introduces Genestack's contribution to the proof of concept developed in its consortium with Constellation Technologies for Phase 2 of the Pistoia Alliance Sequence Services project. The presentation was delivered at the Pistoia Alliance Conference in Boston, MA, on April 24, 2012.
2. GENESTACK PLATFORM
Objective universal genomics
applications platform
existing tool integration
Reason
& new tool development
provide full set of
Approach
building blocks
3. GENESTACK PLATFORM
Sharing Public data
Security Applications
HPC Private data
4. GENESTACK PLATFORM
Data private & secure
sharing
free public data
format-independent
custom data types
7. Genestack Limited, Salisbury Telephone +447990705531, Registered in England
and Wales Company
House, Station Road, Cambridge, Email: info@genestack.com, No. 7778793
GENESTACK
CB1 2LA, United Kingdom Twitter: @genestackltd
www.genestack.com
GENESTACK
www.genestack.com
GENOMICS Universal genomics data platform. Secure hosting and team sharing of Big
Data genomics experiments. Bioinformatics applications ecosystem in the
OPERATING
GENOMICS Universal genomics data platform. Securedata from public repositories. Data
cloud. Free access to curated genomic hosting and team sharing of Big
Data genomics application development. End-to-end sequencing service.
curation and experiments. Bioinformatics applications ecosystem in the
SYSTEM
OPERATING cloud. Free access to curated genomic data from public repositories. Data
Applications SDK & marketplace. Fixed monthly subscription.
curation and application development. End-to-end sequencing service.
SYSTEM Applications SDK & marketplace. Fixed monthly subscription.
Solutions to Six Problems With Genomic Data and Applications in the Enterprise
Solutions to Six Problems With Genomic Data and Applications in the Enterprise
1. Managing Genomic Data Storage Costs Interesting: NGS produces files hundreds of gigabytes
in size; encrypting/decrypting them is slow and CPU-
Problem: Sequencing gets cheaper per genome,
intensive, while bioinformatics tools can take hours or
1. Managing Genomic Dataper dollar, but data storage
producing more gigabases Storage Costs Interesting: NGS produces files hundreds of gigabytes
days to run. We have thought of ways to maintain
and processing costs are in fact growing. In-house in size; encrypting/decrypting them is slow and CPU-
Problem: Sequencing gets cheaper per genome, security even for such cases.
intensive, while bioinformatics tools can take hours or
producing at cluster World:pertake large capitalCloud Computing Track
storage and solutions dollar, but data
more gigabases 3:40pm Apr 26 instorage
Speaking Bio-IT operating costs.
expenditures and big in fact growing. In-house days Using Public Data with Proprietary Kapushesky, CEO
Misha Data Cost-
3. to run. We have thought of ways to maintain
and processing costs are
security even for such cases.
effectively misha@genestack.com
storage and cluster solutions take large capital your data
Solution: We offer a scalable way part in our
Launch Q3 2012. Want to take to manage early access programme? Data with Proprietary Data Cost-
expenditures processing costs on our cloud-based
Twitter @genestackltd
3. Problem: To use data from 1000 Genomes, GEO, Ensembl
Using Public
storage and and big operating costs.