SlideShare uma empresa Scribd logo
1 de 24
Harnessing cloud computing
for high capacity analysis of
neuroimaging data
Cameron Craddock, PhD
Computational Neuroimaging Lab
Center for Biomedical Imaging and Neuromodulation
Nathan S. Kline Institute for Psychiatric Research
Center for the Developing Brain
Child Mind Institute
Discovery science in Psychiatric Neuroimaging
1. Characterizing inter-individual variation in connectomes (Kelly et al.
2012)
2. Identifying biomarkers of disease state, severity, and prognosis
(Craddock 2009)
3. Re-defining mental health in terms of neurophenotypes, e.g. RDOC
(Castellanos 2013)
Data is often shared only in its raw form – must be preprocessed to remove
nuisance variation and to be made comparable across individuals and sites.
Connectomics is Big Data
Configurable Pipeline for the Analysis of
Connectomes (CPAC)
• Pipeline to automate preprocessing and analysis
of large-scale datasets
• Most cutting edge functional connectivity
preprocessing and analysis algorithms
• Configurable to enable “plurality” – evaluate
different processing parameters and strategies
• Automatically identifies and takes advantage of
parallelism on multi-threaded, multi-core, and
cluster architectures
• “Warm restarts” – only re-compute what has
changed
• Open science – open source
• http://fcp-indi.github.io
Nypipe
Computing in the Amazon Cloud
• No hardware capital cost
• No hardware maintenance
• No software installation or
configuration*
• Resources scale to meet
need for no overhead
• Available everywhere and
to everybody
• Allows access to exotic
architectures, such as GPUs
*If appropriate AMI is available
Amazon EC2 - Instance
• The hardware on which your processing will
run:
Instance Pricing
• On-demand Pricing
– Always available, fixed
price, non-interruptible,
most stable
• Spot instances
– Market to sell otherwise
unused time, variable
price, interruptible
Spot Instances
• Prices fluctuate over
time
• If price exceeds the max
you are willing to pay,
your instances are
terminated
Storage
• S3 – Simple Storage Service
– Secure and stable storage with a web service interface, pay for what you use
– Big and slow, $0.03 per GB/Month
– Can be accessed from anywhere
• EBS – Elastic Block Storage
– Provisioned storage (SSD HD) directly connected to instance, pay for what you provision
– Fast and expensive, $0.10 per GB/Month
– Persistent and transferrable
• Instance Storage
– SSD storage provided with some instances, included in instance price
– Fast and free
– Non-persistent and non-transferrable – good for cache
Amazon EC2 - Instance
• The hardware on which your processing will
run:
Data Transfer
• In general, free in - pay out
– Out to other Amazon service such as S3, EBS, etc
is free
– Out to Internet is $0.09 per GB (becomes slightly
cheaper after 10TB or so)
Amazon Machine Images
• Virtual machines that provide the software
environment for your processing
• You can build your own, or use one
maintained by others
StarCluster
• Star cluster simplifies the process of building a
Sun Grid Engine based cluster in EC2
– Dynamically add and remove compute nodes
– Uses spot instances
– Provides scripts for performing many
administrative tasks
C-PAC Amazon Machine Image
Nypipe
Proof of concept
• Preprocessed 1,112 datasets from
ABIDE with C-PAC
– 4 different preprocessing strategies
(+/- temporal filter, +/- global signal
regression)
– 24 derivatives:
• ReHo, ALFF, fALFF, 10 RSNs, VMHC, binary
degree centrality, weighted degree
centrality, lFCD, time courses for 5 atlases
(AAL, TT, EZ, HO, CC200, CC400)
http://preprocessed-connectomes-project.github.io/abide
• Requires 45 minute to process 1 dataset
• 3 datasets can be processed in parallel
• Processing results in .5GB of data
Model Parameters
Cloud vs. Traditional Computing
0
5000
10000
15000
100
2000
5000
10000
15000
20000
25000
30000
35000
40000
45000
50000
Number of Datasets
Cost($)
Instance Cost Storage Cost Transfer Cost
0
4000
8000
12000
100
2000
5000
10000
15000
20000
25000
30000
35000
40000
45000
50000
Number of Datasets
Time(hours)
No Download Total Processing Time
Impact of Spot Instances
Simulations using past 90 days of spot price history
Other Pipelines
What about HIPAA?
• Amazon AWS meets FedRAMP and NIST 800-53
standards, which are more rigorous than HIPAA
– Access to instances controlled using 256-bit AES
– Default firewalls deny all outside access
– EC2, EBS, and S3 storage are compatible with encryption
• AWS HIPAA whitepaper
–
http://d0.awsstatic.com/whitepapers/compliance/AWS_HI
PAA_Compliance_Whitepaper.pdf
C-PAC Amazon Machine Image
Nypipe
Preprocessed INDI Data in the Cloud
http://preprocessed-connectomes-project.github.io/
• Available through S3
Bucket generously
provided by AWS
• Raw INDI will be available
soon
- HCP Data available in the cloud:
- https://wiki.humanconnectome.org/display/PublicData/Home
- Receive $100 AWS Credits at the HCP workshop in Hawaii
- http://humanconnectome.org/course-registration/2015/exploring-the-human-
connectome.php
Acknowledgements
CPAC Team: Daniel Clark, Steven Giavasis and Michael Milham.
NDAR “Cloud Team”: Christian Haselgrove, Dave Kennedy, and Jack van
Horn.
NDAR Team: Dan Hall, Brian Koser, David Obenshain, Svetlana Novikova,
and Malcom Jackson.
CPAC-NDAR integration was funded by a contract from NDAR.
ABIDE Preprocessed data is hosted in a Public S3 Bucket provided
by AWS.

Mais conteúdo relacionado

Mais procurados

Neuroanatomy of language functions
Neuroanatomy of language functionsNeuroanatomy of language functions
Neuroanatomy of language functionsHemaraja Nayaka S
 
Cerebellopontine Angle Tumors
Cerebellopontine Angle Tumors Cerebellopontine Angle Tumors
Cerebellopontine Angle Tumors Ade Wijaya
 
Approaches to Brainstem
Approaches to BrainstemApproaches to Brainstem
Approaches to Brainstemrushabhshah295
 
Meningoencephalocele
MeningoencephaloceleMeningoencephalocele
MeningoencephaloceleAnneSaputra
 
SPEECH PERCEPTION THEORIES MASLP
SPEECH PERCEPTION THEORIES MASLPSPEECH PERCEPTION THEORIES MASLP
SPEECH PERCEPTION THEORIES MASLPHimaniBansal15
 
Neural correlates of working memory
Neural correlates of working memoryNeural correlates of working memory
Neural correlates of working memoryOzella Brundidge
 
Presentation1.pptx, diffusion tensor imaging of white matter tract in cerebra...
Presentation1.pptx, diffusion tensor imaging of white matter tract in cerebra...Presentation1.pptx, diffusion tensor imaging of white matter tract in cerebra...
Presentation1.pptx, diffusion tensor imaging of white matter tract in cerebra...Abdellah Nazeer
 
Frontal Lobe Syndrome
Frontal Lobe SyndromeFrontal Lobe Syndrome
Frontal Lobe SyndromeShah Parind
 
Neuroimaging in psychiatry
Neuroimaging in psychiatryNeuroimaging in psychiatry
Neuroimaging in psychiatrySantanu Ghosh
 
The cerebello pontine angle
The cerebello pontine angleThe cerebello pontine angle
The cerebello pontine angleDr Himanshu Soni
 
Spontaneous intracranial hypotension (sih)
Spontaneous intracranial hypotension (sih)Spontaneous intracranial hypotension (sih)
Spontaneous intracranial hypotension (sih)hazem youssef
 
Holmes Tremor 0218
Holmes Tremor  0218Holmes Tremor  0218
Holmes Tremor 0218Ade Wijaya
 
Multiple sclerosis: Revised diagnostic criteria
Multiple sclerosis: Revised diagnostic criteria Multiple sclerosis: Revised diagnostic criteria
Multiple sclerosis: Revised diagnostic criteria Sudhir Kumar
 
Neurointerventional Therapy for Brain Aneurysms and Acute Stroke
Neurointerventional Therapy for Brain Aneurysms and Acute Stroke Neurointerventional Therapy for Brain Aneurysms and Acute Stroke
Neurointerventional Therapy for Brain Aneurysms and Acute Stroke Allina Health
 

Mais procurados (20)

Neuroanatomy of language functions
Neuroanatomy of language functionsNeuroanatomy of language functions
Neuroanatomy of language functions
 
Cerebellopontine Angle Tumors
Cerebellopontine Angle Tumors Cerebellopontine Angle Tumors
Cerebellopontine Angle Tumors
 
Stereotaxy Brain
Stereotaxy BrainStereotaxy Brain
Stereotaxy Brain
 
Approaches to Brainstem
Approaches to BrainstemApproaches to Brainstem
Approaches to Brainstem
 
Meningoencephalocele
MeningoencephaloceleMeningoencephalocele
Meningoencephalocele
 
SPEECH PERCEPTION THEORIES MASLP
SPEECH PERCEPTION THEORIES MASLPSPEECH PERCEPTION THEORIES MASLP
SPEECH PERCEPTION THEORIES MASLP
 
Neural correlates of working memory
Neural correlates of working memoryNeural correlates of working memory
Neural correlates of working memory
 
Presentation1.pptx, diffusion tensor imaging of white matter tract in cerebra...
Presentation1.pptx, diffusion tensor imaging of white matter tract in cerebra...Presentation1.pptx, diffusion tensor imaging of white matter tract in cerebra...
Presentation1.pptx, diffusion tensor imaging of white matter tract in cerebra...
 
Frontal Lobe Syndrome
Frontal Lobe SyndromeFrontal Lobe Syndrome
Frontal Lobe Syndrome
 
Forgotten reflex testing
Forgotten reflex testingForgotten reflex testing
Forgotten reflex testing
 
Epilepsy surgery
Epilepsy surgeryEpilepsy surgery
Epilepsy surgery
 
Neuroimaging in psychiatry
Neuroimaging in psychiatryNeuroimaging in psychiatry
Neuroimaging in psychiatry
 
Case record...Postinfectious cerebellitis
Case record...Postinfectious cerebellitisCase record...Postinfectious cerebellitis
Case record...Postinfectious cerebellitis
 
The cerebello pontine angle
The cerebello pontine angleThe cerebello pontine angle
The cerebello pontine angle
 
Spontaneous intracranial hypotension (sih)
Spontaneous intracranial hypotension (sih)Spontaneous intracranial hypotension (sih)
Spontaneous intracranial hypotension (sih)
 
fNIRS data analysis
fNIRS data analysisfNIRS data analysis
fNIRS data analysis
 
Language
LanguageLanguage
Language
 
Holmes Tremor 0218
Holmes Tremor  0218Holmes Tremor  0218
Holmes Tremor 0218
 
Multiple sclerosis: Revised diagnostic criteria
Multiple sclerosis: Revised diagnostic criteria Multiple sclerosis: Revised diagnostic criteria
Multiple sclerosis: Revised diagnostic criteria
 
Neurointerventional Therapy for Brain Aneurysms and Acute Stroke
Neurointerventional Therapy for Brain Aneurysms and Acute Stroke Neurointerventional Therapy for Brain Aneurysms and Acute Stroke
Neurointerventional Therapy for Brain Aneurysms and Acute Stroke
 

Destaque

PAK CHINA ECONOMIC CORRIDOR
PAK CHINA ECONOMIC CORRIDOR PAK CHINA ECONOMIC CORRIDOR
PAK CHINA ECONOMIC CORRIDOR Tanweer Sudhan
 
The Importance of open source in cloud computing
The Importance of open source in cloud computingThe Importance of open source in cloud computing
The Importance of open source in cloud computingKrishnan Subramanian
 
Cloud, Security and opensource 2012-12-28 at SSU
Cloud, Security and opensource 2012-12-28 at SSUCloud, Security and opensource 2012-12-28 at SSU
Cloud, Security and opensource 2012-12-28 at SSULINE株式会社
 
Green house effect By Allah Dad Khan Peshawar
Green house effect  By Allah Dad Khan PeshawarGreen house effect  By Allah Dad Khan Peshawar
Green house effect By Allah Dad Khan PeshawarMr.Allah Dad Khan
 
Drugs Among Youths
Drugs Among YouthsDrugs Among Youths
Drugs Among YouthsJohann Lo
 
Cloud computing
Cloud computingCloud computing
Cloud computingStudent
 
Save earth from pollution
Save earth from pollutionSave earth from pollution
Save earth from pollutionShiggi
 
Global warming
Global warmingGlobal warming
Global warmingna0612
 
Cloud- A Technical or Organisational Challenge? Or Both?
Cloud- A Technical or Organisational Challenge? Or Both?Cloud- A Technical or Organisational Challenge? Or Both?
Cloud- A Technical or Organisational Challenge? Or Both?Justin Pirie
 
Life of Youth and Drug Addiction
Life of Youth and Drug AddictionLife of Youth and Drug Addiction
Life of Youth and Drug Addictionraj kishor
 
Side effects of different drugs
Side effects of different drugsSide effects of different drugs
Side effects of different drugsfareeha Awan
 
Pak china economic corridor
Pak china economic corridorPak china economic corridor
Pak china economic corridorshamroz kakar
 
Green house effect-(a two minutes presentation)
Green house effect-(a two minutes presentation)Green house effect-(a two minutes presentation)
Green house effect-(a two minutes presentation)Shardindu Kant
 
presentation on Greenhouse effect & climate change
presentation on Greenhouse effect & climate changepresentation on Greenhouse effect & climate change
presentation on Greenhouse effect & climate changeTasif Ahmed
 
DRUG ADDICTION
DRUG ADDICTIONDRUG ADDICTION
DRUG ADDICTIONRia Gupta
 
Cpac presentation
Cpac presentationCpac presentation
Cpac presentationAbid Aqeel
 
Prediction Analysis in Clinical and Basic Neuroscience
Prediction Analysis in Clinical and Basic NeurosciencePrediction Analysis in Clinical and Basic Neuroscience
Prediction Analysis in Clinical and Basic NeuroscienceCameron Craddock
 

Destaque (20)

PAK CHINA ECONOMIC CORRIDOR
PAK CHINA ECONOMIC CORRIDOR PAK CHINA ECONOMIC CORRIDOR
PAK CHINA ECONOMIC CORRIDOR
 
The Importance of open source in cloud computing
The Importance of open source in cloud computingThe Importance of open source in cloud computing
The Importance of open source in cloud computing
 
Cloud, Security and opensource 2012-12-28 at SSU
Cloud, Security and opensource 2012-12-28 at SSUCloud, Security and opensource 2012-12-28 at SSU
Cloud, Security and opensource 2012-12-28 at SSU
 
Green house effect By Allah Dad Khan Peshawar
Green house effect  By Allah Dad Khan PeshawarGreen house effect  By Allah Dad Khan Peshawar
Green house effect By Allah Dad Khan Peshawar
 
Drugs Among Youths
Drugs Among YouthsDrugs Among Youths
Drugs Among Youths
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Save earth from pollution
Save earth from pollutionSave earth from pollution
Save earth from pollution
 
Global warming
Global warmingGlobal warming
Global warming
 
Cloud- A Technical or Organisational Challenge? Or Both?
Cloud- A Technical or Organisational Challenge? Or Both?Cloud- A Technical or Organisational Challenge? Or Both?
Cloud- A Technical or Organisational Challenge? Or Both?
 
A tutorial in Connectome Analysis (1) - Marcus Kaiser
A tutorial in Connectome Analysis (1) - Marcus KaiserA tutorial in Connectome Analysis (1) - Marcus Kaiser
A tutorial in Connectome Analysis (1) - Marcus Kaiser
 
A tutorial in Connectome Analysis (2) - Marcus Kaiser
A tutorial in Connectome Analysis (2) - Marcus KaiserA tutorial in Connectome Analysis (2) - Marcus Kaiser
A tutorial in Connectome Analysis (2) - Marcus Kaiser
 
Life of Youth and Drug Addiction
Life of Youth and Drug AddictionLife of Youth and Drug Addiction
Life of Youth and Drug Addiction
 
Side effects of different drugs
Side effects of different drugsSide effects of different drugs
Side effects of different drugs
 
Cloud ppt
Cloud pptCloud ppt
Cloud ppt
 
Pak china economic corridor
Pak china economic corridorPak china economic corridor
Pak china economic corridor
 
Green house effect-(a two minutes presentation)
Green house effect-(a two minutes presentation)Green house effect-(a two minutes presentation)
Green house effect-(a two minutes presentation)
 
presentation on Greenhouse effect & climate change
presentation on Greenhouse effect & climate changepresentation on Greenhouse effect & climate change
presentation on Greenhouse effect & climate change
 
DRUG ADDICTION
DRUG ADDICTIONDRUG ADDICTION
DRUG ADDICTION
 
Cpac presentation
Cpac presentationCpac presentation
Cpac presentation
 
Prediction Analysis in Clinical and Basic Neuroscience
Prediction Analysis in Clinical and Basic NeurosciencePrediction Analysis in Clinical and Basic Neuroscience
Prediction Analysis in Clinical and Basic Neuroscience
 

Semelhante a CPAC Connectome Analysis in the Cloud

Intro to High Performance Computing in the AWS Cloud
Intro to High Performance Computing in the AWS CloudIntro to High Performance Computing in the AWS Cloud
Intro to High Performance Computing in the AWS CloudAmazon Web Services
 
AWS Webcast - An Introduction to High Performance Computing on AWS
AWS Webcast - An Introduction to High Performance Computing on AWSAWS Webcast - An Introduction to High Performance Computing on AWS
AWS Webcast - An Introduction to High Performance Computing on AWSAmazon Web Services
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformDATAVERSITY
 
High Performance Computing with AWS
High Performance Computing with AWSHigh Performance Computing with AWS
High Performance Computing with AWSAmazon Web Services
 
Data Automation at Light Sources
Data Automation at Light SourcesData Automation at Light Sources
Data Automation at Light SourcesIan Foster
 
Finding New Sub-Atomic Particles on the AWS Cloud (BDT402) | AWS re:Invent 2013
Finding New Sub-Atomic Particles on the AWS Cloud (BDT402) | AWS re:Invent 2013Finding New Sub-Atomic Particles on the AWS Cloud (BDT402) | AWS re:Invent 2013
Finding New Sub-Atomic Particles on the AWS Cloud (BDT402) | AWS re:Invent 2013Amazon Web Services
 
What's inside the black box? Using ML to tune and manage Kafka. (Matthew Stum...
What's inside the black box? Using ML to tune and manage Kafka. (Matthew Stum...What's inside the black box? Using ML to tune and manage Kafka. (Matthew Stum...
What's inside the black box? Using ML to tune and manage Kafka. (Matthew Stum...confluent
 
CLIMB System Introduction Talk - CLIMB Launch
CLIMB System Introduction Talk - CLIMB LaunchCLIMB System Introduction Talk - CLIMB Launch
CLIMB System Introduction Talk - CLIMB LaunchTom Connor
 
How to run your Hadoop Cluster in 10 minutes
How to run your Hadoop Cluster in 10 minutesHow to run your Hadoop Cluster in 10 minutes
How to run your Hadoop Cluster in 10 minutesVladimir Simek
 
Challenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data GenomicsChallenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data GenomicsYasin Memari
 
2015 04 bio it world
2015 04 bio it world2015 04 bio it world
2015 04 bio it worldChris Dwan
 
Scalable analytics for iaas cloud availability
Scalable analytics for iaas cloud availabilityScalable analytics for iaas cloud availability
Scalable analytics for iaas cloud availabilityPapitha Velumani
 
Simulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud InfrastructuresSimulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud InfrastructuresCloudLightning
 
Big data at experimental facilities
Big data at experimental facilitiesBig data at experimental facilities
Big data at experimental facilitiesIan Foster
 
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your Mind
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your MindDeliver Best-in-Class HPC Cloud Solutions Without Losing Your Mind
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your MindAvere Systems
 
Don't Be Scared. Data Don't Bite. Introduction to Big Data.
Don't Be Scared. Data Don't Bite. Introduction to Big Data.Don't Be Scared. Data Don't Bite. Introduction to Big Data.
Don't Be Scared. Data Don't Bite. Introduction to Big Data.KGMGROUP
 
GPU cloud with Job scheduler and Container
GPU cloud with Job scheduler and ContainerGPU cloud with Job scheduler and Container
GPU cloud with Job scheduler and ContainerAndrew Yongjoon Kong
 
Science cloud foster june 2013
Science cloud foster june 2013Science cloud foster june 2013
Science cloud foster june 2013Kirill Osipov
 
Science as a Service: How On-Demand Computing can Accelerate Discovery
Science as a Service: How On-Demand Computing can Accelerate DiscoveryScience as a Service: How On-Demand Computing can Accelerate Discovery
Science as a Service: How On-Demand Computing can Accelerate DiscoveryIan Foster
 

Semelhante a CPAC Connectome Analysis in the Cloud (20)

Intro to High Performance Computing in the AWS Cloud
Intro to High Performance Computing in the AWS CloudIntro to High Performance Computing in the AWS Cloud
Intro to High Performance Computing in the AWS Cloud
 
AWS Webcast - An Introduction to High Performance Computing on AWS
AWS Webcast - An Introduction to High Performance Computing on AWSAWS Webcast - An Introduction to High Performance Computing on AWS
AWS Webcast - An Introduction to High Performance Computing on AWS
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics Platform
 
High Performance Computing with AWS
High Performance Computing with AWSHigh Performance Computing with AWS
High Performance Computing with AWS
 
Data Automation at Light Sources
Data Automation at Light SourcesData Automation at Light Sources
Data Automation at Light Sources
 
Finding New Sub-Atomic Particles on the AWS Cloud (BDT402) | AWS re:Invent 2013
Finding New Sub-Atomic Particles on the AWS Cloud (BDT402) | AWS re:Invent 2013Finding New Sub-Atomic Particles on the AWS Cloud (BDT402) | AWS re:Invent 2013
Finding New Sub-Atomic Particles on the AWS Cloud (BDT402) | AWS re:Invent 2013
 
What's inside the black box? Using ML to tune and manage Kafka. (Matthew Stum...
What's inside the black box? Using ML to tune and manage Kafka. (Matthew Stum...What's inside the black box? Using ML to tune and manage Kafka. (Matthew Stum...
What's inside the black box? Using ML to tune and manage Kafka. (Matthew Stum...
 
CLIMB System Introduction Talk - CLIMB Launch
CLIMB System Introduction Talk - CLIMB LaunchCLIMB System Introduction Talk - CLIMB Launch
CLIMB System Introduction Talk - CLIMB Launch
 
How to run your Hadoop Cluster in 10 minutes
How to run your Hadoop Cluster in 10 minutesHow to run your Hadoop Cluster in 10 minutes
How to run your Hadoop Cluster in 10 minutes
 
Challenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data GenomicsChallenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data Genomics
 
2015 04 bio it world
2015 04 bio it world2015 04 bio it world
2015 04 bio it world
 
Brad stack - Digital Health and Well-Being Festival
Brad stack - Digital Health and Well-Being Festival Brad stack - Digital Health and Well-Being Festival
Brad stack - Digital Health and Well-Being Festival
 
Scalable analytics for iaas cloud availability
Scalable analytics for iaas cloud availabilityScalable analytics for iaas cloud availability
Scalable analytics for iaas cloud availability
 
Simulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud InfrastructuresSimulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud Infrastructures
 
Big data at experimental facilities
Big data at experimental facilitiesBig data at experimental facilities
Big data at experimental facilities
 
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your Mind
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your MindDeliver Best-in-Class HPC Cloud Solutions Without Losing Your Mind
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your Mind
 
Don't Be Scared. Data Don't Bite. Introduction to Big Data.
Don't Be Scared. Data Don't Bite. Introduction to Big Data.Don't Be Scared. Data Don't Bite. Introduction to Big Data.
Don't Be Scared. Data Don't Bite. Introduction to Big Data.
 
GPU cloud with Job scheduler and Container
GPU cloud with Job scheduler and ContainerGPU cloud with Job scheduler and Container
GPU cloud with Job scheduler and Container
 
Science cloud foster june 2013
Science cloud foster june 2013Science cloud foster june 2013
Science cloud foster june 2013
 
Science as a Service: How On-Demand Computing can Accelerate Discovery
Science as a Service: How On-Demand Computing can Accelerate DiscoveryScience as a Service: How On-Demand Computing can Accelerate Discovery
Science as a Service: How On-Demand Computing can Accelerate Discovery
 

Mais de Cameron Craddock

Genetics influence inter-subject Brain State Prediction.
Genetics influence inter-subject Brain State Prediction.Genetics influence inter-subject Brain State Prediction.
Genetics influence inter-subject Brain State Prediction.Cameron Craddock
 
Introduction to resting state fMRI preprocessing and analysis
Introduction to resting state fMRI preprocessing and analysisIntroduction to resting state fMRI preprocessing and analysis
Introduction to resting state fMRI preprocessing and analysisCameron Craddock
 
Open repositories for neuroimaging research
Open repositories for neuroimaging researchOpen repositories for neuroimaging research
Open repositories for neuroimaging researchCameron Craddock
 
Using RealTime fMRI Based Neurofeedback To Probe Default Network Regulation
Using RealTime fMRI Based Neurofeedback To Probe Default Network RegulationUsing RealTime fMRI Based Neurofeedback To Probe Default Network Regulation
Using RealTime fMRI Based Neurofeedback To Probe Default Network RegulationCameron Craddock
 
Using RealTime fMRI Based Neurofeedback to Probe Default Network Regulation
Using RealTime fMRI Based Neurofeedback to Probe Default Network RegulationUsing RealTime fMRI Based Neurofeedback to Probe Default Network Regulation
Using RealTime fMRI Based Neurofeedback to Probe Default Network RegulationCameron Craddock
 
Tracking Dynamic Networks in Real Time
Tracking Dynamic Networks in Real TimeTracking Dynamic Networks in Real Time
Tracking Dynamic Networks in Real TimeCameron Craddock
 
PCP Quality Assessment Protocol
PCP Quality Assessment ProtocolPCP Quality Assessment Protocol
PCP Quality Assessment ProtocolCameron Craddock
 

Mais de Cameron Craddock (8)

Genetics influence inter-subject Brain State Prediction.
Genetics influence inter-subject Brain State Prediction.Genetics influence inter-subject Brain State Prediction.
Genetics influence inter-subject Brain State Prediction.
 
Introduction to resting state fMRI preprocessing and analysis
Introduction to resting state fMRI preprocessing and analysisIntroduction to resting state fMRI preprocessing and analysis
Introduction to resting state fMRI preprocessing and analysis
 
Open repositories for neuroimaging research
Open repositories for neuroimaging researchOpen repositories for neuroimaging research
Open repositories for neuroimaging research
 
Using RealTime fMRI Based Neurofeedback To Probe Default Network Regulation
Using RealTime fMRI Based Neurofeedback To Probe Default Network RegulationUsing RealTime fMRI Based Neurofeedback To Probe Default Network Regulation
Using RealTime fMRI Based Neurofeedback To Probe Default Network Regulation
 
Head Motion in fMRI
Head Motion in fMRIHead Motion in fMRI
Head Motion in fMRI
 
Using RealTime fMRI Based Neurofeedback to Probe Default Network Regulation
Using RealTime fMRI Based Neurofeedback to Probe Default Network RegulationUsing RealTime fMRI Based Neurofeedback to Probe Default Network Regulation
Using RealTime fMRI Based Neurofeedback to Probe Default Network Regulation
 
Tracking Dynamic Networks in Real Time
Tracking Dynamic Networks in Real TimeTracking Dynamic Networks in Real Time
Tracking Dynamic Networks in Real Time
 
PCP Quality Assessment Protocol
PCP Quality Assessment ProtocolPCP Quality Assessment Protocol
PCP Quality Assessment Protocol
 

Último

chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringmulugeta48
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...tanu pandey
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...SUHANI PANDEY
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Christo Ananth
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLManishPatel169454
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01KreezheaRecto
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 

Último (20)

chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
NFPA 5000 2024 standard .
NFPA 5000 2024 standard                                  .NFPA 5000 2024 standard                                  .
NFPA 5000 2024 standard .
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 

CPAC Connectome Analysis in the Cloud

  • 1. Harnessing cloud computing for high capacity analysis of neuroimaging data Cameron Craddock, PhD Computational Neuroimaging Lab Center for Biomedical Imaging and Neuromodulation Nathan S. Kline Institute for Psychiatric Research Center for the Developing Brain Child Mind Institute
  • 2. Discovery science in Psychiatric Neuroimaging 1. Characterizing inter-individual variation in connectomes (Kelly et al. 2012) 2. Identifying biomarkers of disease state, severity, and prognosis (Craddock 2009) 3. Re-defining mental health in terms of neurophenotypes, e.g. RDOC (Castellanos 2013) Data is often shared only in its raw form – must be preprocessed to remove nuisance variation and to be made comparable across individuals and sites.
  • 4. Configurable Pipeline for the Analysis of Connectomes (CPAC) • Pipeline to automate preprocessing and analysis of large-scale datasets • Most cutting edge functional connectivity preprocessing and analysis algorithms • Configurable to enable “plurality” – evaluate different processing parameters and strategies • Automatically identifies and takes advantage of parallelism on multi-threaded, multi-core, and cluster architectures • “Warm restarts” – only re-compute what has changed • Open science – open source • http://fcp-indi.github.io Nypipe
  • 5. Computing in the Amazon Cloud • No hardware capital cost • No hardware maintenance • No software installation or configuration* • Resources scale to meet need for no overhead • Available everywhere and to everybody • Allows access to exotic architectures, such as GPUs *If appropriate AMI is available
  • 6. Amazon EC2 - Instance • The hardware on which your processing will run:
  • 7. Instance Pricing • On-demand Pricing – Always available, fixed price, non-interruptible, most stable • Spot instances – Market to sell otherwise unused time, variable price, interruptible
  • 8. Spot Instances • Prices fluctuate over time • If price exceeds the max you are willing to pay, your instances are terminated
  • 9. Storage • S3 – Simple Storage Service – Secure and stable storage with a web service interface, pay for what you use – Big and slow, $0.03 per GB/Month – Can be accessed from anywhere • EBS – Elastic Block Storage – Provisioned storage (SSD HD) directly connected to instance, pay for what you provision – Fast and expensive, $0.10 per GB/Month – Persistent and transferrable • Instance Storage – SSD storage provided with some instances, included in instance price – Fast and free – Non-persistent and non-transferrable – good for cache
  • 10. Amazon EC2 - Instance • The hardware on which your processing will run:
  • 11. Data Transfer • In general, free in - pay out – Out to other Amazon service such as S3, EBS, etc is free – Out to Internet is $0.09 per GB (becomes slightly cheaper after 10TB or so)
  • 12. Amazon Machine Images • Virtual machines that provide the software environment for your processing • You can build your own, or use one maintained by others
  • 13. StarCluster • Star cluster simplifies the process of building a Sun Grid Engine based cluster in EC2 – Dynamically add and remove compute nodes – Uses spot instances – Provides scripts for performing many administrative tasks
  • 14. C-PAC Amazon Machine Image Nypipe
  • 15. Proof of concept • Preprocessed 1,112 datasets from ABIDE with C-PAC – 4 different preprocessing strategies (+/- temporal filter, +/- global signal regression) – 24 derivatives: • ReHo, ALFF, fALFF, 10 RSNs, VMHC, binary degree centrality, weighted degree centrality, lFCD, time courses for 5 atlases (AAL, TT, EZ, HO, CC200, CC400) http://preprocessed-connectomes-project.github.io/abide
  • 16. • Requires 45 minute to process 1 dataset • 3 datasets can be processed in parallel • Processing results in .5GB of data Model Parameters
  • 17. Cloud vs. Traditional Computing 0 5000 10000 15000 100 2000 5000 10000 15000 20000 25000 30000 35000 40000 45000 50000 Number of Datasets Cost($) Instance Cost Storage Cost Transfer Cost 0 4000 8000 12000 100 2000 5000 10000 15000 20000 25000 30000 35000 40000 45000 50000 Number of Datasets Time(hours) No Download Total Processing Time
  • 18. Impact of Spot Instances Simulations using past 90 days of spot price history
  • 20. What about HIPAA? • Amazon AWS meets FedRAMP and NIST 800-53 standards, which are more rigorous than HIPAA – Access to instances controlled using 256-bit AES – Default firewalls deny all outside access – EC2, EBS, and S3 storage are compatible with encryption • AWS HIPAA whitepaper – http://d0.awsstatic.com/whitepapers/compliance/AWS_HI PAA_Compliance_Whitepaper.pdf
  • 21. C-PAC Amazon Machine Image Nypipe
  • 22. Preprocessed INDI Data in the Cloud http://preprocessed-connectomes-project.github.io/ • Available through S3 Bucket generously provided by AWS • Raw INDI will be available soon
  • 23. - HCP Data available in the cloud: - https://wiki.humanconnectome.org/display/PublicData/Home - Receive $100 AWS Credits at the HCP workshop in Hawaii - http://humanconnectome.org/course-registration/2015/exploring-the-human- connectome.php
  • 24. Acknowledgements CPAC Team: Daniel Clark, Steven Giavasis and Michael Milham. NDAR “Cloud Team”: Christian Haselgrove, Dave Kennedy, and Jack van Horn. NDAR Team: Dan Hall, Brian Koser, David Obenshain, Svetlana Novikova, and Malcom Jackson. CPAC-NDAR integration was funded by a contract from NDAR. ABIDE Preprocessed data is hosted in a Public S3 Bucket provided by AWS.

Notas do Editor

  1. The goal of large-scale analyses of connectomes data are to map inter-individual variation in phenotype, such as sex, age, IQ, etc, to variation in the connectome. For clinical populations, we are particularly concerned with identifying connectome based biomarkers of disease presence, its severity, and prognosis, specifically treatment outcomes. Recently, there has been consternation about the ecological validity of psychiatric disease classifications that are based on syndromes that are described by the presence of symptoms. This provides the opportunity to redefine psychiatric populations based on the similarity of connectomes, i.e. clustering individuals based on their connectivity profiles.
  2. Green boxes indicate initiatives in which data is aggregated after it is acquired, rather than centralized initiatives in which data acquisition was coordinated between sites. Since the data collection is not coordinated for these sites, the data is more heterogeneous, being collected with different parameters.
  3. The ultimate goal of CPAC is to make high-throughput state-of-the-art connectomes analyses accessible to researchers who lack programming and/or neuroimaging expertise. It is currently still in alpha, with the expectation of being beta by mid 2015. It is currently limited to functional connectivity analyses, but we plan to add DTI by the end of 2015.
  4. Several different methods have been proposed for preprocessing connectomes data to remove nuisance variation that obscures biological variation. Some of these methods have been shown to introduce artifacts that bias results. Rather than an single best practice, a pluralistic approach is needed, in which several different procedures are performed and the results are compared to identify those that are robust across strategies.
  5. In addition to a large number of preprocessing strategies, several different methods of analyses have been proposed such as: (left to right) (top row) eigenvector and degree centrality, voxel mirrored homotopic connecitivity, fractional amplitude of low frequency fluctuations, bootstrap analysis of stable clusters, (bottom) regional homogeniety, amplitude of low frequency fluctuations, and multi-dimensional matrix regression.
  6. Preprocessing large datasets using current tools requires a substantial amount of time, even if automated using scripts. Performing multiple preprocessing strategies is a multiplier for execution time, and different analysis adds to computation time. What is needed are tools that can not only automate the preprocessing, but also take advantage of parallelism inherent in the data and algorithms, to achieve high-throughput processing on high performance computing architectures. We are developing CPAC to this aim.
  7. The ultimate goal of CPAC is to make high-throughput state-of-the-art connectomes analyses accessible to researchers who lack programming and/or neuroimaging expertise. It is currently still in alpha, with the expectation of being beta by mid 2015. It is currently limited to functional connectivity analyses, but we plan to add DTI by the end of 2015.
  8. Most expensive on left is US-west, least expensive is us-east, second blue is SA-EAST-1c Longest is EU-east shortest is SA-east
  9. Instead of sharing just raw data, why not share the preprocessed data as well?