SlideShare uma empresa Scribd logo
1 de 20
High-throughput eScience mixing Grids and Clouds: an experience with the Nimrod tool family Presenter: Blair Bethwaite MonasheScience and Grid Engineering Lab
MeSsAGE Lab team: David Abramson Colin Enticott SlavisaGaric and others... Acknowledgements NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
Agenda NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
The Nimrod tool family NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
Parametric computing with the Nimrod tools Vary parameters Execute programs Copy code/data in/out X, Y, Z could be: Basic data types; ints, floats, strings Files Random numbers to drive Monte Carlo modelling X Y Parameter Space Solution Space Z User Job EII Cloud Workshop - AWS Intro		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
Nimrod Applications messagelab.monash.edu.au/EScienceApplications NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
From Clusters, to Grids, to Clouds Jobs / Nimrod experiment Nimrod Actuator, e.g., SGE, PBS, LSF, Condor Local Batch System NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
From Clusters, to Grids, to Clouds Jobs / Nimrod experiment Portal Nimrod-O/E/K Nimrod/G Actuator, e.g., Globus Servers Upper middleware Lower middleware Pilot jobs / agents Agents Grid Middleware Grid Middleware Grid Middleware Agents Grid Middleware NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
From Clusters, to Grids, to Clouds NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni The Grid Global utility computing mk.1-(beta) Somewhere in-between Infrastructure and Platform as-a-Service For Nimrod Increased computational scale – massively parallel New scheduling and data challenges Computational economy proposed Problems Interoperability Barriers to entry
From Clusters, to Grids, to Clouds NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
From Clusters, to Grids, to Clouds NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni Cloud opportunities for HTC Virtualisation helps interoperability and scalability Cloud bursting Scale-out to supplement locally and nationally available resources Test computational economy and scheduling, in anger Deadline driven Budget driven What’s missing? Grids provide services above IaaS E.g., you can build a grid on EC2 Grids provide job and data handling services, more like PaaS
From Clusters, to Grids, to Clouds  def process_queue(self):         """Prepare allocation of commands/agents to instances.         This might mean requesting new instances from the web service and/or         allocating available slots from existing instances.         ""“         if not self._queued_cmds and not self.proxy:             return False self._update_available_instances() req_slots = len(self._queued_cmds) new_slots = req_slots - self.free_slots num_insts = new_slots / self.slots_per_instance         # if we need the proxy we might have to force         # launching an instance to host it         if self.proxy and num_insts < 1 br />               and len(self.instances) < 1: num_insts = 1 rsv = None         ...         ...         if num_insts > 0:             try: rsv = self.ec2conn.run_instances(self.ami_id, min_count=1, max_count=num_insts, key_name=self.ws_label, security_groups=[self.secgroup.name], instance_type=self.ec2InstanceType)             except EC2ResponseError,e:                 if ec2.parse_response_error(e, 'Code') == br />u'InstanceLimitExceeded': self.at_instance_limit = True                     print "[%s] Instance limit exceeded" % self.label                 else:                     print "[%s] Error running instances:%s" % br />                        (self.label, t5exc.exception())                     raise         if rsv: self._pending_reservations.append(rsv)         ... NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
Integrating with IaaS Jobs / Nimrod experiment Portal Nimrod-O/E/K Nimrod/G Actuator: Globus,... Services New actuators: EC2, IBM, Azure, OCCI?,...? RESTfulIaaS API Grid Middleware VM Agents Agents VM VM Agents Agents NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
Integrating with IaaS Advantage: Nimrod is already a meta-scheduler Creates an ad-hoc grid dynamically overlaying the available resource pool Don’t need Grid-like job processing services to stand-up resource pool Requires explicit management of infrastructure Extra level of scheduling – when to initialise infrastructure? NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
Integrating with IaaS 1 2 3 NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
Application Examples A lot of existing grid based infrastructure So, mix it together “Mixing Grids and Clouds: High-Throughput Science Using the Nimrod Tool Family,” in Cloud Computing, vol. 0 (Springer London, 2010) Markov Chain Monte Carlo methods for recommender systems For better results, insert coins here... NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
Application Examples Modelling ash dispersion – NG-TEPHRA IEEE e-Science 2010 Supplement local infrastructure for deadline sensitive analysis NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
Work-in-progress What’s keeping me awake... Spot-price scheduling Smarter data handling Windows support On EC2 And integrating with Azure Rose NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
Nimrod utilising NeCTAR RC Host MeSsAGE Lab tools Dev and test environment Excess capacity 		supporting HTC NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni
Thank you! Presentation by: Blair Bethwaite Researcher, Developer, SysAdmin Monash eScience and Grid Engineering Lab Feedback/queries: blair.bethwaite@monash.edu david.abramson@monash.edu NeCTAR Research Cloud Workshop		     Blair Bethwaite - MeSsAGE Lab, Monash Uni

Mais conteúdo relacionado

Mais procurados

Deep Recurrent Neural Networks for Sequence Learning in Spark by Yves Mabiala
Deep Recurrent Neural Networks for Sequence Learning in Spark by Yves MabialaDeep Recurrent Neural Networks for Sequence Learning in Spark by Yves Mabiala
Deep Recurrent Neural Networks for Sequence Learning in Spark by Yves Mabiala
Spark Summit
 
1 sx5g formation-solutions-cloud-avancees-avec-smartcloud-orchestrator-sur-la...
1 sx5g formation-solutions-cloud-avancees-avec-smartcloud-orchestrator-sur-la...1 sx5g formation-solutions-cloud-avancees-avec-smartcloud-orchestrator-sur-la...
1 sx5g formation-solutions-cloud-avancees-avec-smartcloud-orchestrator-sur-la...
CERTyou Formation
 
CloudClustering: Toward an Iterative Data Processing Pattern on the Cloud
CloudClustering: Toward an Iterative Data Processing Pattern on the CloudCloudClustering: Toward an Iterative Data Processing Pattern on the Cloud
CloudClustering: Toward an Iterative Data Processing Pattern on the Cloud
Ankur Dave
 
1 r029g formation-ibm-cognos-icm-building-reports-v-8-1
1 r029g formation-ibm-cognos-icm-building-reports-v-8-11 r029g formation-ibm-cognos-icm-building-reports-v-8-1
1 r029g formation-ibm-cognos-icm-building-reports-v-8-1
CERTyou Formation
 

Mais procurados (18)

Anomaly Detection at Scale
Anomaly Detection at ScaleAnomaly Detection at Scale
Anomaly Detection at Scale
 
Spark Meetup TensorFrames
Spark Meetup TensorFramesSpark Meetup TensorFrames
Spark Meetup TensorFrames
 
Real-time Big Data Processing with Storm
Real-time Big Data Processing with StormReal-time Big Data Processing with Storm
Real-time Big Data Processing with Storm
 
Applying your Convolutional Neural Networks
Applying your Convolutional Neural NetworksApplying your Convolutional Neural Networks
Applying your Convolutional Neural Networks
 
Deep Recurrent Neural Networks for Sequence Learning in Spark by Yves Mabiala
Deep Recurrent Neural Networks for Sequence Learning in Spark by Yves MabialaDeep Recurrent Neural Networks for Sequence Learning in Spark by Yves Mabiala
Deep Recurrent Neural Networks for Sequence Learning in Spark by Yves Mabiala
 
Deep recurrent neutral networks for Sequence Learning in Spark
Deep recurrent neutral networks for Sequence Learning in SparkDeep recurrent neutral networks for Sequence Learning in Spark
Deep recurrent neutral networks for Sequence Learning in Spark
 
53
5353
53
 
CPAC Connectome Analysis in the Cloud
CPAC Connectome Analysis in the CloudCPAC Connectome Analysis in the Cloud
CPAC Connectome Analysis in the Cloud
 
Time Series Analysis for Network Secruity
Time Series Analysis for Network SecruityTime Series Analysis for Network Secruity
Time Series Analysis for Network Secruity
 
1 sx5g formation-solutions-cloud-avancees-avec-smartcloud-orchestrator-sur-la...
1 sx5g formation-solutions-cloud-avancees-avec-smartcloud-orchestrator-sur-la...1 sx5g formation-solutions-cloud-avancees-avec-smartcloud-orchestrator-sur-la...
1 sx5g formation-solutions-cloud-avancees-avec-smartcloud-orchestrator-sur-la...
 
CloudClustering: Toward an Iterative Data Processing Pattern on the Cloud
CloudClustering: Toward an Iterative Data Processing Pattern on the CloudCloudClustering: Toward an Iterative Data Processing Pattern on the Cloud
CloudClustering: Toward an Iterative Data Processing Pattern on the Cloud
 
DIET_BLAST
DIET_BLASTDIET_BLAST
DIET_BLAST
 
Image Classification Done Simply using Keras and TensorFlow
Image Classification Done Simply using Keras and TensorFlow Image Classification Done Simply using Keras and TensorFlow
Image Classification Done Simply using Keras and TensorFlow
 
Computational decision making
Computational decision makingComputational decision making
Computational decision making
 
Master's Thesis - climateprediction.net: A Cloudy Approach
Master's Thesis - climateprediction.net: A Cloudy ApproachMaster's Thesis - climateprediction.net: A Cloudy Approach
Master's Thesis - climateprediction.net: A Cloudy Approach
 
1 r029g formation-ibm-cognos-icm-building-reports-v-8-1
1 r029g formation-ibm-cognos-icm-building-reports-v-8-11 r029g formation-ibm-cognos-icm-building-reports-v-8-1
1 r029g formation-ibm-cognos-icm-building-reports-v-8-1
 
Weather Station Data Publication at Irstea: an implementation Report.
Weather Station Data Publication at Irstea: an implementation Report.  Weather Station Data Publication at Irstea: an implementation Report.
Weather Station Data Publication at Irstea: an implementation Report.
 
Intro to the Distributed Version of TensorFlow
Intro to the Distributed Version of TensorFlowIntro to the Distributed Version of TensorFlow
Intro to the Distributed Version of TensorFlow
 

Semelhante a Nimrod cloud

Research in Cloud Computing
Research in Cloud ComputingResearch in Cloud Computing
Research in Cloud Computing
Rajshri Mohan
 
Task Adaptive Neural Network Search with Meta-Contrastive Learning
Task Adaptive Neural Network Search with Meta-Contrastive LearningTask Adaptive Neural Network Search with Meta-Contrastive Learning
Task Adaptive Neural Network Search with Meta-Contrastive Learning
MLAI2
 
Three Degrees of Mediation: Challenges and Lessons in building Cloud-agnostic...
Three Degrees of Mediation: Challenges and Lessons in building Cloud-agnostic...Three Degrees of Mediation: Challenges and Lessons in building Cloud-agnostic...
Three Degrees of Mediation: Challenges and Lessons in building Cloud-agnostic...
Alex Maclinovsky
 

Semelhante a Nimrod cloud (20)

Research in Cloud Computing
Research in Cloud ComputingResearch in Cloud Computing
Research in Cloud Computing
 
StackWatch: A prototype CloudWatch service for CloudStack
StackWatch: A prototype CloudWatch service for CloudStackStackWatch: A prototype CloudWatch service for CloudStack
StackWatch: A prototype CloudWatch service for CloudStack
 
MBrace: Large-scale cloud computation with F# (CUFP 2014)
MBrace: Large-scale cloud computation with F# (CUFP 2014)MBrace: Large-scale cloud computation with F# (CUFP 2014)
MBrace: Large-scale cloud computation with F# (CUFP 2014)
 
Azure machine learning service
Azure machine learning serviceAzure machine learning service
Azure machine learning service
 
Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016
Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016
Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016
 
Task Adaptive Neural Network Search with Meta-Contrastive Learning
Task Adaptive Neural Network Search with Meta-Contrastive LearningTask Adaptive Neural Network Search with Meta-Contrastive Learning
Task Adaptive Neural Network Search with Meta-Contrastive Learning
 
Using Grid Technologies in the Cloud for High Scalability
Using Grid Technologies in the Cloud for High ScalabilityUsing Grid Technologies in the Cloud for High Scalability
Using Grid Technologies in the Cloud for High Scalability
 
Concurrent and Distributed CloudSim Simulations
Concurrent and Distributed CloudSim SimulationsConcurrent and Distributed CloudSim Simulations
Concurrent and Distributed CloudSim Simulations
 
Cloudsim_openstack_aws_lastunit_bsccs_cloud computing
Cloudsim_openstack_aws_lastunit_bsccs_cloud computingCloudsim_openstack_aws_lastunit_bsccs_cloud computing
Cloudsim_openstack_aws_lastunit_bsccs_cloud computing
 
FULLTEXT02
FULLTEXT02FULLTEXT02
FULLTEXT02
 
Building a Just-in-Time Application Stack for Analysts
Building a Just-in-Time Application Stack for AnalystsBuilding a Just-in-Time Application Stack for Analysts
Building a Just-in-Time Application Stack for Analysts
 
Openstack Pakistan Workshop (intro)
Openstack Pakistan Workshop (intro)Openstack Pakistan Workshop (intro)
Openstack Pakistan Workshop (intro)
 
Three Degrees of Mediation: Challenges and Lessons in building Cloud-agnostic...
Three Degrees of Mediation: Challenges and Lessons in building Cloud-agnostic...Three Degrees of Mediation: Challenges and Lessons in building Cloud-agnostic...
Three Degrees of Mediation: Challenges and Lessons in building Cloud-agnostic...
 
Multicloud Deployment of Computing Clusters for Loosely Coupled Multi Task C...
Multicloud Deployment of Computing Clusters for Loosely  Coupled Multi Task C...Multicloud Deployment of Computing Clusters for Loosely  Coupled Multi Task C...
Multicloud Deployment of Computing Clusters for Loosely Coupled Multi Task C...
 
Julien Simon "Scaling ML from 0 to millions of users"
Julien Simon "Scaling ML from 0 to millions of users"Julien Simon "Scaling ML from 0 to millions of users"
Julien Simon "Scaling ML from 0 to millions of users"
 
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
 
Essel cloud-tecnical
Essel cloud-tecnicalEssel cloud-tecnical
Essel cloud-tecnical
 
Experiments with Complex Scientific Applications on Hybrid Cloud Infrastructures
Experiments with Complex Scientific Applications on Hybrid Cloud InfrastructuresExperiments with Complex Scientific Applications on Hybrid Cloud Infrastructures
Experiments with Complex Scientific Applications on Hybrid Cloud Infrastructures
 
Time to Science, Time to Results. Accelerating Scientific research in the Cloud
Time to Science, Time to Results. Accelerating Scientific research in the CloudTime to Science, Time to Results. Accelerating Scientific research in the Cloud
Time to Science, Time to Results. Accelerating Scientific research in the Cloud
 
Viktor Tsykunov: Azure Machine Learning Service
Viktor Tsykunov: Azure Machine Learning ServiceViktor Tsykunov: Azure Machine Learning Service
Viktor Tsykunov: Azure Machine Learning Service
 

Último

“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
Muhammad Subhan
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
FIDO Alliance
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc
 

Último (20)

ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development Companies
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage Intacct
 
2024 May Patch Tuesday
2024 May Patch Tuesday2024 May Patch Tuesday
2024 May Patch Tuesday
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptx
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdf
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 

Nimrod cloud

  • 1. High-throughput eScience mixing Grids and Clouds: an experience with the Nimrod tool family Presenter: Blair Bethwaite MonasheScience and Grid Engineering Lab
  • 2. MeSsAGE Lab team: David Abramson Colin Enticott SlavisaGaric and others... Acknowledgements NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 3. Agenda NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 4. The Nimrod tool family NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 5. Parametric computing with the Nimrod tools Vary parameters Execute programs Copy code/data in/out X, Y, Z could be: Basic data types; ints, floats, strings Files Random numbers to drive Monte Carlo modelling X Y Parameter Space Solution Space Z User Job EII Cloud Workshop - AWS Intro Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 6. Nimrod Applications messagelab.monash.edu.au/EScienceApplications NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 7. From Clusters, to Grids, to Clouds Jobs / Nimrod experiment Nimrod Actuator, e.g., SGE, PBS, LSF, Condor Local Batch System NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 8. From Clusters, to Grids, to Clouds Jobs / Nimrod experiment Portal Nimrod-O/E/K Nimrod/G Actuator, e.g., Globus Servers Upper middleware Lower middleware Pilot jobs / agents Agents Grid Middleware Grid Middleware Grid Middleware Agents Grid Middleware NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 9. From Clusters, to Grids, to Clouds NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni The Grid Global utility computing mk.1-(beta) Somewhere in-between Infrastructure and Platform as-a-Service For Nimrod Increased computational scale – massively parallel New scheduling and data challenges Computational economy proposed Problems Interoperability Barriers to entry
  • 10. From Clusters, to Grids, to Clouds NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 11. From Clusters, to Grids, to Clouds NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni Cloud opportunities for HTC Virtualisation helps interoperability and scalability Cloud bursting Scale-out to supplement locally and nationally available resources Test computational economy and scheduling, in anger Deadline driven Budget driven What’s missing? Grids provide services above IaaS E.g., you can build a grid on EC2 Grids provide job and data handling services, more like PaaS
  • 12. From Clusters, to Grids, to Clouds def process_queue(self): """Prepare allocation of commands/agents to instances. This might mean requesting new instances from the web service and/or allocating available slots from existing instances. ""“ if not self._queued_cmds and not self.proxy: return False self._update_available_instances() req_slots = len(self._queued_cmds) new_slots = req_slots - self.free_slots num_insts = new_slots / self.slots_per_instance # if we need the proxy we might have to force # launching an instance to host it if self.proxy and num_insts < 1 br /> and len(self.instances) < 1: num_insts = 1 rsv = None ... ... if num_insts > 0: try: rsv = self.ec2conn.run_instances(self.ami_id, min_count=1, max_count=num_insts, key_name=self.ws_label, security_groups=[self.secgroup.name], instance_type=self.ec2InstanceType) except EC2ResponseError,e: if ec2.parse_response_error(e, 'Code') == br />u'InstanceLimitExceeded': self.at_instance_limit = True print "[%s] Instance limit exceeded" % self.label else: print "[%s] Error running instances:%s" % br /> (self.label, t5exc.exception()) raise if rsv: self._pending_reservations.append(rsv) ... NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 13. Integrating with IaaS Jobs / Nimrod experiment Portal Nimrod-O/E/K Nimrod/G Actuator: Globus,... Services New actuators: EC2, IBM, Azure, OCCI?,...? RESTfulIaaS API Grid Middleware VM Agents Agents VM VM Agents Agents NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 14. Integrating with IaaS Advantage: Nimrod is already a meta-scheduler Creates an ad-hoc grid dynamically overlaying the available resource pool Don’t need Grid-like job processing services to stand-up resource pool Requires explicit management of infrastructure Extra level of scheduling – when to initialise infrastructure? NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 15. Integrating with IaaS 1 2 3 NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 16. Application Examples A lot of existing grid based infrastructure So, mix it together “Mixing Grids and Clouds: High-Throughput Science Using the Nimrod Tool Family,” in Cloud Computing, vol. 0 (Springer London, 2010) Markov Chain Monte Carlo methods for recommender systems For better results, insert coins here... NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 17. Application Examples Modelling ash dispersion – NG-TEPHRA IEEE e-Science 2010 Supplement local infrastructure for deadline sensitive analysis NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 18. Work-in-progress What’s keeping me awake... Spot-price scheduling Smarter data handling Windows support On EC2 And integrating with Azure Rose NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 19. Nimrod utilising NeCTAR RC Host MeSsAGE Lab tools Dev and test environment Excess capacity supporting HTC NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni
  • 20. Thank you! Presentation by: Blair Bethwaite Researcher, Developer, SysAdmin Monash eScience and Grid Engineering Lab Feedback/queries: blair.bethwaite@monash.edu david.abramson@monash.edu NeCTAR Research Cloud Workshop Blair Bethwaite - MeSsAGE Lab, Monash Uni

Notas do Editor

  1. Please ask questions during the talk if you have them.
  2. Simple pleasingly-parallel computing for “legacy” (misnomer: just need existing app, Nimrod is the distributed glue that launches and contextualises each job). Onclusters, compute grids, and now clouds.Also support computational economy via economic scheduling.
  3. Molecular docking in drug designEngineering antennae for maximum gainAirfoil optimising LD ratio
  4. Original Nimrod also acted as the cluster management system, commercial spin-off to Enfuzion.
  5. Nimrod/G – “G” originally stood for Globus but now more general supporting other lower level middleware, such as Condor.
  6. Then AWS came along... suddenly public utility computing became a realityOn demand: start and stop machines any time, lead time of minutes.Self service: no lengthy email trail with your data centre admin, just make a web service call.PAYG: pay for what you use, tear it down when not needed.Think of it as a computational vending machine.
  7. Code snippet from Nimrod EC2 actuator – bringing up your first few machines like this is cool! And incredibly easy with these APIs, and great tools like Boto.
  8. Actuator model makes this integration relatively painless compared to an app highly dependent on higher Grid middleware functions.
  9. Clouds provide an interesting infrastructure to supplement the usual resources available for academic computing.You can pay to get your results faster, or make them higher quality.
  10. Probabilistic spatial and density distribution mapping of volcanic tephra, potentially useful in time sensitive scenarios, i.e., immediately preceding or following an eruption event.