SlideShare uma empresa Scribd logo
1 de 51
OGCE Workflow Toolkit for Multi-Scale Science Applications Suresh Marru Pervasive Technology Institute Indiana University ODI Gateway/Pipeline Evaluation  NOAO Jan 21st 2010
TeraGrid
Data Capacitor ,[object Object]
IU Bloomington data center
NSF funded project to create a high throughput 535 TB Lustre storage solution
Short term storage for applications with high bandwidth needs
IU added a wide area version in 2008
Enhanced wide area security,[object Object]
Massive Data Storage System MDSS Architecture
[object Object]
Wide
Open,[object Object]
Gateways Advantages Increase access To instruments Increase capabilities To analyze data Improve workforce development For underserved populations Increase outreach Increase public awareness Public sees value in investments in large facilities
There are approximately 30 gateways using the TeraGrid
Example Gateway Accomplishments LEAD - access to radar data NVO – access to sky surveys OOI – access to sensor data PolarGrid – access to polar ice sheet data SIDGrid – analysis tools GridChem – developing multiscale coupling How would this have been done before gateways? How many details do we want each individual scientist to need to know?
TeraGrid Advantages and Challenges What’s different when the resource doesn’t belong just to me? Resource discovery Accounting Security Proposal-based requests for resources (peer-reviewed access) Code scaling and performance numbers Justification of resources Gateway citations Tremendous benefits at the high end, but even more work for the developers Potential impact on science is huge Small number of developers can impact thousands of scientists But need a way to train and fund those developers and provide them with appropriate tools
Example Gateway: LEAD
Weather is Local, High-Impact, Heterogeneous and Rapidly Evolving…Yet Our Technologies and Thinking are Static Rain and Snow Fog Rain and Snow Snow and Freezing Rain Intense Turbulence Severe Thunderstorms
LEAD Dynamic Adaptive Infrastructure  Storms Forming Forecast Model Streaming Observations Data Mining Instrument Steering Refine forecast grid
NSF Engineering Research Center for Collaborative Adaptive Sensing of the Atmosphere (CASA) UMass/Amherst, OU, CSU, UPRM Concept:  inexpensive, phased array Doppler radars on cell towers and buildings Dynamically adaptive sensing of multiple targets while simultaneously meeting multiple end-user needs
LEAD Workflow Requirements Run jobs on-demand on TeraGrid. Deadline driven workflows (severe weather tracking) Users ranging from 8th grade students to seasoned researchers. Run jobs on Multiple TeraGrid resources to decrease turn-around time. Must be able to integrate to Portal with very user friendly web interface.
Workflow Survey in 2003(http://www.extreme.indiana.edu/swf-survey/)
High Level LEAD Architecture Workflow graph Application  services Compute Engine User Portal Workflow Engine Fault Tolerance & scheduler Event Notification Bus Portal server MyLEAD  Agent service Data Management Service Data Catalog service Providence Collection service MyLEAD User Metadata catalog Data Storage
LEAD Pioneering Technology
LEAD Scientists and Educational Interactions Lowering the barrier for using complex end-to-end weather technologies Democratize Empower Facilitate End Users Developers Researcherss
Layered Architecture
Open Grid Computing Environments Generalize, Harden, Build Test
Flexible Layered Service Oriented Architecture User Interactions Other Clients XBaya GUI Web Portal XBaya Core Event Bus Middleware Services GFac Services Workflow Engine (ODE) XRegistry XMCCat Metadata Catalog Compute & Data Resources Computational Cloud  Local Lab Resources Computational Grids
OGCE Workflow Suite Generic Service Toolkit Tool to wrap command-line applications as web services Handles file staging & job submission and monitoring Extensible runtime for security, resource brokering & urgent computing Generic Factory service for on-demand creation of application services XRegistry Information repository for the OGCE workflow suite Register, search, retrieve & share XML documents User & hierarchical group based authorization XBaya GUI based tool to compose & monitor workflows Extensible support for compiler plug-ins like BPEL, Jython, SCUFL Dynamic Workflow Execution support to start, pause, resume, rewind of workflow executions Apache ODE Scientific Workflow Extensions XBaya GUI integration for BPEL Generation Asynchronous support for long running workflows Instrumented with fine grained monitoring Eventing System Supports both WS-Eventing and WS-Notification Standards Very scalable Persistent Message Box for clients behind firewalls and with intermittent network glitches.
Generic Application Service Factory
Application Wrapper Framework c Application Factory ,[object Object]
The Application Factory generates a web service for each application with I/O interfaces.
Registers WSDL for the service with a registry
Each service generates a stream of notifications that log the service actions back to the XMCCat Metadata Catalog, user monitoring, and provenance tracking toolsApp  Service Run program & publish events
Service Monitoring via Events 1 2 3 4 5 6 ,[object Object]
I am running your request
I have started to move your input files
I have all the files
I am running your application
The application is finished
I am moving the output to you file space
I am done
These are automatically generated by the service using a distributed event system(WS-Eventing / WS-Notification)
Topic based pub-sub system witha well known “channel”Application Service Instance Notification Channel x x publisher listener
Workflow Composer
Interoperable XBaya Workflow Architecture BPEL  1.1 BPEL  2.0 SCUFL Abstract DAG Model Composition and Monitoring Python Dynamic Enactor/Interpreter Jython Based Enactor GPEL  Engine Apache  ODE Engine Taverna Python Runtime Message Bus
WS-BPEL Business Process Execution Language for Web Services (WS-BPEL) De-facto standard for specifying web service based business processes and service compositions Basic activities Invoke, Receive, Assign.. Structured activities Sequence, Flow, ForEach,..
Workflow Composition, Execution & Monitoring     XBaya enables users to construct, share, execute and monitor sequence of tasks executing on their local workstations to high-end compute resources.
GPEL Grid Process Execution Language BPEL4WS based home grown research workflow engine Supports a subset of BPEL4WS 1.1  One of the very early adaptations of BPEL efforts Specifically designed for eScience Usage Long running workflow support Decoupled client
Benefits of Porting to Apache ODE

Mais conteúdo relacionado

Mais procurados

Project DRAC: Creating an applications-aware network
Project DRAC: Creating an applications-aware networkProject DRAC: Creating an applications-aware network
Project DRAC: Creating an applications-aware networkTal Lavian Ph.D.
 
Soa12c launch 5 event processing shmakov eng cr
Soa12c launch 5 event processing shmakov eng crSoa12c launch 5 event processing shmakov eng cr
Soa12c launch 5 event processing shmakov eng crVasily Demin
 
Workload Automation for Cloud Migration and Machine Learning Platform
Workload Automation for Cloud Migration and Machine Learning PlatformWorkload Automation for Cloud Migration and Machine Learning Platform
Workload Automation for Cloud Migration and Machine Learning PlatformActiveeon
 
Comparison of Open-Source Data Stream Processing Engines: Spark Streaming, Fl...
Comparison of Open-Source Data Stream Processing Engines: Spark Streaming, Fl...Comparison of Open-Source Data Stream Processing Engines: Spark Streaming, Fl...
Comparison of Open-Source Data Stream Processing Engines: Spark Streaming, Fl...Darshan Gorasiya
 
Cloud Computing Course
Cloud Computing Course Cloud Computing Course
Cloud Computing Course Arshdeep Bahga
 
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and FutureHadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and FutureVinod Kumar Vavilapalli
 
Hybrid Cloud Monitoring - Datatdog
Hybrid Cloud Monitoring - DatatdogHybrid Cloud Monitoring - Datatdog
Hybrid Cloud Monitoring - DatatdogChase Thompson
 
An overview of grid monitoring
An overview of grid monitoringAn overview of grid monitoring
An overview of grid monitoringManoj Prabhakar
 
Enterprise Data and Analytics Architecture Overview for Electric Utility
Enterprise Data and Analytics Architecture Overview for Electric UtilityEnterprise Data and Analytics Architecture Overview for Electric Utility
Enterprise Data and Analytics Architecture Overview for Electric UtilityPrajesh Bhattacharya
 
Moving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive ModelMoving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive ModelDataWorks Summit
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Databricks
 
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data AnalyticsHow to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data AnalyticsInformatica
 
DRAFT - Enterprise Data and Analytics Architecture Overview for Electric Utility
DRAFT - Enterprise Data and Analytics Architecture Overview for Electric UtilityDRAFT - Enterprise Data and Analytics Architecture Overview for Electric Utility
DRAFT - Enterprise Data and Analytics Architecture Overview for Electric UtilityPrajesh Bhattacharya
 
IRJET- A Survey on Remote Data Possession Verification Protocol in Cloud Storage
IRJET- A Survey on Remote Data Possession Verification Protocol in Cloud StorageIRJET- A Survey on Remote Data Possession Verification Protocol in Cloud Storage
IRJET- A Survey on Remote Data Possession Verification Protocol in Cloud StorageIRJET Journal
 
Real-Time Robot Predictive Maintenance in Action
Real-Time Robot Predictive Maintenance in ActionReal-Time Robot Predictive Maintenance in Action
Real-Time Robot Predictive Maintenance in ActionDataWorks Summit
 
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachEvolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachDataWorks Summit
 

Mais procurados (20)

r1501e
r1501er1501e
r1501e
 
Project DRAC: Creating an applications-aware network
Project DRAC: Creating an applications-aware networkProject DRAC: Creating an applications-aware network
Project DRAC: Creating an applications-aware network
 
Soa12c launch 5 event processing shmakov eng cr
Soa12c launch 5 event processing shmakov eng crSoa12c launch 5 event processing shmakov eng cr
Soa12c launch 5 event processing shmakov eng cr
 
Workload Automation for Cloud Migration and Machine Learning Platform
Workload Automation for Cloud Migration and Machine Learning PlatformWorkload Automation for Cloud Migration and Machine Learning Platform
Workload Automation for Cloud Migration and Machine Learning Platform
 
Cs6703 grid and cloud computing unit 1
Cs6703 grid and cloud computing unit 1Cs6703 grid and cloud computing unit 1
Cs6703 grid and cloud computing unit 1
 
Unit 4
Unit 4Unit 4
Unit 4
 
Comparison of Open-Source Data Stream Processing Engines: Spark Streaming, Fl...
Comparison of Open-Source Data Stream Processing Engines: Spark Streaming, Fl...Comparison of Open-Source Data Stream Processing Engines: Spark Streaming, Fl...
Comparison of Open-Source Data Stream Processing Engines: Spark Streaming, Fl...
 
Cloud Computing Course
Cloud Computing Course Cloud Computing Course
Cloud Computing Course
 
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and FutureHadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
 
Hybrid Cloud Monitoring - Datatdog
Hybrid Cloud Monitoring - DatatdogHybrid Cloud Monitoring - Datatdog
Hybrid Cloud Monitoring - Datatdog
 
An overview of grid monitoring
An overview of grid monitoringAn overview of grid monitoring
An overview of grid monitoring
 
Enterprise Data and Analytics Architecture Overview for Electric Utility
Enterprise Data and Analytics Architecture Overview for Electric UtilityEnterprise Data and Analytics Architecture Overview for Electric Utility
Enterprise Data and Analytics Architecture Overview for Electric Utility
 
Moving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive ModelMoving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive Model
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data AnalyticsHow to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
 
DRAFT - Enterprise Data and Analytics Architecture Overview for Electric Utility
DRAFT - Enterprise Data and Analytics Architecture Overview for Electric UtilityDRAFT - Enterprise Data and Analytics Architecture Overview for Electric Utility
DRAFT - Enterprise Data and Analytics Architecture Overview for Electric Utility
 
IRJET- A Survey on Remote Data Possession Verification Protocol in Cloud Storage
IRJET- A Survey on Remote Data Possession Verification Protocol in Cloud StorageIRJET- A Survey on Remote Data Possession Verification Protocol in Cloud Storage
IRJET- A Survey on Remote Data Possession Verification Protocol in Cloud Storage
 
Real-Time Robot Predictive Maintenance in Action
Real-Time Robot Predictive Maintenance in ActionReal-Time Robot Predictive Maintenance in Action
Real-Time Robot Predictive Maintenance in Action
 
V04405122126
V04405122126V04405122126
V04405122126
 
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachEvolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
 

Semelhante a Ogce Workflow Suite

Scientific
Scientific Scientific
Scientific marpierc
 
grid mining
grid mininggrid mining
grid miningARNOLD
 
XSEDE14 SciGaP-Apache Airavata Tutorial
XSEDE14 SciGaP-Apache Airavata TutorialXSEDE14 SciGaP-Apache Airavata Tutorial
XSEDE14 SciGaP-Apache Airavata Tutorialmarpierc
 
Opportunities and Challenges for Running Scientific Workflows on the Cloud
Opportunities and Challenges for Running Scientific Workflows on the Cloud Opportunities and Challenges for Running Scientific Workflows on the Cloud
Opportunities and Challenges for Running Scientific Workflows on the Cloud lyingcom
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...BigData_Europe
 
Indiana University's Advanced Science Gateway Support
Indiana University's Advanced Science Gateway SupportIndiana University's Advanced Science Gateway Support
Indiana University's Advanced Science Gateway Supportmarpierc
 
Privacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storagePrivacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storagedbpublications
 
Educause Annual 2007
Educause Annual 2007Educause Annual 2007
Educause Annual 2007Neil Matatall
 
Atmosphere 2014: Switching from monolithic approach to modular cloud computin...
Atmosphere 2014: Switching from monolithic approach to modular cloud computin...Atmosphere 2014: Switching from monolithic approach to modular cloud computin...
Atmosphere 2014: Switching from monolithic approach to modular cloud computin...PROIDEA
 
Harbour IT & VMware - vForum 2010 Wrap
Harbour IT & VMware - vForum 2010 WrapHarbour IT & VMware - vForum 2010 Wrap
Harbour IT & VMware - vForum 2010 WrapHarbourIT
 
070416 Egu Vienna Husar
070416 Egu Vienna Husar070416 Egu Vienna Husar
070416 Egu Vienna HusarRudolf Husar
 
Stream analytics
Stream analyticsStream analytics
Stream analyticsrebeccatho
 
Conceptualizing And Prototyping A Scalable Genomic Data Analysis Pipeline: Us...
Conceptualizing And Prototyping A Scalable Genomic Data Analysis Pipeline: Us...Conceptualizing And Prototyping A Scalable Genomic Data Analysis Pipeline: Us...
Conceptualizing And Prototyping A Scalable Genomic Data Analysis Pipeline: Us...Shadab Ali Khan
 
A cloud environment for backup and data storage
A cloud environment for backup and data storageA cloud environment for backup and data storage
A cloud environment for backup and data storageIGEEKS TECHNOLOGIES
 
Data Science in the cloud with Microsoft Azure
Data Science in the cloud with Microsoft Azure Data Science in the cloud with Microsoft Azure
Data Science in the cloud with Microsoft Azure TechExeter
 

Semelhante a Ogce Workflow Suite (20)

Scientific
Scientific Scientific
Scientific
 
grid mining
grid mininggrid mining
grid mining
 
XSEDE14 SciGaP-Apache Airavata Tutorial
XSEDE14 SciGaP-Apache Airavata TutorialXSEDE14 SciGaP-Apache Airavata Tutorial
XSEDE14 SciGaP-Apache Airavata Tutorial
 
Grid computing
Grid computingGrid computing
Grid computing
 
SomeSlides
SomeSlidesSomeSlides
SomeSlides
 
Opportunities and Challenges for Running Scientific Workflows on the Cloud
Opportunities and Challenges for Running Scientific Workflows on the Cloud Opportunities and Challenges for Running Scientific Workflows on the Cloud
Opportunities and Challenges for Running Scientific Workflows on the Cloud
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
As34269277
As34269277As34269277
As34269277
 
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
 
Indiana University's Advanced Science Gateway Support
Indiana University's Advanced Science Gateway SupportIndiana University's Advanced Science Gateway Support
Indiana University's Advanced Science Gateway Support
 
Privacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storagePrivacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storage
 
Web Services in the Real World
Web Services in the Real WorldWeb Services in the Real World
Web Services in the Real World
 
Educause Annual 2007
Educause Annual 2007Educause Annual 2007
Educause Annual 2007
 
Atmosphere 2014: Switching from monolithic approach to modular cloud computin...
Atmosphere 2014: Switching from monolithic approach to modular cloud computin...Atmosphere 2014: Switching from monolithic approach to modular cloud computin...
Atmosphere 2014: Switching from monolithic approach to modular cloud computin...
 
Harbour IT & VMware - vForum 2010 Wrap
Harbour IT & VMware - vForum 2010 WrapHarbour IT & VMware - vForum 2010 Wrap
Harbour IT & VMware - vForum 2010 Wrap
 
070416 Egu Vienna Husar
070416 Egu Vienna Husar070416 Egu Vienna Husar
070416 Egu Vienna Husar
 
Stream analytics
Stream analyticsStream analytics
Stream analytics
 
Conceptualizing And Prototyping A Scalable Genomic Data Analysis Pipeline: Us...
Conceptualizing And Prototyping A Scalable Genomic Data Analysis Pipeline: Us...Conceptualizing And Prototyping A Scalable Genomic Data Analysis Pipeline: Us...
Conceptualizing And Prototyping A Scalable Genomic Data Analysis Pipeline: Us...
 
A cloud environment for backup and data storage
A cloud environment for backup and data storageA cloud environment for backup and data storage
A cloud environment for backup and data storage
 
Data Science in the cloud with Microsoft Azure
Data Science in the cloud with Microsoft Azure Data Science in the cloud with Microsoft Azure
Data Science in the cloud with Microsoft Azure
 

Mais de smarru

Cyberinfrastructure Experiences with Apache Airavata
Cyberinfrastructure Experiences with Apache AiravataCyberinfrastructure Experiences with Apache Airavata
Cyberinfrastructure Experiences with Apache Airavatasmarru
 
Apache Airavata Credential Store
Apache Airavata Credential StoreApache Airavata Credential Store
Apache Airavata Credential Storesmarru
 
RESTLess Design with Apache Thrift: Experiences from Apache Airavata
RESTLess Design with Apache Thrift: Experiences from Apache AiravataRESTLess Design with Apache Thrift: Experiences from Apache Airavata
RESTLess Design with Apache Thrift: Experiences from Apache Airavatasmarru
 
Google Summer of Code at Apache Software Foundation
Google Summer of Code at Apache Software FoundationGoogle Summer of Code at Apache Software Foundation
Google Summer of Code at Apache Software Foundationsmarru
 
Gsoc airavata
Gsoc airavataGsoc airavata
Gsoc airavatasmarru
 
Learning Open Source through GSOC
Learning Open Source through GSOC Learning Open Source through GSOC
Learning Open Source through GSOC smarru
 
Apache Student Induction ApacheCon 2013
Apache Student Induction ApacheCon 2013Apache Student Induction ApacheCon 2013
Apache Student Induction ApacheCon 2013smarru
 
Apache Airavata ApacheCon2013
Apache Airavata ApacheCon2013Apache Airavata ApacheCon2013
Apache Airavata ApacheCon2013smarru
 
Ogce Workflow Suite Tg09
Ogce Workflow Suite Tg09Ogce Workflow Suite Tg09
Ogce Workflow Suite Tg09smarru
 

Mais de smarru (9)

Cyberinfrastructure Experiences with Apache Airavata
Cyberinfrastructure Experiences with Apache AiravataCyberinfrastructure Experiences with Apache Airavata
Cyberinfrastructure Experiences with Apache Airavata
 
Apache Airavata Credential Store
Apache Airavata Credential StoreApache Airavata Credential Store
Apache Airavata Credential Store
 
RESTLess Design with Apache Thrift: Experiences from Apache Airavata
RESTLess Design with Apache Thrift: Experiences from Apache AiravataRESTLess Design with Apache Thrift: Experiences from Apache Airavata
RESTLess Design with Apache Thrift: Experiences from Apache Airavata
 
Google Summer of Code at Apache Software Foundation
Google Summer of Code at Apache Software FoundationGoogle Summer of Code at Apache Software Foundation
Google Summer of Code at Apache Software Foundation
 
Gsoc airavata
Gsoc airavataGsoc airavata
Gsoc airavata
 
Learning Open Source through GSOC
Learning Open Source through GSOC Learning Open Source through GSOC
Learning Open Source through GSOC
 
Apache Student Induction ApacheCon 2013
Apache Student Induction ApacheCon 2013Apache Student Induction ApacheCon 2013
Apache Student Induction ApacheCon 2013
 
Apache Airavata ApacheCon2013
Apache Airavata ApacheCon2013Apache Airavata ApacheCon2013
Apache Airavata ApacheCon2013
 
Ogce Workflow Suite Tg09
Ogce Workflow Suite Tg09Ogce Workflow Suite Tg09
Ogce Workflow Suite Tg09
 

Último

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 

Último (20)

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 

Ogce Workflow Suite

  • 1. OGCE Workflow Toolkit for Multi-Scale Science Applications Suresh Marru Pervasive Technology Institute Indiana University ODI Gateway/Pipeline Evaluation NOAO Jan 21st 2010
  • 3.
  • 5. NSF funded project to create a high throughput 535 TB Lustre storage solution
  • 6. Short term storage for applications with high bandwidth needs
  • 7. IU added a wide area version in 2008
  • 8.
  • 9. Massive Data Storage System MDSS Architecture
  • 10.
  • 11. Wide
  • 12.
  • 13. Gateways Advantages Increase access To instruments Increase capabilities To analyze data Improve workforce development For underserved populations Increase outreach Increase public awareness Public sees value in investments in large facilities
  • 14. There are approximately 30 gateways using the TeraGrid
  • 15. Example Gateway Accomplishments LEAD - access to radar data NVO – access to sky surveys OOI – access to sensor data PolarGrid – access to polar ice sheet data SIDGrid – analysis tools GridChem – developing multiscale coupling How would this have been done before gateways? How many details do we want each individual scientist to need to know?
  • 16. TeraGrid Advantages and Challenges What’s different when the resource doesn’t belong just to me? Resource discovery Accounting Security Proposal-based requests for resources (peer-reviewed access) Code scaling and performance numbers Justification of resources Gateway citations Tremendous benefits at the high end, but even more work for the developers Potential impact on science is huge Small number of developers can impact thousands of scientists But need a way to train and fund those developers and provide them with appropriate tools
  • 18. Weather is Local, High-Impact, Heterogeneous and Rapidly Evolving…Yet Our Technologies and Thinking are Static Rain and Snow Fog Rain and Snow Snow and Freezing Rain Intense Turbulence Severe Thunderstorms
  • 19. LEAD Dynamic Adaptive Infrastructure Storms Forming Forecast Model Streaming Observations Data Mining Instrument Steering Refine forecast grid
  • 20. NSF Engineering Research Center for Collaborative Adaptive Sensing of the Atmosphere (CASA) UMass/Amherst, OU, CSU, UPRM Concept: inexpensive, phased array Doppler radars on cell towers and buildings Dynamically adaptive sensing of multiple targets while simultaneously meeting multiple end-user needs
  • 21. LEAD Workflow Requirements Run jobs on-demand on TeraGrid. Deadline driven workflows (severe weather tracking) Users ranging from 8th grade students to seasoned researchers. Run jobs on Multiple TeraGrid resources to decrease turn-around time. Must be able to integrate to Portal with very user friendly web interface.
  • 22. Workflow Survey in 2003(http://www.extreme.indiana.edu/swf-survey/)
  • 23. High Level LEAD Architecture Workflow graph Application services Compute Engine User Portal Workflow Engine Fault Tolerance & scheduler Event Notification Bus Portal server MyLEAD Agent service Data Management Service Data Catalog service Providence Collection service MyLEAD User Metadata catalog Data Storage
  • 25. LEAD Scientists and Educational Interactions Lowering the barrier for using complex end-to-end weather technologies Democratize Empower Facilitate End Users Developers Researcherss
  • 26.
  • 28. Open Grid Computing Environments Generalize, Harden, Build Test
  • 29. Flexible Layered Service Oriented Architecture User Interactions Other Clients XBaya GUI Web Portal XBaya Core Event Bus Middleware Services GFac Services Workflow Engine (ODE) XRegistry XMCCat Metadata Catalog Compute & Data Resources Computational Cloud Local Lab Resources Computational Grids
  • 30. OGCE Workflow Suite Generic Service Toolkit Tool to wrap command-line applications as web services Handles file staging & job submission and monitoring Extensible runtime for security, resource brokering & urgent computing Generic Factory service for on-demand creation of application services XRegistry Information repository for the OGCE workflow suite Register, search, retrieve & share XML documents User & hierarchical group based authorization XBaya GUI based tool to compose & monitor workflows Extensible support for compiler plug-ins like BPEL, Jython, SCUFL Dynamic Workflow Execution support to start, pause, resume, rewind of workflow executions Apache ODE Scientific Workflow Extensions XBaya GUI integration for BPEL Generation Asynchronous support for long running workflows Instrumented with fine grained monitoring Eventing System Supports both WS-Eventing and WS-Notification Standards Very scalable Persistent Message Box for clients behind firewalls and with intermittent network glitches.
  • 32.
  • 33. The Application Factory generates a web service for each application with I/O interfaces.
  • 34. Registers WSDL for the service with a registry
  • 35. Each service generates a stream of notifications that log the service actions back to the XMCCat Metadata Catalog, user monitoring, and provenance tracking toolsApp Service Run program & publish events
  • 36.
  • 37. I am running your request
  • 38. I have started to move your input files
  • 39. I have all the files
  • 40. I am running your application
  • 42. I am moving the output to you file space
  • 44. These are automatically generated by the service using a distributed event system(WS-Eventing / WS-Notification)
  • 45. Topic based pub-sub system witha well known “channel”Application Service Instance Notification Channel x x publisher listener
  • 47. Interoperable XBaya Workflow Architecture BPEL 1.1 BPEL 2.0 SCUFL Abstract DAG Model Composition and Monitoring Python Dynamic Enactor/Interpreter Jython Based Enactor GPEL Engine Apache ODE Engine Taverna Python Runtime Message Bus
  • 48. WS-BPEL Business Process Execution Language for Web Services (WS-BPEL) De-facto standard for specifying web service based business processes and service compositions Basic activities Invoke, Receive, Assign.. Structured activities Sequence, Flow, ForEach,..
  • 49. Workflow Composition, Execution & Monitoring XBaya enables users to construct, share, execute and monitor sequence of tasks executing on their local workstations to high-end compute resources.
  • 50. GPEL Grid Process Execution Language BPEL4WS based home grown research workflow engine Supports a subset of BPEL4WS 1.1 One of the very early adaptations of BPEL efforts Specifically designed for eScience Usage Long running workflow support Decoupled client
  • 51. Benefits of Porting to Apache ODE
  • 52.
  • 53. Simple Recovery Architecture Portal BPEL Workflow Engine Application Performance Models Fault Tolerance/ Recovery Service Resource Reliability Models Application Service OVP/ RST/ MIG NWS, MDS BQP Notification Service Deadline & Success Probability 36 36
  • 54. OGCE Workflow Usage Flow Scientist/Application provider registers application description with Registry Service. Workflow Author constructs the workflow with multiple wrapped application services. Workflow is compiled and deployed to the ODE workflow Engine. Workflow inputs are captured by XBaya and workflow is launched to ODE. Workflow system and possibly some services publish notifications to the Message bus reporting the progress of the workflow. XBaya monitoring system listens to notifications and color the workflow components to present workflow progress.
  • 55. Packaged, Downloadable Software http://www.collab-ogce.org/ogce/index.php/Main_Page
  • 56. ODI Overview – Image Acquisition WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 57. Overview – Image Copy WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 58. Overview – Image Transfer to Data Capacitor WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 59. Overview – Image Ingestions into the Archive WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 60. Overview – Clean Up WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 61. Overview – Automated Tier 1 Processing WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 62. Overview – User Driven Tier 2 Processing WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 63. Overview – User Driven Tier 2 Processing WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 64.
  • 65. Some apps have rich Client Gui’s, a challenge with asynchronous long running workflows
  • 67. Parametric sweep scheduling, monitoring iteration steps, graphical composition
  • 68.
  • 69.
  • 70. Input list of amino acid sequences
  • 72.
  • 73. Live Demo & Questions?

Notas do Editor

  1. Scientific workflow requirements challenge WS-BPEL