SlideShare uma empresa Scribd logo
1 de 42
Introduction to Themes and Technologies  Per Öster <per.oster@csc.fi> CSC – IT Center for Science Ltd Finland
CSC at a glance ,[object Object]
Reorganized as a company, CSC - Scientific     Computing Ltd. in 1993
All shares to the Ministry of Education of Finland in 1997
 Operates on a non-profit principle
Facilities in Espoo, close to Otaniemi community (of 15,000 students and    16,000 technologyprofessionals)
Staff 170
Turnover 2008 19,6 millioneuros,[object Object]
Themes of the Second Week
The Acronyms
Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows
Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows
1. Principles of job submission and execution management	 Vision UNiformInterface to COmputingResources seamless, secure, and intuitive History 08/1997 – 12/2002: UNICORE and UNICORE Plus projects Initial development started in two German projects funded by the German ministry of education and research (BMBF) Continuation in different EU projects since 2002 Open Source community development since summer 2004
http://www.unicore.eu UNICORE 6 Guiding Principles, Implementation Strategies Open source under BSD license with software hosted on SourceForge Standards-based: OGSA-conform, WS-RF 1.2 compliant Open, extensible Service-Oriented Architecture (SOA) Interoperable with other Grid technologies Seamless, secure and intuitive following a vertical end-to-end approach Mature Security: X.509, proxy and VO support Workflow support tightly integrated while being extensible for different workflow languages and engines for domain-specific usage Application integration mechanisms on the client, services and resource level Variety of clients: graphical, command-line, API, portal, etc. Quick and simple installation and configuration Support for many operating systems (Windows, MacOS, Linux, UNIX) and batch systems (LoadLeveler, Torque, SLURM, LSF, OpenCCS) Implemented in Java to achieve platform-independence
scientific clientsand applications URCEclipse-based Rich client HiLAProgrammingAPI UCCcommand-line client Portal e.g. GridSphere X.509, Proxies, SOAP, WS-RF, WS-I, JSDL web service stack Gateway central services running in WS-RF hosting environments ServiceRegistry WorkflowEngine OGSA-RUS, UR,GLUE 2.0 ServiceOrchestrator CISInfoService Gateway – Site 1 Gateway – Site 2 authentication UNICOREWS-RFhostingenvironment UNICOREWS-RFhostingenvironment OGSA-ByteIO, OGSA-BES, JSDL, HPC-P, OGSA-RUS, UR UNICORE Atomic Services OGSA-* UNICORE Atomic Services OGSA-* UVOSVO Service Grid services hosting XNJS – Site 1 XNJS – Site 2 IDB IDB job incarnation X.509, XACML, SAML, Proxies XACML entity XACML entity XUUDB XUUDB authorization Target System Interface – Site 1 Target System Interface – Site 2 DRMAA ExternalStorage Local RMS (e.g. Torque, LL, LSF, etc.) Local RMS (e.g. Torque, LL, LSF, etc.) GridFTP, Proxies USpace USpace data transfer to external storages http://www.unicore.eu
http://www.unicore.eu Workflows in    Two layer architecture for scalability Workflow engine Based on Shark open-source XPDLengine Pluggable, domain-specific workflow languages Service orchestrator Job execution and monitoring Callback to workflow engine Brokering based on pluggable strategies Clients GUI client based on Eclipse Commandline submission of workflows is also possible
Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows
High-Throughput Computing Large amount of tasks that can be executed independently Parameter Studies Monte Carlo or Stochastic Methods Genome Sequencing (matching) Analysis of LHC data : Starting from this Looking for this (1 in 1013)
2. Principles of high-throughput computing Vision Condor provides high-throughput computing in a variety of environments Local dedicated clusters (machine rooms) Local opportunistic (desktop) computers) Grid environments; Can submit jobs to other systems Can run workflows of jobs Can run parallel jobs Independently parallel (lots of single jobs) Tightly coupled (such as MPI)
2. Principles of high-throughput computing History and Activity  Distributed Computing research performed by a team of ~35 faculty, full time staff and students who Established in 1985 Faces software/middleware engineering challenges in a UNIX/Linux/Windows/OS X environment,  Involved in national and international collaborations, Interacts with users in academia and industry, Maintains and support a distributed production environment (more than 5000 CPUs at UW), Educates and trains students.
Condor Project:Main Threads of Activities Distributed Computing Research – develop and evaluate new concepts, frameworks and technologies  Develop and maintain Condor; support our users  More on next slide The Open Science Grid (OSG) – build and operate a national High Throughput Computing infrastructure The Grid Laboratory Of Wisconsin (GLOW) – build, maintain and operate a distributed computing and storage infrastructure on the UW campus   The NSF Middleware Initiative  (NMI)  - Develop, build and operate a national Build and Test facility powered by Metronome  (ETICS-II)
Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows
Web Services XML DCE RPC DCOM RMI CORBA “Web services has dramatically reduced the programming and management cost of publishing and receiving information” Jim Gray, Microsoft Research EMBRACE – 4yr EU project to establish services for the bioinformatics community
3. Principles of service-oriented architectures Vision Provide the fundamental components to get the grid working History Starting point in I-WAY, a distributed high-performance network demonstrated at the SuperComputing '95 conference and exhibition
…14 Years Later 4 major versions Components to address the original problems Many new fields recent hot topics: service oriented science, virtualization Diverse application areas recently: lots of bioinformatics and medical apps others include: earthquakes, particle physics, earth sciences
21 Globus Software now – many components Globus Projects OGSA-DAI GT4 MPICH- G2 Data Rep Replica Location Java Runtime MyProxy Delegation GridWay GridFTP MDS4 CAS C Runtime GSI- OpenSSH Incubator Mgmt Reliable File Transfer GRAM Python Runtime C Sec GT4 Docs Incubator Projects Cog WF GAARDS VirtWkSp MEDICUS Others... Metrics OGRO GDTE UGP GridShib Dyn Acct Gavia JSC DDM LRMA HOC-SA PURSE Introduce WEEP Gavia MS SGGC ServMark Security Execution Mgmt Info Services Common Runtime Other Data Mgmt
Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows
4. Principles of distributed data management
EGEE Project Overview 17000 users 136000  LCPUs (cores) 25Pb disk 39Pb tape 12 million jobs/month +45% in a year 268 sites +5% in a year 48 countries +10% in a year 162 VOs +29% in a year Technical Status - Steven Newhouse - EGEE-III First Review 24-25 June 2009 24
Middleware Supporting HTC Technical Status - Steven Newhouse - EGEE-III First Review 24-25 June 2009 25 Archeology Astronomy Astrophysics Civil Protection Comp. Chemistry Earth Sciences Finance Fusion Geophysics High Energy Physics Life Sciences Multimedia Material Sciences History of gLite ,[object Object]
Entered production in May 2006
Middleware distribution of EGEESupported End-user Activity ,[object Object]
+44% users in a year
23 core VOs
A core VO has >10% of usage within its science cluster,[object Object]
Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows
The Computing “Eco-system” ,[object Object],TIER 1 Large-scale HPC centers Capability  Computing National/regional centers, Grid-collaboration TIER 2 Capacity  Computing TIER3 Local centers Personal/office computing TIER4
5. Principles of using distributed and high performance systems ARC middleware (Advanced Resource Connector) open source out-of-the-box Grid solution software which enables production quality computational and data Grids (released in May 2002) development is coordinated by NDGF emphasis is put on scalability, stability, reliability and performance builds upon standard OS solutions,OpenLDAP, OpenSSL, SASL and Globus Toolkit adds services not provided by Globus extends or completely replaces some Globus components
NorduGrid collaboration* ,[object Object],national Grids (e.g. M-grid, SweGrid, NorGrid), users also outside the Nordic countries real users, real applications implemented a production Grid system working non stop since May 2002 open for anyone to participate * http://www.nordugrid.org/monitor
M-grid   ̶  the Finnish Material Sciences Grid ,[object Object],partners are laboratories and departments and not university IT centers not limited by the field of research, used for a wide range of physical, chemical and nanoscience applications ,[object Object]
first large initiative to put Grid middleware into production use in Finland
goal:  throughput computing capacity  mainly for the needs of physics and chemistry researchers

Mais conteúdo relacionado

Mais procurados

OCCI - The Open Cloud Computing Interface – flexible, portable, interoperable...
OCCI - The Open Cloud Computing Interface – flexible, portable, interoperable...OCCI - The Open Cloud Computing Interface – flexible, portable, interoperable...
OCCI - The Open Cloud Computing Interface – flexible, portable, interoperable...
Alan Sill
 
Tutorial on Hybrid Data Infrastructures: D4Science as a case study
Tutorial on Hybrid Data Infrastructures: D4Science as a case studyTutorial on Hybrid Data Infrastructures: D4Science as a case study
Tutorial on Hybrid Data Infrastructures: D4Science as a case study
Blue BRIDGE
 
111018 geo sif_aq_interop
111018 geo sif_aq_interop111018 geo sif_aq_interop
111018 geo sif_aq_interop
Rudolf Husar
 
Craig Walker_TERN Eco-informatics: Managing and delivering ecological researc...
Craig Walker_TERN Eco-informatics: Managing and delivering ecological researc...Craig Walker_TERN Eco-informatics: Managing and delivering ecological researc...
Craig Walker_TERN Eco-informatics: Managing and delivering ecological researc...
TERN Australia
 
Azure Brain: 4th paradigm, scientific discovery & (really) big data
Azure Brain: 4th paradigm, scientific discovery & (really) big dataAzure Brain: 4th paradigm, scientific discovery & (really) big data
Azure Brain: 4th paradigm, scientific discovery & (really) big data
Microsoft Technet France
 

Mais procurados (20)

Gridforum Juergen Knobloch Grids For Science 20080402
Gridforum Juergen Knobloch Grids For Science 20080402Gridforum Juergen Knobloch Grids For Science 20080402
Gridforum Juergen Knobloch Grids For Science 20080402
 
Cyberinfrastructure and its Role in Science
Cyberinfrastructure and its Role in ScienceCyberinfrastructure and its Role in Science
Cyberinfrastructure and its Role in Science
 
OCCI - The Open Cloud Computing Interface – flexible, portable, interoperable...
OCCI - The Open Cloud Computing Interface – flexible, portable, interoperable...OCCI - The Open Cloud Computing Interface – flexible, portable, interoperable...
OCCI - The Open Cloud Computing Interface – flexible, portable, interoperable...
 
Cloud Testbeds for Standards Development and Innovation
Cloud Testbeds for Standards Development and InnovationCloud Testbeds for Standards Development and Innovation
Cloud Testbeds for Standards Development and Innovation
 
Calit2-a Persistent UCSD/UCI Framework for Collaboration
Calit2-a Persistent UCSD/UCI Framework for CollaborationCalit2-a Persistent UCSD/UCI Framework for Collaboration
Calit2-a Persistent UCSD/UCI Framework for Collaboration
 
Tutorial on Hybrid Data Infrastructures: D4Science as a case study
Tutorial on Hybrid Data Infrastructures: D4Science as a case studyTutorial on Hybrid Data Infrastructures: D4Science as a case study
Tutorial on Hybrid Data Infrastructures: D4Science as a case study
 
Using e-Infrastructures for Biodiversity Conservation
Using e-Infrastructures for Biodiversity ConservationUsing e-Infrastructures for Biodiversity Conservation
Using e-Infrastructures for Biodiversity Conservation
 
Graphs are at the Heart of the Cloud
Graphs are at the Heart of the CloudGraphs are at the Heart of the Cloud
Graphs are at the Heart of the Cloud
 
Using a Widely Distributed Federated Cloud System to Support Multiple Dispara...
Using a Widely Distributed Federated Cloud System to Support Multiple Dispara...Using a Widely Distributed Federated Cloud System to Support Multiple Dispara...
Using a Widely Distributed Federated Cloud System to Support Multiple Dispara...
 
TeraGrid Communication and Computation
TeraGrid Communication and ComputationTeraGrid Communication and Computation
TeraGrid Communication and Computation
 
SCAPE - Building Digital Preservation Infrastructure
SCAPE - Building Digital Preservation InfrastructureSCAPE - Building Digital Preservation Infrastructure
SCAPE - Building Digital Preservation Infrastructure
 
GlobusWorld 2021: Arecibo Observatory Data Movement
GlobusWorld 2021: Arecibo Observatory Data MovementGlobusWorld 2021: Arecibo Observatory Data Movement
GlobusWorld 2021: Arecibo Observatory Data Movement
 
Big Data, Beyond the Data Center
Big Data, Beyond the Data CenterBig Data, Beyond the Data Center
Big Data, Beyond the Data Center
 
111018 geo sif_aq_interop
111018 geo sif_aq_interop111018 geo sif_aq_interop
111018 geo sif_aq_interop
 
NSF CAC Cloud Interoperability Testbed Projects
NSF CAC Cloud Interoperability Testbed ProjectsNSF CAC Cloud Interoperability Testbed Projects
NSF CAC Cloud Interoperability Testbed Projects
 
Enabling efficient movement of data into & out of a high-performance analysis...
Enabling efficient movement of data into & out of a high-performance analysis...Enabling efficient movement of data into & out of a high-performance analysis...
Enabling efficient movement of data into & out of a high-performance analysis...
 
Mateo Valero - Big data: de la investigación científica a la gestión empresarial
Mateo Valero - Big data: de la investigación científica a la gestión empresarialMateo Valero - Big data: de la investigación científica a la gestión empresarial
Mateo Valero - Big data: de la investigación científica a la gestión empresarial
 
IDB-Cloud Providing Bioinformatics Services on Cloud
IDB-Cloud Providing Bioinformatics Services on CloudIDB-Cloud Providing Bioinformatics Services on Cloud
IDB-Cloud Providing Bioinformatics Services on Cloud
 
Craig Walker_TERN Eco-informatics: Managing and delivering ecological researc...
Craig Walker_TERN Eco-informatics: Managing and delivering ecological researc...Craig Walker_TERN Eco-informatics: Managing and delivering ecological researc...
Craig Walker_TERN Eco-informatics: Managing and delivering ecological researc...
 
Azure Brain: 4th paradigm, scientific discovery & (really) big data
Azure Brain: 4th paradigm, scientific discovery & (really) big dataAzure Brain: 4th paradigm, scientific discovery & (really) big data
Azure Brain: 4th paradigm, scientific discovery & (really) big data
 

Destaque (6)

Session5 T Infr Access Emidio
Session5 T Infr Access EmidioSession5 T Infr Access Emidio
Session5 T Infr Access Emidio
 
Session 40 : SAGA Overview and Introduction
Session 40 : SAGA Overview and Introduction Session 40 : SAGA Overview and Introduction
Session 40 : SAGA Overview and Introduction
 
Application Form
Application FormApplication Form
Application Form
 
Issgc Welcome
Issgc WelcomeIssgc Welcome
Issgc Welcome
 
Session 49 - Semantic metadata management practical
Session 49 - Semantic metadata management practical Session 49 - Semantic metadata management practical
Session 49 - Semantic metadata management practical
 
Departure
DepartureDeparture
Departure
 

Semelhante a General Introduction to technologies that will be seen in the school

OpenDaylight-in-NextGenNetworkServices
OpenDaylight-in-NextGenNetworkServicesOpenDaylight-in-NextGenNetworkServices
OpenDaylight-in-NextGenNetworkServices
Charles Eckel
 
CV_Kelvin_2016
CV_Kelvin_2016CV_Kelvin_2016
CV_Kelvin_2016
Kelvin Tan
 
Hungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applicationsHungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applications
Ferenc Szalai
 
Ogce Workflow Suite
Ogce Workflow SuiteOgce Workflow Suite
Ogce Workflow Suite
smarru
 

Semelhante a General Introduction to technologies that will be seen in the school (20)

The Malaysian Government Interopersbility Framework For Open Source Software ...
The Malaysian Government Interopersbility Framework For Open Source Software ...The Malaysian Government Interopersbility Framework For Open Source Software ...
The Malaysian Government Interopersbility Framework For Open Source Software ...
 
MPLS/SDN 2013 Intercloud Standardization and Testbeds - Sill
MPLS/SDN 2013 Intercloud Standardization and Testbeds - SillMPLS/SDN 2013 Intercloud Standardization and Testbeds - Sill
MPLS/SDN 2013 Intercloud Standardization and Testbeds - Sill
 
OpenDaylight-in-NextGenNetworkServices
OpenDaylight-in-NextGenNetworkServicesOpenDaylight-in-NextGenNetworkServices
OpenDaylight-in-NextGenNetworkServices
 
ION Malta - IETF Update
ION Malta - IETF UpdateION Malta - IETF Update
ION Malta - IETF Update
 
ION Costa Rica - About the IETF and How to Get Involved
ION Costa Rica - About the IETF and How to Get InvolvedION Costa Rica - About the IETF and How to Get Involved
ION Costa Rica - About the IETF and How to Get Involved
 
ION Islamabad - What's Happening at the IETF?
ION Islamabad - What's Happening at the IETF?ION Islamabad - What's Happening at the IETF?
ION Islamabad - What's Happening at the IETF?
 
ION Belgrade - IETF Update
ION Belgrade - IETF UpdateION Belgrade - IETF Update
ION Belgrade - IETF Update
 
Activeeon - Scale Beyond Limits
Activeeon - Scale Beyond LimitsActiveeon - Scale Beyond Limits
Activeeon - Scale Beyond Limits
 
Federated Cloud Computing
Federated Cloud ComputingFederated Cloud Computing
Federated Cloud Computing
 
High-Performance and Scalable Designs of Programming Models for Exascale Systems
High-Performance and Scalable Designs of Programming Models for Exascale SystemsHigh-Performance and Scalable Designs of Programming Models for Exascale Systems
High-Performance and Scalable Designs of Programming Models for Exascale Systems
 
ION Durban - What's Happening at the IETF?
ION Durban - What's Happening at the IETF?ION Durban - What's Happening at the IETF?
ION Durban - What's Happening at the IETF?
 
SomeSlides
SomeSlidesSomeSlides
SomeSlides
 
2017 dagstuhl-nfv-rothenberg
2017 dagstuhl-nfv-rothenberg2017 dagstuhl-nfv-rothenberg
2017 dagstuhl-nfv-rothenberg
 
CV_Kelvin_2016
CV_Kelvin_2016CV_Kelvin_2016
CV_Kelvin_2016
 
Hungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applicationsHungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applications
 
A Library for Emerging High-Performance Computing Clusters
A Library for Emerging High-Performance Computing ClustersA Library for Emerging High-Performance Computing Clusters
A Library for Emerging High-Performance Computing Clusters
 
Red hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategyRed hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategy
 
Network Softwarization on KREONET: KREONET-S
Network Softwarization on KREONET: KREONET-SNetwork Softwarization on KREONET: KREONET-S
Network Softwarization on KREONET: KREONET-S
 
Designing HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale SystemsDesigning HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale Systems
 
Ogce Workflow Suite
Ogce Workflow SuiteOgce Workflow Suite
Ogce Workflow Suite
 

Mais de ISSGC Summer School

Session 58 - Cloud computing, virtualisation and the future
Session 58 - Cloud computing, virtualisation and the future Session 58 - Cloud computing, virtualisation and the future
Session 58 - Cloud computing, virtualisation and the future
ISSGC Summer School
 
Session 58 :: Cloud computing, virtualisation and the future Speaker: Ake Edlund
Session 58 :: Cloud computing, virtualisation and the future Speaker: Ake EdlundSession 58 :: Cloud computing, virtualisation and the future Speaker: Ake Edlund
Session 58 :: Cloud computing, virtualisation and the future Speaker: Ake Edlund
ISSGC Summer School
 
Session 50 - High Performance Computing Ecosystem in Europe
Session 50 - High Performance Computing Ecosystem in EuropeSession 50 - High Performance Computing Ecosystem in Europe
Session 50 - High Performance Computing Ecosystem in Europe
ISSGC Summer School
 
Session 49 Practical Semantic Sticky Note
Session 49 Practical Semantic Sticky NoteSession 49 Practical Semantic Sticky Note
Session 49 Practical Semantic Sticky Note
ISSGC Summer School
 
Session 48 - Principles of Semantic metadata management
Session 48 - Principles of Semantic metadata management Session 48 - Principles of Semantic metadata management
Session 48 - Principles of Semantic metadata management
ISSGC Summer School
 
Session 46 - Principles of workflow management and execution
Session 46 - Principles of workflow management and execution Session 46 - Principles of workflow management and execution
Session 46 - Principles of workflow management and execution
ISSGC Summer School
 
Session 37 - Intro to Workflows, API's and semantics
Session 37 - Intro to Workflows, API's and semantics Session 37 - Intro to Workflows, API's and semantics
Session 37 - Intro to Workflows, API's and semantics
ISSGC Summer School
 
Session 24 - Distribute Data and Metadata Management with gLite
Session 24 - Distribute Data and Metadata Management with gLiteSession 24 - Distribute Data and Metadata Management with gLite
Session 24 - Distribute Data and Metadata Management with gLite
ISSGC Summer School
 

Mais de ISSGC Summer School (20)

Session 58 - Cloud computing, virtualisation and the future
Session 58 - Cloud computing, virtualisation and the future Session 58 - Cloud computing, virtualisation and the future
Session 58 - Cloud computing, virtualisation and the future
 
Session 58 :: Cloud computing, virtualisation and the future Speaker: Ake Edlund
Session 58 :: Cloud computing, virtualisation and the future Speaker: Ake EdlundSession 58 :: Cloud computing, virtualisation and the future Speaker: Ake Edlund
Session 58 :: Cloud computing, virtualisation and the future Speaker: Ake Edlund
 
Session 50 - High Performance Computing Ecosystem in Europe
Session 50 - High Performance Computing Ecosystem in EuropeSession 50 - High Performance Computing Ecosystem in Europe
Session 50 - High Performance Computing Ecosystem in Europe
 
Integrating Practical2009
Integrating Practical2009Integrating Practical2009
Integrating Practical2009
 
Session 49 Practical Semantic Sticky Note
Session 49 Practical Semantic Sticky NoteSession 49 Practical Semantic Sticky Note
Session 49 Practical Semantic Sticky Note
 
Session 48 - Principles of Semantic metadata management
Session 48 - Principles of Semantic metadata management Session 48 - Principles of Semantic metadata management
Session 48 - Principles of Semantic metadata management
 
Session 46 - Principles of workflow management and execution
Session 46 - Principles of workflow management and execution Session 46 - Principles of workflow management and execution
Session 46 - Principles of workflow management and execution
 
Session 42 - GridSAM
Session 42 - GridSAMSession 42 - GridSAM
Session 42 - GridSAM
 
Session 37 - Intro to Workflows, API's and semantics
Session 37 - Intro to Workflows, API's and semantics Session 37 - Intro to Workflows, API's and semantics
Session 37 - Intro to Workflows, API's and semantics
 
Session 43 :: Accessing data using a common interface: OGSA-DAI as an example
Session 43 :: Accessing data using a common interface: OGSA-DAI as an exampleSession 43 :: Accessing data using a common interface: OGSA-DAI as an example
Session 43 :: Accessing data using a common interface: OGSA-DAI as an example
 
Session 36 - Engage Results
Session 36 - Engage ResultsSession 36 - Engage Results
Session 36 - Engage Results
 
Social Program
Social ProgramSocial Program
Social Program
 
Session29 Arc
Session29 ArcSession29 Arc
Session29 Arc
 
Session 24 - Distribute Data and Metadata Management with gLite
Session 24 - Distribute Data and Metadata Management with gLiteSession 24 - Distribute Data and Metadata Management with gLite
Session 24 - Distribute Data and Metadata Management with gLite
 
Session 23 - gLite Overview
Session 23 - gLite OverviewSession 23 - gLite Overview
Session 23 - gLite Overview
 
Session 3-Distributed System Principals
Session 3-Distributed System PrincipalsSession 3-Distributed System Principals
Session 3-Distributed System Principals
 
Session10part2 Servers Detailed
Session10part2  Servers DetailedSession10part2  Servers Detailed
Session10part2 Servers Detailed
 
Session18 Madduri
Session18  MadduriSession18  Madduri
Session18 Madduri
 
Session6 Security Emidio
Session6 Security  EmidioSession6 Security  Emidio
Session6 Security Emidio
 
Session9part1
Session9part1Session9part1
Session9part1
 

Último

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 

Último (20)

Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 

General Introduction to technologies that will be seen in the school

  • 1. Introduction to Themes and Technologies Per Öster <per.oster@csc.fi> CSC – IT Center for Science Ltd Finland
  • 2.
  • 3. Reorganized as a company, CSC - Scientific Computing Ltd. in 1993
  • 4. All shares to the Ministry of Education of Finland in 1997
  • 5. Operates on a non-profit principle
  • 6. Facilities in Espoo, close to Otaniemi community (of 15,000 students and 16,000 technologyprofessionals)
  • 8.
  • 9. Themes of the Second Week
  • 11. Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows
  • 12. Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows
  • 13. 1. Principles of job submission and execution management Vision UNiformInterface to COmputingResources seamless, secure, and intuitive History 08/1997 – 12/2002: UNICORE and UNICORE Plus projects Initial development started in two German projects funded by the German ministry of education and research (BMBF) Continuation in different EU projects since 2002 Open Source community development since summer 2004
  • 14. http://www.unicore.eu UNICORE 6 Guiding Principles, Implementation Strategies Open source under BSD license with software hosted on SourceForge Standards-based: OGSA-conform, WS-RF 1.2 compliant Open, extensible Service-Oriented Architecture (SOA) Interoperable with other Grid technologies Seamless, secure and intuitive following a vertical end-to-end approach Mature Security: X.509, proxy and VO support Workflow support tightly integrated while being extensible for different workflow languages and engines for domain-specific usage Application integration mechanisms on the client, services and resource level Variety of clients: graphical, command-line, API, portal, etc. Quick and simple installation and configuration Support for many operating systems (Windows, MacOS, Linux, UNIX) and batch systems (LoadLeveler, Torque, SLURM, LSF, OpenCCS) Implemented in Java to achieve platform-independence
  • 15. scientific clientsand applications URCEclipse-based Rich client HiLAProgrammingAPI UCCcommand-line client Portal e.g. GridSphere X.509, Proxies, SOAP, WS-RF, WS-I, JSDL web service stack Gateway central services running in WS-RF hosting environments ServiceRegistry WorkflowEngine OGSA-RUS, UR,GLUE 2.0 ServiceOrchestrator CISInfoService Gateway – Site 1 Gateway – Site 2 authentication UNICOREWS-RFhostingenvironment UNICOREWS-RFhostingenvironment OGSA-ByteIO, OGSA-BES, JSDL, HPC-P, OGSA-RUS, UR UNICORE Atomic Services OGSA-* UNICORE Atomic Services OGSA-* UVOSVO Service Grid services hosting XNJS – Site 1 XNJS – Site 2 IDB IDB job incarnation X.509, XACML, SAML, Proxies XACML entity XACML entity XUUDB XUUDB authorization Target System Interface – Site 1 Target System Interface – Site 2 DRMAA ExternalStorage Local RMS (e.g. Torque, LL, LSF, etc.) Local RMS (e.g. Torque, LL, LSF, etc.) GridFTP, Proxies USpace USpace data transfer to external storages http://www.unicore.eu
  • 16. http://www.unicore.eu Workflows in Two layer architecture for scalability Workflow engine Based on Shark open-source XPDLengine Pluggable, domain-specific workflow languages Service orchestrator Job execution and monitoring Callback to workflow engine Brokering based on pluggable strategies Clients GUI client based on Eclipse Commandline submission of workflows is also possible
  • 17. Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows
  • 18. High-Throughput Computing Large amount of tasks that can be executed independently Parameter Studies Monte Carlo or Stochastic Methods Genome Sequencing (matching) Analysis of LHC data : Starting from this Looking for this (1 in 1013)
  • 19. 2. Principles of high-throughput computing Vision Condor provides high-throughput computing in a variety of environments Local dedicated clusters (machine rooms) Local opportunistic (desktop) computers) Grid environments; Can submit jobs to other systems Can run workflows of jobs Can run parallel jobs Independently parallel (lots of single jobs) Tightly coupled (such as MPI)
  • 20. 2. Principles of high-throughput computing History and Activity Distributed Computing research performed by a team of ~35 faculty, full time staff and students who Established in 1985 Faces software/middleware engineering challenges in a UNIX/Linux/Windows/OS X environment, Involved in national and international collaborations, Interacts with users in academia and industry, Maintains and support a distributed production environment (more than 5000 CPUs at UW), Educates and trains students.
  • 21. Condor Project:Main Threads of Activities Distributed Computing Research – develop and evaluate new concepts, frameworks and technologies Develop and maintain Condor; support our users More on next slide The Open Science Grid (OSG) – build and operate a national High Throughput Computing infrastructure The Grid Laboratory Of Wisconsin (GLOW) – build, maintain and operate a distributed computing and storage infrastructure on the UW campus The NSF Middleware Initiative (NMI) - Develop, build and operate a national Build and Test facility powered by Metronome (ETICS-II)
  • 22. Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows
  • 23. Web Services XML DCE RPC DCOM RMI CORBA “Web services has dramatically reduced the programming and management cost of publishing and receiving information” Jim Gray, Microsoft Research EMBRACE – 4yr EU project to establish services for the bioinformatics community
  • 24. 3. Principles of service-oriented architectures Vision Provide the fundamental components to get the grid working History Starting point in I-WAY, a distributed high-performance network demonstrated at the SuperComputing '95 conference and exhibition
  • 25. …14 Years Later 4 major versions Components to address the original problems Many new fields recent hot topics: service oriented science, virtualization Diverse application areas recently: lots of bioinformatics and medical apps others include: earthquakes, particle physics, earth sciences
  • 26. 21 Globus Software now – many components Globus Projects OGSA-DAI GT4 MPICH- G2 Data Rep Replica Location Java Runtime MyProxy Delegation GridWay GridFTP MDS4 CAS C Runtime GSI- OpenSSH Incubator Mgmt Reliable File Transfer GRAM Python Runtime C Sec GT4 Docs Incubator Projects Cog WF GAARDS VirtWkSp MEDICUS Others... Metrics OGRO GDTE UGP GridShib Dyn Acct Gavia JSC DDM LRMA HOC-SA PURSE Introduce WEEP Gavia MS SGGC ServMark Security Execution Mgmt Info Services Common Runtime Other Data Mgmt
  • 27. Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows
  • 28. 4. Principles of distributed data management
  • 29. EGEE Project Overview 17000 users 136000 LCPUs (cores) 25Pb disk 39Pb tape 12 million jobs/month +45% in a year 268 sites +5% in a year 48 countries +10% in a year 162 VOs +29% in a year Technical Status - Steven Newhouse - EGEE-III First Review 24-25 June 2009 24
  • 30.
  • 32.
  • 33. +44% users in a year
  • 35.
  • 36. Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows
  • 37.
  • 38. 5. Principles of using distributed and high performance systems ARC middleware (Advanced Resource Connector) open source out-of-the-box Grid solution software which enables production quality computational and data Grids (released in May 2002) development is coordinated by NDGF emphasis is put on scalability, stability, reliability and performance builds upon standard OS solutions,OpenLDAP, OpenSSL, SASL and Globus Toolkit adds services not provided by Globus extends or completely replaces some Globus components
  • 39.
  • 40.
  • 41. first large initiative to put Grid middleware into production use in Finland
  • 42. goal: throughput computing capacity mainly for the needs of physics and chemistry researchers
  • 43.
  • 46. 11 TF peak performance
  • 47.
  • 52. Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows
  • 53. 6. Higher level APIs: OGSA-DAI, SAGA and metadata management (S-OGSA) OGSA-DAI Vision is to enable the sharing of data resources to enable collaboration, to support: Data access - access to structured data in distributed heterogeneous data resources. Data transformation e.g. expose data in schema X to users as data in schema Y. Data integration e.g. expose multiple databases to users as a single virtual database Data delivery - delivering data to where it's needed by the most appropriate means e.g. web service, e-mail, HTTP, FTP, GridFTP
  • 54. 6. Higher level APIs: OGSA-DAI, SAGA and metadata management (S-OGSA) OGSA-DAI History The OGSA-DAI project started in February 2002 as part of the UK e-Science Grid Core Program Is today part of OMII-UK, a partnership between: OMII, The University of Southampton myGrid, The University of Manchester OGSA-DAI, The University of Edinburgh
  • 55. 6. Higher level APIs: OGSA-DAI, SAGA and metadata management (S-OGSA) Vision of a Simple API for Grid Application - SAGA Provide simple programmatic interface that is widely-adopted, usable and available for enabling applications for the grid Simplicity: easy to use, install, administer and maintain Uniformity: provides support for different application programming languages as well as consistent semantics and style for different Grid functionality Scalability: Contains mechanisms for the same application (source) code to run on a variety of systems ranging from laptops to HPC resources Genericity: adds support for different grid middleware, even concurrent ones Modularity: provides a framework that is easily extendable
  • 56. 6. Higher level APIs: OGSA-DAI, SAGA and metadata management (S-OGSA) Metadata management: Make metadata Princess in the kingdom of Semantic Web
  • 57. Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows
  • 58. 7. Workflows Organize your work e.g: Gather initial data Pre-processing of data Define computing job(s) Initiate job(s) Gather results Post-processing of results : Repeat During the school you will understand how you can do this in different ways with the systems studied. But, this can also be done with specific workflow systems: Taverna, P-Grade Portal,…
  • 59. Motivations for developing P-GRADE portal P-GRADE portal should Give an answer for all the questions of an e-scientist Hide the complexity of the underlying grid middlewares Provide a high-level graphical user interface that is easy-to-use for e-scientists Support many different grid programming approaches (see Morris Riedel’s talk): Simple Scripts & Control (sequential and MPI job execution) Scientific Application Plug-ins (based on GEMLCA) Complex Workflows Parameter sweep applications: both on job and workflow level Interoperability: transparent access to grids based on different middleware technology Support three levels of parallelism
  • 60. Short History of P-GRADE portal Parallel Grid Application and Development Environment Initial development started in the Hungarian SuperComputing Grid project in 2003 It has been continuously developed since 2003 Detailed information: http://portal.p-grade.hu/ Open Source community development since January 2008: https://sourceforge.net/projects/pgportal/
  • 61. Integrating Practical Principles of service-oriented architecture Principles of high-throughput computing Principles of distributed data management Principles of job submission and execution management Principles of using distributed and high performance systems Higher level APIs: OGSA-DAI, SAGA and metadata management Workflows

Notas do Editor

  1. Yellow – gLite, Green – externally supported components, gLite consortium