Grid Computing July 2009

Grid computing Ian Foster Computation Institute Argonne National Lab & University of Chicago

“ When the network is as fast as the computer’s internal links, the machine disintegrates across the net into a set of special purpose appliances” (George Gilder, 2001)

“ Computation may someday be organized as a public utility … The computing utility could become the basis for a new and important industry.” John McCarthy (1961)

Scientific collaboration Scientific collaboration

Important characteristics ,[object Object],[object Object],[object Object],[object Object],We are not building something simple like a bridge or an airline reservation system

We are dealing with complex adaptive systems ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

We need to function in the zone of complexity Ralph Stacey, Complexity and Creativity in Organizations , 1996 Low Low High High Agreement about outcomes Certainty about outcomes Plan and control Chaos Zone of complexity

We need to function in the zone of complexity Ralph Stacey, Complexity and Creativity in Organizations , 1996 Low Low High High Agreement about outcomes Certainty about outcomes Plan and control Chaos

“ The Anatomy of the Grid,” 2001 ,[object Object]

Examples (from AotG, 2001) ,[object Object],[object Object],[object Object],[object Object]

From the organizational behavior and management community ,[object Object],[object Object],[object Object],[object Object],[object Object],Collaboration based on rich data & computing capabilities

NSF Workshops on Building Effective Virtual Organizations ,[object Object]

The Grid paradigm ,[object Object],[object Object],[object Object],[object Object],1995 2000 2005 2010 Computer science Physics Astronomy Engineering Biology Biomedicine Healthcare

We call these groupings virtual organizations (VOs) ,[object Object],[object Object],[object Object],[object Object],[object Object],A set of individuals and/or institutions engaged in the controlled sharing of resources in pursuit of a common goal But U.S. health system is marked by fragmented and inefficient VOs with insufficient mechanisms for controlled sharing ,[object Object]

The Grid paradigm and information integration Data sources Platform services Radiology Medical records Name resources; move data around Make resources usable and useful Make resources accessible over the network Pathology Genomics Labs Manage who can do what RHIO

The Grid paradigm and information integration Data sources Platform services Transform data into knowledge Radiology Medical records Management Integration Publication Enhance user cognitive processes Incorporate into business processes Pathology Genomics Labs Security and policy RHIO

The Grid paradigm and information integration Data sources Platform services Value services Analysis Radiology Medical records Management Integration Publication Cognitive support Applications Pathology Genomics Labs Security and policy RHIO

We partition the multi-faceted interoperability problem ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Analysis Management Integration Publication Applications

Security and policy : Managing who can do what ,[object Object],[object Object],[object Object],[object Object]

Identity-based authZ Most simple - not scalable Unix Access Control Lists (Discretionary Access Control: DAC) Groups, directories, simple admin POSIX ACLs/MS-ACLs Finer-grained admin policy Role-based Access Control (RBAC) Separation of role/group from rule admin Mandatory Access Control (MAC) Clearance, classification, compartmentalization Attribute-based Access Control (ABAC) Generalization of attributes >>> Policy language abstraction level and expressiveness >>>

Publication : Make information accessible ,[object Object],[object Object],[object Object]

Federating computers for physics data analysis

Earth System Grid Main ESG Portal CMIP3 (IPCC AR4) ESG Portal ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],8,000 registered users 1,900 registered projects ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],400 scientific papers published to date based on analysis of CMIP3 (IPCC AR4) data ESG usage: over 500 sites worldwide ESG monthly download volumes Globus

Children’s Oncology Group Enterprise/Grid Interface service DICOM protocols Grid protocols (Web services) DICOM XDS HL7 Vendor-specific Wide area service actor Plug-in adapters

Automating service creation, deployment ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Index service Repository Service Introduce Container caGrid, Introduce, gRAVI: Ohio State, U.Chicago Appln Service Create Store Advertize Discover Invoke; get results Transfer GAR Deploy

As of Oct 19, 2008: 122 participants 105 services 70 data 35 analytical

Management : Naming and moving information ,[object Object],[object Object],D S1 S2 S3 D S1 S2 S3 D S1 S2 S3

LIGO Data Grid Birmingham • Replicating >1 Terabyte/day to 8 sites 770 TB replicated to date: >120 million replicas MTBF = 1 month LIGO Gravitational Wave Observatory Ann Chervenak et al., ISI; Scott Koranda et al, LIGO ,[object Object],AEI/Golm Globus

[object Object],Data replication service List of required Files GridFTP Local Replica Catalog Replica Location Index Data Replication Service Reliable File Transfer Service Local Replica Catalog GridFTP “ Design and Implementation of a Data Replication Service Based on the Lightweight Data Replicator System,” Chervenak et al., 2005 Replica Location Index Data movement Data location Data replication

Naming objects: A prerequisite to management ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],A framework for distributed digital object services: Kahn, Wilensky, 1995

Health Object Identifier (HOI) naming system uri:hdl :// 888 .us.npi. 1234567890 .dicom/ 8A648C33 -A5…4939EBE Random String for Identifier-Body PHI-free and guaranteed unique 888: CHI’s top-level naming authority National Provider Id used in hierarchical Identifier Namespace Application Context’s Namespace governed by provider Naming Authority HOI’s URI schema identifier—based on Handle

Data movement in clinical trials

Community public health: Digital retinopathy screening network

Integration : Making information useful ? 0% 100% Degree of prior syntactic and semantic agreement Degree of communication 0% 100% Rigid standards-based approach Loosely coupled approach Adaptive approach

Integration via mediation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Query Reformulation Query Optimization Query Execution Engine Wrapper Query in the source schema Wrapper Query in union of exported source schema Distributed query execution Global Data Model (Levy 2000)

ECOG 5202 integrated sample management ECOG CC ECOG PCO MD Anderson Web portal OGSA-DQP OGSA-DAI OGSA-DAI OGSA-DAI Mediator

Analytics : Transform data into knowledge ,[object Object],[object Object]

Microarray clustering using Taverna ,[object Object],[object Object],[object Object],Workflow in/output caGrid services “ Shim” services others Wei Tan

Many many tasks: Identifying potential drug targets 2M+ ligands Protein x target(s) (Mike Kubal, Benoit Roux, and others)

start report DOCK6 Receptor (1 per protein: defines pocket to bind to) ZINC 3-D structures ligands complexes NAB script parameters (defines flexible residues, #MDsteps) Amber Score: 1. AmberizeLigand 3. AmberizeComplex 5. RunNABScript end BuildNABScript NAB Script NAB Script Template Amber prep: 2. AmberizeReceptor 4. perl: gen nabscript FRED Receptor (1 per protein: defines pocket to bind to) Manually prep DOCK6 rec file Manually prep FRED rec file 1 protein (1MB) PDB protein descriptions For 1 target: 4 million tasks 500,000 cpu-hrs (50 cpu-years) 6 GB 2M structures (6 GB) DOCK6 FRED ~4M x 60s x 1 cpu ~60K cpu-hrs Amber ~10K x 20m x 1 cpu ~3K cpu-hrs Select best ~500 ~500 x 10hr x 100 cpu ~500K cpu-hrs GCMC Select best ~5K Select best ~5K

DOCK on BG/P: ~1M tasks on 118,000 CPUs ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Time (secs)

Scaling Posix to petascale … . . . Large dataset CN-striped intermediate file system  Torus and tree interconnects  Global file system Chirp (multicast) MosaStore (striping) Staging Inter- mediate Local LFS Compute node (local datasets) LFS Compute node (local datasets)

Efficiency for 4 second tasks and varying data size (1KB to 1MB) for CIO and GPFS up to 32K processors

“ Sine” workload, 2M tasks, 10MB:10ms ratio, 100 nodes, GCC policy, 50GB caches/node Ioan Raicu

Same scenario, but with dynamic resource provisioning

Data diffusion sine-wave workload: Summary ,[object Object],[object Object],[object Object]

Recap ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Functioning in the zone of complexity Ralph Stacey, Complexity and Creativity in Organizations , 1996 Low Low High High Agreement about outcomes Certainty about outcomes Plan and control Chaos

“ The computer revolution hasn’t happened yet.” Alan Kay, 1997

Time Connectivity (on log scale) Science Enterprise Consumer “ When the network is as fast as the computer's internal links, the machine disintegrates across the net into a set of special purpose appliances” (George Gilder, 2001) Grid Cloud ????

Thank you! Computation Institute www.ci.uchicago.edu

Grid Computing July 2009

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Destaque

Destaque (20)

Semelhante a Grid Computing July 2009

Semelhante a Grid Computing July 2009 (20)

Mais de Ian Foster

Mais de Ian Foster (20)

Último

Último (20)

Grid Computing July 2009

Notas do Editor