SlideShare uma empresa Scribd logo
1 de 32
Baixar para ler offline
June 3, 2015
Overview of XSEDE Systems Engineering
John Towns
PI and Project Director, XSEDE
Executive Director, Science & Technology, NCSA
jtowns@ncsa.illinois.edu
License terms
• Please cite as: Towns, John, Overview of XSEDE Systems Engineering, June
2015,
[http://www.slideshare.net/jtownsil/overview-of-xsede-systems-
engineering]
• ORCID ID: http://orcid.org/0000-0001-7961-2277
• Except where otherwise noted, by inclusion of a source URL or some other
note, the contents of this presentation are © by the Board of Trustees of
University of Illinois. This content is released under the Creative Commons
Attribution 3.0 Unported license
(http://creativecommons.org/licenses/by/3.0/). This license includes the
following terms: You are free to share – copy and redistribute the material
in any medium or format; and to adapt – remix, transform, and build upon
the material for any purpose, even commercially.
• This can be done under the following conditions: attribution – You must
give appropriate credit, provide a link to the license, and indicate if
changes were made. You may do so in any reasonable manner, but not in
any way that suggests the licensor endorses you or your use.
2
XSEDE in Context
• XSEDE is an award made under the eXtreme
Digital solicitation
– TeraGrid Phase III: eXtreme Digital Resources for
Science and Engineering (XD), NSF 08-571
• Consistent with NSF’s vision and strategy
statements
3
XSEDE – accelerating scientific discovery
XSEDE’s Vision:
a world of digitally-enabled scholars, researchers, and
engineers participating in multidisciplinary collaborations
while seamlessly accessing computing resources and
sharing data to tackle society’s grand challenges.
XSEDE’s Mission:
to substantially enhance the productivity of a growing
community of scholars, researchers, and engineers
through access to advanced digital services that support
open research;
and to coordinate and add significant value to the
leading cyberinfrastructure resources funded by the NSF
and other agencies.
4
XSEDE Factoids: high-order bits
• 5 year, US$121M project
– plus US$9M, 5 year Technology Investigation Service
• separate award from NSF
– option for additional 5 years of funding upon major review after
PY3
• No funding for major hardware
– coordinate, support and create a national/international
cyberinfrastructure
– coordinate allocations, support, training and documentation for
>$100M of concurrent project awards from NSF
• ~112 FTE /~250 individuals funded across 20 partner
institutions
– this requires solid partnering!
5
Total Research Funding Supported by XSEDE
in Program Year 3
6
$767 million in research
supported by XSEDE
in PY3
(July 2013-June 2014)
Research funding only. XSEDE leverages and
integrates additional infrastructure, some
funded by NSF (e.g. Track 2 systems) and some
not (e.g. Internet2).
What is XSEDE?
• An ecosystem of advanced digital services accelerating
scientific discovery
– support a growing portfolio of resources and services
• advanced computing, high-end visualization, data analysis, and other
resources and services
• interoperability with other infrastructures
• A virtual organization (partnership!) providing
– dynamic distributed infrastructure
– support services and technical expertise to enable researchers
engineers and scholars
• addressing the most important and challenging problems facing the
nation and world
• More than just a project funded by the National Science
Foundation
– XSEDE is a path-finding experiment in how to develop, deploy
and support e-science infrastructure
7
XSEDEnet – Using Internet2’s AL2S Service
XSEDE Computational User Census
9
What do you mean by “Advanced Digital
Services?”
• Often use the terms “resources” and “services”
– these should be interpreted very broadly
– most are likely not operated by XSEDE
• Examples of resources
– compute engines: HPC, HTC (high throughput computing), campus,
departmental, research group, project, …
– data: simulation output, input files, instrument data, repositories, public
databases, private databases, …
– instruments: telescopes, beam lines, sensor nets, shake tables, microscopes, …
– infrastructure: local networks, wide-area networks, …
• Examples of services
– collaboration: wikis, forums, telepresence, …
– data: data transport, data management, sharing, curation, provenance, …
– access/used: authentication, authorization, accounting, …
– coordination: meta-queuing, …
– support: helpdesk, consulting, ECSS, training, …
– And many more: education, outreach, community building, …
10
XSEDE – What does it do?
• Each individual XSEDE resource is designed to
satisfy its user needs as fully as possible.
• XSEDE’s system-wide functions support a more
basic user need: the need to have a relationship
with the collection of resources as a whole and to
maintain that relationship over time as their
specific data and computing needs change and as
system components change.
• In addition to staff and programs, XSEDE provides
and operates a system that supports this mission.
11
The XSEDE system
12
Six system functions
Three levels
of
interaction
How did we get this system?
1. We operated the legacy system: Legacy TeraGrid system + early XSEDE
additions/replacements
2. We talked to users and solicited specific use cases: Engaged a broad set
of users and user representatives to document (~70) stakeholder use
cases detailing what users need to do using XSEDE digital ecosystem.
3. We looked for the “80%” commonality: Identified ten “canonical” use
cases that provide most of what XSEDE users and communities need and
offer clear strategies for the rest.
4. We defined a foundational architecture: Organized the architecture
around canonicals (six basic system functions) and applied our expertise
to define three levels of interaction: user access interfaces (Web, CLI,
API), service interfaces (protocols), and resource-specific interfaces.
5. We redesigned the allocations and identity management functions:
Agreement between all impacted XSEDE working groups on updates to
the federated allocations system and the federated security mechanisms.
13
From requirements to use cases
• PY0/Proposal: User needs documented as formal Requirement Statements.
• PY1 Q1-Q3: Pivoted to a stakeholder use case-driven process developed by Felix
Bachmann from CMU’s Software Engineering Institute. Now have much clearer
way to describe user needs and to trace the architecture to those user needs.
• PY1 Q4–PY2 Q3: Engaged stakeholders to produce ~70 use cases in their project
areas:
– Data: Data Analytics, Data Management, and Visualization
– Campus Bridging
– Computing: HPC, HTC, and Scientific Workflows
– Connecting Instrumentation
– Federation and Interoperability
– Science Gateways
• PY2 Q3–PY4: Distilled a new set of 10 canonical use cases containing the common
user needs that define the foundational architecture.
14
What is the problem?
• Campus Bridging use case 5
15
16
17
From use cases to architecture
18
Six system functions Ten “canonical” use cases
Identity Management Canonical 6.0 – authenticate to one or more SP
resources, SP services, and XSEDE central
services
Canonical 9.0 – user & group management
Submitting and Managing
Computations
Canonical 1.0
Transferring Datasets Canonical 2.0
Accessing Remote Files Canonical 3.0
Interactive Login Canonical 4.0
Discovering and Providing
Resource Information
Canonical 7.0 – subscribe for resource
Information
Canonical 8.0 – search for resource Information
Canonical 11.0 – publish resource Information
Canonical 12.0 – update resource Information
XSEDE architecture documents
•Six functional areas
•Three levels of interaction
•Key user-access interfaces for each functional area
XSEDE
Architecture
Overview
•Guide for Web/Cloud interactions
•Guide for Open Standards-based Web Service interactions
•Service interfaces (protocols) for each functional area
•Descriptions of all software components
•Deployment guides
XSEDE
Architecture
Level-3
Decomposition
•Identity Management
•Submitting and Managing Computations
•Interactive Login
•Remote File Access
•Transferring Datasets
•Discovering and Providing Resource Information
Canonical
Use Cases and Responses
•O(100) unique use cases (and growing)
•For each, we know:
•What the user wants to happen
•How the system makes it happen
•What is still missing, if anything
Stakeholder-supplied
Use Cases and Responses
19
140pp
14pp
150pp
(est.)
300pp
(est.)
From architecture to components
1. Use case-driven process: let the users drive the requirements
2. Prioritization: UREP-prioritized use cases
3. Traceability: traceability from use cases to operational capabilities
4. Documentation: SWOT analysis identified deficient documentation
and led to a new L1-2 architecture description
5. Metrics: established KPIs to measure and monitor performance
6. Tuned reviews: streamlined Active Design Reviews to focus on
sufficient implementation details, leaving detailed design for later
engineering steps
7. Tuned testing: joint SD&I and Operations testing where
appropriate to accelerate delivery
8. Improved user documentation: formalized user documentation
preparation in the engineering process
20
Redesigned identity management
Goals:
• Support modern standards
• Sustainability
• Federate identities
• Science gateways
• Campus bridging
Globus Auth:
• Links identities from many
sources
• Supports existing OAuth2,
OpenID Connect services
• Robust funding model
• Large existing user base
• Already part of XSEDE’s data
transfer services
21
Work-in-progress
through early 2016…
From components to a system
1. Operator coordination: Established Campus Bridging and Service Provider
coordination. Conducting annual operations checklist reviews.
2. Engineering process: Established an end-to-end engineering process with
S&SE, A&D, SD&I, and Operations collaboratively advancing XSEDE’s
digital ecosystem. This process includes planning and prioritizing,
architecture and design, development/integration, significant testing
phases, deployment, and retirement; and engage stakeholders in
reviewing engineering products as they transition through phases.
3. Engineering process adoption: Selected elements of the engineering
process used developing the XSEDE Resource Allocation Service (XRAS).
4. Engineering tools: Deployed a set of engineering software and services to
manage use cases and engineering activities, distribute software, and
publish (engineering) documents to the community.
22
23
Architecture Design, Documentation
Use Cases, Qualities
Design, Development, Integration
Acceptance Test
Production Deployment
Active Design Review
Integration Test
Area(s): A&D, S&SE
Overview: An iterative engineering process
Operations Readiness Review
Area(s): SD&I
Area(s): Operations
Users XSEDE Service Providers
User
experience
Operational
changes
Bug fixes, routine
enhancements
New Use Cases
Implementation-ready
architecture
Deployment-ready
components
Operational
capabilities
User benefits from the engineering process
 Traceability from users (use cases) to operational capabilities
 Enhanced usability derived from architectural interoperability
 Better user documentation and developer documentation
 Service quality, scalability, and reliability
Sustainability benefits
 Written docs on what the system is supposed to do
 Written docs on how the system is supposed to work
 Record of key decisions (security choices, component evaluations,
priorities)
 A solid basis for future work (by us and by others)
24
Ongoing challenges
• Managing engineering documents
• Stakeholder visibility into engineering process
• Pace of getting components through the engineering process
• Timely stakeholder participation of engineering reviews
• Resources to address big data, clouds, advanced SaaS, and
other emerging capabilities
25
Looking ahead (years 6-10)
• Build on the foundational architecture
– Emphasize key interface specs: Web (HTML, HTTP,
CSS, JavaScript), TLS &X.509, OAuth2, OpenID
Connect, REST
– Continue leveraging Globus Auth and Transfer
– Broaden use of sharing and group features
– Keep executing (while refining) the engineering
process
• Encourage and enable others to build as well
– Documentation, training, examples
26
Priorities for PY6-PY10:
Community Infrastructure
• Continue to evolve the XSEDE infrastructure
– must provide support for and integration of “Track
2” resource
• Expose this architecture to the broader
community
– facilitate integration of broad range of services
• provide discoverability
– become the “connector of services” to support
the research enterprise
27
Priorities for PY6-PY10:
Toward “Sustainability”
• Developing services on offer to others: providing basic
cyberinfrastructure services
– expose services developed and put in place to operate the XSEDE
environment
– where necessary customize/extend for needs of other projects
– charge incremental costs for operating/supporting services for other
projects
• Objective is not to make money!
– provides mechanism for other NSF project investments to leverage the
XSEDE investment
– can lead to significant cost saving across NSF CI investments
– others can leverage this too: projects, institutions, regional consortia,
…
• Pilot under way: NCAR with XRAS
– will use XRAS to support allocation of NCAR resources
– expressions of interest from some campuses
28
Questions?
Overview of XSEDE Systems Engineering

Mais conteúdo relacionado

Mais procurados

Cs 1023 lec 6 architecture (week 1)
Cs 1023 lec 6 architecture (week 1)Cs 1023 lec 6 architecture (week 1)
Cs 1023 lec 6 architecture (week 1)stanbridge
 
Supporting Research Communities with XSEDE
Supporting Research Communities with XSEDESupporting Research Communities with XSEDE
Supporting Research Communities with XSEDEJohn Towns
 
Electronic Resource Management
Electronic Resource ManagementElectronic Resource Management
Electronic Resource Managementcatherin preethi
 
A Survey of Exploratory Search Systems Based on LOD Resources
A Survey of Exploratory Search Systems Based on LOD ResourcesA Survey of Exploratory Search Systems Based on LOD Resources
A Survey of Exploratory Search Systems Based on LOD ResourcesKarwan Jacksi
 
Supporting Research Communities with XSEDE
Supporting Research Communities with XSEDESupporting Research Communities with XSEDE
Supporting Research Communities with XSEDEJohn Towns
 
Introduction to XSEDE 2.0
Introduction to XSEDE 2.0Introduction to XSEDE 2.0
Introduction to XSEDE 2.0John Towns
 
A Method to Select e-Infrastructure Components to Sustain
A Method to Select e-Infrastructure Components to SustainA Method to Select e-Infrastructure Components to Sustain
A Method to Select e-Infrastructure Components to SustainDaniel S. Katz
 
XSEDE Overview (March 2014)
XSEDE Overview (March 2014)XSEDE Overview (March 2014)
XSEDE Overview (March 2014)John Towns
 

Mais procurados (8)

Cs 1023 lec 6 architecture (week 1)
Cs 1023 lec 6 architecture (week 1)Cs 1023 lec 6 architecture (week 1)
Cs 1023 lec 6 architecture (week 1)
 
Supporting Research Communities with XSEDE
Supporting Research Communities with XSEDESupporting Research Communities with XSEDE
Supporting Research Communities with XSEDE
 
Electronic Resource Management
Electronic Resource ManagementElectronic Resource Management
Electronic Resource Management
 
A Survey of Exploratory Search Systems Based on LOD Resources
A Survey of Exploratory Search Systems Based on LOD ResourcesA Survey of Exploratory Search Systems Based on LOD Resources
A Survey of Exploratory Search Systems Based on LOD Resources
 
Supporting Research Communities with XSEDE
Supporting Research Communities with XSEDESupporting Research Communities with XSEDE
Supporting Research Communities with XSEDE
 
Introduction to XSEDE 2.0
Introduction to XSEDE 2.0Introduction to XSEDE 2.0
Introduction to XSEDE 2.0
 
A Method to Select e-Infrastructure Components to Sustain
A Method to Select e-Infrastructure Components to SustainA Method to Select e-Infrastructure Components to Sustain
A Method to Select e-Infrastructure Components to Sustain
 
XSEDE Overview (March 2014)
XSEDE Overview (March 2014)XSEDE Overview (March 2014)
XSEDE Overview (March 2014)
 

Semelhante a Overview of XSEDE Systems Engineering

Overview of XSEDE and Introduction to XSEDE 2.0 and Beyond
Overview of XSEDE and Introduction to XSEDE 2.0 and BeyondOverview of XSEDE and Introduction to XSEDE 2.0 and Beyond
Overview of XSEDE and Introduction to XSEDE 2.0 and BeyondJohn Towns
 
Jisc Research Data Shared Service - Spring Update
Jisc Research Data Shared Service - Spring UpdateJisc Research Data Shared Service - Spring Update
Jisc Research Data Shared Service - Spring UpdateJisc RDM
 
System models in distributed system
System models in distributed systemSystem models in distributed system
System models in distributed systemishapadhy
 
Supporting Research Communities with XSEDE
Supporting Research Communities with XSEDESupporting Research Communities with XSEDE
Supporting Research Communities with XSEDEJohn Towns
 
PEARC17: The Community Software Repository from XSEDE: A Resource for the Nat...
PEARC17: The Community Software Repository from XSEDE: A Resource for the Nat...PEARC17: The Community Software Repository from XSEDE: A Resource for the Nat...
PEARC17: The Community Software Repository from XSEDE: A Resource for the Nat...John-Paul Navarro
 
ARCC National Perspective Panel: XSEDE (Towns)
ARCC National Perspective Panel: XSEDE (Towns)ARCC National Perspective Panel: XSEDE (Towns)
ARCC National Perspective Panel: XSEDE (Towns)John Towns
 
Open Source and Science at the National Science Foundation (NSF)
Open Source and Science at the National Science Foundation (NSF)Open Source and Science at the National Science Foundation (NSF)
Open Source and Science at the National Science Foundation (NSF)Daniel S. Katz
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesASIS&T
 
Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016Jisc RDM
 
RDM shared services at IDCC
RDM shared services at IDCCRDM shared services at IDCC
RDM shared services at IDCCJisc RDM
 
State of XSEDE: July 2014
State of XSEDE: July 2014State of XSEDE: July 2014
State of XSEDE: July 2014John Towns
 
Jisc research data shared service overview IDCC 2016
Jisc research data shared service overview IDCC 2016Jisc research data shared service overview IDCC 2016
Jisc research data shared service overview IDCC 2016Jisc RDM
 
Competency framework: engineers, statisticians, data scientists, librarians, ...
Competency framework: engineers, statisticians, data scientists, librarians, ...Competency framework: engineers, statisticians, data scientists, librarians, ...
Competency framework: engineers, statisticians, data scientists, librarians, ...African Open Science Platform
 
eROSA Stakeholder WS1: EOSC Architecture
eROSA Stakeholder WS1: EOSC ArchitectureeROSA Stakeholder WS1: EOSC Architecture
eROSA Stakeholder WS1: EOSC Architecturee-ROSA
 
ORION Workshop: XSEDE and Building a National/International Cyberinfrastructure
ORION Workshop: XSEDE and Building a National/International CyberinfrastructureORION Workshop: XSEDE and Building a National/International Cyberinfrastructure
ORION Workshop: XSEDE and Building a National/International CyberinfrastructureJohn Towns
 
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...Enrico Motta
 
Advanced Automated Analytics Using OSS Tools, GA Tech FDA Conference 2016
Advanced Automated Analytics Using OSS Tools, GA Tech FDA Conference 2016Advanced Automated Analytics Using OSS Tools, GA Tech FDA Conference 2016
Advanced Automated Analytics Using OSS Tools, GA Tech FDA Conference 2016Grid Protection Alliance
 

Semelhante a Overview of XSEDE Systems Engineering (20)

Overview of XSEDE and Introduction to XSEDE 2.0 and Beyond
Overview of XSEDE and Introduction to XSEDE 2.0 and BeyondOverview of XSEDE and Introduction to XSEDE 2.0 and Beyond
Overview of XSEDE and Introduction to XSEDE 2.0 and Beyond
 
Jisc Research Data Shared Service - Spring Update
Jisc Research Data Shared Service - Spring UpdateJisc Research Data Shared Service - Spring Update
Jisc Research Data Shared Service - Spring Update
 
System models in distributed system
System models in distributed systemSystem models in distributed system
System models in distributed system
 
Supporting Research Communities with XSEDE
Supporting Research Communities with XSEDESupporting Research Communities with XSEDE
Supporting Research Communities with XSEDE
 
PEARC17: The Community Software Repository from XSEDE: A Resource for the Nat...
PEARC17: The Community Software Repository from XSEDE: A Resource for the Nat...PEARC17: The Community Software Repository from XSEDE: A Resource for the Nat...
PEARC17: The Community Software Repository from XSEDE: A Resource for the Nat...
 
ARCC National Perspective Panel: XSEDE (Towns)
ARCC National Perspective Panel: XSEDE (Towns)ARCC National Perspective Panel: XSEDE (Towns)
ARCC National Perspective Panel: XSEDE (Towns)
 
Open Source and Science at the National Science Foundation (NSF)
Open Source and Science at the National Science Foundation (NSF)Open Source and Science at the National Science Foundation (NSF)
Open Source and Science at the National Science Foundation (NSF)
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016
 
Enabling Citizen-empowered Apps over Linked Data
Enabling Citizen-empowered Apps over Linked DataEnabling Citizen-empowered Apps over Linked Data
Enabling Citizen-empowered Apps over Linked Data
 
RDM shared services at IDCC
RDM shared services at IDCCRDM shared services at IDCC
RDM shared services at IDCC
 
State of XSEDE: July 2014
State of XSEDE: July 2014State of XSEDE: July 2014
State of XSEDE: July 2014
 
Jisc research data shared service overview IDCC 2016
Jisc research data shared service overview IDCC 2016Jisc research data shared service overview IDCC 2016
Jisc research data shared service overview IDCC 2016
 
Competency framework: engineers, statisticians, data scientists, librarians, ...
Competency framework: engineers, statisticians, data scientists, librarians, ...Competency framework: engineers, statisticians, data scientists, librarians, ...
Competency framework: engineers, statisticians, data scientists, librarians, ...
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 
Information Systems
Information SystemsInformation Systems
Information Systems
 
eROSA Stakeholder WS1: EOSC Architecture
eROSA Stakeholder WS1: EOSC ArchitectureeROSA Stakeholder WS1: EOSC Architecture
eROSA Stakeholder WS1: EOSC Architecture
 
ORION Workshop: XSEDE and Building a National/International Cyberinfrastructure
ORION Workshop: XSEDE and Building a National/International CyberinfrastructureORION Workshop: XSEDE and Building a National/International Cyberinfrastructure
ORION Workshop: XSEDE and Building a National/International Cyberinfrastructure
 
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
 
Advanced Automated Analytics Using OSS Tools, GA Tech FDA Conference 2016
Advanced Automated Analytics Using OSS Tools, GA Tech FDA Conference 2016Advanced Automated Analytics Using OSS Tools, GA Tech FDA Conference 2016
Advanced Automated Analytics Using OSS Tools, GA Tech FDA Conference 2016
 

Mais de John Towns

Research IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator NeedsResearch IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator NeedsJohn Towns
 
Research IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator NeedsResearch IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator NeedsJohn Towns
 
XSEDE at Brainstorm HPCD Jan2015
XSEDE at Brainstorm HPCD Jan2015XSEDE at Brainstorm HPCD Jan2015
XSEDE at Brainstorm HPCD Jan2015John Towns
 
Year of ci overview
Year of ci overviewYear of ci overview
Year of ci overviewJohn Towns
 
CSC2013: Exascale in the US
CSC2013: Exascale in the USCSC2013: Exascale in the US
CSC2013: Exascale in the USJohn Towns
 
XSEDE13: State of XSEDE
XSEDE13: State of XSEDEXSEDE13: State of XSEDE
XSEDE13: State of XSEDEJohn Towns
 
CASC Sustainability Workshop, Panel 3: Models, challenges and opportunities
CASC Sustainability Workshop, Panel 3: Models, challenges and opportunitiesCASC Sustainability Workshop, Panel 3: Models, challenges and opportunities
CASC Sustainability Workshop, Panel 3: Models, challenges and opportunitiesJohn Towns
 
Tapia fireside chat-towns
Tapia fireside chat-townsTapia fireside chat-towns
Tapia fireside chat-townsJohn Towns
 

Mais de John Towns (8)

Research IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator NeedsResearch IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator Needs
 
Research IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator NeedsResearch IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator Needs
 
XSEDE at Brainstorm HPCD Jan2015
XSEDE at Brainstorm HPCD Jan2015XSEDE at Brainstorm HPCD Jan2015
XSEDE at Brainstorm HPCD Jan2015
 
Year of ci overview
Year of ci overviewYear of ci overview
Year of ci overview
 
CSC2013: Exascale in the US
CSC2013: Exascale in the USCSC2013: Exascale in the US
CSC2013: Exascale in the US
 
XSEDE13: State of XSEDE
XSEDE13: State of XSEDEXSEDE13: State of XSEDE
XSEDE13: State of XSEDE
 
CASC Sustainability Workshop, Panel 3: Models, challenges and opportunities
CASC Sustainability Workshop, Panel 3: Models, challenges and opportunitiesCASC Sustainability Workshop, Panel 3: Models, challenges and opportunities
CASC Sustainability Workshop, Panel 3: Models, challenges and opportunities
 
Tapia fireside chat-towns
Tapia fireside chat-townsTapia fireside chat-towns
Tapia fireside chat-towns
 

Último

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Último (20)

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Overview of XSEDE Systems Engineering

  • 1. June 3, 2015 Overview of XSEDE Systems Engineering John Towns PI and Project Director, XSEDE Executive Director, Science & Technology, NCSA jtowns@ncsa.illinois.edu
  • 2. License terms • Please cite as: Towns, John, Overview of XSEDE Systems Engineering, June 2015, [http://www.slideshare.net/jtownsil/overview-of-xsede-systems- engineering] • ORCID ID: http://orcid.org/0000-0001-7961-2277 • Except where otherwise noted, by inclusion of a source URL or some other note, the contents of this presentation are © by the Board of Trustees of University of Illinois. This content is released under the Creative Commons Attribution 3.0 Unported license (http://creativecommons.org/licenses/by/3.0/). This license includes the following terms: You are free to share – copy and redistribute the material in any medium or format; and to adapt – remix, transform, and build upon the material for any purpose, even commercially. • This can be done under the following conditions: attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use. 2
  • 3. XSEDE in Context • XSEDE is an award made under the eXtreme Digital solicitation – TeraGrid Phase III: eXtreme Digital Resources for Science and Engineering (XD), NSF 08-571 • Consistent with NSF’s vision and strategy statements 3
  • 4. XSEDE – accelerating scientific discovery XSEDE’s Vision: a world of digitally-enabled scholars, researchers, and engineers participating in multidisciplinary collaborations while seamlessly accessing computing resources and sharing data to tackle society’s grand challenges. XSEDE’s Mission: to substantially enhance the productivity of a growing community of scholars, researchers, and engineers through access to advanced digital services that support open research; and to coordinate and add significant value to the leading cyberinfrastructure resources funded by the NSF and other agencies. 4
  • 5. XSEDE Factoids: high-order bits • 5 year, US$121M project – plus US$9M, 5 year Technology Investigation Service • separate award from NSF – option for additional 5 years of funding upon major review after PY3 • No funding for major hardware – coordinate, support and create a national/international cyberinfrastructure – coordinate allocations, support, training and documentation for >$100M of concurrent project awards from NSF • ~112 FTE /~250 individuals funded across 20 partner institutions – this requires solid partnering! 5
  • 6. Total Research Funding Supported by XSEDE in Program Year 3 6 $767 million in research supported by XSEDE in PY3 (July 2013-June 2014) Research funding only. XSEDE leverages and integrates additional infrastructure, some funded by NSF (e.g. Track 2 systems) and some not (e.g. Internet2).
  • 7. What is XSEDE? • An ecosystem of advanced digital services accelerating scientific discovery – support a growing portfolio of resources and services • advanced computing, high-end visualization, data analysis, and other resources and services • interoperability with other infrastructures • A virtual organization (partnership!) providing – dynamic distributed infrastructure – support services and technical expertise to enable researchers engineers and scholars • addressing the most important and challenging problems facing the nation and world • More than just a project funded by the National Science Foundation – XSEDE is a path-finding experiment in how to develop, deploy and support e-science infrastructure 7
  • 8. XSEDEnet – Using Internet2’s AL2S Service
  • 10. What do you mean by “Advanced Digital Services?” • Often use the terms “resources” and “services” – these should be interpreted very broadly – most are likely not operated by XSEDE • Examples of resources – compute engines: HPC, HTC (high throughput computing), campus, departmental, research group, project, … – data: simulation output, input files, instrument data, repositories, public databases, private databases, … – instruments: telescopes, beam lines, sensor nets, shake tables, microscopes, … – infrastructure: local networks, wide-area networks, … • Examples of services – collaboration: wikis, forums, telepresence, … – data: data transport, data management, sharing, curation, provenance, … – access/used: authentication, authorization, accounting, … – coordination: meta-queuing, … – support: helpdesk, consulting, ECSS, training, … – And many more: education, outreach, community building, … 10
  • 11. XSEDE – What does it do? • Each individual XSEDE resource is designed to satisfy its user needs as fully as possible. • XSEDE’s system-wide functions support a more basic user need: the need to have a relationship with the collection of resources as a whole and to maintain that relationship over time as their specific data and computing needs change and as system components change. • In addition to staff and programs, XSEDE provides and operates a system that supports this mission. 11
  • 12. The XSEDE system 12 Six system functions Three levels of interaction
  • 13. How did we get this system? 1. We operated the legacy system: Legacy TeraGrid system + early XSEDE additions/replacements 2. We talked to users and solicited specific use cases: Engaged a broad set of users and user representatives to document (~70) stakeholder use cases detailing what users need to do using XSEDE digital ecosystem. 3. We looked for the “80%” commonality: Identified ten “canonical” use cases that provide most of what XSEDE users and communities need and offer clear strategies for the rest. 4. We defined a foundational architecture: Organized the architecture around canonicals (six basic system functions) and applied our expertise to define three levels of interaction: user access interfaces (Web, CLI, API), service interfaces (protocols), and resource-specific interfaces. 5. We redesigned the allocations and identity management functions: Agreement between all impacted XSEDE working groups on updates to the federated allocations system and the federated security mechanisms. 13
  • 14. From requirements to use cases • PY0/Proposal: User needs documented as formal Requirement Statements. • PY1 Q1-Q3: Pivoted to a stakeholder use case-driven process developed by Felix Bachmann from CMU’s Software Engineering Institute. Now have much clearer way to describe user needs and to trace the architecture to those user needs. • PY1 Q4–PY2 Q3: Engaged stakeholders to produce ~70 use cases in their project areas: – Data: Data Analytics, Data Management, and Visualization – Campus Bridging – Computing: HPC, HTC, and Scientific Workflows – Connecting Instrumentation – Federation and Interoperability – Science Gateways • PY2 Q3–PY4: Distilled a new set of 10 canonical use cases containing the common user needs that define the foundational architecture. 14
  • 15. What is the problem? • Campus Bridging use case 5 15
  • 16. 16
  • 17. 17
  • 18. From use cases to architecture 18 Six system functions Ten “canonical” use cases Identity Management Canonical 6.0 – authenticate to one or more SP resources, SP services, and XSEDE central services Canonical 9.0 – user & group management Submitting and Managing Computations Canonical 1.0 Transferring Datasets Canonical 2.0 Accessing Remote Files Canonical 3.0 Interactive Login Canonical 4.0 Discovering and Providing Resource Information Canonical 7.0 – subscribe for resource Information Canonical 8.0 – search for resource Information Canonical 11.0 – publish resource Information Canonical 12.0 – update resource Information
  • 19. XSEDE architecture documents •Six functional areas •Three levels of interaction •Key user-access interfaces for each functional area XSEDE Architecture Overview •Guide for Web/Cloud interactions •Guide for Open Standards-based Web Service interactions •Service interfaces (protocols) for each functional area •Descriptions of all software components •Deployment guides XSEDE Architecture Level-3 Decomposition •Identity Management •Submitting and Managing Computations •Interactive Login •Remote File Access •Transferring Datasets •Discovering and Providing Resource Information Canonical Use Cases and Responses •O(100) unique use cases (and growing) •For each, we know: •What the user wants to happen •How the system makes it happen •What is still missing, if anything Stakeholder-supplied Use Cases and Responses 19 140pp 14pp 150pp (est.) 300pp (est.)
  • 20. From architecture to components 1. Use case-driven process: let the users drive the requirements 2. Prioritization: UREP-prioritized use cases 3. Traceability: traceability from use cases to operational capabilities 4. Documentation: SWOT analysis identified deficient documentation and led to a new L1-2 architecture description 5. Metrics: established KPIs to measure and monitor performance 6. Tuned reviews: streamlined Active Design Reviews to focus on sufficient implementation details, leaving detailed design for later engineering steps 7. Tuned testing: joint SD&I and Operations testing where appropriate to accelerate delivery 8. Improved user documentation: formalized user documentation preparation in the engineering process 20
  • 21. Redesigned identity management Goals: • Support modern standards • Sustainability • Federate identities • Science gateways • Campus bridging Globus Auth: • Links identities from many sources • Supports existing OAuth2, OpenID Connect services • Robust funding model • Large existing user base • Already part of XSEDE’s data transfer services 21 Work-in-progress through early 2016…
  • 22. From components to a system 1. Operator coordination: Established Campus Bridging and Service Provider coordination. Conducting annual operations checklist reviews. 2. Engineering process: Established an end-to-end engineering process with S&SE, A&D, SD&I, and Operations collaboratively advancing XSEDE’s digital ecosystem. This process includes planning and prioritizing, architecture and design, development/integration, significant testing phases, deployment, and retirement; and engage stakeholders in reviewing engineering products as they transition through phases. 3. Engineering process adoption: Selected elements of the engineering process used developing the XSEDE Resource Allocation Service (XRAS). 4. Engineering tools: Deployed a set of engineering software and services to manage use cases and engineering activities, distribute software, and publish (engineering) documents to the community. 22
  • 23. 23 Architecture Design, Documentation Use Cases, Qualities Design, Development, Integration Acceptance Test Production Deployment Active Design Review Integration Test Area(s): A&D, S&SE Overview: An iterative engineering process Operations Readiness Review Area(s): SD&I Area(s): Operations Users XSEDE Service Providers User experience Operational changes Bug fixes, routine enhancements New Use Cases Implementation-ready architecture Deployment-ready components Operational capabilities
  • 24. User benefits from the engineering process  Traceability from users (use cases) to operational capabilities  Enhanced usability derived from architectural interoperability  Better user documentation and developer documentation  Service quality, scalability, and reliability Sustainability benefits  Written docs on what the system is supposed to do  Written docs on how the system is supposed to work  Record of key decisions (security choices, component evaluations, priorities)  A solid basis for future work (by us and by others) 24
  • 25. Ongoing challenges • Managing engineering documents • Stakeholder visibility into engineering process • Pace of getting components through the engineering process • Timely stakeholder participation of engineering reviews • Resources to address big data, clouds, advanced SaaS, and other emerging capabilities 25
  • 26. Looking ahead (years 6-10) • Build on the foundational architecture – Emphasize key interface specs: Web (HTML, HTTP, CSS, JavaScript), TLS &X.509, OAuth2, OpenID Connect, REST – Continue leveraging Globus Auth and Transfer – Broaden use of sharing and group features – Keep executing (while refining) the engineering process • Encourage and enable others to build as well – Documentation, training, examples 26
  • 27. Priorities for PY6-PY10: Community Infrastructure • Continue to evolve the XSEDE infrastructure – must provide support for and integration of “Track 2” resource • Expose this architecture to the broader community – facilitate integration of broad range of services • provide discoverability – become the “connector of services” to support the research enterprise 27
  • 28. Priorities for PY6-PY10: Toward “Sustainability” • Developing services on offer to others: providing basic cyberinfrastructure services – expose services developed and put in place to operate the XSEDE environment – where necessary customize/extend for needs of other projects – charge incremental costs for operating/supporting services for other projects • Objective is not to make money! – provides mechanism for other NSF project investments to leverage the XSEDE investment – can lead to significant cost saving across NSF CI investments – others can leverage this too: projects, institutions, regional consortia, … • Pilot under way: NCAR with XRAS – will use XRAS to support allocation of NCAR resources – expressions of interest from some campuses 28
  • 29.
  • 30.