SlideShare uma empresa Scribd logo
1 de 13
Baixar para ler offline
Exascale in the US
John Towns
Director, Collaborative eScience Program
Office, NCSA
jtowns@ncsa.illinois.edu

National Center for Supercomputing Applications
University of Illinois at Urbana–Champaign
Setting Some Context
• It is a bad idea to project too far in the future based on current
technology
• but provides a basis for worst case scenarios

• Challenge: build a system that will support sustained
1.0 EFLOP/s performance
• assume: code can sustain 5% of peak performance
• need a 20 Eflop/s system
• assume: no constraints on power, parallelism, MTBF, scales of
interconnects,…
• assume: application can be written
• i.e. ignore scaling, thread counts, message passing issues,
memory size constraints, languages, libraries, development
tools, …

• Let’s try to do this with today’s technology
Current Systems in the US:
Titan (Cray XK7 @ ORNL)
• Performance
• peak: 27 Pflop/s
• HPL: 17.6 Pflop/s
• #2 in 11/13 TOP500

• sustained: ??

• Facility Support
• 4,352 sqft / 404 m2
• 8.2 MW

• Floating Point Support:
• 18,688 compute nodes
• 299,008 AMD Opteron “cores”
• 16 “cores” /node

• 18,688 NVIDIA Kepler (K20) GPUs
• 1 GPU/node

• Data Support
• 710 TiB of memory
• 584 TiB with CPUs (2GB/core)
• 126 TiB on GPUs (6 GB/GPU)

• 10 PB of disk storage
Current Systems in the US:
Sequoia (IBM BlueGene/Q @ LLNL)
• Performance:

• Floating Point Support:

• peak: 20 Pflop/s
• HPL: 17.2 Pflop/s
• #3 in 11/13 TOP500

• 98,304 compute nodes
• 1.57M PowerPC A2 cores
• 16 cores /node

• sustained: ??

• Facility Support
• 3,000 sqft / 280
• 7.9 MW

m2

• Data Support
• 1.5 PiB of memory
• 1GB/core

• 50 PB of disk storage
Current Systems in the US:
Mira (IBM BlueGene/Q @ ANL)
• Performance:

• Floating Point Support:

• peak: 10 Pflop/s
• HPL: 8.59 Pflop/s
• #5 in 11/13 TOP500

• 49,152 compute nodes
• 786,432 PowerPC A2 cores
• 16 cores /node

• sustained: ??

• Facility Support
• 1,500 sqft / 140
(estimated)
• 4.0 MW

m2

• Data Support
• 768 TiB of memory
• 1GB/core

• 35 PiB of disk storage
Current Systems in the US:
Stampede (Dell C8220 @ TACC)
• Performance
• Floating Point Support:
• peak: 9.6 Pflop/s
• HPL: 5.17 Pflop/s
• #7 in 11/13 TOP500

• sustained: ??

• Facility Support
• 8,000 sqft / 745 m2
• 4.5 MW

• 6,400 compute nodes
• 102,400 Intel SandyBridge cores
• 16 cores/node

• 6,880 Xeon Phi co-processors
• 1 Phi/node

• Data Support
• 270 TiB of memory
• 2GB/core (for most nodes)

• 14 PB of disk storage
Current Systems in the US:
Blue Waters (Cray XE6/XK7 @ NCSA)
• Performance
• Floating Point Support:
• peak: 13.3 Pflop/s
• HPL: n/a
• NOT in TOP500

• sustained: 1.0 Pflop/s

• Facility Support
• sqft / m2
• 14 MW

• 26,864 compute nodes
• 396,032 AMD Opteron “cores”
• 16 “cores” /node

• 4,224 NVIDIA Kepler (K20) GPUs
• 1 GPU/node on those nodes

• Data Support
• 1.5 PiB of memory
• 2GB/core

• 26.4 PiB of disk storage
Brain Dead Projection
• Simply build larger versions of current systems
• of course this is stupid, but we can learn a few things

• Assume:
• real application can get 5% of peak of system (optimistic!)
• applications can be scaled to necessary levels
CPU
Execution
Cores Accelerators Threads Memory Power Space
(M)
(M)
(B)
(PiB)
(MW) (M sqft)
Titan

226.8

Stampede
Blue
Waters

35.6

525.9 6,220

3.3

1607.7

Sequoia

14.2
0.0

1.6

1536.0 8,090

3.1

218.5

14.7

1.9

562.5 9,600

17.1

609.8

6.5

16.8

2309.8 21,558

7.7

• No avoiding O(1B) threads of execution
• Commodity solution will not get there first
• Likely need some combination of lower power processors and
accelerators
• ARM + Phi anyone?

• Memory will likely need to be <<0.5 GB/thread
Step-by-Step-Instructions
• How to create an exascale computing system in 1,200
easy steps

....

10
Trying to move Forward in the US: Politics
• “Exascale” fell out of favor in the US political scene
• it is recovering, but this slowed things down
• recently, Congress has become much more interested in the
scale of systems being deployed elsewhere around the world

• Legislation recently introduced to provide funding to
support moving forward
• suggests DOE-University partnerships
• could result in US$220M
Trying to move Forward in the US: Actions
•

DARPA UHPC program: 1 PFLOPS rack at 57 KW in 2015 (50
GFLOPS/W)
•

http://www.darpa.mil/Our_Work/MTO/Programs/Ubiquitous_High_Performance_Comp
uting_(UHPC).aspx

• 3 phases, like HPCS
• pushed by previous DARPA leadership
• seems to have less support from current leadership

•

DARPA studies:
• ExtremeScale Hardware Study (Kogge)
• http://users.ece.gatech.edu/~mrichard/ExascaleComputingStudyReport
s/exascale_final_report_100208.pdf
• ExtremeScale Software Study (Sarkar)
• http://users.ece.gatech.edu/~mrichard/ExascaleComputingStudyReport
s/ECSS%20report%20101909.pdf
• ExtremeScale Resiliency Study (Elnozahy)
• http://institutes.lanl.gov/resilience/docs/IBM%20Mootaz%20White%20P
aper%20System%20Resilience.pdf

•

NSF Efforts: no word…
Trying to move Forward in the US:
Community Efforts
• NSF supporting Big Data and Extreme-Scale Computing
• http://www.exascale.org/bdec/
• US + EU + Japan
• builds on IESP
• looks at crosscutting issues of Big Data and Extreme-Scale
computing
• DOE has sponsored a series of workshops recently
• Workshop on Modeling & Simulation of Exascale Systems &
Applications (Sep 2013)
• http://hpc.pnl.gov/modsim/2013/

• Workshop on Applied Mathematics Research for Exascale
Computing (Aug 2013)
• https://collab.mcs.anl.gov/display/examath/ExaMath13+Workshop

• Productive Programming Models for Exascale (Aug 2012)
• http://xsci.pnnl.gov/ppme/

Mais conteúdo relacionado

Semelhante a CSC2013: Exascale in the US

Mauricio breteernitiz hpc-exascale-iscte
Mauricio breteernitiz hpc-exascale-iscteMauricio breteernitiz hpc-exascale-iscte
Mauricio breteernitiz hpc-exascale-isctembreternitz
 
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast DataDatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast DataHakka Labs
 
Monitoring Exascale Supercomputers With Tim Osborne | Current 2022
Monitoring Exascale Supercomputers With Tim Osborne | Current 2022Monitoring Exascale Supercomputers With Tim Osborne | Current 2022
Monitoring Exascale Supercomputers With Tim Osborne | Current 2022HostedbyConfluent
 
Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...
Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...
Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...confluent
 
From the Benchtop to the Datacenter: HPC Requirements in Life Science Research
From the Benchtop to the Datacenter: HPC Requirements in Life Science ResearchFrom the Benchtop to the Datacenter: HPC Requirements in Life Science Research
From the Benchtop to the Datacenter: HPC Requirements in Life Science ResearchAri Berman
 
Ceph on the Brain: Storage and Data-Movement Supporting the Human Brain Project
Ceph on the Brain: Storage and Data-Movement Supporting the Human Brain ProjectCeph on the Brain: Storage and Data-Movement Supporting the Human Brain Project
Ceph on the Brain: Storage and Data-Movement Supporting the Human Brain Projectinside-BigData.com
 
Lessons Learned on Benchmarking Big Data Platforms
Lessons Learned on Benchmarking  Big Data PlatformsLessons Learned on Benchmarking  Big Data Platforms
Lessons Learned on Benchmarking Big Data Platformst_ivanov
 
Parallelism Processor Design
Parallelism Processor DesignParallelism Processor Design
Parallelism Processor DesignSri Prasanna
 
High Performance Computing Presentation
High Performance Computing PresentationHigh Performance Computing Presentation
High Performance Computing Presentationomar altayyan
 
How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...
How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...
How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...DevOps.com
 
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...Alluxio, Inc.
 
Building a Distributed File System for the Cloud-Native Era
Building a Distributed File System for the Cloud-Native EraBuilding a Distributed File System for the Cloud-Native Era
Building a Distributed File System for the Cloud-Native EraAlluxio, Inc.
 
How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...
How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...
How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...InfluxData
 
DARPA ERI Summit 2018: The End of Moore’s Law & Faster General Purpose Comput...
DARPA ERI Summit 2018: The End of Moore’s Law & Faster General Purpose Comput...DARPA ERI Summit 2018: The End of Moore’s Law & Faster General Purpose Comput...
DARPA ERI Summit 2018: The End of Moore’s Law & Faster General Purpose Comput...zionsaint
 

Semelhante a CSC2013: Exascale in the US (20)

supercomputer
supercomputersupercomputer
supercomputer
 
Mauricio breteernitiz hpc-exascale-iscte
Mauricio breteernitiz hpc-exascale-iscteMauricio breteernitiz hpc-exascale-iscte
Mauricio breteernitiz hpc-exascale-iscte
 
Pdc lecture1
Pdc lecture1Pdc lecture1
Pdc lecture1
 
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast DataDatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
 
Super Computers
Super ComputersSuper Computers
Super Computers
 
Monitoring Exascale Supercomputers With Tim Osborne | Current 2022
Monitoring Exascale Supercomputers With Tim Osborne | Current 2022Monitoring Exascale Supercomputers With Tim Osborne | Current 2022
Monitoring Exascale Supercomputers With Tim Osborne | Current 2022
 
Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...
Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...
Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...
 
From the Benchtop to the Datacenter: HPC Requirements in Life Science Research
From the Benchtop to the Datacenter: HPC Requirements in Life Science ResearchFrom the Benchtop to the Datacenter: HPC Requirements in Life Science Research
From the Benchtop to the Datacenter: HPC Requirements in Life Science Research
 
Ceph on the Brain: Storage and Data-Movement Supporting the Human Brain Project
Ceph on the Brain: Storage and Data-Movement Supporting the Human Brain ProjectCeph on the Brain: Storage and Data-Movement Supporting the Human Brain Project
Ceph on the Brain: Storage and Data-Movement Supporting the Human Brain Project
 
Microprocessors
MicroprocessorsMicroprocessors
Microprocessors
 
Microprocessors
MicroprocessorsMicroprocessors
Microprocessors
 
Lessons Learned on Benchmarking Big Data Platforms
Lessons Learned on Benchmarking  Big Data PlatformsLessons Learned on Benchmarking  Big Data Platforms
Lessons Learned on Benchmarking Big Data Platforms
 
Supercomputer @ manarat university by reza
Supercomputer  @ manarat university by rezaSupercomputer  @ manarat university by reza
Supercomputer @ manarat university by reza
 
Parallelism Processor Design
Parallelism Processor DesignParallelism Processor Design
Parallelism Processor Design
 
High Performance Computing Presentation
High Performance Computing PresentationHigh Performance Computing Presentation
High Performance Computing Presentation
 
How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...
How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...
How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...
 
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
 
Building a Distributed File System for the Cloud-Native Era
Building a Distributed File System for the Cloud-Native EraBuilding a Distributed File System for the Cloud-Native Era
Building a Distributed File System for the Cloud-Native Era
 
How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...
How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...
How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...
 
DARPA ERI Summit 2018: The End of Moore’s Law & Faster General Purpose Comput...
DARPA ERI Summit 2018: The End of Moore’s Law & Faster General Purpose Comput...DARPA ERI Summit 2018: The End of Moore’s Law & Faster General Purpose Comput...
DARPA ERI Summit 2018: The End of Moore’s Law & Faster General Purpose Comput...
 

Mais de John Towns

XSEDE National Cyberinfrastructure, NIST, and Supporting NCSI Objectives
XSEDE National Cyberinfrastructure, NIST, and Supporting NCSI Objectives XSEDE National Cyberinfrastructure, NIST, and Supporting NCSI Objectives
XSEDE National Cyberinfrastructure, NIST, and Supporting NCSI Objectives John Towns
 
Research IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator NeedsResearch IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator NeedsJohn Towns
 
Research IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator NeedsResearch IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator NeedsJohn Towns
 
XSEDE and National Cyberinfrastructure
XSEDE and National CyberinfrastructureXSEDE and National Cyberinfrastructure
XSEDE and National CyberinfrastructureJohn Towns
 
Supporting Research Communities with XSEDE
Supporting Research Communities with XSEDESupporting Research Communities with XSEDE
Supporting Research Communities with XSEDEJohn Towns
 
Supporting Research Communities with XSEDE
Supporting Research Communities with XSEDESupporting Research Communities with XSEDE
Supporting Research Communities with XSEDEJohn Towns
 
Introduction to XSEDE 2.0
Introduction to XSEDE 2.0Introduction to XSEDE 2.0
Introduction to XSEDE 2.0John Towns
 
Overview of XSEDE Systems Engineering
Overview of XSEDE Systems EngineeringOverview of XSEDE Systems Engineering
Overview of XSEDE Systems EngineeringJohn Towns
 
Overview of XSEDE and Introduction to XSEDE 2.0 and Beyond
Overview of XSEDE and Introduction to XSEDE 2.0 and BeyondOverview of XSEDE and Introduction to XSEDE 2.0 and Beyond
Overview of XSEDE and Introduction to XSEDE 2.0 and BeyondJohn Towns
 
XSEDE at Brainstorm HPCD Jan2015
XSEDE at Brainstorm HPCD Jan2015XSEDE at Brainstorm HPCD Jan2015
XSEDE at Brainstorm HPCD Jan2015John Towns
 
Supporting Research Communities with XSEDE
Supporting Research Communities with XSEDESupporting Research Communities with XSEDE
Supporting Research Communities with XSEDEJohn Towns
 
Year of ci overview
Year of ci overviewYear of ci overview
Year of ci overviewJohn Towns
 
XSEDE: an ecosystem of advanced digital services accelerating scientific disc...
XSEDE: an ecosystem of advanced digital services accelerating scientific disc...XSEDE: an ecosystem of advanced digital services accelerating scientific disc...
XSEDE: an ecosystem of advanced digital services accelerating scientific disc...John Towns
 
State of XSEDE: July 2014
State of XSEDE: July 2014State of XSEDE: July 2014
State of XSEDE: July 2014John Towns
 
XSEDE Overview (March 2014)
XSEDE Overview (March 2014)XSEDE Overview (March 2014)
XSEDE Overview (March 2014)John Towns
 
ARCC National Perspective Panel: XSEDE (Towns)
ARCC National Perspective Panel: XSEDE (Towns)ARCC National Perspective Panel: XSEDE (Towns)
ARCC National Perspective Panel: XSEDE (Towns)John Towns
 
XSEDE13: State of XSEDE
XSEDE13: State of XSEDEXSEDE13: State of XSEDE
XSEDE13: State of XSEDEJohn Towns
 
Xsede for-nlhpc
Xsede for-nlhpcXsede for-nlhpc
Xsede for-nlhpcJohn Towns
 
CASC Sustainability Workshop, Panel 3: Models, challenges and opportunities
CASC Sustainability Workshop, Panel 3: Models, challenges and opportunitiesCASC Sustainability Workshop, Panel 3: Models, challenges and opportunities
CASC Sustainability Workshop, Panel 3: Models, challenges and opportunitiesJohn Towns
 
Tapia fireside chat-towns
Tapia fireside chat-townsTapia fireside chat-towns
Tapia fireside chat-townsJohn Towns
 

Mais de John Towns (20)

XSEDE National Cyberinfrastructure, NIST, and Supporting NCSI Objectives
XSEDE National Cyberinfrastructure, NIST, and Supporting NCSI Objectives XSEDE National Cyberinfrastructure, NIST, and Supporting NCSI Objectives
XSEDE National Cyberinfrastructure, NIST, and Supporting NCSI Objectives
 
Research IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator NeedsResearch IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator Needs
 
Research IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator NeedsResearch IT @ Illinois: Establishing Service Responsive to Investigator Needs
Research IT @ Illinois: Establishing Service Responsive to Investigator Needs
 
XSEDE and National Cyberinfrastructure
XSEDE and National CyberinfrastructureXSEDE and National Cyberinfrastructure
XSEDE and National Cyberinfrastructure
 
Supporting Research Communities with XSEDE
Supporting Research Communities with XSEDESupporting Research Communities with XSEDE
Supporting Research Communities with XSEDE
 
Supporting Research Communities with XSEDE
Supporting Research Communities with XSEDESupporting Research Communities with XSEDE
Supporting Research Communities with XSEDE
 
Introduction to XSEDE 2.0
Introduction to XSEDE 2.0Introduction to XSEDE 2.0
Introduction to XSEDE 2.0
 
Overview of XSEDE Systems Engineering
Overview of XSEDE Systems EngineeringOverview of XSEDE Systems Engineering
Overview of XSEDE Systems Engineering
 
Overview of XSEDE and Introduction to XSEDE 2.0 and Beyond
Overview of XSEDE and Introduction to XSEDE 2.0 and BeyondOverview of XSEDE and Introduction to XSEDE 2.0 and Beyond
Overview of XSEDE and Introduction to XSEDE 2.0 and Beyond
 
XSEDE at Brainstorm HPCD Jan2015
XSEDE at Brainstorm HPCD Jan2015XSEDE at Brainstorm HPCD Jan2015
XSEDE at Brainstorm HPCD Jan2015
 
Supporting Research Communities with XSEDE
Supporting Research Communities with XSEDESupporting Research Communities with XSEDE
Supporting Research Communities with XSEDE
 
Year of ci overview
Year of ci overviewYear of ci overview
Year of ci overview
 
XSEDE: an ecosystem of advanced digital services accelerating scientific disc...
XSEDE: an ecosystem of advanced digital services accelerating scientific disc...XSEDE: an ecosystem of advanced digital services accelerating scientific disc...
XSEDE: an ecosystem of advanced digital services accelerating scientific disc...
 
State of XSEDE: July 2014
State of XSEDE: July 2014State of XSEDE: July 2014
State of XSEDE: July 2014
 
XSEDE Overview (March 2014)
XSEDE Overview (March 2014)XSEDE Overview (March 2014)
XSEDE Overview (March 2014)
 
ARCC National Perspective Panel: XSEDE (Towns)
ARCC National Perspective Panel: XSEDE (Towns)ARCC National Perspective Panel: XSEDE (Towns)
ARCC National Perspective Panel: XSEDE (Towns)
 
XSEDE13: State of XSEDE
XSEDE13: State of XSEDEXSEDE13: State of XSEDE
XSEDE13: State of XSEDE
 
Xsede for-nlhpc
Xsede for-nlhpcXsede for-nlhpc
Xsede for-nlhpc
 
CASC Sustainability Workshop, Panel 3: Models, challenges and opportunities
CASC Sustainability Workshop, Panel 3: Models, challenges and opportunitiesCASC Sustainability Workshop, Panel 3: Models, challenges and opportunities
CASC Sustainability Workshop, Panel 3: Models, challenges and opportunities
 
Tapia fireside chat-towns
Tapia fireside chat-townsTapia fireside chat-towns
Tapia fireside chat-towns
 

Último

Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 

Último (20)

Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 

CSC2013: Exascale in the US

  • 1. Exascale in the US John Towns Director, Collaborative eScience Program Office, NCSA jtowns@ncsa.illinois.edu National Center for Supercomputing Applications University of Illinois at Urbana–Champaign
  • 2. Setting Some Context • It is a bad idea to project too far in the future based on current technology • but provides a basis for worst case scenarios • Challenge: build a system that will support sustained 1.0 EFLOP/s performance • assume: code can sustain 5% of peak performance • need a 20 Eflop/s system • assume: no constraints on power, parallelism, MTBF, scales of interconnects,… • assume: application can be written • i.e. ignore scaling, thread counts, message passing issues, memory size constraints, languages, libraries, development tools, … • Let’s try to do this with today’s technology
  • 3. Current Systems in the US: Titan (Cray XK7 @ ORNL) • Performance • peak: 27 Pflop/s • HPL: 17.6 Pflop/s • #2 in 11/13 TOP500 • sustained: ?? • Facility Support • 4,352 sqft / 404 m2 • 8.2 MW • Floating Point Support: • 18,688 compute nodes • 299,008 AMD Opteron “cores” • 16 “cores” /node • 18,688 NVIDIA Kepler (K20) GPUs • 1 GPU/node • Data Support • 710 TiB of memory • 584 TiB with CPUs (2GB/core) • 126 TiB on GPUs (6 GB/GPU) • 10 PB of disk storage
  • 4. Current Systems in the US: Sequoia (IBM BlueGene/Q @ LLNL) • Performance: • Floating Point Support: • peak: 20 Pflop/s • HPL: 17.2 Pflop/s • #3 in 11/13 TOP500 • 98,304 compute nodes • 1.57M PowerPC A2 cores • 16 cores /node • sustained: ?? • Facility Support • 3,000 sqft / 280 • 7.9 MW m2 • Data Support • 1.5 PiB of memory • 1GB/core • 50 PB of disk storage
  • 5. Current Systems in the US: Mira (IBM BlueGene/Q @ ANL) • Performance: • Floating Point Support: • peak: 10 Pflop/s • HPL: 8.59 Pflop/s • #5 in 11/13 TOP500 • 49,152 compute nodes • 786,432 PowerPC A2 cores • 16 cores /node • sustained: ?? • Facility Support • 1,500 sqft / 140 (estimated) • 4.0 MW m2 • Data Support • 768 TiB of memory • 1GB/core • 35 PiB of disk storage
  • 6. Current Systems in the US: Stampede (Dell C8220 @ TACC) • Performance • Floating Point Support: • peak: 9.6 Pflop/s • HPL: 5.17 Pflop/s • #7 in 11/13 TOP500 • sustained: ?? • Facility Support • 8,000 sqft / 745 m2 • 4.5 MW • 6,400 compute nodes • 102,400 Intel SandyBridge cores • 16 cores/node • 6,880 Xeon Phi co-processors • 1 Phi/node • Data Support • 270 TiB of memory • 2GB/core (for most nodes) • 14 PB of disk storage
  • 7. Current Systems in the US: Blue Waters (Cray XE6/XK7 @ NCSA) • Performance • Floating Point Support: • peak: 13.3 Pflop/s • HPL: n/a • NOT in TOP500 • sustained: 1.0 Pflop/s • Facility Support • sqft / m2 • 14 MW • 26,864 compute nodes • 396,032 AMD Opteron “cores” • 16 “cores” /node • 4,224 NVIDIA Kepler (K20) GPUs • 1 GPU/node on those nodes • Data Support • 1.5 PiB of memory • 2GB/core • 26.4 PiB of disk storage
  • 8. Brain Dead Projection • Simply build larger versions of current systems • of course this is stupid, but we can learn a few things • Assume: • real application can get 5% of peak of system (optimistic!) • applications can be scaled to necessary levels
  • 9. CPU Execution Cores Accelerators Threads Memory Power Space (M) (M) (B) (PiB) (MW) (M sqft) Titan 226.8 Stampede Blue Waters 35.6 525.9 6,220 3.3 1607.7 Sequoia 14.2 0.0 1.6 1536.0 8,090 3.1 218.5 14.7 1.9 562.5 9,600 17.1 609.8 6.5 16.8 2309.8 21,558 7.7 • No avoiding O(1B) threads of execution • Commodity solution will not get there first • Likely need some combination of lower power processors and accelerators • ARM + Phi anyone? • Memory will likely need to be <<0.5 GB/thread
  • 10. Step-by-Step-Instructions • How to create an exascale computing system in 1,200 easy steps .... 10
  • 11. Trying to move Forward in the US: Politics • “Exascale” fell out of favor in the US political scene • it is recovering, but this slowed things down • recently, Congress has become much more interested in the scale of systems being deployed elsewhere around the world • Legislation recently introduced to provide funding to support moving forward • suggests DOE-University partnerships • could result in US$220M
  • 12. Trying to move Forward in the US: Actions • DARPA UHPC program: 1 PFLOPS rack at 57 KW in 2015 (50 GFLOPS/W) • http://www.darpa.mil/Our_Work/MTO/Programs/Ubiquitous_High_Performance_Comp uting_(UHPC).aspx • 3 phases, like HPCS • pushed by previous DARPA leadership • seems to have less support from current leadership • DARPA studies: • ExtremeScale Hardware Study (Kogge) • http://users.ece.gatech.edu/~mrichard/ExascaleComputingStudyReport s/exascale_final_report_100208.pdf • ExtremeScale Software Study (Sarkar) • http://users.ece.gatech.edu/~mrichard/ExascaleComputingStudyReport s/ECSS%20report%20101909.pdf • ExtremeScale Resiliency Study (Elnozahy) • http://institutes.lanl.gov/resilience/docs/IBM%20Mootaz%20White%20P aper%20System%20Resilience.pdf • NSF Efforts: no word…
  • 13. Trying to move Forward in the US: Community Efforts • NSF supporting Big Data and Extreme-Scale Computing • http://www.exascale.org/bdec/ • US + EU + Japan • builds on IESP • looks at crosscutting issues of Big Data and Extreme-Scale computing • DOE has sponsored a series of workshops recently • Workshop on Modeling & Simulation of Exascale Systems & Applications (Sep 2013) • http://hpc.pnl.gov/modsim/2013/ • Workshop on Applied Mathematics Research for Exascale Computing (Aug 2013) • https://collab.mcs.anl.gov/display/examath/ExaMath13+Workshop • Productive Programming Models for Exascale (Aug 2012) • http://xsci.pnnl.gov/ppme/