SlideShare a Scribd company logo
1 of 25
Download to read offline
How Parallelware technology eases
HPC software development for
POWER systems
Parallelware Analyzer and GPU programming for big code bases
Manuel Arenaz
manuel.arenaz@appentra.com
OpenPOWER Academic Discussion Group Workshop 2018
Saturday, 10 November 2018 | Dallas, US
https://indico-jsc.fz-juelich.de/event/76/
Index
● Is There a Need for Parallelware Tools on POWER Systems?
● The Parallelware Tools Suite: Design of Software Components
● Parallelware Technology: Roadmap 2018-2020
● Parallelware Trainer: Roadmap 2018-2019
● Parallelware Analyzer: Roadmap 2018-2019
● Conclusions & Future Work
Index
● Is There a Need for Parallelware Tools on POWER Systems?
● The Parallelware Tools Suite: Design of Software Components
● Parallelware Core Technology: Roadmap 2018-2020
● Parallelware Trainer: Roadmap 2018-2019
● Parallelware Analyzer: Roadmap 2018-2019
● Conclusions & Future Work
Is there a need for Parallelware tools on Power?
Is there a need for Parallelware tools on Power?
Incredible computational power in one full node of Summit!
Full Node Capabilities
Processor POWER9 V100
Count 2 6
FLOPS (SP) 2.161 TFLOPS
2 × 22 × 49.12 GFLOPs
94.2 TFLOPs
6 × 15.7 TFLOPs
FLOPS (DP) 1.081 TFLOPS
2 × 22 × 24.56 GFLOPs
46.8 TFLOPS
6 × 7.8 TFLOPs
AI FLOPS - 750 TFLOPS
6 × 125 TFLOPs
Memory 512 GiB (DRR4)
16 × 32 GiB
96 GiB (HBM2)
6 × 16 GiB
Bandwidth 341.33 GB/s
16 × 21.33 GB/s
900 GB/s/GPU
What are Summit’s parallel programming challenges?
Is there a need for Parallelware tools on Power?
● Parallel programming of many-core processors
● Parallel programming of multiple GPUs
(multi-GPU)
● Data movement through a heterogeneous
complex memory hierarchy
● Training of computational researchers and
engineers
● Porting of existing codes to (pre-)exascale systems
How can
Parallelware tools
help to address
these challenges?
Index
● Is There a Need for Parallelware Tools on POWER Systems?
● The Parallelware Tools Suite: Design of Software Components
● Parallelware Technology: Roadmap 2018-2020
● Parallelware Trainer: Roadmap 2018-2019
● Parallelware Analyzer: Roadmap 2018-2019
● Conclusions & Future Work
Parallelware Tools Suite: Software Components
GUI Desktop
Emerging
Technology
Command
Line Tool
Emerging
Technology
1. Parallelware Trainer
2. Parallelware Analyzer
Parallelware Technology (libpw)
Parallelware front-end Parallelware back-endParallelware middle-end
Semantic
Analysis
Engine
C
OpenACC 2.0
Multi-Threading
Offloading
OpenMP 4.5
Index
● Is There a Need for Parallelware Tools on POWER Systems?
● The Parallelware Tools Suite: Design of Software Components
● Parallelware Technology: Roadmap 2018-2020
● Parallelware Trainer: Roadmap 2018-2019
● Parallelware Analyzer: Roadmap 2018-2019
● Conclusions & Future Work
Parallelware Technology: Roadmap 2018-2020
GUI Desktop
Emerging
Technology
Command
Line Tool
Emerging
Technology
The H2020 FETHPC projects MAESTRO and EPEEC
will enable an incremental development following a
co-design approach guided by (pre-)exascale
applications.
Parallelware Technology (libpw)
Parallelware front-end Parallelware back-endParallelware middle-end
Semantic
Analysis
Engine
C
OpenACC 2.0
Multi-Threading
Offloading
OpenMP 4.5
C++
Fortran
Tasking
OmpSs
FPGAs
http://www.prace-ri.eu/pracesc18-presentations/
Index
● Is There a Need for Parallelware Tools on POWER Systems?
● The Parallelware Tools Suite: Design of Software Components
● Parallelware Technology: Roadmap 2018-2020
● Parallelware Trainer: Roadmap 2018-2019
● Parallelware Analyzer: Roadmap 2018-2019
● Conclusions & Future Work
An interactive
tool that acts as
your mentor
“ Tell me, I will forget,
Show me, I may remember,
Involve me, I will understand.”
Emerging
Technologies
Parallelware Trainer (v1.0 Sep 2018)
Technical features:
● Identification of parallelization opportunities.
● Assistance with the introduction of correct
OpenMP and OpenACC directives.
● Correct data scoping, including private/shared
variables.
● Support for C programming language.
● Use any compiler and any build/compilation tool
in Windows, Linux and MacOSX.
● Develop, test and benchmark all within the
same interface.
Benefits:
● Faster, more effective learning.
● Work on realistic codes rather than toy
examples, including your own code.
● Reduced learning curve.
● Parallelize code within minutes.
● Immediate identification of where and how
to parallelize.
● Support for multithreading, offloading to
GPUs.
https://www.appentra.com/products/parallelware-trainer/
Parallelware Trainer (v1.0 Sep 2018)
Project Explorer Code Editor Version Manager
Output Consoles
PASSIVE LEARNING
Lectures
INTERACTIVE
LEARNING
Exercises
Demonstrations
LEARNING BY
DOING WITH
MENTORS
Problem solving
HACKATHONS
WORKSHOPS
COURSES
interactivity Learning effort
Scalability
Learning effort Interactivity
Scalability
interactivityLearning effort
Scalability
THE HPC EDUCATION & TRAINING PYRAMID
Parallelware Trainer
New knowledge base (glossary of parallel programming)
Parallelware Trainer
Index
● Is There a Need for Parallelware Tools on POWER Systems?
● The Parallelware Tools Suite: Design of Software Components
● Parallelware Technology: Roadmap 2018-2020
● Parallelware Trainer: Roadmap 2018-2019
● Parallelware Analyzer: Roadmap 2018-2019
● Conclusions & Future Work
A command-line
reporting tool to
improve
productivity of HPC
application
developers
Emerging
Technologies
Methodological Approach to Parallel Programming
Tuned OpenACC
parallel versionCode
Analysis
Working OpenMP
Parallel version
Profiled
version
Working
version
MS0 MS1 MS2 MS3
Working OpenACC
parallel versionPreparing for
the hackathon
Tuned OpenMP
parallel version
Highly optimized OpenMP+OpenACC parallel
version
MS4 MS5
MS6 MS7
MS8
Biggest parallelisation barrier is “Code Analysis”: data scoping
across procedure boundaries in codes using complex in-memory data layouts
Parallelware Analyzer (Beta)
● Help to understand where and how to parallelize in real codes.
● Reports to facilitate understanding the code from different perspectives.
● Batch processing of files/directories of big code bases.
Report “--datascoping”
Parallelware Analyzer (Beta)
Parallelware Analyzer: Roadmap 2019+
www.appentra.com/products/parallelware-analyzer
● Early Access Program (EAP) Q1-Q2 2019
○ Students/Researchers/Developers
○ Academia/Industry
● Create a community to ensure development
aligned with the user communities needs
○ Discussion forums
● Benefits
○ Free access to the tool
○ Early adopter discounts when the product
is officially launched
● Official launch of Parallelware Analyzer:
○ Expected to be at SC19
● Sign-up now for EAP starting 1 Jan 2019
Index
● Is There a Need for Parallelware Tools on POWER Systems?
● The Parallelware Tools Suite: Design of Software Components
● Parallelware Technology: Roadmap 2018-2019
● Parallelware Trainer: Roadmap 2018-2019
● Parallelware Analyzer: Roadmap 2018-2019
● Conclusions & Future Work
Conclusions & Future Work
Conclusions
● Parallelware tools are advancing the state of the art in parallel
programming challenges, and can facilitate it for POWER systems too.
○ Parallelware Trainer can help in training for POWER systems using OpenMP
and OpenACC for multicores and GPUs.
● Parallelware Analyzer (BETA) helps to dive into the complexity of
developing HPC codes, covering the computational, control-flow and
memory perspectives
○ Data scoping in big code bases is probably pain point #1
Coming soon...
● Distribute binary packages of Parallelware tools for POWER systems.
● Certify Parallelware tools as OpenPOWER Ready.
SC18
Emerging
Technologies
booth #619
SC18
Startup Pavilion
booth #3869
How Parallelware technology eases
HPC software development for
POWER systems?
Parallelware Analyzer and GPU programming for big code bases
Manuel Arenaz
manuel.arenaz@appentra.com
OpenPOWER Academic Discussion Group Workshop 2018
Saturday, 10 November 2018 | Dallas, US
https://indico-jsc.fz-juelich.de/event/76/

More Related Content

What's hot

OpenERP - Pentaho Integration, WillowIT
OpenERP - Pentaho Integration, WillowITOpenERP - Pentaho Integration, WillowIT
OpenERP - Pentaho Integration, WillowITOdoo
 
How to build high frequency trading with our matlab secrets with c++ and mysql
How to build high frequency trading with our matlab secrets with c++ and mysqlHow to build high frequency trading with our matlab secrets with c++ and mysql
How to build high frequency trading with our matlab secrets with c++ and mysqlBryan Downing
 
GraphQL - Missing Link In REST
GraphQL -  Missing Link In RESTGraphQL -  Missing Link In REST
GraphQL - Missing Link In RESTAnubolu Sreekanth
 
River Trail: A Path to Parallelism in JavaScript
River Trail: A Path to Parallelism in JavaScriptRiver Trail: A Path to Parallelism in JavaScript
River Trail: A Path to Parallelism in JavaScriptRoberto Falconi
 
Evolution of API-driven architectures
Evolution of API-driven architecturesEvolution of API-driven architectures
Evolution of API-driven architecturesSven Bernhardt
 
Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...
Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...
Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...TigerGraph
 
Bradley Lin Wan Resume
Bradley Lin Wan ResumeBradley Lin Wan Resume
Bradley Lin Wan ResumeLin Wan
 
Querix 4 gl app analyzer 2016 journey to the center of your 4gl application
Querix 4 gl app analyzer 2016 journey to the center of your 4gl applicationQuerix 4 gl app analyzer 2016 journey to the center of your 4gl application
Querix 4 gl app analyzer 2016 journey to the center of your 4gl applicationBeGooden-IT Consulting
 

What's hot (10)

OpenERP - Pentaho Integration, WillowIT
OpenERP - Pentaho Integration, WillowITOpenERP - Pentaho Integration, WillowIT
OpenERP - Pentaho Integration, WillowIT
 
seibold_resume
seibold_resumeseibold_resume
seibold_resume
 
How to build high frequency trading with our matlab secrets with c++ and mysql
How to build high frequency trading with our matlab secrets with c++ and mysqlHow to build high frequency trading with our matlab secrets with c++ and mysql
How to build high frequency trading with our matlab secrets with c++ and mysql
 
GraphQL - Missing Link In REST
GraphQL -  Missing Link In RESTGraphQL -  Missing Link In REST
GraphQL - Missing Link In REST
 
River Trail: A Path to Parallelism in JavaScript
River Trail: A Path to Parallelism in JavaScriptRiver Trail: A Path to Parallelism in JavaScript
River Trail: A Path to Parallelism in JavaScript
 
Evolution of API-driven architectures
Evolution of API-driven architecturesEvolution of API-driven architectures
Evolution of API-driven architectures
 
Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...
Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...
Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...
 
Bradley Lin Wan Resume
Bradley Lin Wan ResumeBradley Lin Wan Resume
Bradley Lin Wan Resume
 
Prathamesh Zarkar Resume
Prathamesh Zarkar ResumePrathamesh Zarkar Resume
Prathamesh Zarkar Resume
 
Querix 4 gl app analyzer 2016 journey to the center of your 4gl application
Querix 4 gl app analyzer 2016 journey to the center of your 4gl applicationQuerix 4 gl app analyzer 2016 journey to the center of your 4gl application
Querix 4 gl app analyzer 2016 journey to the center of your 4gl application
 

Similar to Parallelware Tools Ease HPC Development on POWER

Power9 aihpc bigdataeducationserver
Power9 aihpc bigdataeducationserverPower9 aihpc bigdataeducationserver
Power9 aihpc bigdataeducationserverGanesan Narayanasamy
 
Массовый параллелизм для гетерогенных вычислений на C++ для беспилотных автом...
Массовый параллелизм для гетерогенных вычислений на C++ для беспилотных автом...Массовый параллелизм для гетерогенных вычислений на C++ для беспилотных автом...
Массовый параллелизм для гетерогенных вычислений на C++ для беспилотных автом...CEE-SEC(R)
 
APIs and SDKs: Breaking Into and Succeeding in a Specialty Market
APIs and SDKs: Breaking Into and Succeeding in a Specialty MarketAPIs and SDKs: Breaking Into and Succeeding in a Specialty Market
APIs and SDKs: Breaking Into and Succeeding in a Specialty MarketScott Abel
 
Conquering the Lambda architecture in LinkedIn metrics platform with Apache C...
Conquering the Lambda architecture in LinkedIn metrics platform with Apache C...Conquering the Lambda architecture in LinkedIn metrics platform with Apache C...
Conquering the Lambda architecture in LinkedIn metrics platform with Apache C...Khai Tran
 
Crafting a Cloud Native API Platform to Accelerate Your Platform Maturity - B...
Crafting a Cloud Native API Platform to Accelerate Your Platform Maturity - B...Crafting a Cloud Native API Platform to Accelerate Your Platform Maturity - B...
Crafting a Cloud Native API Platform to Accelerate Your Platform Maturity - B...Nordic APIs
 
OpenACC Monthly Highlights: July 2020
OpenACC Monthly Highlights: July 2020OpenACC Monthly Highlights: July 2020
OpenACC Monthly Highlights: July 2020OpenACC
 
OpenACC Monthly Highlights September 2020
OpenACC Monthly Highlights September 2020OpenACC Monthly Highlights September 2020
OpenACC Monthly Highlights September 2020OpenACC
 
Developing Spatial Applications with CARTO for React v1.1
Developing Spatial Applications with CARTO for React v1.1Developing Spatial Applications with CARTO for React v1.1
Developing Spatial Applications with CARTO for React v1.1CARTO
 
LAS16-108: JerryScript and other scripting languages for IoT
LAS16-108: JerryScript and other scripting languages for IoTLAS16-108: JerryScript and other scripting languages for IoT
LAS16-108: JerryScript and other scripting languages for IoTLinaro
 
CIS110 Computer Programming Design Chapter (1)
CIS110 Computer Programming Design Chapter  (1)CIS110 Computer Programming Design Chapter  (1)
CIS110 Computer Programming Design Chapter (1)Dr. Ahmed Al Zaidy
 
OpenACC Monthly Highlights: February 2021
OpenACC Monthly Highlights: February 2021OpenACC Monthly Highlights: February 2021
OpenACC Monthly Highlights: February 2021OpenACC
 
Flipping the script: How to take the first step towards internal developer pl...
Flipping the script: How to take the first step towards internal developer pl...Flipping the script: How to take the first step towards internal developer pl...
Flipping the script: How to take the first step towards internal developer pl...Abigail Bangser
 
Job description of nvidia hiring 2013 q4
Job description of nvidia hiring 2013 q4Job description of nvidia hiring 2013 q4
Job description of nvidia hiring 2013 q4Richard Sun
 
Marvin AI - Uma plataforma de código aberto para implementar e gerenciar mode...
Marvin AI - Uma plataforma de código aberto para implementar e gerenciar mode...Marvin AI - Uma plataforma de código aberto para implementar e gerenciar mode...
Marvin AI - Uma plataforma de código aberto para implementar e gerenciar mode...Daniel Takabayashi, MSc
 
Advanced technologies and techniques for debugging HPC applications
Advanced technologies and techniques for debugging HPC applicationsAdvanced technologies and techniques for debugging HPC applications
Advanced technologies and techniques for debugging HPC applicationsRogue Wave Software
 
chapter1-161229182113 (1).pdf
chapter1-161229182113 (1).pdfchapter1-161229182113 (1).pdf
chapter1-161229182113 (1).pdfBernardVelasco1
 
Labview1_ Computer Applications in Control_ACRRL
Labview1_ Computer Applications in Control_ACRRLLabview1_ Computer Applications in Control_ACRRL
Labview1_ Computer Applications in Control_ACRRLMohammad Sabouri
 
Auto conversion of serial C code to CUDA code
Auto conversion of serial C code to CUDA codeAuto conversion of serial C code to CUDA code
Auto conversion of serial C code to CUDA codeIRJET Journal
 
2018-10-18 J2 4C - its gonna be PowerApps and Flow - Penelope Coventry
2018-10-18 J2 4C - its gonna be PowerApps and Flow - Penelope Coventry2018-10-18 J2 4C - its gonna be PowerApps and Flow - Penelope Coventry
2018-10-18 J2 4C - its gonna be PowerApps and Flow - Penelope CoventryModern Workplace Conference Paris
 
PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...
PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...
PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...AMD Developer Central
 

Similar to Parallelware Tools Ease HPC Development on POWER (20)

Power9 aihpc bigdataeducationserver
Power9 aihpc bigdataeducationserverPower9 aihpc bigdataeducationserver
Power9 aihpc bigdataeducationserver
 
Массовый параллелизм для гетерогенных вычислений на C++ для беспилотных автом...
Массовый параллелизм для гетерогенных вычислений на C++ для беспилотных автом...Массовый параллелизм для гетерогенных вычислений на C++ для беспилотных автом...
Массовый параллелизм для гетерогенных вычислений на C++ для беспилотных автом...
 
APIs and SDKs: Breaking Into and Succeeding in a Specialty Market
APIs and SDKs: Breaking Into and Succeeding in a Specialty MarketAPIs and SDKs: Breaking Into and Succeeding in a Specialty Market
APIs and SDKs: Breaking Into and Succeeding in a Specialty Market
 
Conquering the Lambda architecture in LinkedIn metrics platform with Apache C...
Conquering the Lambda architecture in LinkedIn metrics platform with Apache C...Conquering the Lambda architecture in LinkedIn metrics platform with Apache C...
Conquering the Lambda architecture in LinkedIn metrics platform with Apache C...
 
Crafting a Cloud Native API Platform to Accelerate Your Platform Maturity - B...
Crafting a Cloud Native API Platform to Accelerate Your Platform Maturity - B...Crafting a Cloud Native API Platform to Accelerate Your Platform Maturity - B...
Crafting a Cloud Native API Platform to Accelerate Your Platform Maturity - B...
 
OpenACC Monthly Highlights: July 2020
OpenACC Monthly Highlights: July 2020OpenACC Monthly Highlights: July 2020
OpenACC Monthly Highlights: July 2020
 
OpenACC Monthly Highlights September 2020
OpenACC Monthly Highlights September 2020OpenACC Monthly Highlights September 2020
OpenACC Monthly Highlights September 2020
 
Developing Spatial Applications with CARTO for React v1.1
Developing Spatial Applications with CARTO for React v1.1Developing Spatial Applications with CARTO for React v1.1
Developing Spatial Applications with CARTO for React v1.1
 
LAS16-108: JerryScript and other scripting languages for IoT
LAS16-108: JerryScript and other scripting languages for IoTLAS16-108: JerryScript and other scripting languages for IoT
LAS16-108: JerryScript and other scripting languages for IoT
 
CIS110 Computer Programming Design Chapter (1)
CIS110 Computer Programming Design Chapter  (1)CIS110 Computer Programming Design Chapter  (1)
CIS110 Computer Programming Design Chapter (1)
 
OpenACC Monthly Highlights: February 2021
OpenACC Monthly Highlights: February 2021OpenACC Monthly Highlights: February 2021
OpenACC Monthly Highlights: February 2021
 
Flipping the script: How to take the first step towards internal developer pl...
Flipping the script: How to take the first step towards internal developer pl...Flipping the script: How to take the first step towards internal developer pl...
Flipping the script: How to take the first step towards internal developer pl...
 
Job description of nvidia hiring 2013 q4
Job description of nvidia hiring 2013 q4Job description of nvidia hiring 2013 q4
Job description of nvidia hiring 2013 q4
 
Marvin AI - Uma plataforma de código aberto para implementar e gerenciar mode...
Marvin AI - Uma plataforma de código aberto para implementar e gerenciar mode...Marvin AI - Uma plataforma de código aberto para implementar e gerenciar mode...
Marvin AI - Uma plataforma de código aberto para implementar e gerenciar mode...
 
Advanced technologies and techniques for debugging HPC applications
Advanced technologies and techniques for debugging HPC applicationsAdvanced technologies and techniques for debugging HPC applications
Advanced technologies and techniques for debugging HPC applications
 
chapter1-161229182113 (1).pdf
chapter1-161229182113 (1).pdfchapter1-161229182113 (1).pdf
chapter1-161229182113 (1).pdf
 
Labview1_ Computer Applications in Control_ACRRL
Labview1_ Computer Applications in Control_ACRRLLabview1_ Computer Applications in Control_ACRRL
Labview1_ Computer Applications in Control_ACRRL
 
Auto conversion of serial C code to CUDA code
Auto conversion of serial C code to CUDA codeAuto conversion of serial C code to CUDA code
Auto conversion of serial C code to CUDA code
 
2018-10-18 J2 4C - its gonna be PowerApps and Flow - Penelope Coventry
2018-10-18 J2 4C - its gonna be PowerApps and Flow - Penelope Coventry2018-10-18 J2 4C - its gonna be PowerApps and Flow - Penelope Coventry
2018-10-18 J2 4C - its gonna be PowerApps and Flow - Penelope Coventry
 
PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...
PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...
PT-4142, Porting and Optimizing OpenMP applications to APU using CAPS tools, ...
 

More from Ganesan Narayanasamy

Chip Design Curriculum development Residency program
Chip Design Curriculum development Residency programChip Design Curriculum development Residency program
Chip Design Curriculum development Residency programGanesan Narayanasamy
 
Basics of Digital Design and Verilog
Basics of Digital Design and VerilogBasics of Digital Design and Verilog
Basics of Digital Design and VerilogGanesan Narayanasamy
 
180 nm Tape out experience using Open POWER ISA
180 nm Tape out experience using Open POWER ISA180 nm Tape out experience using Open POWER ISA
180 nm Tape out experience using Open POWER ISAGanesan Narayanasamy
 
Workload Transformation and Innovations in POWER Architecture
Workload Transformation and Innovations in POWER Architecture Workload Transformation and Innovations in POWER Architecture
Workload Transformation and Innovations in POWER Architecture Ganesan Narayanasamy
 
Deep Learning Use Cases using OpenPOWER systems
Deep Learning Use Cases using OpenPOWER systemsDeep Learning Use Cases using OpenPOWER systems
Deep Learning Use Cases using OpenPOWER systemsGanesan Narayanasamy
 
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...Ganesan Narayanasamy
 
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systemsAI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systemsGanesan Narayanasamy
 
AI in Health Care using IBM Systems/OpenPOWER systems
AI in Health Care using IBM Systems/OpenPOWER systemsAI in Health Care using IBM Systems/OpenPOWER systems
AI in Health Care using IBM Systems/OpenPOWER systemsGanesan Narayanasamy
 
AI in Healh Care using IBM POWER systems
AI in Healh Care using IBM POWER systems AI in Healh Care using IBM POWER systems
AI in Healh Care using IBM POWER systems Ganesan Narayanasamy
 
Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9Ganesan Narayanasamy
 

More from Ganesan Narayanasamy (20)

Chip Design Curriculum development Residency program
Chip Design Curriculum development Residency programChip Design Curriculum development Residency program
Chip Design Curriculum development Residency program
 
Basics of Digital Design and Verilog
Basics of Digital Design and VerilogBasics of Digital Design and Verilog
Basics of Digital Design and Verilog
 
180 nm Tape out experience using Open POWER ISA
180 nm Tape out experience using Open POWER ISA180 nm Tape out experience using Open POWER ISA
180 nm Tape out experience using Open POWER ISA
 
Workload Transformation and Innovations in POWER Architecture
Workload Transformation and Innovations in POWER Architecture Workload Transformation and Innovations in POWER Architecture
Workload Transformation and Innovations in POWER Architecture
 
OpenPOWER Workshop at IIT Roorkee
OpenPOWER Workshop at IIT RoorkeeOpenPOWER Workshop at IIT Roorkee
OpenPOWER Workshop at IIT Roorkee
 
Deep Learning Use Cases using OpenPOWER systems
Deep Learning Use Cases using OpenPOWER systemsDeep Learning Use Cases using OpenPOWER systems
Deep Learning Use Cases using OpenPOWER systems
 
IBM BOA for POWER
IBM BOA for POWER IBM BOA for POWER
IBM BOA for POWER
 
OpenPOWER System Marconi100
OpenPOWER System Marconi100OpenPOWER System Marconi100
OpenPOWER System Marconi100
 
OpenPOWER Latest Updates
OpenPOWER Latest UpdatesOpenPOWER Latest Updates
OpenPOWER Latest Updates
 
POWER10 innovations for HPC
POWER10 innovations for HPCPOWER10 innovations for HPC
POWER10 innovations for HPC
 
Deeplearningusingcloudpakfordata
DeeplearningusingcloudpakfordataDeeplearningusingcloudpakfordata
Deeplearningusingcloudpakfordata
 
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
 
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systemsAI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
 
AI in healthcare - Use Cases
AI in healthcare - Use Cases AI in healthcare - Use Cases
AI in healthcare - Use Cases
 
AI in Health Care using IBM Systems/OpenPOWER systems
AI in Health Care using IBM Systems/OpenPOWER systemsAI in Health Care using IBM Systems/OpenPOWER systems
AI in Health Care using IBM Systems/OpenPOWER systems
 
AI in Healh Care using IBM POWER systems
AI in Healh Care using IBM POWER systems AI in Healh Care using IBM POWER systems
AI in Healh Care using IBM POWER systems
 
Poster from NUS
Poster from NUSPoster from NUS
Poster from NUS
 
SAP HANA on POWER9 systems
SAP HANA on POWER9 systemsSAP HANA on POWER9 systems
SAP HANA on POWER9 systems
 
Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9
 
AI in the enterprise
AI in the enterprise AI in the enterprise
AI in the enterprise
 

Recently uploaded

Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 

Recently uploaded (20)

Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 

Parallelware Tools Ease HPC Development on POWER

  • 1. How Parallelware technology eases HPC software development for POWER systems Parallelware Analyzer and GPU programming for big code bases Manuel Arenaz manuel.arenaz@appentra.com OpenPOWER Academic Discussion Group Workshop 2018 Saturday, 10 November 2018 | Dallas, US https://indico-jsc.fz-juelich.de/event/76/
  • 2. Index ● Is There a Need for Parallelware Tools on POWER Systems? ● The Parallelware Tools Suite: Design of Software Components ● Parallelware Technology: Roadmap 2018-2020 ● Parallelware Trainer: Roadmap 2018-2019 ● Parallelware Analyzer: Roadmap 2018-2019 ● Conclusions & Future Work
  • 3. Index ● Is There a Need for Parallelware Tools on POWER Systems? ● The Parallelware Tools Suite: Design of Software Components ● Parallelware Core Technology: Roadmap 2018-2020 ● Parallelware Trainer: Roadmap 2018-2019 ● Parallelware Analyzer: Roadmap 2018-2019 ● Conclusions & Future Work
  • 4. Is there a need for Parallelware tools on Power?
  • 5. Is there a need for Parallelware tools on Power? Incredible computational power in one full node of Summit! Full Node Capabilities Processor POWER9 V100 Count 2 6 FLOPS (SP) 2.161 TFLOPS 2 × 22 × 49.12 GFLOPs 94.2 TFLOPs 6 × 15.7 TFLOPs FLOPS (DP) 1.081 TFLOPS 2 × 22 × 24.56 GFLOPs 46.8 TFLOPS 6 × 7.8 TFLOPs AI FLOPS - 750 TFLOPS 6 × 125 TFLOPs Memory 512 GiB (DRR4) 16 × 32 GiB 96 GiB (HBM2) 6 × 16 GiB Bandwidth 341.33 GB/s 16 × 21.33 GB/s 900 GB/s/GPU What are Summit’s parallel programming challenges?
  • 6. Is there a need for Parallelware tools on Power? ● Parallel programming of many-core processors ● Parallel programming of multiple GPUs (multi-GPU) ● Data movement through a heterogeneous complex memory hierarchy ● Training of computational researchers and engineers ● Porting of existing codes to (pre-)exascale systems How can Parallelware tools help to address these challenges?
  • 7. Index ● Is There a Need for Parallelware Tools on POWER Systems? ● The Parallelware Tools Suite: Design of Software Components ● Parallelware Technology: Roadmap 2018-2020 ● Parallelware Trainer: Roadmap 2018-2019 ● Parallelware Analyzer: Roadmap 2018-2019 ● Conclusions & Future Work
  • 8. Parallelware Tools Suite: Software Components GUI Desktop Emerging Technology Command Line Tool Emerging Technology 1. Parallelware Trainer 2. Parallelware Analyzer Parallelware Technology (libpw) Parallelware front-end Parallelware back-endParallelware middle-end Semantic Analysis Engine C OpenACC 2.0 Multi-Threading Offloading OpenMP 4.5
  • 9. Index ● Is There a Need for Parallelware Tools on POWER Systems? ● The Parallelware Tools Suite: Design of Software Components ● Parallelware Technology: Roadmap 2018-2020 ● Parallelware Trainer: Roadmap 2018-2019 ● Parallelware Analyzer: Roadmap 2018-2019 ● Conclusions & Future Work
  • 10. Parallelware Technology: Roadmap 2018-2020 GUI Desktop Emerging Technology Command Line Tool Emerging Technology The H2020 FETHPC projects MAESTRO and EPEEC will enable an incremental development following a co-design approach guided by (pre-)exascale applications. Parallelware Technology (libpw) Parallelware front-end Parallelware back-endParallelware middle-end Semantic Analysis Engine C OpenACC 2.0 Multi-Threading Offloading OpenMP 4.5 C++ Fortran Tasking OmpSs FPGAs http://www.prace-ri.eu/pracesc18-presentations/
  • 11. Index ● Is There a Need for Parallelware Tools on POWER Systems? ● The Parallelware Tools Suite: Design of Software Components ● Parallelware Technology: Roadmap 2018-2020 ● Parallelware Trainer: Roadmap 2018-2019 ● Parallelware Analyzer: Roadmap 2018-2019 ● Conclusions & Future Work
  • 12. An interactive tool that acts as your mentor “ Tell me, I will forget, Show me, I may remember, Involve me, I will understand.” Emerging Technologies
  • 13. Parallelware Trainer (v1.0 Sep 2018) Technical features: ● Identification of parallelization opportunities. ● Assistance with the introduction of correct OpenMP and OpenACC directives. ● Correct data scoping, including private/shared variables. ● Support for C programming language. ● Use any compiler and any build/compilation tool in Windows, Linux and MacOSX. ● Develop, test and benchmark all within the same interface. Benefits: ● Faster, more effective learning. ● Work on realistic codes rather than toy examples, including your own code. ● Reduced learning curve. ● Parallelize code within minutes. ● Immediate identification of where and how to parallelize. ● Support for multithreading, offloading to GPUs. https://www.appentra.com/products/parallelware-trainer/
  • 14. Parallelware Trainer (v1.0 Sep 2018) Project Explorer Code Editor Version Manager Output Consoles
  • 15. PASSIVE LEARNING Lectures INTERACTIVE LEARNING Exercises Demonstrations LEARNING BY DOING WITH MENTORS Problem solving HACKATHONS WORKSHOPS COURSES interactivity Learning effort Scalability Learning effort Interactivity Scalability interactivityLearning effort Scalability THE HPC EDUCATION & TRAINING PYRAMID Parallelware Trainer
  • 16. New knowledge base (glossary of parallel programming) Parallelware Trainer
  • 17. Index ● Is There a Need for Parallelware Tools on POWER Systems? ● The Parallelware Tools Suite: Design of Software Components ● Parallelware Technology: Roadmap 2018-2020 ● Parallelware Trainer: Roadmap 2018-2019 ● Parallelware Analyzer: Roadmap 2018-2019 ● Conclusions & Future Work
  • 18. A command-line reporting tool to improve productivity of HPC application developers Emerging Technologies
  • 19. Methodological Approach to Parallel Programming Tuned OpenACC parallel versionCode Analysis Working OpenMP Parallel version Profiled version Working version MS0 MS1 MS2 MS3 Working OpenACC parallel versionPreparing for the hackathon Tuned OpenMP parallel version Highly optimized OpenMP+OpenACC parallel version MS4 MS5 MS6 MS7 MS8 Biggest parallelisation barrier is “Code Analysis”: data scoping across procedure boundaries in codes using complex in-memory data layouts
  • 20. Parallelware Analyzer (Beta) ● Help to understand where and how to parallelize in real codes. ● Reports to facilitate understanding the code from different perspectives. ● Batch processing of files/directories of big code bases.
  • 22. Parallelware Analyzer: Roadmap 2019+ www.appentra.com/products/parallelware-analyzer ● Early Access Program (EAP) Q1-Q2 2019 ○ Students/Researchers/Developers ○ Academia/Industry ● Create a community to ensure development aligned with the user communities needs ○ Discussion forums ● Benefits ○ Free access to the tool ○ Early adopter discounts when the product is officially launched ● Official launch of Parallelware Analyzer: ○ Expected to be at SC19 ● Sign-up now for EAP starting 1 Jan 2019
  • 23. Index ● Is There a Need for Parallelware Tools on POWER Systems? ● The Parallelware Tools Suite: Design of Software Components ● Parallelware Technology: Roadmap 2018-2019 ● Parallelware Trainer: Roadmap 2018-2019 ● Parallelware Analyzer: Roadmap 2018-2019 ● Conclusions & Future Work
  • 24. Conclusions & Future Work Conclusions ● Parallelware tools are advancing the state of the art in parallel programming challenges, and can facilitate it for POWER systems too. ○ Parallelware Trainer can help in training for POWER systems using OpenMP and OpenACC for multicores and GPUs. ● Parallelware Analyzer (BETA) helps to dive into the complexity of developing HPC codes, covering the computational, control-flow and memory perspectives ○ Data scoping in big code bases is probably pain point #1 Coming soon... ● Distribute binary packages of Parallelware tools for POWER systems. ● Certify Parallelware tools as OpenPOWER Ready. SC18 Emerging Technologies booth #619 SC18 Startup Pavilion booth #3869
  • 25. How Parallelware technology eases HPC software development for POWER systems? Parallelware Analyzer and GPU programming for big code bases Manuel Arenaz manuel.arenaz@appentra.com OpenPOWER Academic Discussion Group Workshop 2018 Saturday, 10 November 2018 | Dallas, US https://indico-jsc.fz-juelich.de/event/76/