SlideShare uma empresa Scribd logo
1 de 24
Baixar para ler offline
34324 - Measurement Tools and Techniques



              Instrumenting
            the MG application
        of NAS Parallel Benchmark


               Maria Stylianou
             marsty5@gmail.com
                 20-APR-2012
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

                                     2
Outline

●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   3
                               (c) with Histograms, 3 - Conclusions
Basic Information
Execution Environments
●   Personal Laptop
    ●   Ubuntu 11.10, 64-bit
    ●   Intel Quad Core i5
    ●   4GB RAM


●   Boada Server
    ●   Intel(R) Xeon(R) CPU E5645 @ 2.40GHz
    ●   12 Cores with HT support
    ●   24 GΒ RAM
     1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   4
                                (c) with Histograms, 3 - Conclusions
Basic Information
NAS Parallel Benchmark
●   Evaluate the performance of parallel supercomputers

●   Several Applications                                   MG – MPI Version
    ●    IS, EP, CG, MG                                   Multi-Grid on a sequence
    ●    FT, BT, SP, LU                                              of meshes



●   Extrae → Produce traces
●   Paraver → Analyse traces
        1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   5
                                   (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   6
                               (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   7
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   8
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   9
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Initialization




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   10
                             (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Execution




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   11
                             (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Finalization




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   12
                             (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   13
                                (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
Instructions




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   14
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
Cycles




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   15
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
IPC: Instructions Per Cycle




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   16
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
L1 Cache Misses




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   17
                              (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   18
                              (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Time
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   19
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   20
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   21
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   22
                            (c) with Histograms, 3 - Conclusions
Conclusions
●   Scalability
    ●   In laptop: No way!
    ●   In Boada: Yes!

●   #Processors Increase
        → L1 Cache Misses Increase


●   Useful information very fast → Histograms!

     1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   23
                               (c) with Histograms, 3 - Conclusions
34324 - Measurement Tools and Techniques



              Instrumenting
            the MG application
        of NAS Parallel Benchmark


               Maria Stylianou
             marsty5@gmail.com
               20-APR-2012
                                           24

Mais conteúdo relacionado

Destaque

Destaque (10)

A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
 
How Companies Learn Your Secrets
How Companies Learn Your SecretsHow Companies Learn Your Secrets
How Companies Learn Your Secrets
 
Pregel - Paper Review
Pregel - Paper ReviewPregel - Paper Review
Pregel - Paper Review
 
Erlang in 10 minutes
Erlang in 10 minutesErlang in 10 minutes
Erlang in 10 minutes
 
Automatic Energy-based Scheduling
Automatic Energy-based SchedulingAutomatic Energy-based Scheduling
Automatic Energy-based Scheduling
 
SPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication MiddlewareSPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication Middleware
 
Google's Dremel
Google's DremelGoogle's Dremel
Google's Dremel
 
Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...
 
Data Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDBData Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDB
 
Apache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformApache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platform
 

Semelhante a Instrumenting the MG applicaiton of NAS Parallel Benchmark

Real time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithmReal time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
ajayrampelli
 
Mi rna data analysis 2013
Mi rna data analysis 2013Mi rna data analysis 2013
Mi rna data analysis 2013
Elsa von Licy
 

Semelhante a Instrumenting the MG applicaiton of NAS Parallel Benchmark (8)

Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
 
SRA final project
SRA final projectSRA final project
SRA final project
 
Incheon National University - EATED SRA
Incheon National University - EATED SRAIncheon National University - EATED SRA
Incheon National University - EATED SRA
 
Dongliang_Slides
Dongliang_SlidesDongliang_Slides
Dongliang_Slides
 
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithmReal time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
 
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
 
Mi rna data analysis 2013
Mi rna data analysis 2013Mi rna data analysis 2013
Mi rna data analysis 2013
 
Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 

Instrumenting the MG applicaiton of NAS Parallel Benchmark

  • 1. 34324 - Measurement Tools and Techniques Instrumenting the MG application of NAS Parallel Benchmark Maria Stylianou marsty5@gmail.com 20-APR-2012
  • 2. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 2
  • 3. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 3 (c) with Histograms, 3 - Conclusions
  • 4. Basic Information Execution Environments ● Personal Laptop ● Ubuntu 11.10, 64-bit ● Intel Quad Core i5 ● 4GB RAM ● Boada Server ● Intel(R) Xeon(R) CPU E5645 @ 2.40GHz ● 12 Cores with HT support ● 24 GΒ RAM 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 4 (c) with Histograms, 3 - Conclusions
  • 5. Basic Information NAS Parallel Benchmark ● Evaluate the performance of parallel supercomputers ● Several Applications MG – MPI Version ● IS, EP, CG, MG Multi-Grid on a sequence ● FT, BT, SP, LU of meshes ● Extrae → Produce traces ● Paraver → Analyse traces 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 5 (c) with Histograms, 3 - Conclusions
  • 6. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 6 (c) with Histograms, 3 - Conclusions
  • 7. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 7 (c) with Histograms, 3 - Conclusions
  • 8. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 8 (c) with Histograms, 3 - Conclusions
  • 9. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 9 (c) with Histograms, 3 - Conclusions
  • 10. Instrumentation by Observation Initialization 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 10 (c) with Histograms, 3 - Conclusions
  • 11. Instrumentation by Observation Execution 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 11 (c) with Histograms, 3 - Conclusions
  • 12. Instrumentation by Observation Finalization 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 12 (c) with Histograms, 3 - Conclusions
  • 13. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 13 (c) with Histograms, 3 - Conclusions
  • 14. Instrumentation using Performance Counters Instructions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 14 (c) with Histograms, 3 - Conclusions
  • 15. Instrumentation using Performance Counters Cycles 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 15 (c) with Histograms, 3 - Conclusions
  • 16. Instrumentation using Performance Counters IPC: Instructions Per Cycle 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 16 (c) with Histograms, 3 - Conclusions
  • 17. Instrumentation using Performance Counters L1 Cache Misses 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 17 (c) with Histograms, 3 - Conclusions
  • 18. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 18 (c) with Histograms, 3 - Conclusions
  • 19. Instrumentation using Histograms Time Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 19 (c) with Histograms, 3 - Conclusions
  • 20. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 20 (c) with Histograms, 3 - Conclusions
  • 21. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 21 (c) with Histograms, 3 - Conclusions
  • 22. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 22 (c) with Histograms, 3 - Conclusions
  • 23. Conclusions ● Scalability ● In laptop: No way! ● In Boada: Yes! ● #Processors Increase → L1 Cache Misses Increase ● Useful information very fast → Histograms! 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 23 (c) with Histograms, 3 - Conclusions
  • 24. 34324 - Measurement Tools and Techniques Instrumenting the MG application of NAS Parallel Benchmark Maria Stylianou marsty5@gmail.com 20-APR-2012 24