SlideShare uma empresa Scribd logo
1 de 38
A Methodology to Develop High Performance
    Applications on GPGPU Architectures:
Application to Simulation of Electrical Machines

                 THESIS DEFENSE


      Antonio Wendell DE OLIVEIRA RODRIGUES
            Advisor : Jean-Luc DEKEYSER
          Co-Advisor : Frédéric GUYOMARC'H
Context: Numerical Methods


                Physics                    Software




                         Architecture



April 3, 2012   Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   2 of 38
Context: Problem
          How to help non-specialists in
          programming/architecture to develop their
          applications
          How to generate automatic code enough
          efficient w.r.t. manually written code
               Taking advantage of available resources
          How to integrate profiling and development
          tools


3 avril 2012                Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   3 of 38
Context: GPGPU
          General-Purpose Computation on GPUs
               Massively Parallel Processing


                                   Physics                  Software




                                                                                    Tsubame 2.0
                                                                                      958 MFlops/watt

                                                                                    Cielo Cray
                                             Architecture
 Green Computing                                                                      278 MFlops/watt


                                             GPGPU

3 avril 2012                Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense         4 of 38
Context: GPGPU (Programming)
          CUDA
               Nvidia’s solution
               1st real high level GPGPU programming
               Large number of applications, libraries,
               developers
               Achieves better performance on Nvidia’s
               hardware
          OpenCL (Open Computing Language)
               Open Standard proposed by Khronos GroupTM
               Multi-vendors (including Nvidia)
               Not only for GPUs
3 avril 2012                Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   5 of 38
Context: Related Work
          Code-Level Specification
               Directives
                   – PGI, OpenHMPP, Annotated C
               Interfaces, translation
                   – Java, Python, Matlab
               Specific Language
                   – SAC (Single Assignment C)      Simulink
                                                   Mindstorms
                      » WITH-loop expressions (CUDA Backend)
                                                  OpenModelica

          High-Level Specification
               Simulink, OpenModelica, Mindstorms
               (Labview)
               Gaspard: OpenMP Branch (Julien Taillard)
                • Programming model, Specification
3 avril 2012                    Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   6 of 38
Context: High Level Specification
          How to separate the algorithm and the hardware
          specifications
                                                              MDE
               •   specify an application
                                                          (UML/MARTE)
               •   the expression of its potential parallelism
                                        Physics      Software
               •   the platform architecture
               •   the link between logical and physical parts
          Model Driven Engineering
               • Clear separation between hardware and software
                 specifications
               • UML: diagrams, tools
                                        Architecture
               • UML profile for MARTE: Parallel expressiveness inspired by
                 ArrayOL
                                                  GPGPU
                     – enables factorization of repeated elements.



3 avril 2012                         Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   7 of 38
Contribs: Methodology
          Points of View (according to MARTE)
               MARTE Specification
                                                              Define
                                                            Methodology

                       Build Model                               <<include>>

                                                    Adapt MARTE
                                                    specification
                                                                <<include>>

                      Annotate Model
                        for Analysis                       Build Execution
                                                           Platform Model


                      Analyze Model                            Provide
                                                              Execution
                                                              Platform
                                                                                               OpenCL, GPU
                                                                                             Cards, Drivers, etc.


3 avril 2012                 Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense                 8 of 38
Contribs: Methodology
          Code Generation from Higher Level Models
          (Gaspard2)




                                           Compilation of Models
                          OpenCL             OpenMP           Pthread         VHDL




                               Program (source code, makefiles, etc.)



3 avril 2012           Wendell Rodrigues   MDE Methodology for GPGPU: MEDEE Meeting   9 of 38
Contribs: Building a Model
          This is the model designer’s point of view


                                          Allocation


               Application                                             Architecture

                                        Deployment


                                          Virtual IP
                                         Software IP
                                          Artifacts
                             Global View of a Whole Model

3 avril 2012                   Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   10 of 38
Contribs: Building an Application
          Application: Code_CARMEL(L2EP-
          lab/EDF)
               • Electrical Machines Modeling and Simulation
               • Sparse Matrices TCarmel/FCarmel
                                  in CSR format


                                      Matrix Assembly




                                                                   PostProcess
                          GenPARAM
                           GenPHYS
                           GenDOF




               Input                                     Solver                       Output




3 avril 2012                  Wendell Rodrigues         MDE Methodology for GPGPU: Thesis Defense   11 of 38
Contribs: CG Example
          Solver: Conjugate Gradient
               • Numerical Method to solve a System of Linear
                 Equations




3 avril 2012                 Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   12 of 38
Contribs: CG Example
          UML/MARTE modeling tool
          (Eclipse/Papyrus)
        Application



                                                                      dotProd
       Architecture


                      A: Real {1000}
                                       <<tiler>>
         Allocation                                    Mult
                                                   m: Mult {1000}                   r: Reduc   {1}
                                       el1: Real   {1}                <<reshape>>
                                                               {1}
                                                                {1}   res: Real     {1000}     {1}      C: Real {1}

                                       el2: Real   {1}

                                       <<tiler>>
       Deployment     B: Real {1000}




3 avril 2012                            Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense    13 of 38
Contribs: CG Example
          The whole algorithm

                                                                                                                              CG_Module_GPU


        Application                                               norm_r0: norm
                                                            r_0 : Real{132651}

                                                                    norm_r0 : Real{1}


                                                                                                                                                   <<interrepetition>>
                                                                            <<defaultlink>>
                                                                                                                                        CGLoop     <<interrepetition>>
                                              b : Real {132651}
                         r_k : Real{132651}                   rr: dotProd
                                                                                        alpha: ScalarDiv
                                                                                                                  <<shaped>>
                                                                                                                                                      <<shaped>>
                                                                                                                                              cg: CGLoop        {132651}
       Architecture                                                 <<defaultlink>>
                                                                                                                   x: DAXPY                  beta: ScalarDiv
                                                                                                                              r_k : Real{132651}
                                                                                                                                                                                           <<shaped>>
                                                                                                                                                                                             p: DAXPY

                                                                                                                                                                                error : Real{1}
                                                                                                                                                                                                                                             r_k1 : Real{132651}



                                                                                                                              norm_r0 : Real{1}                                                                         error : Real{1}
                                                                                                                                                                                                                     error: ScalarDivSqrt    x_k1 : Real{132651}
                          norm_r0 : Real{1}
                                                                      init: InitVars
                                                                                                                              x_k : Real{132651}                          r_k1 : Real{132651}
                                                                                                <<defaultlink>>
                                                                       x0 : Real{132651}                                                                                                                                                     p_k1 : Real{132651}
                         x_k : Real{132651}
                                                                                                                              A : Real{3442951}

                                         A : Real{3442951}                                                                    iA : Integer{132652}                   x_k1 : Real{132651}
                                                                    <<shaped>>                                                                                                                                                x_out          error : Real{1}
                                                                   Ap: dgemvCSR                   pAp: dotProd                    minusalpha: Negative              <<shaped>>
                                                                                                                                                                         r: DAXPY
                         A : Real{3442951}                                                                                                                                                              rrnew: dotProd

         Allocation    iA : Integer{132652}
                      jA : Integer{3442951}
                                              iA : Real{132651}
                                                                                                                              jA : Integer{3442951}                  p_k1 : Real{132651}


                        p_k : Real{132651}
                                                                                                                              p_k : Real{132651}
                                              jA : Real{132651}

                                                                                                                                                      <<interrepetition>>




       Deployment



3 avril 2012                                                                Wendell Rodrigues                     MDE Methodology for GPGPU: Thesis Defense                                                                                 14 of 38
Contribs: CG Example
          Defining the Architecture

        Application



       Architecture



         Allocation



       Deployment



3 avril 2012            Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   15 of 38
Contribs: CG Example
          Allocating Tasks

        Application



       Architecture



         Allocation



       Deployment



3 avril 2012            Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   16 of 38
Contribs: CG Example
          Allocating FlowPorts

        Application



       Architecture



         Allocation



       Deployment



3 avril 2012            Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   17 of 38
Contribs: CG Example
          Elementary Tasks and Software IP

        Application      Sip_Mult
                                                  • Code of an elementary
                                                    function
                                                  • Parameter order
                                                  • Possible header files or
       Architecture                                 libraries, compiling
                                                    directives, so on.
                              <<manifest>>


         Allocation     <<artifact>>
                       <<codeFile>>
                         MultCF


       Deployment



3 avril 2012           Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   18 of 38
Contribs: Execution Test and
   Results
          The model designer starts the code
          generation
          The model compiler generates a program
               Makefile and source files

                                                                                                                                                                                                                                                                                                                                         CGLoop


                                                                                                                                                                                                                                         r_k : Real{132651}    rr: dotProd
                                                                                                                                                                                                                                                                                    alpha: ScalarDiv
                                                                                                                                                                                                                                                                                                                   <<shaped>>
                                                                                                                                                                                                                                                                                                                      x: DAXPY               beta: ScalarDiv                   <<shaped>>
                                                                                                                                                                                                                                                                                                                                                                                p: DAXPY                                        r_k1 : Real{132651}



                                                                                                                                                                                                                                                                                                                                                                                                         error: ScalarDivSqrt   x_k1 : Real{132651}
                                                                                                                                                                                                                                          norm_r0 : Real{1}


                                                                                                                                                                                                                                                                                                                                                                                                                                p_k1 : Real{132651}
                                                                                                                                                                                                                                         x_k : Real{132651}


                                                                                                                                                                                                                                                                     <<shaped>>                                                                                                                                                 error : Real{1}
                                                                                                                                                                                                                                                                    Ap: dgemvCSR              pAp: dotProd                         minusalpha: Negative           <<shaped>>
                                                                                                                                                                                                                                                                                                                                                                   r: DAXPY
                                                                                                                                                                                                                                         A : Real{3442951}                                                                                                                                  rrnew: dotProd
                                                                                                                                                                                                                                       iA : Integer{132652}
                                                                                                                                                                                                                                      jA : Integer{3442951}
                                                                                                                                                                                                                                        p_k : Real{132651}




                                                                                                                                                                                                                                                                                                                                                  <<allocate>> <<abstract>>
                                                                                                                                                                                                                                                                                                                          Architecture



                                                                                                                                                                                                                                                                                                         <<shaped>>                         <<shaped>>
                                                                                                                                                                                                                                                                                                        h1: HOST {1}                       d1: DEVICE {1}

                                                                                                                                                                                                                                                              <<allocate>> <<abstract>>                      mp: Memory                           mgp: Memory




                                                                                                                        CGLoop


                        r_k : Real{132651}    rr: dotProd
                                                                   alpha: ScalarDiv
                                                                                                  <<shaped>>
                                                                                                     x: DAXPY               beta: ScalarDiv                   <<shaped>>
                                                                                                                                                               p: DAXPY                                        r_k1 : Real{132651}



                                                                                                                                                                                        error: ScalarDivSqrt   x_k1 : Real{132651}
                         norm_r0 : Real{1}


                                                                                                                                                                                                               p_k1 : Real{132651}
                        x_k : Real{132651}


                                                    <<shaped>>                                                                                                                                                 error : Real{1}
                                                   Ap: dgemvCSR              pAp: dotProd                         minusalpha: Negative           <<shaped>>
                                                                                                                                                  r: DAXPY
                        A : Real{3442951}                                                                                                                                  rrnew: dotProd
                      iA : Integer{132652}
                     jA : Integer{3442951}
                       p_k : Real{132651}




                                                                                                                                 <<allocate>> <<abstract>>
                                                                                                         Architecture



                                                                                        <<shaped>>                         <<shaped>>
                                                                                       h1: HOST {1}                       d1: DEVICE {1}

                                             <<allocate>> <<abstract>>                      mp: Memory                           mgp: Memory




3 avril 2012                                                                                                                                                                                                                         Wendell Rodrigues                                                                                   MDE Methodology for GPGPU: Thesis Defense                                                                    19 of 38
Contribs: Execution Test and
   Results
          CG Program to CG Module for
          Code_CARMEL: Adaptation

                  GenDOF: Fortran

                 GenPHYS: Fortran                                          C/C++

                GenPARAM: Fortran


                     T/FCarmel:
                       Fortran     Interface C




               PostProcessing: Fortran


3 avril 2012                  Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   20 of 38
Contribs: Execution Test and
   Results
          Evaluating Scalability: FEM on different
          meshes




3 avril 2012            Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   21 of 38
Contribs: Execution Test and
   Results
                                                    CPU: AMD Opteron, 8-core
          Results                                   @2.4GHz and 64GB RAM.

               Execution Time                       GPU: NVidia S1070 4
                                                    devices Tesla T10 (4GB
                                                    RAM each) – Compute
                                                    Capability 1.3

                                                                              Performance




                                                               1




3 avril 2012                    Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   22 of 38
Contribs: How It Works
          This is the methodology provider’s point of
          view (the UML/MARTE-to-OpenCL chain)

                             3                      6                     9




                     2                  5                      8


                                                                     #include b.h
                                                                     func(a,b){
                 1           4                      7                  c=a+b;
                                                                     }




3 avril 2012             Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   23 of 38
Contribs: UML/MARTE to OpenCL
                 UML-to-MARTE Transformation
                    • avoids the UML complexity
                    • keeps only the essential elements of MARTE
                 Port Instance Transformation
                    • UML does not implement instances of FlowPorts
                      when we instantiate a part (tasks)

                                 Mult                                 m: Mult {100}             k: Mult {20}
               el1: Real   {1}                                         {1}                       {1}
                                        {1}   res: Real                           {1}                       {1}
               el2: Real   {1}                                         {1}                       {1}




     1


3 avril 2012                                    Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense     24 of 38
Contribs: UML/MARTE to OpenCL
               Tiler-to-Task Transformation
                • Expressed in ArrayOL as stereotype of connectors
                • Special tasks allocated available processors




3 avril 2012                  Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   25 of 38
Contribs: UML/MARTE to OpenCL
               Local and Global Graphs Transformations
               Scheduling Policy Transformation
                                       globalDependencies
                                                            p1_Task

                                                                      Start
                                          StartTask                   Task


                                                            IPTask             IPTask
                                                             vec1               vec2
                                           Global
                                           Graph:                     Global
                                          p1_Task                     Graph
                                                                                  contains other
                                                                       dev
                                                                                sub-graphs


                                                                      IPTask
                                                                       v1v2
                                          EndTask


                                                                      End
                                                                      Task




3 avril 2012               Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense           26 of 38
Contribs: UML/MARTE to OpenCL
               Memory Mapping Transformation

                                                              main




                         1                2                              3                         4

               addMemoryMap           defineScope            propagateDataAllocation            createTilerTaskDA

                                                    X                                      5

          defineBasicDataAllocations    createAffectationDataAllocation        createVirtualIPSoftIPDA




3 avril 2012                            Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense              27 of 38
Contribs: UML/MARTE to OpenCL
                    Hybrid Transformation
                                                                   main                                       HorizontalFilter                                     VerticalFilter
                                                                                                                  <<shaped>>           same allocation                <<shaped>>
                                                                                                               rhf: RHF {288,44}                                   rvf: RVF {32,132}
                                                                                                                                   «tiler»           «tiler»
   Thread (work-item)                                      createHybridApp                                                         1       2     3             4
                                                                                                    «tiler»                                                                            «tiler»
     Grid definition
                                                             toHybridApp
                                                                                                                          refersTo             refersTo
                              1                      2                             3

                             toDevSide               toHostSide           Schedule Host

                                                                                             4

                              toKernel          toMainFunction                    Schedule Device




               kernelVars   toIPFunction        toTilerFunctions            mainVars




                                         defineVars




                                     optimizeTransfer




3 avril 2012                                                      Wendell Rodrigues           MDE Methodology for GPGPU: Thesis Defense                                                28 of 38
Contribs: UML/MARTE to OpenCL
          Code Generation Model to Text
          Transformation
               Based on Acceleo Templates
               Functionalities
               • IP insertions
               • Tiler notation to Memory Address Computation in C
               • Implements the memory transfer optimization




3 avril 2012                 Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   29 of 38
Contribs: UML/MARTE to OpenCL
          Code Generation Model to Text
          Transformation                                                                       send(dataaddress)
                                                                                               with size data to Device;




   <<shaped>>
                    Multiple Devices                                                           Launch Kernel on Device with grid (WG,WI)
p: DAXPY {100}


                                                                                               recv(dataaddress)
                                             <<hwResource>> <<shaped>>                         with size data from Device;
                                                    d1: Device      {4}


                                                                                                 for (i = 0; i < numDev; i++)
                                              gp: GPU     mgp: Memory
                                                                                                    send(dataaddress + i*data/numDev)
                                                                                                    with size data/numDev to Device i;


                                                                                                 for (i = 0; i < numDev; i++)

               <<abstraction>><<allocate>>
                                                                                                    Launch Kernel on Device i with grid (WG/numDev,WI)



                                                                                                 for (i = 0; i < numDev; i++)

                                                                                                    recv(dataaddress + i*data/numDev)
                                                                                                    with size data/numDev from Device i;




3 avril 2012                                               Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense                                 30 of 38
Contribs: UML/MARTE to OpenCL
          Code Generation Model to Text
          Transformation
               • Tiler Analysis (Shared Memory Use)




3 avril 2012                 Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   31 of 38
Contribs: Profiling Analysis
          Integrating Profiler and Models
                 High Level Abstraction          7
                                                                                Profiling and
                  Model of Application,                                            Advice
                                                             Profiling and
               Architecture and Allocation                 Optimization Hints      Model
                             Transformation
                                                                   Vincent Aranega’s
                                                             Annotations          Profiling and Advices
                                                                              6
                       1
                                 Chain                               Thesis (2011) Model Production
                                                                          Domain Specific Profiling Analisys
                                                                              Transformation Library



                 Generated Code Files                Trace
                (Makefile, *.cl, *.cpp, *.h)          Models
                                                                        Profiling Log         Device Features
                                                                           Model             Database Model
                                  SDK
                        2      Compilation
                                Process           UID base link                 5      Log Parser


               Binaries and Runtime Files
                                                                            Logs

                                Software
                        3       Execution       Profiling Logs Production        4

                                     Hybrid Execution Platform

3 avril 2012                           Wendell Rodrigues     MDE Methodology for GPGPU: Thesis Defense         32 of 38
Contribs: Profiling Analysis
          Integrating Profiler and Models (Case
          Study)


                   {16,1000000}




3 avril 2012                Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   33 of 38
Contribs: Profiling Analysis
          Integrating Profiler and Models (Case
          Study)




                                                 ~ 60%




3 avril 2012            Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   34 of 38
Experimental Validation: Alternator from Valeo

          Generated Code for PCG in
          Code_CARMEL for an industrial application




3 avril 2012           Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   35 of 38
Experimental Validation: Alternator from Valeo

          Sparse Matrix
                  • N=775,689
                  • NNZ=12,502,443
          Solution: Preconditioned Conjugate
          Gradient (PCG) in 10,000 iterations
                                            Time (s)                   Speedup


               CPU (AMD Opteron)            2300 (~38min)              1


               GPU (S1070)                  250 (~4min)                9.2



3 avril 2012                       Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   36 of 38
Conclusions and Perspectives
          Developing Methodology
               • Non-specialists can develop their applications from
                 higher levels specification
          Optimizations and MultiGPU
               • Memory Issues: Efficient code
               • Profiling Integration
               • Scaling according to hardware
          Numerical Methods (Industrial Applications)
               • Speedups > 9x
               • Multiple Simulations
                  – 10 hours/simulation            ~ 1 hour
               • High Performance with low investment in hardware
          Code_CARMEL Integration
3 avril 2012                   Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   37 of 38
Conclusions and Perspectives
          GPU Clusters
               For instance, Tianhe in China
               MPI as solution for inter-node communication
                • Issues: distributed memory, communication,
                  synchronization
          High-Level Control on the Code Generation
          Chain
                • Optimization levels, dynamic parameters




3 avril 2012                  Wendell Rodrigues   MDE Methodology for GPGPU: Thesis Defense   38 of 38

Mais conteúdo relacionado

Mais procurados

Optimization problems and algorithms
Optimization problems and  algorithmsOptimization problems and  algorithms
Optimization problems and algorithmsAboul Ella Hassanien
 
Uncertainty Quantification with Unsupervised Deep learning and Multi Agent Sy...
Uncertainty Quantification with Unsupervised Deep learning and Multi Agent Sy...Uncertainty Quantification with Unsupervised Deep learning and Multi Agent Sy...
Uncertainty Quantification with Unsupervised Deep learning and Multi Agent Sy...Bang Xiang Yong
 
CapellaDays2022 | Thales | Stairway to heaven: Climbing the very first steps
CapellaDays2022 | Thales | Stairway to heaven: Climbing the very first stepsCapellaDays2022 | Thales | Stairway to heaven: Climbing the very first steps
CapellaDays2022 | Thales | Stairway to heaven: Climbing the very first stepsObeo
 
Fast Multi-frame Stereo Scene Flow with Motion Segmentation (CVPR 2017)
Fast Multi-frame Stereo Scene Flow with Motion Segmentation (CVPR 2017)Fast Multi-frame Stereo Scene Flow with Motion Segmentation (CVPR 2017)
Fast Multi-frame Stereo Scene Flow with Motion Segmentation (CVPR 2017)Tatsunori Taniai
 
Lecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryLecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryAlbert Orriols-Puig
 
Masters Thesis Defense
Masters Thesis DefenseMasters Thesis Defense
Masters Thesis Defenseelliotk
 
PR-409: Denoising Diffusion Probabilistic Models
PR-409: Denoising Diffusion Probabilistic ModelsPR-409: Denoising Diffusion Probabilistic Models
PR-409: Denoising Diffusion Probabilistic ModelsHyeongmin Lee
 
15 Machine Learning Multilayer Perceptron
15 Machine Learning Multilayer Perceptron15 Machine Learning Multilayer Perceptron
15 Machine Learning Multilayer PerceptronAndres Mendez-Vazquez
 
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...Jinwon Lee
 
Distributed machine learning
Distributed machine learningDistributed machine learning
Distributed machine learningStanley Wang
 
Leveraging Alf for SysML, Part 1: Better Simulation Modeling
Leveraging Alf for SysML, Part 1: Better Simulation ModelingLeveraging Alf for SysML, Part 1: Better Simulation Modeling
Leveraging Alf for SysML, Part 1: Better Simulation ModelingEd Seidewitz
 
Metaheuristic Optimization: Algorithm Analysis and Open Problems
Metaheuristic Optimization: Algorithm Analysis and Open ProblemsMetaheuristic Optimization: Algorithm Analysis and Open Problems
Metaheuristic Optimization: Algorithm Analysis and Open ProblemsXin-She Yang
 
Master Thesis Presentation
Master Thesis PresentationMaster Thesis Presentation
Master Thesis PresentationEvaBernat4
 
Creo tookit geometric traversal and evaluator
Creo tookit  geometric traversal and evaluatorCreo tookit  geometric traversal and evaluator
Creo tookit geometric traversal and evaluatorSandip Jadhav
 
Image Classification And Support Vector Machine
Image Classification And Support Vector MachineImage Classification And Support Vector Machine
Image Classification And Support Vector MachineShao-Chuan Wang
 
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...Ryo Takahashi
 
Do wide and deep networks learn the same things? Uncovering how neural networ...
Do wide and deep networks learn the same things? Uncovering how neural networ...Do wide and deep networks learn the same things? Uncovering how neural networ...
Do wide and deep networks learn the same things? Uncovering how neural networ...Seunghyun Hwang
 
Incose Systems Engineering Vision-2035
Incose Systems Engineering Vision-2035Incose Systems Engineering Vision-2035
Incose Systems Engineering Vision-2035AngelCepedanoBeteta1
 

Mais procurados (20)

Optimization problems and algorithms
Optimization problems and  algorithmsOptimization problems and  algorithms
Optimization problems and algorithms
 
ENERGY EFFICIENCY IN CLOUD COMPUTING
ENERGY EFFICIENCY IN CLOUD COMPUTINGENERGY EFFICIENCY IN CLOUD COMPUTING
ENERGY EFFICIENCY IN CLOUD COMPUTING
 
Uncertainty Quantification with Unsupervised Deep learning and Multi Agent Sy...
Uncertainty Quantification with Unsupervised Deep learning and Multi Agent Sy...Uncertainty Quantification with Unsupervised Deep learning and Multi Agent Sy...
Uncertainty Quantification with Unsupervised Deep learning and Multi Agent Sy...
 
CapellaDays2022 | Thales | Stairway to heaven: Climbing the very first steps
CapellaDays2022 | Thales | Stairway to heaven: Climbing the very first stepsCapellaDays2022 | Thales | Stairway to heaven: Climbing the very first steps
CapellaDays2022 | Thales | Stairway to heaven: Climbing the very first steps
 
Fast Multi-frame Stereo Scene Flow with Motion Segmentation (CVPR 2017)
Fast Multi-frame Stereo Scene Flow with Motion Segmentation (CVPR 2017)Fast Multi-frame Stereo Scene Flow with Motion Segmentation (CVPR 2017)
Fast Multi-frame Stereo Scene Flow with Motion Segmentation (CVPR 2017)
 
Lecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryLecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-Theory
 
Masters Thesis Defense
Masters Thesis DefenseMasters Thesis Defense
Masters Thesis Defense
 
PR-409: Denoising Diffusion Probabilistic Models
PR-409: Denoising Diffusion Probabilistic ModelsPR-409: Denoising Diffusion Probabilistic Models
PR-409: Denoising Diffusion Probabilistic Models
 
15 Machine Learning Multilayer Perceptron
15 Machine Learning Multilayer Perceptron15 Machine Learning Multilayer Perceptron
15 Machine Learning Multilayer Perceptron
 
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
 
Distributed machine learning
Distributed machine learningDistributed machine learning
Distributed machine learning
 
Leveraging Alf for SysML, Part 1: Better Simulation Modeling
Leveraging Alf for SysML, Part 1: Better Simulation ModelingLeveraging Alf for SysML, Part 1: Better Simulation Modeling
Leveraging Alf for SysML, Part 1: Better Simulation Modeling
 
Metaheuristic Optimization: Algorithm Analysis and Open Problems
Metaheuristic Optimization: Algorithm Analysis and Open ProblemsMetaheuristic Optimization: Algorithm Analysis and Open Problems
Metaheuristic Optimization: Algorithm Analysis and Open Problems
 
Master Thesis Presentation
Master Thesis PresentationMaster Thesis Presentation
Master Thesis Presentation
 
Creo tookit geometric traversal and evaluator
Creo tookit  geometric traversal and evaluatorCreo tookit  geometric traversal and evaluator
Creo tookit geometric traversal and evaluator
 
Image Classification And Support Vector Machine
Image Classification And Support Vector MachineImage Classification And Support Vector Machine
Image Classification And Support Vector Machine
 
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...
 
MDA
MDAMDA
MDA
 
Do wide and deep networks learn the same things? Uncovering how neural networ...
Do wide and deep networks learn the same things? Uncovering how neural networ...Do wide and deep networks learn the same things? Uncovering how neural networ...
Do wide and deep networks learn the same things? Uncovering how neural networ...
 
Incose Systems Engineering Vision-2035
Incose Systems Engineering Vision-2035Incose Systems Engineering Vision-2035
Incose Systems Engineering Vision-2035
 

Destaque

Vlad Trifa - Final PhD Thesis Defense at ETH Zurich
Vlad Trifa - Final PhD Thesis Defense at ETH ZurichVlad Trifa - Final PhD Thesis Defense at ETH Zurich
Vlad Trifa - Final PhD Thesis Defense at ETH ZurichVlad Trifa
 
Thesis defense
Thesis defenseThesis defense
Thesis defenseMichal Tal
 
Ph.D Thesis Defense: Magnetic Reconnection as a Chondrule Heating Mechanism
Ph.D Thesis Defense: Magnetic Reconnection as a Chondrule Heating MechanismPh.D Thesis Defense: Magnetic Reconnection as a Chondrule Heating Mechanism
Ph.D Thesis Defense: Magnetic Reconnection as a Chondrule Heating MechanismSamuel Lazerson
 
PhD Thesis Defense - Enhancing Software Quality and Quality of Experience thr...
PhD Thesis Defense - Enhancing Software Quality and Quality of Experience thr...PhD Thesis Defense - Enhancing Software Quality and Quality of Experience thr...
PhD Thesis Defense - Enhancing Software Quality and Quality of Experience thr...Pedro Luis Mateo Navarro
 
Thesis defence of Dall'Olio Giovanni Marco. Applications of network theory to...
Thesis defence of Dall'Olio Giovanni Marco. Applications of network theory to...Thesis defence of Dall'Olio Giovanni Marco. Applications of network theory to...
Thesis defence of Dall'Olio Giovanni Marco. Applications of network theory to...Giovanni Marco Dall'Olio
 
Thesis Defense Presentation
Thesis Defense PresentationThesis Defense Presentation
Thesis Defense PresentationJamesDavie
 
My Thesis Defense Presentation
My Thesis Defense PresentationMy Thesis Defense Presentation
My Thesis Defense PresentationOnur Taylan
 
Powerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis DefencePowerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis DefenceCatie Chase
 
Masters Defense presentation
Masters Defense presentationMasters Defense presentation
Masters Defense presentationProyag Datta
 
Prepare your Ph.D. Defense Presentation
Prepare your Ph.D. Defense PresentationPrepare your Ph.D. Defense Presentation
Prepare your Ph.D. Defense PresentationChristian Glahn
 
Thesis Power Point Presentation
Thesis Power Point PresentationThesis Power Point Presentation
Thesis Power Point Presentationriddhikapandya1985
 
10 23 & 10 28 2011 hayward bush email - notepad
10 23 & 10 28 2011 hayward bush email - notepad10 23 & 10 28 2011 hayward bush email - notepad
10 23 & 10 28 2011 hayward bush email - notepadscameronwriting
 
Slide Ffs + Tips Singkat
Slide Ffs + Tips SingkatSlide Ffs + Tips Singkat
Slide Ffs + Tips Singkatmuchnigroups
 
Financial & mgt. accounting
Financial & mgt. accountingFinancial & mgt. accounting
Financial & mgt. accountingRohit Mishra
 
الاب متى المسكين شرح انجيل متى - المقابلات الست بين الناموس والمسيح
الاب متى المسكين  شرح انجيل متى - المقابلات الست بين الناموس والمسيحالاب متى المسكين  شرح انجيل متى - المقابلات الست بين الناموس والمسيح
الاب متى المسكين شرح انجيل متى - المقابلات الست بين الناموس والمسيحIbrahimia Church Ftriends
 

Destaque (20)

Thesis defense
Thesis defenseThesis defense
Thesis defense
 
Vlad Trifa - Final PhD Thesis Defense at ETH Zurich
Vlad Trifa - Final PhD Thesis Defense at ETH ZurichVlad Trifa - Final PhD Thesis Defense at ETH Zurich
Vlad Trifa - Final PhD Thesis Defense at ETH Zurich
 
Thesis defense
Thesis defenseThesis defense
Thesis defense
 
Ph.D Thesis Defense: Magnetic Reconnection as a Chondrule Heating Mechanism
Ph.D Thesis Defense: Magnetic Reconnection as a Chondrule Heating MechanismPh.D Thesis Defense: Magnetic Reconnection as a Chondrule Heating Mechanism
Ph.D Thesis Defense: Magnetic Reconnection as a Chondrule Heating Mechanism
 
PhD Thesis Defense - Enhancing Software Quality and Quality of Experience thr...
PhD Thesis Defense - Enhancing Software Quality and Quality of Experience thr...PhD Thesis Defense - Enhancing Software Quality and Quality of Experience thr...
PhD Thesis Defense - Enhancing Software Quality and Quality of Experience thr...
 
Thesis defence of Dall'Olio Giovanni Marco. Applications of network theory to...
Thesis defence of Dall'Olio Giovanni Marco. Applications of network theory to...Thesis defence of Dall'Olio Giovanni Marco. Applications of network theory to...
Thesis defence of Dall'Olio Giovanni Marco. Applications of network theory to...
 
Thesis Defense Presentation
Thesis Defense PresentationThesis Defense Presentation
Thesis Defense Presentation
 
Thesis Defense Presentation
Thesis Defense PresentationThesis Defense Presentation
Thesis Defense Presentation
 
My Thesis Defense Presentation
My Thesis Defense PresentationMy Thesis Defense Presentation
My Thesis Defense Presentation
 
Powerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis DefencePowerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis Defence
 
Thesis powerpoint
Thesis powerpointThesis powerpoint
Thesis powerpoint
 
Masters Defense presentation
Masters Defense presentationMasters Defense presentation
Masters Defense presentation
 
Prepare your Ph.D. Defense Presentation
Prepare your Ph.D. Defense PresentationPrepare your Ph.D. Defense Presentation
Prepare your Ph.D. Defense Presentation
 
Thesis Power Point Presentation
Thesis Power Point PresentationThesis Power Point Presentation
Thesis Power Point Presentation
 
10 23 & 10 28 2011 hayward bush email - notepad
10 23 & 10 28 2011 hayward bush email - notepad10 23 & 10 28 2011 hayward bush email - notepad
10 23 & 10 28 2011 hayward bush email - notepad
 
Slide Ffs + Tips Singkat
Slide Ffs + Tips SingkatSlide Ffs + Tips Singkat
Slide Ffs + Tips Singkat
 
Self deception 1
Self deception 1Self deception 1
Self deception 1
 
Financial & mgt. accounting
Financial & mgt. accountingFinancial & mgt. accounting
Financial & mgt. accounting
 
الاب متى المسكين شرح انجيل متى - المقابلات الست بين الناموس والمسيح
الاب متى المسكين  شرح انجيل متى - المقابلات الست بين الناموس والمسيحالاب متى المسكين  شرح انجيل متى - المقابلات الست بين الناموس والمسيح
الاب متى المسكين شرح انجيل متى - المقابلات الست بين الناموس والمسيح
 
Jfs retirement
Jfs retirementJfs retirement
Jfs retirement
 

Semelhante a Thesis Defense

Orthogonal Matching Pursuit in 2D for Java with GPGPU Prospectives
Orthogonal Matching Pursuit in 2D for Java with GPGPU ProspectivesOrthogonal Matching Pursuit in 2D for Java with GPGPU Prospectives
Orthogonal Matching Pursuit in 2D for Java with GPGPU ProspectivesMatt Simons
 
Directive-based approach to Heterogeneous Computing
Directive-based approach to Heterogeneous ComputingDirective-based approach to Heterogeneous Computing
Directive-based approach to Heterogeneous ComputingRuymán Reyes
 
OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC
 
OpenACC Monthly Highlights: January 2021
OpenACC Monthly Highlights: January 2021OpenACC Monthly Highlights: January 2021
OpenACC Monthly Highlights: January 2021OpenACC
 
Scalable Algorithm Design with MapReduce
Scalable Algorithm Design with MapReduceScalable Algorithm Design with MapReduce
Scalable Algorithm Design with MapReducePietro Michiardi
 
Integrating profiling into mde compilers
Integrating profiling into mde compilersIntegrating profiling into mde compilers
Integrating profiling into mde compilersijseajournal
 
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYSPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYcsandit
 
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY cscpconf
 
SEAA'2012: An MDE approach for Runtime Monitoring and Adapting Component-base...
SEAA'2012: An MDE approach for Runtime Monitoring and Adapting Component-base...SEAA'2012: An MDE approach for Runtime Monitoring and Adapting Component-base...
SEAA'2012: An MDE approach for Runtime Monitoring and Adapting Component-base...Applied Computing Group
 
The road to multi/many core computing
The road to multi/many core computingThe road to multi/many core computing
The road to multi/many core computingOsvaldo Gervasi
 
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONSA SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONScseij
 
DAE Tools 1.8.0 - Introduction
DAE Tools 1.8.0 - IntroductionDAE Tools 1.8.0 - Introduction
DAE Tools 1.8.0 - IntroductionDragan Nikolić
 
OGCE RT Rroject Review
OGCE RT Rroject ReviewOGCE RT Rroject Review
OGCE RT Rroject Reviewmarpierc
 
OGCE Review for Indiana University Research Technologies
OGCE Review for Indiana University Research TechnologiesOGCE Review for Indiana University Research Technologies
OGCE Review for Indiana University Research Technologiesmarpierc
 
C++ Data-flow Parallelism sounds great! But how practical is it? Let’s see ho...
C++ Data-flow Parallelism sounds great! But how practical is it? Let’s see ho...C++ Data-flow Parallelism sounds great! But how practical is it? Let’s see ho...
C++ Data-flow Parallelism sounds great! But how practical is it? Let’s see ho...Jason Hearne-McGuiness
 
Accelerating Deep Learning Training with BigDL and Drizzle on Apache Spark wi...
Accelerating Deep Learning Training with BigDL and Drizzle on Apache Spark wi...Accelerating Deep Learning Training with BigDL and Drizzle on Apache Spark wi...
Accelerating Deep Learning Training with BigDL and Drizzle on Apache Spark wi...Databricks
 
Applying Model Driven Development in MAS for limited devices
Applying Model Driven Development in MAS for limited devicesApplying Model Driven Development in MAS for limited devices
Applying Model Driven Development in MAS for limited devicesMiguel Rebollo
 
Deepcoder to Self-Code with Machine Learning
Deepcoder to Self-Code with Machine LearningDeepcoder to Self-Code with Machine Learning
Deepcoder to Self-Code with Machine LearningIRJET Journal
 

Semelhante a Thesis Defense (20)

Orthogonal Matching Pursuit in 2D for Java with GPGPU Prospectives
Orthogonal Matching Pursuit in 2D for Java with GPGPU ProspectivesOrthogonal Matching Pursuit in 2D for Java with GPGPU Prospectives
Orthogonal Matching Pursuit in 2D for Java with GPGPU Prospectives
 
Directive-based approach to Heterogeneous Computing
Directive-based approach to Heterogeneous ComputingDirective-based approach to Heterogeneous Computing
Directive-based approach to Heterogeneous Computing
 
OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020
 
OpenACC Monthly Highlights: January 2021
OpenACC Monthly Highlights: January 2021OpenACC Monthly Highlights: January 2021
OpenACC Monthly Highlights: January 2021
 
Scalable Algorithm Design with MapReduce
Scalable Algorithm Design with MapReduceScalable Algorithm Design with MapReduce
Scalable Algorithm Design with MapReduce
 
Integrating profiling into mde compilers
Integrating profiling into mde compilersIntegrating profiling into mde compilers
Integrating profiling into mde compilers
 
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYSPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
 
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
 
SEAA'2012: An MDE approach for Runtime Monitoring and Adapting Component-base...
SEAA'2012: An MDE approach for Runtime Monitoring and Adapting Component-base...SEAA'2012: An MDE approach for Runtime Monitoring and Adapting Component-base...
SEAA'2012: An MDE approach for Runtime Monitoring and Adapting Component-base...
 
The road to multi/many core computing
The road to multi/many core computingThe road to multi/many core computing
The road to multi/many core computing
 
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONSA SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
 
DAE Tools 1.8.0 - Introduction
DAE Tools 1.8.0 - IntroductionDAE Tools 1.8.0 - Introduction
DAE Tools 1.8.0 - Introduction
 
OGCE RT Rroject Review
OGCE RT Rroject ReviewOGCE RT Rroject Review
OGCE RT Rroject Review
 
OGCE Review for Indiana University Research Technologies
OGCE Review for Indiana University Research TechnologiesOGCE Review for Indiana University Research Technologies
OGCE Review for Indiana University Research Technologies
 
HPPS - Final - 06/14/2007
HPPS - Final - 06/14/2007HPPS - Final - 06/14/2007
HPPS - Final - 06/14/2007
 
C++ Data-flow Parallelism sounds great! But how practical is it? Let’s see ho...
C++ Data-flow Parallelism sounds great! But how practical is it? Let’s see ho...C++ Data-flow Parallelism sounds great! But how practical is it? Let’s see ho...
C++ Data-flow Parallelism sounds great! But how practical is it? Let’s see ho...
 
Accelerating Deep Learning Training with BigDL and Drizzle on Apache Spark wi...
Accelerating Deep Learning Training with BigDL and Drizzle on Apache Spark wi...Accelerating Deep Learning Training with BigDL and Drizzle on Apache Spark wi...
Accelerating Deep Learning Training with BigDL and Drizzle on Apache Spark wi...
 
In-Memory Compute Grids… Explained
In-Memory Compute Grids… ExplainedIn-Memory Compute Grids… Explained
In-Memory Compute Grids… Explained
 
Applying Model Driven Development in MAS for limited devices
Applying Model Driven Development in MAS for limited devicesApplying Model Driven Development in MAS for limited devices
Applying Model Driven Development in MAS for limited devices
 
Deepcoder to Self-Code with Machine Learning
Deepcoder to Self-Code with Machine LearningDeepcoder to Self-Code with Machine Learning
Deepcoder to Self-Code with Machine Learning
 

Último

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 

Último (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

Thesis Defense

  • 1. A Methodology to Develop High Performance Applications on GPGPU Architectures: Application to Simulation of Electrical Machines THESIS DEFENSE Antonio Wendell DE OLIVEIRA RODRIGUES Advisor : Jean-Luc DEKEYSER Co-Advisor : Frédéric GUYOMARC'H
  • 2. Context: Numerical Methods Physics Software Architecture April 3, 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 2 of 38
  • 3. Context: Problem How to help non-specialists in programming/architecture to develop their applications How to generate automatic code enough efficient w.r.t. manually written code Taking advantage of available resources How to integrate profiling and development tools 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 3 of 38
  • 4. Context: GPGPU General-Purpose Computation on GPUs Massively Parallel Processing Physics Software Tsubame 2.0 958 MFlops/watt Cielo Cray Architecture Green Computing 278 MFlops/watt GPGPU 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 4 of 38
  • 5. Context: GPGPU (Programming) CUDA Nvidia’s solution 1st real high level GPGPU programming Large number of applications, libraries, developers Achieves better performance on Nvidia’s hardware OpenCL (Open Computing Language) Open Standard proposed by Khronos GroupTM Multi-vendors (including Nvidia) Not only for GPUs 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 5 of 38
  • 6. Context: Related Work Code-Level Specification Directives – PGI, OpenHMPP, Annotated C Interfaces, translation – Java, Python, Matlab Specific Language – SAC (Single Assignment C) Simulink Mindstorms » WITH-loop expressions (CUDA Backend) OpenModelica High-Level Specification Simulink, OpenModelica, Mindstorms (Labview) Gaspard: OpenMP Branch (Julien Taillard) • Programming model, Specification 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 6 of 38
  • 7. Context: High Level Specification How to separate the algorithm and the hardware specifications MDE • specify an application (UML/MARTE) • the expression of its potential parallelism Physics Software • the platform architecture • the link between logical and physical parts Model Driven Engineering • Clear separation between hardware and software specifications • UML: diagrams, tools Architecture • UML profile for MARTE: Parallel expressiveness inspired by ArrayOL GPGPU – enables factorization of repeated elements. 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 7 of 38
  • 8. Contribs: Methodology Points of View (according to MARTE) MARTE Specification Define Methodology Build Model <<include>> Adapt MARTE specification <<include>> Annotate Model for Analysis Build Execution Platform Model Analyze Model Provide Execution Platform OpenCL, GPU Cards, Drivers, etc. 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 8 of 38
  • 9. Contribs: Methodology Code Generation from Higher Level Models (Gaspard2) Compilation of Models OpenCL OpenMP Pthread VHDL Program (source code, makefiles, etc.) 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: MEDEE Meeting 9 of 38
  • 10. Contribs: Building a Model This is the model designer’s point of view Allocation Application Architecture Deployment Virtual IP Software IP Artifacts Global View of a Whole Model 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 10 of 38
  • 11. Contribs: Building an Application Application: Code_CARMEL(L2EP- lab/EDF) • Electrical Machines Modeling and Simulation • Sparse Matrices TCarmel/FCarmel in CSR format Matrix Assembly PostProcess GenPARAM GenPHYS GenDOF Input Solver Output 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 11 of 38
  • 12. Contribs: CG Example Solver: Conjugate Gradient • Numerical Method to solve a System of Linear Equations 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 12 of 38
  • 13. Contribs: CG Example UML/MARTE modeling tool (Eclipse/Papyrus) Application dotProd Architecture A: Real {1000} <<tiler>> Allocation Mult m: Mult {1000} r: Reduc {1} el1: Real {1} <<reshape>> {1} {1} res: Real {1000} {1} C: Real {1} el2: Real {1} <<tiler>> Deployment B: Real {1000} 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 13 of 38
  • 14. Contribs: CG Example The whole algorithm CG_Module_GPU Application norm_r0: norm r_0 : Real{132651} norm_r0 : Real{1} <<interrepetition>> <<defaultlink>> CGLoop <<interrepetition>> b : Real {132651} r_k : Real{132651} rr: dotProd alpha: ScalarDiv <<shaped>> <<shaped>> cg: CGLoop {132651} Architecture <<defaultlink>> x: DAXPY beta: ScalarDiv r_k : Real{132651} <<shaped>> p: DAXPY error : Real{1} r_k1 : Real{132651} norm_r0 : Real{1} error : Real{1} error: ScalarDivSqrt x_k1 : Real{132651} norm_r0 : Real{1} init: InitVars x_k : Real{132651} r_k1 : Real{132651} <<defaultlink>> x0 : Real{132651} p_k1 : Real{132651} x_k : Real{132651} A : Real{3442951} A : Real{3442951} iA : Integer{132652} x_k1 : Real{132651} <<shaped>> x_out error : Real{1} Ap: dgemvCSR pAp: dotProd minusalpha: Negative <<shaped>> r: DAXPY A : Real{3442951} rrnew: dotProd Allocation iA : Integer{132652} jA : Integer{3442951} iA : Real{132651} jA : Integer{3442951} p_k1 : Real{132651} p_k : Real{132651} p_k : Real{132651} jA : Real{132651} <<interrepetition>> Deployment 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 14 of 38
  • 15. Contribs: CG Example Defining the Architecture Application Architecture Allocation Deployment 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 15 of 38
  • 16. Contribs: CG Example Allocating Tasks Application Architecture Allocation Deployment 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 16 of 38
  • 17. Contribs: CG Example Allocating FlowPorts Application Architecture Allocation Deployment 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 17 of 38
  • 18. Contribs: CG Example Elementary Tasks and Software IP Application Sip_Mult • Code of an elementary function • Parameter order • Possible header files or Architecture libraries, compiling directives, so on. <<manifest>> Allocation <<artifact>> <<codeFile>> MultCF Deployment 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 18 of 38
  • 19. Contribs: Execution Test and Results The model designer starts the code generation The model compiler generates a program Makefile and source files CGLoop r_k : Real{132651} rr: dotProd alpha: ScalarDiv <<shaped>> x: DAXPY beta: ScalarDiv <<shaped>> p: DAXPY r_k1 : Real{132651} error: ScalarDivSqrt x_k1 : Real{132651} norm_r0 : Real{1} p_k1 : Real{132651} x_k : Real{132651} <<shaped>> error : Real{1} Ap: dgemvCSR pAp: dotProd minusalpha: Negative <<shaped>> r: DAXPY A : Real{3442951} rrnew: dotProd iA : Integer{132652} jA : Integer{3442951} p_k : Real{132651} <<allocate>> <<abstract>> Architecture <<shaped>> <<shaped>> h1: HOST {1} d1: DEVICE {1} <<allocate>> <<abstract>> mp: Memory mgp: Memory CGLoop r_k : Real{132651} rr: dotProd alpha: ScalarDiv <<shaped>> x: DAXPY beta: ScalarDiv <<shaped>> p: DAXPY r_k1 : Real{132651} error: ScalarDivSqrt x_k1 : Real{132651} norm_r0 : Real{1} p_k1 : Real{132651} x_k : Real{132651} <<shaped>> error : Real{1} Ap: dgemvCSR pAp: dotProd minusalpha: Negative <<shaped>> r: DAXPY A : Real{3442951} rrnew: dotProd iA : Integer{132652} jA : Integer{3442951} p_k : Real{132651} <<allocate>> <<abstract>> Architecture <<shaped>> <<shaped>> h1: HOST {1} d1: DEVICE {1} <<allocate>> <<abstract>> mp: Memory mgp: Memory 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 19 of 38
  • 20. Contribs: Execution Test and Results CG Program to CG Module for Code_CARMEL: Adaptation GenDOF: Fortran GenPHYS: Fortran C/C++ GenPARAM: Fortran T/FCarmel: Fortran Interface C PostProcessing: Fortran 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 20 of 38
  • 21. Contribs: Execution Test and Results Evaluating Scalability: FEM on different meshes 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 21 of 38
  • 22. Contribs: Execution Test and Results CPU: AMD Opteron, 8-core Results @2.4GHz and 64GB RAM. Execution Time GPU: NVidia S1070 4 devices Tesla T10 (4GB RAM each) – Compute Capability 1.3 Performance 1 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 22 of 38
  • 23. Contribs: How It Works This is the methodology provider’s point of view (the UML/MARTE-to-OpenCL chain) 3 6 9 2 5 8 #include b.h func(a,b){ 1 4 7 c=a+b; } 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 23 of 38
  • 24. Contribs: UML/MARTE to OpenCL UML-to-MARTE Transformation • avoids the UML complexity • keeps only the essential elements of MARTE Port Instance Transformation • UML does not implement instances of FlowPorts when we instantiate a part (tasks) Mult m: Mult {100} k: Mult {20} el1: Real {1} {1} {1} {1} res: Real {1} {1} el2: Real {1} {1} {1} 1 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 24 of 38
  • 25. Contribs: UML/MARTE to OpenCL Tiler-to-Task Transformation • Expressed in ArrayOL as stereotype of connectors • Special tasks allocated available processors 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 25 of 38
  • 26. Contribs: UML/MARTE to OpenCL Local and Global Graphs Transformations Scheduling Policy Transformation globalDependencies p1_Task Start StartTask Task IPTask IPTask vec1 vec2 Global Graph: Global p1_Task Graph contains other dev sub-graphs IPTask v1v2 EndTask End Task 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 26 of 38
  • 27. Contribs: UML/MARTE to OpenCL Memory Mapping Transformation main 1 2 3 4 addMemoryMap defineScope propagateDataAllocation createTilerTaskDA X 5 defineBasicDataAllocations createAffectationDataAllocation createVirtualIPSoftIPDA 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 27 of 38
  • 28. Contribs: UML/MARTE to OpenCL Hybrid Transformation main HorizontalFilter VerticalFilter <<shaped>> same allocation <<shaped>> rhf: RHF {288,44} rvf: RVF {32,132} «tiler» «tiler» Thread (work-item) createHybridApp 1 2 3 4 «tiler» «tiler» Grid definition toHybridApp refersTo refersTo 1 2 3 toDevSide toHostSide Schedule Host 4 toKernel toMainFunction Schedule Device kernelVars toIPFunction toTilerFunctions mainVars defineVars optimizeTransfer 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 28 of 38
  • 29. Contribs: UML/MARTE to OpenCL Code Generation Model to Text Transformation Based on Acceleo Templates Functionalities • IP insertions • Tiler notation to Memory Address Computation in C • Implements the memory transfer optimization 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 29 of 38
  • 30. Contribs: UML/MARTE to OpenCL Code Generation Model to Text Transformation send(dataaddress) with size data to Device; <<shaped>> Multiple Devices Launch Kernel on Device with grid (WG,WI) p: DAXPY {100} recv(dataaddress) <<hwResource>> <<shaped>> with size data from Device; d1: Device {4} for (i = 0; i < numDev; i++) gp: GPU mgp: Memory send(dataaddress + i*data/numDev) with size data/numDev to Device i; for (i = 0; i < numDev; i++) <<abstraction>><<allocate>> Launch Kernel on Device i with grid (WG/numDev,WI) for (i = 0; i < numDev; i++) recv(dataaddress + i*data/numDev) with size data/numDev from Device i; 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 30 of 38
  • 31. Contribs: UML/MARTE to OpenCL Code Generation Model to Text Transformation • Tiler Analysis (Shared Memory Use) 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 31 of 38
  • 32. Contribs: Profiling Analysis Integrating Profiler and Models High Level Abstraction 7 Profiling and Model of Application, Advice Profiling and Architecture and Allocation Optimization Hints Model Transformation Vincent Aranega’s Annotations Profiling and Advices 6 1 Chain Thesis (2011) Model Production Domain Specific Profiling Analisys Transformation Library Generated Code Files Trace (Makefile, *.cl, *.cpp, *.h) Models Profiling Log Device Features Model Database Model SDK 2 Compilation Process UID base link 5 Log Parser Binaries and Runtime Files Logs Software 3 Execution Profiling Logs Production 4 Hybrid Execution Platform 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 32 of 38
  • 33. Contribs: Profiling Analysis Integrating Profiler and Models (Case Study) {16,1000000} 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 33 of 38
  • 34. Contribs: Profiling Analysis Integrating Profiler and Models (Case Study) ~ 60% 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 34 of 38
  • 35. Experimental Validation: Alternator from Valeo Generated Code for PCG in Code_CARMEL for an industrial application 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 35 of 38
  • 36. Experimental Validation: Alternator from Valeo Sparse Matrix • N=775,689 • NNZ=12,502,443 Solution: Preconditioned Conjugate Gradient (PCG) in 10,000 iterations Time (s) Speedup CPU (AMD Opteron) 2300 (~38min) 1 GPU (S1070) 250 (~4min) 9.2 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 36 of 38
  • 37. Conclusions and Perspectives Developing Methodology • Non-specialists can develop their applications from higher levels specification Optimizations and MultiGPU • Memory Issues: Efficient code • Profiling Integration • Scaling according to hardware Numerical Methods (Industrial Applications) • Speedups > 9x • Multiple Simulations – 10 hours/simulation ~ 1 hour • High Performance with low investment in hardware Code_CARMEL Integration 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 37 of 38
  • 38. Conclusions and Perspectives GPU Clusters For instance, Tianhe in China MPI as solution for inter-node communication • Issues: distributed memory, communication, synchronization High-Level Control on the Code Generation Chain • Optimization levels, dynamic parameters 3 avril 2012 Wendell Rodrigues MDE Methodology for GPGPU: Thesis Defense 38 of 38

Notas do Editor

  1. ----- Meeting Notes (19/01/12 11:16) -----slides bienconclusion valeo/region ils ont paye!!!au sein de lequipeinvertir les logos