SlideShare uma empresa Scribd logo
1 de 18
INTRODUCTION
• THE GRAPHICS PROCESSING UNIT (GPU) HAS BECOME AN INTEGRAL PART OF
TODAY’S MAINSTREAM COMPUTING SYSTEMS. OVER THE PAST SIX YEARS, THERE
HAS BEEN A MARKED INCREASE IN THE PERFORMANCE AND CAPABILITIES OF
GPUS.
• GPU IS A GRAPHICAL PROCESSING UNIT WHICH ENABLES YOU RUN HIGH
DEFINITIONS GRAPHICS ON YOUR PC, WHICH ARE THE EMEND OF MODERN
COMPUTING. LIKE THE CPU (CENTRAL PROCESSING UNIT), IT IS A SINGLE-CHIP
PROCESSOR. THE GPU HAS HUNDREDS OF CORES AS COMPARED TO THE 4 OR 8 IN
THE LATEST CPUS. THE PRIMARY JOB OF THE GPU IS TO COMPUTE 3D FUNCTIONS.
PICTURES OF THE GPU
Nvidia Geforce GTX 1070 FTW 8 GB
AMD Radeon R9 295X2
GPU ARCHITECTURE
• CONTROL HARDWARE DOMINATES PROCESSORS
• COMPLEX, DIFFICULT TO BUILD AND VERIFY
• TAKES SUBSTANTIAL FRACTION OF DIE SCALES POORLY
• PAY FOR MAX THROUGHPUT, SUSTAIN AVERAGE THROUGHPUT
• QUADRATIC DEPENDENCY CHECKING
• CONTROL HARDWARE DOESN’T DO ANY MATH!
• OVER THE PAST FEW YEARS, THE GPU HAS EVOLVED FROM A FIXED-FUNCTION
SPECIAL-PURPOSE PROCESSOR INTO A FULL-FLEDGED PARALLEL PROGRAMMABLE
PROCESSOR WITH ADDITIONAL FIXED-FUNCTION SPECIAL-PURPOSE
FUNCTIONALITY.
GPU ARCHITECTURE
• THE GRAPHICS PIPELINE
- THE INPUT TO THE GPU IS A LIST OF GEOMETRIC PRIMITIVES,
TYPICALLY TRIANGLES, IN A 3-D WORLD COORDINATE SYSTEM. THROUGH MANY
STEPS, VERTEX OPERATIONS: THE INPUT PRIMITIVES ARE FORMED FROM
INDIVIDUAL VERTICES. EACH VERTEX MUST BE TRANSFORMED INTO SCREEN
SPACE AND SHADED, TYPICALLY THROUGH COMPUTING THEIR INTERACTION
WITH THE LIGHTS IN THE SCENE. BECAUSE TYPICAL SCENES HAVE TENS TO
HUNDREDS OF THOUSANDS OF VERTICES, AND EACH VERTEX CAN BE COMPUTED
INDEPENDENTLY, THIS STAGE IS WELL SUITED FOR PARALLEL HARDWARE.
GPU ARCHITECTURE
GPU ARCHITECTURE
• EVOLUTION OF GPU ARCHITECTURE
- THE FIXED-FUNCTION PIPELINE LACKED THE GENERALITY TO
EFFICIENTLY EXPRESS MORE COMPLICATED SHADING AND LIGHTING
OPERATIONS THAT ARE ESSENTIAL FOR COMPLEX EFFECTS. THE KEY STEP WAS
REPLACING THE FIXED-FUNCTION PER-VERTEX AND PER-FRAGMENT OPERATIONS
WITH USER-SPECIFIED PROGRAMS RUN ON EACH VERTEX AND FRAGMENT. OVER
THE PAST SIX YEARS, THESE VERTEX PROGRAMS AND FRAGMENT PROGRAMS
HAVE BECOME INCREASINGLY MORE CAPABLE, WITH LARGER LIMITS ON THEIR
SIZE AND RESOURCE CONSUMPTION, WITH MORE FULLY FEATURED
INSTRUCTION SETS, AND WITH MORE FLEXIBLE CONTROL-FLOW OPERATIONS.
GPU ARCHITECTURE
• ARCHITECTURE OF MODERN GPU
- WE NOTED THAT THE GPU IS BUILT FOR DIFFERENT APPLICATION
DEMANDS THAN THE CPU: LARGE PARALLEL COMPUTATION REQUIREMENTS WITH
AN EMPHASIS ON THROUGHPUT RATHER THAN LATENCY. CONSEQUENTLY, THE
ARCHITECTURE OF THE GPU HAS PROGRESSED IN A DIFFERENT DIRECTION THAN
THAT OF THE CPU.
- THE CPU DIVIDES THE PIPELINE IN TIME, APPLYING ALL RESOURCES IN
THE PROCESSOR TO EACH STAGE IN TURN. GPUS HAVE HISTORICALLY TAKEN A
DIFFERENT APPROACH. THE GPU DIVIDES THE RESOURCES OF THE PROCESSOR
AMONG THE DIFFERENT STAGES, SUCH THAT THE PIPELINE IS DIVIDED IN SPACE, NOT
TIME. THE PART OF THE PROCESSOR WORKING ON ONE STAGE FEEDS ITS OUTPUT
DIRECTLY INTO A DIFFERENT PART THAT WORKS ON THE NEXT STAGE.
GPU ARCHITECTURE
LATEST TECHNOLOGIES USED
• NVIDIA
– TESLA HPC SPECIFIC GPUS HAVE EVOLVED FROM GEFORCE SERIES
• AMD
– FIRE STREAM HPC SPECIFIC GPUS HAVE EVOLVED FROM (ATI)
RADEON SERIES
• INTEL
– KNIGHTS CORNER MANY-CORE X86 CHIP IS LIKE HYBRID BETWEEN
A GPU AND MANY-CORE CPU
CHARACTERISTICS OF
GRAPHICS IN GPU
• LARGE COMPUTATIONAL REQUIREMENTS
• MASSIVE PARALLELISM
• GRAPHICS PIPELINE DESIGNED FOR INDEPENDENT OPERATIONS
• LONG LATENCIES TOLERABLE
• DEEP, FEED-FORWARD PIPELINES
• HACKS ARE OK—CAN TOLERATE LACK OF ACCURACY
• GPUS ARE GOOD AT PARALLEL, ARITHMETICALLY INTENSE, STREAMING-
MEMORY PROBLEMS
ASSUMPTIONS AND
LIMITATIONS IN GPU
• IN ADDITION TO QUERY PROCESSING, LARGE WEB SEARCH ENGINES NEED TO
PERFORM MANY OTHER OPERATIONS INCLUDING WEB CRAWL-ING, INDEX
BUILDING, AND DATA MINING STEPS FOR TASKS SUCH AS LINK ANALYSIS AND
SPAM AND DUPLICATE DETECTION. WE FOCUS HERE ON QUERY PROCESSING AND
IN PARTICULAR ON ONE PHASE OF THIS STEP AS EXPLAINED FURTHER BELOW. WE
BELIEVE THAT THIS PART IS SUITABLE FOR IMPLEMENTATION ON GPUS AS IT IS
FAIRLY SIMPLE IN STRUCTURE BUT NONETHELESS CONSUMES A
DISPROPORTIONATE AMOUNT OF THE OVERALL SYSTEM RESOURCES. IN
CONTRAST, WE DO NOT THINK THAT IMPLEMENTATION OF A COMPLETE SEARCH
ENGINE ON A GPU IS CURRENTLY REALISTIC.
GPU COMPUTING
• THE GPU PROGRAMMING MODEL
- THE PROGRAMMABLE UNITS OF THE GPU FOLLOW A SINGLE
PROGRAM MULTIPLE-DATA (SPMD) PROGRAMMING MODEL. FOR EFFICIENCY,
THE GPU PROCESSES MANY ELEMENTS (VERTICES OR FRAGMENTS) IN PARALLEL
USING THE SAME PROGRAM. EACH ELEMENT IS INDEPENDENT FROM THE OTHER
ELEMENTS, AND IN THE BASE PROGRAMMING MODEL, ELEMENTS CANNOT
COMMUNICATE WITH EACH OTHER. ALL GPU PROGRAMS MUST BE STRUCTURED
IN THIS WAY: MANY PARALLEL ELEMENTS, EACH PROCESSED IN PARALLEL BY A
SINGLE PROGRAM.
GPU COMPUTING
• GENERAL-PURPOSE COMPUTING ON THE GPU
- STEPS TO SHOW THE SIMPLER AND DIRECT WAY THAT TODAY’S GPU COMPUTING
APPLICATIONS ARE WRITTEN.
1. PROGRAMMING A GPU FOR GRAPHICS: WE BEGIN WITH THE SAME GPU PIPELINE THAT WE
DESCRIBED IN SECTION II, CONCENTRATING ON THE PROGRAMMABLE ASPECTS OF THIS PIPELINE.
2. THE PROGRAMMER SPECIFIES GEOMETRY THAT COVERS A REGION ON THE SCREEN. THE
RASTERIZER GENERATES A FRAGMENT AT EACH PIXEL LOCATION COVERED BY THAT GEOMETRY.
3. EACH FRAGMENT IS SHADED BY THE FRAGMENT PROGRAM.
4. THE FRAGMENT PROGRAM COMPUTES THE VALUE OF THE FRAGMENT BY A COMBINATION OF MATH
OPERATIONS AND GLOBAL MEMORY READS FROM A GLOBAL BTEXTURE [MEMORY].
5. THE RESULTING IMAGE CAN THEN BE USED AS TEXTURE ON FUTURE PASSES THROUGH THE
GRAPHICS PIPELINE.
GPU VIRTUALIZATION TAXONOMY
• WE OBSERVE THAT DIFFERENT USE CASES WEIGHT THE CRITERIA
DIFFERENTLY—FOR EXAMPLE A VDI DEPLOYMENT VALUES HIGH VM-TO-GPU
CONSOLIDATION RATIOS (E.G., MULTIPLEXING) WHILE A CONSUMER RUNNING
A VM TO ACCESS A GAME OR CAD APPLICATION UNAVAILABLE ON HIS HOST
VALUES PERFORMANCE AND LIKELY FIDELITY. A TECH SUPPORT PERSON
MAINTAINING A LIBRARY OF DIFFERENT CONFIGURATIONS AND AN IT
ADMINISTRATOR RUNNING SERVER VMS ARE BOTH LIKELY TO VALUE
PORTABILITY AND SECURE ISOLATION (INTERPOSITION).
GPU VIRTUALIZATION TAXONOMY
• FRONT-END VIRTUALIZATION
- FRONT-END VIRTUALIZATION INTRODUCES A VIRTUALIZATION BOUNDARY AT A
RELATIVELY HIGH LEVEL IN THE STACK, AND RUNS THE GRAPHICS DRIVER IN THE
HOST/HYPERVISOR. THIS APPROACH DOES NOT RELY ON ANY GPU VENDOR- OR MODEL-
SPECIFIC DE- TAILS.
• BACK-END VIRTUALIZATION
-THE MOST OBVIOUS BACK-END VIRTUALIZATION TECHNIQUE IS FIXED PASS-
THROUGH: THE PERMANENT ASSOCIATION OF A VIRTUAL MACHINE WITH FULL EXCLUSIVE
ACCESS TO A PHYSICAL GPU. RECENT CHIPSET FEATURES, SUCH AS INTEL’S VT-D, MAKE FIXED
PASS-THROUGH PRACTICAL WITHOUT REQUIRING ANY SPECIAL KNOWLEDGE OF A GPU’S
PROGRAMMING INTERFACES. HOWEVER, FIXED PASS-THROUGH IS NOT A GENERAL SOLUTION.
IT COMPLETELY FORGOES ANY MULTIPLEXING AND PACKING MACHINES WITH ONE GPU PER
VIRTUAL MACHINE (PLUS ONE FOR THE HOST) IS NOT FEASIBLE.
Graphic Processing Unit (GPU)

Mais conteúdo relacionado

Mais procurados

Gpu presentation
Gpu presentationGpu presentation
Gpu presentationJosiah Lund
 
Graphics processing unit (GPU)
Graphics processing unit (GPU)Graphics processing unit (GPU)
Graphics processing unit (GPU)Amal R
 
Nvidia (History, GPU Architecture and New Pascal Architecture)
Nvidia (History, GPU Architecture and New Pascal Architecture)Nvidia (History, GPU Architecture and New Pascal Architecture)
Nvidia (History, GPU Architecture and New Pascal Architecture)Saksham Tanwar
 
GPU Architecture NVIDIA (GTX GeForce 480)
GPU Architecture NVIDIA (GTX GeForce 480)GPU Architecture NVIDIA (GTX GeForce 480)
GPU Architecture NVIDIA (GTX GeForce 480)Fatima Qayyum
 
Graphics Processing Unit by Saurabh
Graphics Processing Unit by SaurabhGraphics Processing Unit by Saurabh
Graphics Processing Unit by SaurabhSaurabh Kumar
 
Graphics Processing Unit - GPU
Graphics Processing Unit - GPUGraphics Processing Unit - GPU
Graphics Processing Unit - GPUChetan Gole
 
GPU Virtualization on VMware's Hosted I/O Architecture
GPU Virtualization on VMware's Hosted I/O ArchitectureGPU Virtualization on VMware's Hosted I/O Architecture
GPU Virtualization on VMware's Hosted I/O Architectureguestb3fc97
 
Introduction to parallel computing using CUDA
Introduction to parallel computing using CUDAIntroduction to parallel computing using CUDA
Introduction to parallel computing using CUDAMartin Peniak
 
Parallel computing with Gpu
Parallel computing with GpuParallel computing with Gpu
Parallel computing with GpuRohit Khatana
 
CPU vs. GPU presentation
CPU vs. GPU presentationCPU vs. GPU presentation
CPU vs. GPU presentationVishal Singh
 
Game Engine Architecture
Game Engine ArchitectureGame Engine Architecture
Game Engine ArchitectureAttila Jenei
 
Delivering a new level of visual performance in an SoC AMD "Raven Ridge" APU
Delivering a new level of visual performance in an SoC AMD "Raven Ridge" APUDelivering a new level of visual performance in an SoC AMD "Raven Ridge" APU
Delivering a new level of visual performance in an SoC AMD "Raven Ridge" APUAMD
 

Mais procurados (20)

Gpu presentation
Gpu presentationGpu presentation
Gpu presentation
 
Graphics processing unit (GPU)
Graphics processing unit (GPU)Graphics processing unit (GPU)
Graphics processing unit (GPU)
 
GPU - Basic Working
GPU - Basic WorkingGPU - Basic Working
GPU - Basic Working
 
Nvidia (History, GPU Architecture and New Pascal Architecture)
Nvidia (History, GPU Architecture and New Pascal Architecture)Nvidia (History, GPU Architecture and New Pascal Architecture)
Nvidia (History, GPU Architecture and New Pascal Architecture)
 
GPU Architecture NVIDIA (GTX GeForce 480)
GPU Architecture NVIDIA (GTX GeForce 480)GPU Architecture NVIDIA (GTX GeForce 480)
GPU Architecture NVIDIA (GTX GeForce 480)
 
Graphics Processing Unit by Saurabh
Graphics Processing Unit by SaurabhGraphics Processing Unit by Saurabh
Graphics Processing Unit by Saurabh
 
It's Time to ROCm!
It's Time to ROCm!It's Time to ROCm!
It's Time to ROCm!
 
Graphics Processing Unit - GPU
Graphics Processing Unit - GPUGraphics Processing Unit - GPU
Graphics Processing Unit - GPU
 
GPU Virtualization on VMware's Hosted I/O Architecture
GPU Virtualization on VMware's Hosted I/O ArchitectureGPU Virtualization on VMware's Hosted I/O Architecture
GPU Virtualization on VMware's Hosted I/O Architecture
 
GPU Computing
GPU ComputingGPU Computing
GPU Computing
 
Introduction to parallel computing using CUDA
Introduction to parallel computing using CUDAIntroduction to parallel computing using CUDA
Introduction to parallel computing using CUDA
 
Graphics card ppt
Graphics card pptGraphics card ppt
Graphics card ppt
 
Parallel computing with Gpu
Parallel computing with GpuParallel computing with Gpu
Parallel computing with Gpu
 
Parallel Computing on the GPU
Parallel Computing on the GPUParallel Computing on the GPU
Parallel Computing on the GPU
 
GPU
GPUGPU
GPU
 
CPU vs. GPU presentation
CPU vs. GPU presentationCPU vs. GPU presentation
CPU vs. GPU presentation
 
Mobile Processors
Mobile ProcessorsMobile Processors
Mobile Processors
 
Introduction to GPU Programming
Introduction to GPU ProgrammingIntroduction to GPU Programming
Introduction to GPU Programming
 
Game Engine Architecture
Game Engine ArchitectureGame Engine Architecture
Game Engine Architecture
 
Delivering a new level of visual performance in an SoC AMD "Raven Ridge" APU
Delivering a new level of visual performance in an SoC AMD "Raven Ridge" APUDelivering a new level of visual performance in an SoC AMD "Raven Ridge" APU
Delivering a new level of visual performance in an SoC AMD "Raven Ridge" APU
 

Destaque

NVIDIA – Inventor of the GPU
NVIDIA – Inventor of the GPUNVIDIA – Inventor of the GPU
NVIDIA – Inventor of the GPUNVIDIA
 
Graphics processing unit (gpu)
Graphics processing unit (gpu)Graphics processing unit (gpu)
Graphics processing unit (gpu)junliwanag
 
Gpu presentation
Gpu presentationGpu presentation
Gpu presentationspartasoft
 
Gpu and The Brick Wall
Gpu and The Brick WallGpu and The Brick Wall
Gpu and The Brick Wallugur candan
 
Introduction to CUDA
Introduction to CUDAIntroduction to CUDA
Introduction to CUDARaymond Tay
 
Introduction to heterogeneous_computing_for_hpc
Introduction to heterogeneous_computing_for_hpcIntroduction to heterogeneous_computing_for_hpc
Introduction to heterogeneous_computing_for_hpcSupasit Kajkamhaeng
 
Hardware Shaders
Hardware ShadersHardware Shaders
Hardware Shadersgueste52f1b
 
Blue sky – red sunset
Blue sky – red sunsetBlue sky – red sunset
Blue sky – red sunsettg2011
 
surface computer ppt
surface computer pptsurface computer ppt
surface computer pptkabego18audry
 
Microsoft surface by NIRAV RANA
Microsoft surface by NIRAV RANAMicrosoft surface by NIRAV RANA
Microsoft surface by NIRAV RANANirav Rana
 
Wireless_Sensor_security
Wireless_Sensor_securityWireless_Sensor_security
Wireless_Sensor_securityTosha Shah
 

Destaque (17)

NVIDIA – Inventor of the GPU
NVIDIA – Inventor of the GPUNVIDIA – Inventor of the GPU
NVIDIA – Inventor of the GPU
 
Graphics processing unit (gpu)
Graphics processing unit (gpu)Graphics processing unit (gpu)
Graphics processing unit (gpu)
 
Lec04 gpu architecture
Lec04 gpu architectureLec04 gpu architecture
Lec04 gpu architecture
 
Gpu presentation
Gpu presentationGpu presentation
Gpu presentation
 
Gpu and The Brick Wall
Gpu and The Brick WallGpu and The Brick Wall
Gpu and The Brick Wall
 
Introduction to CUDA
Introduction to CUDAIntroduction to CUDA
Introduction to CUDA
 
Introduction to heterogeneous_computing_for_hpc
Introduction to heterogeneous_computing_for_hpcIntroduction to heterogeneous_computing_for_hpc
Introduction to heterogeneous_computing_for_hpc
 
Hardware Shaders
Hardware ShadersHardware Shaders
Hardware Shaders
 
Blue sky – red sunset
Blue sky – red sunsetBlue sky – red sunset
Blue sky – red sunset
 
Surface Computer
Surface ComputerSurface Computer
Surface Computer
 
Sc13 gpu
Sc13 gpuSc13 gpu
Sc13 gpu
 
surface computer ppt
surface computer pptsurface computer ppt
surface computer ppt
 
Microsoft surface by NIRAV RANA
Microsoft surface by NIRAV RANAMicrosoft surface by NIRAV RANA
Microsoft surface by NIRAV RANA
 
Nvidia SC13 Podcast
Nvidia SC13 PodcastNvidia SC13 Podcast
Nvidia SC13 Podcast
 
Wireless_Sensor_security
Wireless_Sensor_securityWireless_Sensor_security
Wireless_Sensor_security
 
why to know asean
why to know aseanwhy to know asean
why to know asean
 
Egito
EgitoEgito
Egito
 

Semelhante a Graphic Processing Unit (GPU)

Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Editor IJARCET
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Editor IJARCET
 
GPU Computing: A brief overview
GPU Computing: A brief overviewGPU Computing: A brief overview
GPU Computing: A brief overviewRajiv Kumar
 
OpenPOWER Supercomputing Recap Day Two: Innovating Across the Stack
OpenPOWER Supercomputing Recap Day Two: Innovating Across the StackOpenPOWER Supercomputing Recap Day Two: Innovating Across the Stack
OpenPOWER Supercomputing Recap Day Two: Innovating Across the StackOpenPOWERorg
 
Challenges and Opportunities of FPGA Acceleration in Big Data
Challenges and Opportunities of FPGA Acceleration in Big DataChallenges and Opportunities of FPGA Acceleration in Big Data
Challenges and Opportunities of FPGA Acceleration in Big DataIRJET Journal
 
GPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production Scale
GPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production ScaleGPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production Scale
GPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production ScaleSpark Summit
 
GPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production Scale
GPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production ScaleGPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production Scale
GPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production Scalesparktc
 
Cluster Technique used in Advanced Computer Architecture.pptx
Cluster Technique used in Advanced Computer Architecture.pptxCluster Technique used in Advanced Computer Architecture.pptx
Cluster Technique used in Advanced Computer Architecture.pptxtiwarirajan1
 
FYP1 Progress Report (final)
FYP1 Progress Report (final)FYP1 Progress Report (final)
FYP1 Progress Report (final)waqas khan
 
37248136-Nano-Technology.pdf
37248136-Nano-Technology.pdf37248136-Nano-Technology.pdf
37248136-Nano-Technology.pdfTB107thippeswamyM
 
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...areej qasrawi
 
X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View
 X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View
X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance ViewRebekah Rodriguez
 
X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View
X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance ViewX13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View
X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance ViewRebekah Rodriguez
 
googlecluster-ieee
googlecluster-ieeegooglecluster-ieee
googlecluster-ieeeHiroshi Ono
 
Exploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design spaceExploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design spacejsvetter
 
Applying Cloud Techniques to Address Complexity in HPC System Integrations
Applying Cloud Techniques to Address Complexity in HPC System IntegrationsApplying Cloud Techniques to Address Complexity in HPC System Integrations
Applying Cloud Techniques to Address Complexity in HPC System Integrationsinside-BigData.com
 

Semelhante a Graphic Processing Unit (GPU) (20)

Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045
 
GPU Computing: A brief overview
GPU Computing: A brief overviewGPU Computing: A brief overview
GPU Computing: A brief overview
 
OpenPOWER Supercomputing Recap Day Two: Innovating Across the Stack
OpenPOWER Supercomputing Recap Day Two: Innovating Across the StackOpenPOWER Supercomputing Recap Day Two: Innovating Across the Stack
OpenPOWER Supercomputing Recap Day Two: Innovating Across the Stack
 
Challenges and Opportunities of FPGA Acceleration in Big Data
Challenges and Opportunities of FPGA Acceleration in Big DataChallenges and Opportunities of FPGA Acceleration in Big Data
Challenges and Opportunities of FPGA Acceleration in Big Data
 
GPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production Scale
GPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production ScaleGPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production Scale
GPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production Scale
 
Amd fusion apus
Amd fusion apusAmd fusion apus
Amd fusion apus
 
GPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production Scale
GPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production ScaleGPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production Scale
GPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production Scale
 
Cluster Technique used in Advanced Computer Architecture.pptx
Cluster Technique used in Advanced Computer Architecture.pptxCluster Technique used in Advanced Computer Architecture.pptx
Cluster Technique used in Advanced Computer Architecture.pptx
 
FYP1 Progress Report (final)
FYP1 Progress Report (final)FYP1 Progress Report (final)
FYP1 Progress Report (final)
 
37248136-Nano-Technology.pdf
37248136-Nano-Technology.pdf37248136-Nano-Technology.pdf
37248136-Nano-Technology.pdf
 
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
 
X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View
 X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View
X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View
 
X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View
X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance ViewX13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View
X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View
 
googlecluster-ieee
googlecluster-ieeegooglecluster-ieee
googlecluster-ieee
 
Exploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design spaceExploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design space
 
Applying Cloud Techniques to Address Complexity in HPC System Integrations
Applying Cloud Techniques to Address Complexity in HPC System IntegrationsApplying Cloud Techniques to Address Complexity in HPC System Integrations
Applying Cloud Techniques to Address Complexity in HPC System Integrations
 
Gpu
GpuGpu
Gpu
 
Gpu
GpuGpu
Gpu
 
Cloud Networking Trends
Cloud Networking TrendsCloud Networking Trends
Cloud Networking Trends
 

Último

在线制作(ANU毕业证书)澳大利亚国立大学毕业证成绩单原版一比一
在线制作(ANU毕业证书)澳大利亚国立大学毕业证成绩单原版一比一在线制作(ANU毕业证书)澳大利亚国立大学毕业证成绩单原版一比一
在线制作(ANU毕业证书)澳大利亚国立大学毕业证成绩单原版一比一ougvy
 
Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...
Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...
Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...MOHANI PANDEY
 
怎样办理斯威本科技大学毕业证(SUT毕业证书)成绩单留信认证
怎样办理斯威本科技大学毕业证(SUT毕业证书)成绩单留信认证怎样办理斯威本科技大学毕业证(SUT毕业证书)成绩单留信认证
怎样办理斯威本科技大学毕业证(SUT毕业证书)成绩单留信认证tufbav
 
Abortion Pill for sale in Riyadh ((+918761049707) Get Cytotec in Dammam
Abortion Pill for sale in Riyadh ((+918761049707) Get Cytotec in DammamAbortion Pill for sale in Riyadh ((+918761049707) Get Cytotec in Dammam
Abortion Pill for sale in Riyadh ((+918761049707) Get Cytotec in Dammamahmedjiabur940
 
Just Call Vip call girls daman Escorts ☎️9352988975 Two shot with one girl (d...
Just Call Vip call girls daman Escorts ☎️9352988975 Two shot with one girl (d...Just Call Vip call girls daman Escorts ☎️9352988975 Two shot with one girl (d...
Just Call Vip call girls daman Escorts ☎️9352988975 Two shot with one girl (d...gajnagarg
 
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...amitlee9823
 
Call Girls Pimple Saudagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Pimple Saudagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Pimple Saudagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Pimple Saudagar Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
➥🔝 7737669865 🔝▻ kakinada Call-girls in Women Seeking Men 🔝kakinada🔝 Escor...
➥🔝 7737669865 🔝▻ kakinada Call-girls in Women Seeking Men  🔝kakinada🔝   Escor...➥🔝 7737669865 🔝▻ kakinada Call-girls in Women Seeking Men  🔝kakinada🔝   Escor...
➥🔝 7737669865 🔝▻ kakinada Call-girls in Women Seeking Men 🔝kakinada🔝 Escor...amitlee9823
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝
Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝
Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Just Call Vip call girls chhindwara Escorts ☎️9352988975 Two shot with one gi...
Just Call Vip call girls chhindwara Escorts ☎️9352988975 Two shot with one gi...Just Call Vip call girls chhindwara Escorts ☎️9352988975 Two shot with one gi...
Just Call Vip call girls chhindwara Escorts ☎️9352988975 Two shot with one gi...gajnagarg
 
Just Call Vip call girls Bhiwandi Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Bhiwandi Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls Bhiwandi Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Bhiwandi Escorts ☎️9352988975 Two shot with one girl...gajnagarg
 
Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...amitlee9823
 
➥🔝 7737669865 🔝▻ Vijayawada Call-girls in Women Seeking Men 🔝Vijayawada🔝 E...
➥🔝 7737669865 🔝▻ Vijayawada Call-girls in Women Seeking Men  🔝Vijayawada🔝   E...➥🔝 7737669865 🔝▻ Vijayawada Call-girls in Women Seeking Men  🔝Vijayawada🔝   E...
➥🔝 7737669865 🔝▻ Vijayawada Call-girls in Women Seeking Men 🔝Vijayawada🔝 E...amitlee9823
 
Just Call Vip call girls godhra Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls godhra Escorts ☎️9352988975 Two shot with one girl (...Just Call Vip call girls godhra Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls godhra Escorts ☎️9352988975 Two shot with one girl (...gajnagarg
 
一比一原版(Otago毕业证书)奥塔哥理工学院毕业证成绩单学位证靠谱定制
一比一原版(Otago毕业证书)奥塔哥理工学院毕业证成绩单学位证靠谱定制一比一原版(Otago毕业证书)奥塔哥理工学院毕业证成绩单学位证靠谱定制
一比一原版(Otago毕业证书)奥塔哥理工学院毕业证成绩单学位证靠谱定制uodye
 
Call Girls Kothrud Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Kothrud Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Kothrud Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Kothrud Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
SM-N975F esquematico completo - reparación.pdf
SM-N975F esquematico completo - reparación.pdfSM-N975F esquematico completo - reparación.pdf
SM-N975F esquematico completo - reparación.pdfStefanoBiamonte1
 

Último (20)

在线制作(ANU毕业证书)澳大利亚国立大学毕业证成绩单原版一比一
在线制作(ANU毕业证书)澳大利亚国立大学毕业证成绩单原版一比一在线制作(ANU毕业证书)澳大利亚国立大学毕业证成绩单原版一比一
在线制作(ANU毕业证书)澳大利亚国立大学毕业证成绩单原版一比一
 
(INDIRA) Call Girl Napur Call Now 8617697112 Napur Escorts 24x7
(INDIRA) Call Girl Napur Call Now 8617697112 Napur Escorts 24x7(INDIRA) Call Girl Napur Call Now 8617697112 Napur Escorts 24x7
(INDIRA) Call Girl Napur Call Now 8617697112 Napur Escorts 24x7
 
Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...
Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...
Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...
 
怎样办理斯威本科技大学毕业证(SUT毕业证书)成绩单留信认证
怎样办理斯威本科技大学毕业证(SUT毕业证书)成绩单留信认证怎样办理斯威本科技大学毕业证(SUT毕业证书)成绩单留信认证
怎样办理斯威本科技大学毕业证(SUT毕业证书)成绩单留信认证
 
Abortion Pill for sale in Riyadh ((+918761049707) Get Cytotec in Dammam
Abortion Pill for sale in Riyadh ((+918761049707) Get Cytotec in DammamAbortion Pill for sale in Riyadh ((+918761049707) Get Cytotec in Dammam
Abortion Pill for sale in Riyadh ((+918761049707) Get Cytotec in Dammam
 
Just Call Vip call girls daman Escorts ☎️9352988975 Two shot with one girl (d...
Just Call Vip call girls daman Escorts ☎️9352988975 Two shot with one girl (d...Just Call Vip call girls daman Escorts ☎️9352988975 Two shot with one girl (d...
Just Call Vip call girls daman Escorts ☎️9352988975 Two shot with one girl (d...
 
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
 
Call Girls Pimple Saudagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Pimple Saudagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Pimple Saudagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Pimple Saudagar Call Me 7737669865 Budget Friendly No Advance Booking
 
➥🔝 7737669865 🔝▻ kakinada Call-girls in Women Seeking Men 🔝kakinada🔝 Escor...
➥🔝 7737669865 🔝▻ kakinada Call-girls in Women Seeking Men  🔝kakinada🔝   Escor...➥🔝 7737669865 🔝▻ kakinada Call-girls in Women Seeking Men  🔝kakinada🔝   Escor...
➥🔝 7737669865 🔝▻ kakinada Call-girls in Women Seeking Men 🔝kakinada🔝 Escor...
 
CHEAP Call Girls in Hauz Quazi (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Hauz Quazi  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Hauz Quazi  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Hauz Quazi (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Abortion pills in Jeddah |+966572737505 | Get Cytotec
Abortion pills in Jeddah |+966572737505 | Get CytotecAbortion pills in Jeddah |+966572737505 | Get Cytotec
Abortion pills in Jeddah |+966572737505 | Get Cytotec
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝
Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝
Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝
 
Just Call Vip call girls chhindwara Escorts ☎️9352988975 Two shot with one gi...
Just Call Vip call girls chhindwara Escorts ☎️9352988975 Two shot with one gi...Just Call Vip call girls chhindwara Escorts ☎️9352988975 Two shot with one gi...
Just Call Vip call girls chhindwara Escorts ☎️9352988975 Two shot with one gi...
 
Just Call Vip call girls Bhiwandi Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Bhiwandi Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls Bhiwandi Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Bhiwandi Escorts ☎️9352988975 Two shot with one girl...
 
Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
 
➥🔝 7737669865 🔝▻ Vijayawada Call-girls in Women Seeking Men 🔝Vijayawada🔝 E...
➥🔝 7737669865 🔝▻ Vijayawada Call-girls in Women Seeking Men  🔝Vijayawada🔝   E...➥🔝 7737669865 🔝▻ Vijayawada Call-girls in Women Seeking Men  🔝Vijayawada🔝   E...
➥🔝 7737669865 🔝▻ Vijayawada Call-girls in Women Seeking Men 🔝Vijayawada🔝 E...
 
Just Call Vip call girls godhra Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls godhra Escorts ☎️9352988975 Two shot with one girl (...Just Call Vip call girls godhra Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls godhra Escorts ☎️9352988975 Two shot with one girl (...
 
一比一原版(Otago毕业证书)奥塔哥理工学院毕业证成绩单学位证靠谱定制
一比一原版(Otago毕业证书)奥塔哥理工学院毕业证成绩单学位证靠谱定制一比一原版(Otago毕业证书)奥塔哥理工学院毕业证成绩单学位证靠谱定制
一比一原版(Otago毕业证书)奥塔哥理工学院毕业证成绩单学位证靠谱定制
 
Call Girls Kothrud Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Kothrud Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Kothrud Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Kothrud Call Me 7737669865 Budget Friendly No Advance Booking
 
SM-N975F esquematico completo - reparación.pdf
SM-N975F esquematico completo - reparación.pdfSM-N975F esquematico completo - reparación.pdf
SM-N975F esquematico completo - reparación.pdf
 

Graphic Processing Unit (GPU)

  • 1.
  • 2. INTRODUCTION • THE GRAPHICS PROCESSING UNIT (GPU) HAS BECOME AN INTEGRAL PART OF TODAY’S MAINSTREAM COMPUTING SYSTEMS. OVER THE PAST SIX YEARS, THERE HAS BEEN A MARKED INCREASE IN THE PERFORMANCE AND CAPABILITIES OF GPUS. • GPU IS A GRAPHICAL PROCESSING UNIT WHICH ENABLES YOU RUN HIGH DEFINITIONS GRAPHICS ON YOUR PC, WHICH ARE THE EMEND OF MODERN COMPUTING. LIKE THE CPU (CENTRAL PROCESSING UNIT), IT IS A SINGLE-CHIP PROCESSOR. THE GPU HAS HUNDREDS OF CORES AS COMPARED TO THE 4 OR 8 IN THE LATEST CPUS. THE PRIMARY JOB OF THE GPU IS TO COMPUTE 3D FUNCTIONS.
  • 3. PICTURES OF THE GPU Nvidia Geforce GTX 1070 FTW 8 GB
  • 5. GPU ARCHITECTURE • CONTROL HARDWARE DOMINATES PROCESSORS • COMPLEX, DIFFICULT TO BUILD AND VERIFY • TAKES SUBSTANTIAL FRACTION OF DIE SCALES POORLY • PAY FOR MAX THROUGHPUT, SUSTAIN AVERAGE THROUGHPUT • QUADRATIC DEPENDENCY CHECKING • CONTROL HARDWARE DOESN’T DO ANY MATH! • OVER THE PAST FEW YEARS, THE GPU HAS EVOLVED FROM A FIXED-FUNCTION SPECIAL-PURPOSE PROCESSOR INTO A FULL-FLEDGED PARALLEL PROGRAMMABLE PROCESSOR WITH ADDITIONAL FIXED-FUNCTION SPECIAL-PURPOSE FUNCTIONALITY.
  • 6. GPU ARCHITECTURE • THE GRAPHICS PIPELINE - THE INPUT TO THE GPU IS A LIST OF GEOMETRIC PRIMITIVES, TYPICALLY TRIANGLES, IN A 3-D WORLD COORDINATE SYSTEM. THROUGH MANY STEPS, VERTEX OPERATIONS: THE INPUT PRIMITIVES ARE FORMED FROM INDIVIDUAL VERTICES. EACH VERTEX MUST BE TRANSFORMED INTO SCREEN SPACE AND SHADED, TYPICALLY THROUGH COMPUTING THEIR INTERACTION WITH THE LIGHTS IN THE SCENE. BECAUSE TYPICAL SCENES HAVE TENS TO HUNDREDS OF THOUSANDS OF VERTICES, AND EACH VERTEX CAN BE COMPUTED INDEPENDENTLY, THIS STAGE IS WELL SUITED FOR PARALLEL HARDWARE.
  • 8. GPU ARCHITECTURE • EVOLUTION OF GPU ARCHITECTURE - THE FIXED-FUNCTION PIPELINE LACKED THE GENERALITY TO EFFICIENTLY EXPRESS MORE COMPLICATED SHADING AND LIGHTING OPERATIONS THAT ARE ESSENTIAL FOR COMPLEX EFFECTS. THE KEY STEP WAS REPLACING THE FIXED-FUNCTION PER-VERTEX AND PER-FRAGMENT OPERATIONS WITH USER-SPECIFIED PROGRAMS RUN ON EACH VERTEX AND FRAGMENT. OVER THE PAST SIX YEARS, THESE VERTEX PROGRAMS AND FRAGMENT PROGRAMS HAVE BECOME INCREASINGLY MORE CAPABLE, WITH LARGER LIMITS ON THEIR SIZE AND RESOURCE CONSUMPTION, WITH MORE FULLY FEATURED INSTRUCTION SETS, AND WITH MORE FLEXIBLE CONTROL-FLOW OPERATIONS.
  • 9. GPU ARCHITECTURE • ARCHITECTURE OF MODERN GPU - WE NOTED THAT THE GPU IS BUILT FOR DIFFERENT APPLICATION DEMANDS THAN THE CPU: LARGE PARALLEL COMPUTATION REQUIREMENTS WITH AN EMPHASIS ON THROUGHPUT RATHER THAN LATENCY. CONSEQUENTLY, THE ARCHITECTURE OF THE GPU HAS PROGRESSED IN A DIFFERENT DIRECTION THAN THAT OF THE CPU. - THE CPU DIVIDES THE PIPELINE IN TIME, APPLYING ALL RESOURCES IN THE PROCESSOR TO EACH STAGE IN TURN. GPUS HAVE HISTORICALLY TAKEN A DIFFERENT APPROACH. THE GPU DIVIDES THE RESOURCES OF THE PROCESSOR AMONG THE DIFFERENT STAGES, SUCH THAT THE PIPELINE IS DIVIDED IN SPACE, NOT TIME. THE PART OF THE PROCESSOR WORKING ON ONE STAGE FEEDS ITS OUTPUT DIRECTLY INTO A DIFFERENT PART THAT WORKS ON THE NEXT STAGE.
  • 11. LATEST TECHNOLOGIES USED • NVIDIA – TESLA HPC SPECIFIC GPUS HAVE EVOLVED FROM GEFORCE SERIES • AMD – FIRE STREAM HPC SPECIFIC GPUS HAVE EVOLVED FROM (ATI) RADEON SERIES • INTEL – KNIGHTS CORNER MANY-CORE X86 CHIP IS LIKE HYBRID BETWEEN A GPU AND MANY-CORE CPU
  • 12. CHARACTERISTICS OF GRAPHICS IN GPU • LARGE COMPUTATIONAL REQUIREMENTS • MASSIVE PARALLELISM • GRAPHICS PIPELINE DESIGNED FOR INDEPENDENT OPERATIONS • LONG LATENCIES TOLERABLE • DEEP, FEED-FORWARD PIPELINES • HACKS ARE OK—CAN TOLERATE LACK OF ACCURACY • GPUS ARE GOOD AT PARALLEL, ARITHMETICALLY INTENSE, STREAMING- MEMORY PROBLEMS
  • 13. ASSUMPTIONS AND LIMITATIONS IN GPU • IN ADDITION TO QUERY PROCESSING, LARGE WEB SEARCH ENGINES NEED TO PERFORM MANY OTHER OPERATIONS INCLUDING WEB CRAWL-ING, INDEX BUILDING, AND DATA MINING STEPS FOR TASKS SUCH AS LINK ANALYSIS AND SPAM AND DUPLICATE DETECTION. WE FOCUS HERE ON QUERY PROCESSING AND IN PARTICULAR ON ONE PHASE OF THIS STEP AS EXPLAINED FURTHER BELOW. WE BELIEVE THAT THIS PART IS SUITABLE FOR IMPLEMENTATION ON GPUS AS IT IS FAIRLY SIMPLE IN STRUCTURE BUT NONETHELESS CONSUMES A DISPROPORTIONATE AMOUNT OF THE OVERALL SYSTEM RESOURCES. IN CONTRAST, WE DO NOT THINK THAT IMPLEMENTATION OF A COMPLETE SEARCH ENGINE ON A GPU IS CURRENTLY REALISTIC.
  • 14. GPU COMPUTING • THE GPU PROGRAMMING MODEL - THE PROGRAMMABLE UNITS OF THE GPU FOLLOW A SINGLE PROGRAM MULTIPLE-DATA (SPMD) PROGRAMMING MODEL. FOR EFFICIENCY, THE GPU PROCESSES MANY ELEMENTS (VERTICES OR FRAGMENTS) IN PARALLEL USING THE SAME PROGRAM. EACH ELEMENT IS INDEPENDENT FROM THE OTHER ELEMENTS, AND IN THE BASE PROGRAMMING MODEL, ELEMENTS CANNOT COMMUNICATE WITH EACH OTHER. ALL GPU PROGRAMS MUST BE STRUCTURED IN THIS WAY: MANY PARALLEL ELEMENTS, EACH PROCESSED IN PARALLEL BY A SINGLE PROGRAM.
  • 15. GPU COMPUTING • GENERAL-PURPOSE COMPUTING ON THE GPU - STEPS TO SHOW THE SIMPLER AND DIRECT WAY THAT TODAY’S GPU COMPUTING APPLICATIONS ARE WRITTEN. 1. PROGRAMMING A GPU FOR GRAPHICS: WE BEGIN WITH THE SAME GPU PIPELINE THAT WE DESCRIBED IN SECTION II, CONCENTRATING ON THE PROGRAMMABLE ASPECTS OF THIS PIPELINE. 2. THE PROGRAMMER SPECIFIES GEOMETRY THAT COVERS A REGION ON THE SCREEN. THE RASTERIZER GENERATES A FRAGMENT AT EACH PIXEL LOCATION COVERED BY THAT GEOMETRY. 3. EACH FRAGMENT IS SHADED BY THE FRAGMENT PROGRAM. 4. THE FRAGMENT PROGRAM COMPUTES THE VALUE OF THE FRAGMENT BY A COMBINATION OF MATH OPERATIONS AND GLOBAL MEMORY READS FROM A GLOBAL BTEXTURE [MEMORY]. 5. THE RESULTING IMAGE CAN THEN BE USED AS TEXTURE ON FUTURE PASSES THROUGH THE GRAPHICS PIPELINE.
  • 16. GPU VIRTUALIZATION TAXONOMY • WE OBSERVE THAT DIFFERENT USE CASES WEIGHT THE CRITERIA DIFFERENTLY—FOR EXAMPLE A VDI DEPLOYMENT VALUES HIGH VM-TO-GPU CONSOLIDATION RATIOS (E.G., MULTIPLEXING) WHILE A CONSUMER RUNNING A VM TO ACCESS A GAME OR CAD APPLICATION UNAVAILABLE ON HIS HOST VALUES PERFORMANCE AND LIKELY FIDELITY. A TECH SUPPORT PERSON MAINTAINING A LIBRARY OF DIFFERENT CONFIGURATIONS AND AN IT ADMINISTRATOR RUNNING SERVER VMS ARE BOTH LIKELY TO VALUE PORTABILITY AND SECURE ISOLATION (INTERPOSITION).
  • 17. GPU VIRTUALIZATION TAXONOMY • FRONT-END VIRTUALIZATION - FRONT-END VIRTUALIZATION INTRODUCES A VIRTUALIZATION BOUNDARY AT A RELATIVELY HIGH LEVEL IN THE STACK, AND RUNS THE GRAPHICS DRIVER IN THE HOST/HYPERVISOR. THIS APPROACH DOES NOT RELY ON ANY GPU VENDOR- OR MODEL- SPECIFIC DE- TAILS. • BACK-END VIRTUALIZATION -THE MOST OBVIOUS BACK-END VIRTUALIZATION TECHNIQUE IS FIXED PASS- THROUGH: THE PERMANENT ASSOCIATION OF A VIRTUAL MACHINE WITH FULL EXCLUSIVE ACCESS TO A PHYSICAL GPU. RECENT CHIPSET FEATURES, SUCH AS INTEL’S VT-D, MAKE FIXED PASS-THROUGH PRACTICAL WITHOUT REQUIRING ANY SPECIAL KNOWLEDGE OF A GPU’S PROGRAMMING INTERFACES. HOWEVER, FIXED PASS-THROUGH IS NOT A GENERAL SOLUTION. IT COMPLETELY FORGOES ANY MULTIPLEXING AND PACKING MACHINES WITH ONE GPU PER VIRTUAL MACHINE (PLUS ONE FOR THE HOST) IS NOT FEASIBLE.