SlideShare uma empresa Scribd logo
1 de 28
Baixar para ler offline
Shinnosuke Furuya, Ph.D., HPC Developer Relations, NVIDIA
2021/08/26
組み込みから HPC まで
ARM コアで実現するエコシステム
2
Automotive
NVIDIA Drive
HPC
NVIDIA Grace CPU
Network
NVIDIA BlueField DPU
AGENDA
3
“World’s Best Performing CEO”
HARVARD BUSINESS REVIEW
“100 Best Companies to Work For”
FORTUNE
“Best Places to Work in 2021”
GLASSDOOR
“World’s Best CEOs”
BARRON’S
“50 Smartest Companies”
MIT TECH REVIEW
“Most Innovative Companies”
FAST COMPANY
Founded in 1993 Jensen Huang, Founder & CEO 19,000 Employees $16.7B in FY21
4
AUTOMOTIVE
5
TEGRA SOC GENERATIONS
Tegra K1 Tegra X1 Parker
CPU Arm Cortex A15 (4 cores) Arm Cortex A57 (4 cores)
NVIDIA Denver 2 (2 cores)
Arm Cortex A57 (4 cores)
GPU Kepler (192 cores) Maxwell (256 cores) Pascal (256 cores)
Products
NVIDIA SHIELD Tablet
NVIDIA Jetson TK1
Google Chromebook
Nintendo Switch
Nintendo Switch Lite
NVIDIA SHIELD TV
NVIDIA Jetson TX1
NVIDIA Jetson Nano
Tesla
Mercedes Benz
Magic Leap 1
NVIDIA Jetson TX2
Xavier Orin Atlan
CPU NVIDIA Carmel (8 cores) Arm Cortex A78AE (12 cores) new Arm
GPU Volta (384 cores) next generation next generation
Products
Toyota
NVIDIA Jetson AGX Xavier
NVIDIA Jetson Xavier NX
NVIDIA DRIVE Pegasus
Mercedes Benz
Volvo
(many automotive companies)
6
NVIDIA ORIN
24.5 billion transistors
12 A78 (Hercules) ARM64 CPUs
254 INT8 TOPS - CUDA Tensor Core GPU + DLA
205 GB/s memory bandwidth
4 10Gbps ENET
8K 30 Dec | 4K 60 Enc – H264 / H265 / VP9
4 R52 Lock-step Pairs Integrated Safety Island ASIL-D
Secure key storage
FUSA ASIL-B Chip | ASIL-D Systematic
Advanced, Software-defined Platform
for Autonomous Machines
7
NVIDIA DRIVE ATLAN
Fusing Next Generation AI and BlueField
Industry’s First 1,000 TOPS SoC
400 Gbps Networking with Secure Gateway
ASIL-D Safety Island
TOPS is the New Horsepower
8
HYPERION 8
AV PLATFORM
2x Orin AV Computer
1x Orin IX Computer
4x Orin + 4x MLNX 3D GT Data Recorder
Sensor Suite: 8 Cameras [8MP], 4 Fisheyes [3MP], 3 In-
Cabin, 9 Radar, 2 Lidar
State-of-the-Art Advances for Data Collection,
Development and Testing
9
THE FUTURE CAR IS SOFTWARE DEFINED
10
NVIDIA DRIVE AV
11
HPC
12
GIANT MODELS PUSHING LIMITS OF EXISTING ARCHITECTURE
Requires a New Architecture
GPU 8,000 GB/sec
CPU 200 GB/sec
PCIE Gen4 (Effective Per GPU) 16 GB/sec
Mem-to-GPU 64 GB/sec
System Bandwidth Bottleneck
DDR4 HBM2e
GPU
GPU
GPU
GPU
x86
ELMo (94M)
BERT-Large
(340M)
GPT-2
(1.5B)
Megatron-LM
(8.3B)
T5 (11B)
Turing-NLG
(17.2B)
GPT-3 (175B)
0.00001
0.0001
0.001
0.01
0.1
1
10
100
1000
2018 2019 2020 2021 2022 2023
Model
Size
(Trillions
of
Parameters)
100 TRILLION PARAMETER MODELS BY 2023
13
ANNOUNCING NVIDIA GRACE
Breakthrough CPU Designed for Giant-Scale AI and HPC Applications
FASTEST INTERCONNECTS
>900 GB/s Cache Coherent NVLink CPU To GPU (14x)
>600GB/s CPU To CPU (2x)
NEXT GENERATION ARM NEOVERSE CORES
>300 SPECrate2017_int_base est.
Availability 2023
HIGHEST MEMORY BANDWIDTH
>500GB/s LPDDR5x w/ ECC
>2x Higher B/W
10x Higher Energy Efficiency
14
TURBOCHARGED TERABYTE SCALE ACCELERATED COMPUTING
CURRENT x86 ARCHITECTURE
DDR4 HBM2e
Evolving Architecture For New Workloads
INTEGRATED CPU-GPU ARCHITECTURE
LPDDR5x HBM2e
3 DAYS FROM 1 MONTH
Fine-Tune Training of 1T Model
REAL-TIME INFERENCE
ON 0.5T MODEL
Interactive Single Node NLP Inference
GPU
GPU
GPU
GPU
GRACE
GRACE
GRACE
GRACE
GPU
GPU
GPU
GPU
x86
Transfer 2TB in 30 secs Transfer 2TB in 1 secs
GPU 8,000 GB/sec
CPU 200 GB/sec
PCIE Gen4
(Effective Per GPU)
16 GB/sec
Mem-to-GPU 64 GB/sec
GPU 8,000 GB/sec
CPU 500 GB/sec
NVLink 500 GB/sec
Mem-to-GPU 2000 GB/sec
Bandwidth claims rounded to nearest hundred for illustration.
Performance results based on projections on these configurations Grace : 8xGrace and 8xA100 with 4th Gen NVIDIA NVLink Connection between CPU and GPU and x86: DGX A100.
Training: 1 Month of training is Fine-Tuning a 1T parameter model on a large custom data set on 64xGrace+64xA100 compared to 8xDGXA100 (16xX86+64xA100)
Inference: 530B Parameter model on 8xGrace+8xA100 compared to DGXA100.
15
ANNOUNCING
THE WORLD’S FASTEST
SUPERCOMPUTER FOR AI
20 Exaflops of AI
Accelerated w/ NVIDIA Grace CPU and NVIDIA GPU
HPC and AI For Scientific and Commercial Apps
Advance Weather, Climate, and Material Science
16
NETWORK
17
INTRODUCING NVIDIA BLUEFIELD-3 DPU
First 400Gb/s Data Processing Unit
Offloads and Accelerates Data Center Infrastructure
Isolates Application from Control and Management Plane
Powerful CPU – 16x Arm A78 Cores
Datapath Accelerator – 16x Cores, 256 Threads
Process Networking, Storage, and Security at 400 Gbps
18
INTRODUCING NVIDIA BLUEFIELD-3 DPU
First 400Gb/s Data Processing Unit
22 Billion Transistors
400Gb/s Ethernet & InfiniBand Connectivity
400Gb/s Crypto Acceleration
18M IOP/s Elastic Block Storage
300 Equivalent x86 Cores
CONNECTX-7
DATA PATH ACCELERATOR
PCIe GEN 5.0
DDR5 MEMORY INTERFACE
ARM CORES
ACCELERATION
ENGINES
19
BLUEFIELD DPU GENERATIONS
BlueField BlueField-2 BlueField-3
Port speed
2 x 100Gb/s
InfiniBand and Ethernet
2 x 100Gb/s, 1 x 200Gb/s
InfiniBand and Ethernet
1 x 400Gb/s, 2x200Gb/s
InfiniBand and Ethernet
Performance
Bandwidth: 200Gb/s
DPDK Max Msg Rate:150Mpps
RDMA max msg rate: 200Mpps
Bandwidth: 200Gb/s
DPDK Max Msg Rate: 215Mpps
RDMA max msg rate: 215Mpps
Bandwidth: 400Gb/s
DPDK max msg rate: 250Mpps
RDMA max msg rate: 330Mpps
Modulation NRZ NRZ & 50G PAM4 NRZ & 100G PAM4
DDR Channels DDR4-2400MT/s Dual channels DDR4-3200MT/s Single channel 2 x DDR5-5600 Interfaces
Max Arm Cores 16 x A72 Arm cores 8 x A72 Arm cores 16 x A78 Arm cores (Hercules)
Embedded ASIC ConnectX-5 ConnectX-6 Dx ConnectX-7
PCIe Gen3.0 x32 / Gen4.0 x16 Gen4.0 x16 Gen5.0 x32
20
NVIDIA DOCA
Enabling Broad BlueField Partner Ecosystem
Software Development Framework for BlueField DPUs
Offload, Accelerate, and Isolate Infrastructure Processing
Support for Hyperscale, Enterprise, Supercomputing and
Hyperconverged Infrastructure
Software Compatibility for Generations of BlueField DPUs
DOCA is for DPUs what CUDA is for GPUs
CYBER
SECURITY
EDGE
STORAGE
PLATFORM
INFRASTRUCTURE
ORCHESTRATION
MANAGEMENT
TELEMETRY
SECURITY NETWORKING STORAGE
ACCELERATION LIBRARIES
DOCA
21
BLUEFIELD-3 USE CASES
Unprecedented Innovation for Modern Data Centers
Cloud Computing
Bare-Metal I Virtualized I Containerized
Private I Public I Hybrid Cloud
Cyber Security
Distributed Security | NGFW I
Micro-segmentation
HPC & AI
Cloud-Native Supercomputing |
Accelerated DLRM
Telco & Edge
Telco Cloud | CloudRAN |
Edge Compute
Media Streaming
Visual High Quality I
8K Video I CDN
Data Storage
HCI I Elastic Block Storage I
Instance Storage
22
BLUEFIELD ENABLES
CLOUD-NATIVE
SUPERCOMPUTING
Collective offload with UCC accelerator
Smart MPI progression
User-defined algorithms
1.4X higher application performance
Multi-Tenancy with Zero-Trust Security
23
NVIDIA DPU ROADMAP
Exponential Growth in Data Center Infrastructure Processing
2020 2022
1X
10X
100X
BlueField-2
7B Transistors
9 SPECint
0.7 TOPS
200 Gbps
BlueField-3
22B Transistors
42 SPECint
1.5 TOPS
400 Gbps
BlueField-4
64B Transistors
160 SPECint
1000 TOPS
800 Gbps
2024
DOCA — ONE ARCHITECTURE
* BlueField-4 product to include opt-in GPU and non-GPU configurations
24
SUMMARY
25
3 CHIPS. YEARLY LEAPS. ONE ARCHITECTURE.
26
MEGATRON cuQUANTUM MORPHEUS
MERLIN
MAXINE CLARA METROPOLIS ISAAC DRIVE
AERIAL
APPLICATION FRAMEWORKS
CHIPS & SYSTEMS
PLATFORM SOFTWARE
NVIDIA ECOSYSTEM PLATFORM
AGX
DGX
RTX EGX
HGX
FLEET COMMAND
OMNIVERSE AI ENTERPRISE
VGPU
GPU CPU DPU
GEFORCE RIVA
27
SUMMARY
• Tegra SoC has a long history, and that experience has been applied to current Xavier, the next generation Orin, and
the next generation Atlan
• The future car is software defined, and NVIDIA provide whole ecosystem such as DRIVE Hyperion and DGX Systems
• Grace CPU is designed for giant-scale AI and HPC applications
• BlueField-3 DPU is the first 400 Gb/s data processing unit
• DOCA enables broad BlueField ecosystem
• GPU, CPU and DPU chips make a yearly leaps in one architecture
組み込みから HPC まで ARM コアで実現するエコシステム

Mais conteúdo relacionado

Mais procurados

AMD: Where Gaming Begins
AMD: Where Gaming BeginsAMD: Where Gaming Begins
AMD: Where Gaming BeginsAMD
 
Deep learning: Hardware Landscape
Deep learning: Hardware LandscapeDeep learning: Hardware Landscape
Deep learning: Hardware LandscapeGrigory Sapunov
 
3D V-Cache
3D V-Cache 3D V-Cache
3D V-Cache AMD
 
AI Hardware Landscape 2021
AI Hardware Landscape 2021AI Hardware Landscape 2021
AI Hardware Landscape 2021Grigory Sapunov
 
DPDK IPSec performance benchmark ~ Georgii Tkachuk
DPDK IPSec performance benchmark ~ Georgii TkachukDPDK IPSec performance benchmark ~ Georgii Tkachuk
DPDK IPSec performance benchmark ~ Georgii TkachukIntel
 
Deep Dive on Amazon EC2 instances
Deep Dive on Amazon EC2 instancesDeep Dive on Amazon EC2 instances
Deep Dive on Amazon EC2 instancesAmazon Web Services
 
"Embedded Vision Made Smart: Introduction to the HALCON Embedded Machine Visi...
"Embedded Vision Made Smart: Introduction to the HALCON Embedded Machine Visi..."Embedded Vision Made Smart: Introduction to the HALCON Embedded Machine Visi...
"Embedded Vision Made Smart: Introduction to the HALCON Embedded Machine Visi...Edge AI and Vision Alliance
 
BUD17-209: Reliability, Availability, and Serviceability (RAS) on ARM64
BUD17-209: Reliability, Availability, and Serviceability (RAS) on ARM64 BUD17-209: Reliability, Availability, and Serviceability (RAS) on ARM64
BUD17-209: Reliability, Availability, and Serviceability (RAS) on ARM64 Linaro
 
Building the World's Largest GPU
Building the World's Largest GPUBuilding the World's Largest GPU
Building the World's Largest GPURenee Yao
 
Revisit DCA, PCIe TPH and DDIO
Revisit DCA, PCIe TPH and DDIORevisit DCA, PCIe TPH and DDIO
Revisit DCA, PCIe TPH and DDIOHisaki Ohara
 
Snapdragon SoC and ARMv7 Architecture
Snapdragon SoC and ARMv7 ArchitectureSnapdragon SoC and ARMv7 Architecture
Snapdragon SoC and ARMv7 ArchitectureSantosh Verma
 
“Zen 3”: AMD 2nd Generation 7nm x86-64 Microprocessor Core
“Zen 3”: AMD 2nd Generation 7nm x86-64 Microprocessor Core“Zen 3”: AMD 2nd Generation 7nm x86-64 Microprocessor Core
“Zen 3”: AMD 2nd Generation 7nm x86-64 Microprocessor CoreAMD
 
AMD EPYC™ Microprocessor Architecture
AMD EPYC™ Microprocessor ArchitectureAMD EPYC™ Microprocessor Architecture
AMD EPYC™ Microprocessor ArchitectureAMD
 
GTC 2018 で発表された自動運転最新情報
GTC 2018 で発表された自動運転最新情報GTC 2018 で発表された自動運転最新情報
GTC 2018 で発表された自動運転最新情報NVIDIA Japan
 
Overview of standardisation status and 3GPP technology evolution trend
Overview of standardisation status and 3GPP technology evolution trendOverview of standardisation status and 3GPP technology evolution trend
Overview of standardisation status and 3GPP technology evolution trend3G4G
 
The Path to "Zen 2"
The Path to "Zen 2"The Path to "Zen 2"
The Path to "Zen 2"AMD
 
Machine Learning on Your Hand - Introduction to Tensorflow Lite Preview
Machine Learning on Your Hand - Introduction to Tensorflow Lite PreviewMachine Learning on Your Hand - Introduction to Tensorflow Lite Preview
Machine Learning on Your Hand - Introduction to Tensorflow Lite PreviewModulabs
 
Secure Boot on ARM systems – Building a complete Chain of Trust upon existing...
Secure Boot on ARM systems – Building a complete Chain of Trust upon existing...Secure Boot on ARM systems – Building a complete Chain of Trust upon existing...
Secure Boot on ARM systems – Building a complete Chain of Trust upon existing...Linaro
 

Mais procurados (20)

AMD: Where Gaming Begins
AMD: Where Gaming BeginsAMD: Where Gaming Begins
AMD: Where Gaming Begins
 
Deep learning: Hardware Landscape
Deep learning: Hardware LandscapeDeep learning: Hardware Landscape
Deep learning: Hardware Landscape
 
3D V-Cache
3D V-Cache 3D V-Cache
3D V-Cache
 
AI Hardware Landscape 2021
AI Hardware Landscape 2021AI Hardware Landscape 2021
AI Hardware Landscape 2021
 
DPDK IPSec performance benchmark ~ Georgii Tkachuk
DPDK IPSec performance benchmark ~ Georgii TkachukDPDK IPSec performance benchmark ~ Georgii Tkachuk
DPDK IPSec performance benchmark ~ Georgii Tkachuk
 
Deep Dive on Amazon EC2 instances
Deep Dive on Amazon EC2 instancesDeep Dive on Amazon EC2 instances
Deep Dive on Amazon EC2 instances
 
"Embedded Vision Made Smart: Introduction to the HALCON Embedded Machine Visi...
"Embedded Vision Made Smart: Introduction to the HALCON Embedded Machine Visi..."Embedded Vision Made Smart: Introduction to the HALCON Embedded Machine Visi...
"Embedded Vision Made Smart: Introduction to the HALCON Embedded Machine Visi...
 
BUD17-209: Reliability, Availability, and Serviceability (RAS) on ARM64
BUD17-209: Reliability, Availability, and Serviceability (RAS) on ARM64 BUD17-209: Reliability, Availability, and Serviceability (RAS) on ARM64
BUD17-209: Reliability, Availability, and Serviceability (RAS) on ARM64
 
Building the World's Largest GPU
Building the World's Largest GPUBuilding the World's Largest GPU
Building the World's Largest GPU
 
Revisit DCA, PCIe TPH and DDIO
Revisit DCA, PCIe TPH and DDIORevisit DCA, PCIe TPH and DDIO
Revisit DCA, PCIe TPH and DDIO
 
Snapdragon SoC and ARMv7 Architecture
Snapdragon SoC and ARMv7 ArchitectureSnapdragon SoC and ARMv7 Architecture
Snapdragon SoC and ARMv7 Architecture
 
“Zen 3”: AMD 2nd Generation 7nm x86-64 Microprocessor Core
“Zen 3”: AMD 2nd Generation 7nm x86-64 Microprocessor Core“Zen 3”: AMD 2nd Generation 7nm x86-64 Microprocessor Core
“Zen 3”: AMD 2nd Generation 7nm x86-64 Microprocessor Core
 
AMD EPYC™ Microprocessor Architecture
AMD EPYC™ Microprocessor ArchitectureAMD EPYC™ Microprocessor Architecture
AMD EPYC™ Microprocessor Architecture
 
GTC 2018 で発表された自動運転最新情報
GTC 2018 で発表された自動運転最新情報GTC 2018 で発表された自動運転最新情報
GTC 2018 で発表された自動運転最新情報
 
Overview of standardisation status and 3GPP technology evolution trend
Overview of standardisation status and 3GPP technology evolution trendOverview of standardisation status and 3GPP technology evolution trend
Overview of standardisation status and 3GPP technology evolution trend
 
The Path to "Zen 2"
The Path to "Zen 2"The Path to "Zen 2"
The Path to "Zen 2"
 
Understanding DPDK
Understanding DPDKUnderstanding DPDK
Understanding DPDK
 
Machine Learning on Your Hand - Introduction to Tensorflow Lite Preview
Machine Learning on Your Hand - Introduction to Tensorflow Lite PreviewMachine Learning on Your Hand - Introduction to Tensorflow Lite Preview
Machine Learning on Your Hand - Introduction to Tensorflow Lite Preview
 
Secure Boot on ARM systems – Building a complete Chain of Trust upon existing...
Secure Boot on ARM systems – Building a complete Chain of Trust upon existing...Secure Boot on ARM systems – Building a complete Chain of Trust upon existing...
Secure Boot on ARM systems – Building a complete Chain of Trust upon existing...
 
GTC 2022 Keynote
GTC 2022 KeynoteGTC 2022 Keynote
GTC 2022 Keynote
 

Semelhante a 組み込みから HPC まで ARM コアで実現するエコシステム

Harnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligenceHarnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligenceAlison B. Lowndes
 
NVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdf
NVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdfNVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdf
NVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdfMuhammadAbdullah311866
 
Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Backend.AI Technical Introduction (19.09 / 2019 Autumn)Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Backend.AI Technical Introduction (19.09 / 2019 Autumn)Lablup Inc.
 
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mão
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mãoWebinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mão
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mãoEmbarcados
 
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA
 
Dell NVIDIA AI Powered Transformation Webinar
Dell NVIDIA AI Powered Transformation WebinarDell NVIDIA AI Powered Transformation Webinar
Dell NVIDIA AI Powered Transformation WebinarBill Wong
 
Hardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and MLHardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and MLinside-BigData.com
 
BladeCenter GPU Expansion Blade (BGE) - Client Presentation
BladeCenter GPU Expansion Blade (BGE) - Client PresentationBladeCenter GPU Expansion Blade (BGE) - Client Presentation
BladeCenter GPU Expansion Blade (BGE) - Client PresentationCliff Kinard
 
Tesla Accelerated Computing Platform
Tesla Accelerated Computing PlatformTesla Accelerated Computing Platform
Tesla Accelerated Computing Platforminside-BigData.com
 
VEDLIoT at FPL'23_Accelerators for Heterogenous Computing in AIoT
VEDLIoT at FPL'23_Accelerators for Heterogenous Computing in AIoTVEDLIoT at FPL'23_Accelerators for Heterogenous Computing in AIoT
VEDLIoT at FPL'23_Accelerators for Heterogenous Computing in AIoTVEDLIoT Project
 
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super AffordableSupermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super AffordableRebekah Rodriguez
 
Supermicro’s Universal GPU: Modular, Standards Based and Built for the Future
Supermicro’s Universal GPU: Modular, Standards Based and Built for the FutureSupermicro’s Universal GPU: Modular, Standards Based and Built for the Future
Supermicro’s Universal GPU: Modular, Standards Based and Built for the FutureRebekah Rodriguez
 
GTC 2016 Opening Keynote
GTC 2016 Opening KeynoteGTC 2016 Opening Keynote
GTC 2016 Opening KeynoteNVIDIA
 
Evolution of Supermicro GPU Server Solution
Evolution of Supermicro GPU Server SolutionEvolution of Supermicro GPU Server Solution
Evolution of Supermicro GPU Server SolutionNVIDIA Taiwan
 

Semelhante a 組み込みから HPC まで ARM コアで実現するエコシステム (20)

Harnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligenceHarnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligence
 
NVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdf
NVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdfNVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdf
NVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdf
 
Hardware in Space
Hardware in SpaceHardware in Space
Hardware in Space
 
Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Backend.AI Technical Introduction (19.09 / 2019 Autumn)Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Backend.AI Technical Introduction (19.09 / 2019 Autumn)
 
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mão
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mãoWebinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mão
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mão
 
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
 
Dell NVIDIA AI Powered Transformation Webinar
Dell NVIDIA AI Powered Transformation WebinarDell NVIDIA AI Powered Transformation Webinar
Dell NVIDIA AI Powered Transformation Webinar
 
Hardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and MLHardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and ML
 
BladeCenter GPU Expansion Blade (BGE) - Client Presentation
BladeCenter GPU Expansion Blade (BGE) - Client PresentationBladeCenter GPU Expansion Blade (BGE) - Client Presentation
BladeCenter GPU Expansion Blade (BGE) - Client Presentation
 
Tesla Accelerated Computing Platform
Tesla Accelerated Computing PlatformTesla Accelerated Computing Platform
Tesla Accelerated Computing Platform
 
GIST AI-X Computing Cluster
GIST AI-X Computing ClusterGIST AI-X Computing Cluster
GIST AI-X Computing Cluster
 
Advances in GPU Computing
Advances in GPU ComputingAdvances in GPU Computing
Advances in GPU Computing
 
VEDLIoT at FPL'23_Accelerators for Heterogenous Computing in AIoT
VEDLIoT at FPL'23_Accelerators for Heterogenous Computing in AIoTVEDLIoT at FPL'23_Accelerators for Heterogenous Computing in AIoT
VEDLIoT at FPL'23_Accelerators for Heterogenous Computing in AIoT
 
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super AffordableSupermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
 
NVIDIA Keynote #GTC21
NVIDIA Keynote #GTC21 NVIDIA Keynote #GTC21
NVIDIA Keynote #GTC21
 
Supermicro’s Universal GPU: Modular, Standards Based and Built for the Future
Supermicro’s Universal GPU: Modular, Standards Based and Built for the FutureSupermicro’s Universal GPU: Modular, Standards Based and Built for the Future
Supermicro’s Universal GPU: Modular, Standards Based and Built for the Future
 
GTC 2016 Opening Keynote
GTC 2016 Opening KeynoteGTC 2016 Opening Keynote
GTC 2016 Opening Keynote
 
NWU and HPC
NWU and HPCNWU and HPC
NWU and HPC
 
Nvidia at SEMICon, Munich
Nvidia at SEMICon, MunichNvidia at SEMICon, Munich
Nvidia at SEMICon, Munich
 
Evolution of Supermicro GPU Server Solution
Evolution of Supermicro GPU Server SolutionEvolution of Supermicro GPU Server Solution
Evolution of Supermicro GPU Server Solution
 

Último

UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 

Último (20)

★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 

組み込みから HPC まで ARM コアで実現するエコシステム

  • 1. Shinnosuke Furuya, Ph.D., HPC Developer Relations, NVIDIA 2021/08/26 組み込みから HPC まで ARM コアで実現するエコシステム
  • 2. 2 Automotive NVIDIA Drive HPC NVIDIA Grace CPU Network NVIDIA BlueField DPU AGENDA
  • 3. 3 “World’s Best Performing CEO” HARVARD BUSINESS REVIEW “100 Best Companies to Work For” FORTUNE “Best Places to Work in 2021” GLASSDOOR “World’s Best CEOs” BARRON’S “50 Smartest Companies” MIT TECH REVIEW “Most Innovative Companies” FAST COMPANY Founded in 1993 Jensen Huang, Founder & CEO 19,000 Employees $16.7B in FY21
  • 5. 5 TEGRA SOC GENERATIONS Tegra K1 Tegra X1 Parker CPU Arm Cortex A15 (4 cores) Arm Cortex A57 (4 cores) NVIDIA Denver 2 (2 cores) Arm Cortex A57 (4 cores) GPU Kepler (192 cores) Maxwell (256 cores) Pascal (256 cores) Products NVIDIA SHIELD Tablet NVIDIA Jetson TK1 Google Chromebook Nintendo Switch Nintendo Switch Lite NVIDIA SHIELD TV NVIDIA Jetson TX1 NVIDIA Jetson Nano Tesla Mercedes Benz Magic Leap 1 NVIDIA Jetson TX2 Xavier Orin Atlan CPU NVIDIA Carmel (8 cores) Arm Cortex A78AE (12 cores) new Arm GPU Volta (384 cores) next generation next generation Products Toyota NVIDIA Jetson AGX Xavier NVIDIA Jetson Xavier NX NVIDIA DRIVE Pegasus Mercedes Benz Volvo (many automotive companies)
  • 6. 6 NVIDIA ORIN 24.5 billion transistors 12 A78 (Hercules) ARM64 CPUs 254 INT8 TOPS - CUDA Tensor Core GPU + DLA 205 GB/s memory bandwidth 4 10Gbps ENET 8K 30 Dec | 4K 60 Enc – H264 / H265 / VP9 4 R52 Lock-step Pairs Integrated Safety Island ASIL-D Secure key storage FUSA ASIL-B Chip | ASIL-D Systematic Advanced, Software-defined Platform for Autonomous Machines
  • 7. 7 NVIDIA DRIVE ATLAN Fusing Next Generation AI and BlueField Industry’s First 1,000 TOPS SoC 400 Gbps Networking with Secure Gateway ASIL-D Safety Island TOPS is the New Horsepower
  • 8. 8 HYPERION 8 AV PLATFORM 2x Orin AV Computer 1x Orin IX Computer 4x Orin + 4x MLNX 3D GT Data Recorder Sensor Suite: 8 Cameras [8MP], 4 Fisheyes [3MP], 3 In- Cabin, 9 Radar, 2 Lidar State-of-the-Art Advances for Data Collection, Development and Testing
  • 9. 9 THE FUTURE CAR IS SOFTWARE DEFINED
  • 12. 12 GIANT MODELS PUSHING LIMITS OF EXISTING ARCHITECTURE Requires a New Architecture GPU 8,000 GB/sec CPU 200 GB/sec PCIE Gen4 (Effective Per GPU) 16 GB/sec Mem-to-GPU 64 GB/sec System Bandwidth Bottleneck DDR4 HBM2e GPU GPU GPU GPU x86 ELMo (94M) BERT-Large (340M) GPT-2 (1.5B) Megatron-LM (8.3B) T5 (11B) Turing-NLG (17.2B) GPT-3 (175B) 0.00001 0.0001 0.001 0.01 0.1 1 10 100 1000 2018 2019 2020 2021 2022 2023 Model Size (Trillions of Parameters) 100 TRILLION PARAMETER MODELS BY 2023
  • 13. 13 ANNOUNCING NVIDIA GRACE Breakthrough CPU Designed for Giant-Scale AI and HPC Applications FASTEST INTERCONNECTS >900 GB/s Cache Coherent NVLink CPU To GPU (14x) >600GB/s CPU To CPU (2x) NEXT GENERATION ARM NEOVERSE CORES >300 SPECrate2017_int_base est. Availability 2023 HIGHEST MEMORY BANDWIDTH >500GB/s LPDDR5x w/ ECC >2x Higher B/W 10x Higher Energy Efficiency
  • 14. 14 TURBOCHARGED TERABYTE SCALE ACCELERATED COMPUTING CURRENT x86 ARCHITECTURE DDR4 HBM2e Evolving Architecture For New Workloads INTEGRATED CPU-GPU ARCHITECTURE LPDDR5x HBM2e 3 DAYS FROM 1 MONTH Fine-Tune Training of 1T Model REAL-TIME INFERENCE ON 0.5T MODEL Interactive Single Node NLP Inference GPU GPU GPU GPU GRACE GRACE GRACE GRACE GPU GPU GPU GPU x86 Transfer 2TB in 30 secs Transfer 2TB in 1 secs GPU 8,000 GB/sec CPU 200 GB/sec PCIE Gen4 (Effective Per GPU) 16 GB/sec Mem-to-GPU 64 GB/sec GPU 8,000 GB/sec CPU 500 GB/sec NVLink 500 GB/sec Mem-to-GPU 2000 GB/sec Bandwidth claims rounded to nearest hundred for illustration. Performance results based on projections on these configurations Grace : 8xGrace and 8xA100 with 4th Gen NVIDIA NVLink Connection between CPU and GPU and x86: DGX A100. Training: 1 Month of training is Fine-Tuning a 1T parameter model on a large custom data set on 64xGrace+64xA100 compared to 8xDGXA100 (16xX86+64xA100) Inference: 530B Parameter model on 8xGrace+8xA100 compared to DGXA100.
  • 15. 15 ANNOUNCING THE WORLD’S FASTEST SUPERCOMPUTER FOR AI 20 Exaflops of AI Accelerated w/ NVIDIA Grace CPU and NVIDIA GPU HPC and AI For Scientific and Commercial Apps Advance Weather, Climate, and Material Science
  • 17. 17 INTRODUCING NVIDIA BLUEFIELD-3 DPU First 400Gb/s Data Processing Unit Offloads and Accelerates Data Center Infrastructure Isolates Application from Control and Management Plane Powerful CPU – 16x Arm A78 Cores Datapath Accelerator – 16x Cores, 256 Threads Process Networking, Storage, and Security at 400 Gbps
  • 18. 18 INTRODUCING NVIDIA BLUEFIELD-3 DPU First 400Gb/s Data Processing Unit 22 Billion Transistors 400Gb/s Ethernet & InfiniBand Connectivity 400Gb/s Crypto Acceleration 18M IOP/s Elastic Block Storage 300 Equivalent x86 Cores CONNECTX-7 DATA PATH ACCELERATOR PCIe GEN 5.0 DDR5 MEMORY INTERFACE ARM CORES ACCELERATION ENGINES
  • 19. 19 BLUEFIELD DPU GENERATIONS BlueField BlueField-2 BlueField-3 Port speed 2 x 100Gb/s InfiniBand and Ethernet 2 x 100Gb/s, 1 x 200Gb/s InfiniBand and Ethernet 1 x 400Gb/s, 2x200Gb/s InfiniBand and Ethernet Performance Bandwidth: 200Gb/s DPDK Max Msg Rate:150Mpps RDMA max msg rate: 200Mpps Bandwidth: 200Gb/s DPDK Max Msg Rate: 215Mpps RDMA max msg rate: 215Mpps Bandwidth: 400Gb/s DPDK max msg rate: 250Mpps RDMA max msg rate: 330Mpps Modulation NRZ NRZ & 50G PAM4 NRZ & 100G PAM4 DDR Channels DDR4-2400MT/s Dual channels DDR4-3200MT/s Single channel 2 x DDR5-5600 Interfaces Max Arm Cores 16 x A72 Arm cores 8 x A72 Arm cores 16 x A78 Arm cores (Hercules) Embedded ASIC ConnectX-5 ConnectX-6 Dx ConnectX-7 PCIe Gen3.0 x32 / Gen4.0 x16 Gen4.0 x16 Gen5.0 x32
  • 20. 20 NVIDIA DOCA Enabling Broad BlueField Partner Ecosystem Software Development Framework for BlueField DPUs Offload, Accelerate, and Isolate Infrastructure Processing Support for Hyperscale, Enterprise, Supercomputing and Hyperconverged Infrastructure Software Compatibility for Generations of BlueField DPUs DOCA is for DPUs what CUDA is for GPUs CYBER SECURITY EDGE STORAGE PLATFORM INFRASTRUCTURE ORCHESTRATION MANAGEMENT TELEMETRY SECURITY NETWORKING STORAGE ACCELERATION LIBRARIES DOCA
  • 21. 21 BLUEFIELD-3 USE CASES Unprecedented Innovation for Modern Data Centers Cloud Computing Bare-Metal I Virtualized I Containerized Private I Public I Hybrid Cloud Cyber Security Distributed Security | NGFW I Micro-segmentation HPC & AI Cloud-Native Supercomputing | Accelerated DLRM Telco & Edge Telco Cloud | CloudRAN | Edge Compute Media Streaming Visual High Quality I 8K Video I CDN Data Storage HCI I Elastic Block Storage I Instance Storage
  • 22. 22 BLUEFIELD ENABLES CLOUD-NATIVE SUPERCOMPUTING Collective offload with UCC accelerator Smart MPI progression User-defined algorithms 1.4X higher application performance Multi-Tenancy with Zero-Trust Security
  • 23. 23 NVIDIA DPU ROADMAP Exponential Growth in Data Center Infrastructure Processing 2020 2022 1X 10X 100X BlueField-2 7B Transistors 9 SPECint 0.7 TOPS 200 Gbps BlueField-3 22B Transistors 42 SPECint 1.5 TOPS 400 Gbps BlueField-4 64B Transistors 160 SPECint 1000 TOPS 800 Gbps 2024 DOCA — ONE ARCHITECTURE * BlueField-4 product to include opt-in GPU and non-GPU configurations
  • 25. 25 3 CHIPS. YEARLY LEAPS. ONE ARCHITECTURE.
  • 26. 26 MEGATRON cuQUANTUM MORPHEUS MERLIN MAXINE CLARA METROPOLIS ISAAC DRIVE AERIAL APPLICATION FRAMEWORKS CHIPS & SYSTEMS PLATFORM SOFTWARE NVIDIA ECOSYSTEM PLATFORM AGX DGX RTX EGX HGX FLEET COMMAND OMNIVERSE AI ENTERPRISE VGPU GPU CPU DPU GEFORCE RIVA
  • 27. 27 SUMMARY • Tegra SoC has a long history, and that experience has been applied to current Xavier, the next generation Orin, and the next generation Atlan • The future car is software defined, and NVIDIA provide whole ecosystem such as DRIVE Hyperion and DGX Systems • Grace CPU is designed for giant-scale AI and HPC applications • BlueField-3 DPU is the first 400 Gb/s data processing unit • DOCA enables broad BlueField ecosystem • GPU, CPU and DPU chips make a yearly leaps in one architecture