SlideShare uma empresa Scribd logo
1 de 26
Baixar para ler offline
Copyright © Khronos® Group Inc. 2018 - Page 1
Vision and
Inferencing
Update
September 2018
Neil Trevett | Khronos President
NVIDIA | VP Developer Ecosystem
ntrevett@nvidia.com | @neilt3d
www.khronos.org
Copyright © Khronos® Group Inc. 2018 - Page 2
Khronos Mission
Software
Silicon
Khronos is an International Industry Consortium creating royalty-free, open standards
to enable software to access hardware acceleration for
3D Graphics, Virtual and Augmented Reality, Parallel Computing,
Neural Networks and Vision Processing
Copyright © Khronos® Group Inc. 2018 - Page 3
Khronos Primary Standards
3D Graphics
VR and AR
Heterogenous Compute
(Parallel Processing)
Vision and Inferencing
APIs File Formats
Copyright © Khronos® Group Inc. 2018 - Page 4
Topics for Today
1 2 3
Landscape of
vision and
inferencing
acceleration
Where Khronos open
standards provide
non-proprietary
ecosystem choices
New updates on
Khronos vision
and inferencing
standards
Copyright © Khronos® Group Inc. 2018 - Page 5
Vision and Inferencing
Runtimes
Machine Learning Acceleration
Desktop and Cloud
GPU/TPU Acceleration
Diverse Inferencing
Acceleration Hardware
Training on Desktop and Cloud
Neural Net Training
Frameworks
Neural Net Training
Frameworks
Neural Net Training
Frameworks
Training
Frameworks
Deployment on Embedded Devices
Optimization
Trained
Networks
Vision and Inferencing
Applications
Training Data Sets
OR
Live
Data
Copyright © Khronos® Group Inc. 2018 - Page 6
Machine Learning Training
Desktop and
Cloud Hardware
cuDNN MIOpen clDNN
Neural Net Training
Frameworks
Neural Net Training
Frameworks
Neural Net Training
Frameworks
Neural Net Training
Frameworks
TPU
Authoring
Interchange
GPUs have well established
APIs and libraries for
compute acceleration
Copyright © Khronos® Group Inc. 2018 - Page 7
Vision and Inferencing
Runtimes
Machine Learning Acceleration
Desktop and Cloud
GPU/TPU Acceleration
Diverse Inferencing
Acceleration Hardware
Training on Desktop and Cloud
Neural Net Training
Frameworks
Neural Net Training
Frameworks
Neural Net Training
Frameworks
Training
Frameworks
Deployment on Embedded Devices
Optimization
Trained
Networks
Vision and Inferencing
Applications
Training Data Sets
OR
Live
Data
Copyright © Khronos® Group Inc. 2018 - Page 8
NNEF - Neural Network Exchange Format
NN Authoring Framework 1
NN Authoring Framework 2
NN Authoring Framework 3
Inference Engine 1
Inference Engine 2
Inference Engine 3
Every Tool Needs an Exporter to
Every Accelerator
With NNEF
Before NNEF
NN Authoring Framework 1
NN Authoring Framework 2
NN Authoring Framework 3
Inference Engine 1
Inference Engine 2
Inference Engine 3
Optimization and processing tools
Copyright © Khronos® Group Inc. 2018 - Page 9
Network Data File
Binary format contains parameter tensors
Supports float and quantized (integer) data
Flexible bit widths and quantization algorithms
Quantization algorithms expressed as extensible
compound operations
Quantization info provided as hints for execution
Network Data File
Binary format contains parameter tensors
Supports float and quantized (integer) data
Flexible bit widths and quantization algorithms
Quantization algorithms expressed as extensible
compound operations
Quantization info provided as hints for execution
NNEF Captures a Neural Network Description
Network Structure File
Distilled, platform independent network description
Human readable, syntactical elements from Python
Standardized Operations
Rigorously defined semantics
Linear, convolution, pooling, normalization, activation, unary/binary
Supports fully connected, convolutional, recurrent architectures
Two Levels of Expressiveness
Flat
Basic transfer of computation graphs with standardized operations
Simple to parse and translate to vendor specific formats
Compositional
Define custom compound operations
Higher-level graph descriptions
More complex to parse but offers more optimization hints
Network Data File
Binary format contains parameter tensors
Supports float and quantized (integer) data
Flexible bit widths and quantization algorithms
Quantization algorithms expressed as extensible
compound operations
Quantization info provided as hints for execution
Split Structure and Data files
Easy independent access to network structure or individual parameter data
Set of files can use a container such as tar or zip with optional compression and encryption
Can associate multiple Data Files with one Network
Structure File e.g. the same data in multiple formats
Copyright © Khronos® Group Inc. 2018 - Page 10
NNEF 1.0
Files
Syntax
Parser/
Validator
TensorFlow
and Caffe
Exporters
NNEF open source projects hosted on
Khronos NNEF GitHub repository
Apache 2.0 license
https://github.com/KhronosGroup/NNEF-Tools
TensorFlow
and Caffe2
Importer /
Exporters
Google
NNAPI
Convertor
OpenVX
Ingestion &
Execution
Live
Imminent
NNEF V1.0 released in August 2018
After positive industry feedback on Provisional
specification released in December 2017
NNEF Working Group Participants
Copyright © Khronos® Group Inc. 2018 - Page 11
NNEF and ONNX
Embedded Inferencing Import Training Interchange
Defined Specification Open Source Project
Stability for hardware deployment Software stack flexibility
Multi-company Governance Initiated by Facebook
Flexible Precision / Quantization 32-bit Floating Point only
Comparing Neural Network
Exchange Industry Initiatives
ONNX and NNEF are Complementary
- ONNX will HAVE to move fast to track authoring framework interchange
- NNEF provides a stable bridge from training into edge inferencing engines
Bidirectional translator
in open source
Initiating open source
bidirectional translator
Khronos tried to use LLVM as a hardware IR
BUT
LLVM evolves without needing to preserve
backwards compatibility.
Fine for software compilers – very difficult to
manage for hardware toolchains and roadmaps
SO
Khronos created hardware oriented SPIR-V
with bidirectional translation to LLVM
Same Industry Dynamics as
LLVM and SPIR-V
Copyright © Khronos® Group Inc. 2018 - Page 12
Vision and Inferencing
Runtimes
Machine Learning Acceleration
Desktop and Cloud
GPU/TPU Acceleration
Diverse Inferencing
Acceleration Hardware
Training on Desktop and Cloud
Neural Net Training
Frameworks
Neural Net Training
Frameworks
Neural Net Training
Frameworks
Training
Frameworks
Deployment on Embedded Devices
Optimization
Trained
Networks
Vision and Inferencing
Applications
Training Data Sets
OR
Live
Data
Copyright © Khronos® Group Inc. 2018 - Page 13
Three Broad Inferencing Choices
Run in
training
framework
Export to
inference
runtime
Compile to
optimized
code
Acceleration needs high-level programming
tools – typically large C++ applications
E.g. TensorFlow or TensorFlow Lite
The most popular industry choice today.
Runtime often uses underlying acceleration APIs
E.g. TensorRT, CoreML
Often used to merge custom or vision code
alongside inferencing runtime – generates
LLVM for CPU + accelerated API code
Copyright © Khronos® Group Inc. 2018 - Page 14
SYCL Single Source C++ Parallel Programming
• SYCL 1.2.1 Adopters Program released in July 2018 with open source conformance tests
- https://www.khronos.org/news/press/khronos-releases-conformance-test-suite-for-sycl-1.2.1
• Multiple Implémentations shipping: triSYCL, ComputeCpp
- http://sycl.tech
• Multiple SYCL libraries for vision and inferencing
- SYCL-BLAS, SYCL-DNN, SYCL-Eigen
C++ Kernel Fusion can gives better performance
on complex apps and libs than hand-coding
Single application source
file using STANDARD C++C++ templates and lambda
functions separate host &
device code
Accelerated code passed into
device OpenCL compilers
Copyright © Khronos® Group Inc. 2018 - Page 15
Python Client C++ Client
Optional C API
TensorFlow tensor Kernels
(> 800 kernels)
ConvolutionsMatrix multiply
Eigen Tensors SYCL-BLAS Library SYCL-DNN Library
TensorFlow on SYCL / OpenCL
State-of-the-art C++
compilers can fuse nodes
in vision and neural
network graphs to provide
optimized performance
often faster than hand-
coded applications
Copyright © Khronos® Group Inc. 2018 - Page 16
Three Broad Inferencing Choices
Run in
training
framework
Export to
inference
runtime
Compile to
optimized
code
Acceleration needs high-level programming
tools – typically large C++ applications
E.g. TensorFlow or TensorFlow Lite
The most popular industry choice today.
Runtime often uses underlying acceleration APIs
E.g. TensorRT, CoreML
Often used to merge custom or vision code
alongside inferencing runtime – generates
LLVM for CPU + accelerated API code
Copyright © Khronos® Group Inc. 2018 - Page 17
Platform Inferencing Stacks
Microsoft Windows
Windows Machine Learning (WinML)
Google Android
Neural Network API (NNAPI)
Apple MacOS and iOS
CoreML
https://docs.microsoft.com/en-us/windows/uwp/machine-learning/ https://developer.android.com/ndk/guides/neuralnetworks/ https://developer.apple.com/documentation/coreml
Core ML Model
Consistent Three Steps
1. Import trained NN model file
2. Build optimized version of graph
3. Run graph on accelerated runtime using
underlying low-level API
Copyright © Khronos® Group Inc. 2018 - Page 18
NNVM - Open Compiler for AI Inferencing
http://www.tvmlang.org/2017/08/17/tvm-release-announcement.html
SPIR-V IR for parallel accelerators
Backend in development
LLVM IR for CPUs
1.Import Trained
Network Description
2. Graph-level
Optimizations
3. Decompose to primitive
instructions and emit
programs for accelerated
run-times
Paul G. Allen School of Computer Science & Engineering, University of Washington
Facebook Glow Compiler
(Graph Lowering Optimizations)
https://facebook.ai/developers/tools/glow
Copyright © Khronos® Group Inc. 2018 - Page 19
OpenVX
PowerEfficiency
Computation Flexibility
Dedicated
Hardware
GPU
Compute
Multi-core
CPUX1
X10
X100
Vision
DSPs
Wide range of vision hardware architectures
OpenVX provides a high-level Graph-based abstraction
->
Enables Graph-level optimizations!
Can be implemented on almost any hardware or processor!
->
Portable, Efficient Vision Processing!
Vision
Node
Vision
Node
Vision
NodeVision
Node
Vision Processing Graph
Shipping Implementations
Copyright © Khronos® Group Inc. 2018 - Page 20
Extending OpenVX for Inferencing #1
Neural Network Extension
• OpenVX Nodes to represent common NN Layers
• 1D-4D Tensors to connect layers and common
• INT16, INT7.8, INT8, and U8 Tensor Ops
vxActivationLayer vxConvolutionLayer vxDeconvolutionLayer
vxFullyConnectedLayer vxNormalizationLayer vxPoolingLayer
vxSoftmaxLayer vxROIPoolingLayer …
Vision
Node
Vision
Node
Vision
Node
Downstream
Application
Processing
Native
Camera
Control CNN Nodes
An OpenVX graph mixing CNN
nodes with traditional vision nodes
NNEF Translator converts NNEF
representation into OpenVX Node Graphs
NNEF Translator
• Ingests NNEF File and builds OpenVX node graph
• Open source project in progress
Importing NNEF Neural Network Descriptions
Copyright © Khronos® Group Inc. 2018 - Page 21
Extending OpenVX for Inferencing #2
OpenVX/OpenCL Interop
• Provisional Extension
• Enables custom OpenCL acceleration to be
invoked from OpenVX User Kernels
• Memory objects can be mapped or copied
Kernel/Graph Import
• Provisional Extension
• Defines container for executable or IR code
• Enables arbitrary code to be inserted as a
OpenVX Node in a graph
OpenCL Command Queue
Application
cl_mem buffers
Fully asynchronous host-device
operations during data exchange
OpenVX data objects
Runtime
Runtime Map or copy OpenVX data
objects into cl_mem buffers
Copy or export
cl_mem buffers into OpenVX
data objects
OpenVX user-kernels can access command
queue and cl_mem objects to asynchronously
schedule OpenCL kernel execution
OpenVX/OpenCL Interop
Creating Custom User Nodes
Copyright © Khronos® Group Inc. 2018 - Page 22
NNEF and OpenVX for Inferencing
Compilation
Kernel
Import
Ingestion
Proprietary
Runtimes
Vision Nodes
User Nodes
NN
Extension
Translator
Executable
Code
To mix inferencing
with vision and other
custom processing
Acceleration
APIs
Many inferencing stacks end up
using OpenCL for hardware
acceleration
Compile to
executable
code
Execute
accelerated
OpenVX
Runtime
Compile to
IR/Binary
Copyright © Khronos® Group Inc. 2018 - Page 23
GPU
OpenCL – Unique Heterogeneous Runtime
FPGA DSP
Custom Hardware
GPU
CPUCPUCPUGPU
Growing number of optimized OpenCL
vision and inferencing libraries
Vision: OpenCV, Halide, Visioncpp
Machine Learning: Xiaomi MACE, Arm Compute Library
Linear Algebra: clDNN, clBlast, ViennaCL
Application or
Inferencing Run-time
Fragmented GPU
API Landscape
OpenCL is the only industry standard for low-level heterogeneous compute
Portable control over memory and parallel task execution
“The closest you can be to your embedded accelerator and still be portable”
Application or
Inferencing Run-time
Copyright © Khronos® Group Inc. 2018 - Page 24
OpenCL Ecosystem Roadmap
2011
OpenCL 1.2
OpenCL C Kernel
Language
OpenCL 2.1
SPIR-V in Core
2015
SYCL 1.2
C++11 Single source
programming
OpenCL 2.2
C++ Kernel Language
2017
SYCL 1.2.1
C++11 Single source
programming
Bringing Heterogeneous
compute to standard ISO C++
Khronos hosting C++17 Parallel STL
C++20 Parallel STL with Ranges Proposal
Processor Deployment
Flexibility
Parallel computation across diverse
processor architectures
Kernel Deployment
Flexibility
Execute OpenCL C kernels on
Vulkan GPU runtimes
OpenCL has an active
three track roadmap
Copyright © Khronos® Group Inc. 2018 - Page 25
OpenCL Next - Feature Set Flexibility
• Defining OpenCL features that become optional for enhanced deployment flexibility
- API and language features e.g. floating point precisions
• Feature Sets avoid fragmentation
- Defined to suit specific markets – e.g. desktop, embedded vision and inferencing
• Implementations are conformant if fully support feature set functionality
OpenCL 2.2 Functionality = queryable, optional feature
Khronos-defined
OpenCL 2.2 Full Profile
Feature Set
Khronos-defined
OpenCL 1.2 Full Profile
Feature Set
Industry-defined
Feature Set E.g.
Embedded Vision
and Inferencing
Copyright © Khronos® Group Inc. 2018 - Page 26
Universal Deployment Flexibility
Open source SPIRV-Cross converts
SPIR-V to MSL or HLSLClspv and clvk
open source tools
OpenCL
Programs
Native
Vulkan Drivers
UWP and
D3D based
Consoles
Open source shims
convert Vulkan to
Metal or D3D API calls
Open source tools enable
OpenCL and Vulkan apps
to be increasingly
deployed on any platform

Mais conteúdo relacionado

Mais procurados

P4/FPGA, Packet Acceleration
P4/FPGA, Packet AccelerationP4/FPGA, Packet Acceleration
P4/FPGA, Packet AccelerationLiz Warner
 
Lightweight Virtualized Containers For Open Platform for NFV* (OPNFV*)
Lightweight Virtualized Containers For Open Platform for NFV* (OPNFV*)Lightweight Virtualized Containers For Open Platform for NFV* (OPNFV*)
Lightweight Virtualized Containers For Open Platform for NFV* (OPNFV*)Michelle Holley
 
Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...
Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...
Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...Jason Dai
 
AI for All: Biology is eating the world & AI is eating Biology
AI for All: Biology is eating the world & AI is eating Biology AI for All: Biology is eating the world & AI is eating Biology
AI for All: Biology is eating the world & AI is eating Biology Intel® Software
 
Open Network Edge Services Software for 5G and Edge
Open Network Edge Services Software for 5G and EdgeOpen Network Edge Services Software for 5G and Edge
Open Network Edge Services Software for 5G and EdgeLiz Warner
 
End-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooEnd-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooJason Dai
 
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...Edge AI and Vision Alliance
 
Edge and 5G: What is in it for the developers?
Edge and 5G: What is in it for the developers?Edge and 5G: What is in it for the developers?
Edge and 5G: What is in it for the developers?Michelle Holley
 
Introduction to container networking in K8s - SDN/NFV London meetup
Introduction to container networking in K8s - SDN/NFV  London meetupIntroduction to container networking in K8s - SDN/NFV  London meetup
Introduction to container networking in K8s - SDN/NFV London meetupHaidee McMahon
 
HKG18-301 - Dramatically Accelerate 96Board Software via an FPGA with Integra...
HKG18-301 - Dramatically Accelerate 96Board Software via an FPGA with Integra...HKG18-301 - Dramatically Accelerate 96Board Software via an FPGA with Integra...
HKG18-301 - Dramatically Accelerate 96Board Software via an FPGA with Integra...Linaro
 
Khronos Overview Japan Virtual Open House Feb 2021
Khronos Overview Japan Virtual Open House Feb 2021Khronos Overview Japan Virtual Open House Feb 2021
Khronos Overview Japan Virtual Open House Feb 2021The Khronos Group Inc.
 
Enabling MEC as a New Telco Business Opportunity
Enabling MEC as a New Telco Business OpportunityEnabling MEC as a New Telco Business Opportunity
Enabling MEC as a New Telco Business OpportunityMichelle Holley
 
AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019
AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019
AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019Intel® Software
 
"Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres...
"Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres..."Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres...
"Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres...Edge AI and Vision Alliance
 
oneAPI: Industry Initiative & Intel Product
oneAPI: Industry Initiative & Intel ProductoneAPI: Industry Initiative & Intel Product
oneAPI: Industry Initiative & Intel ProductTyrone Systems
 
“Vitis and Vitis AI: Application Acceleration from Cloud to Edge,” a Presenta...
“Vitis and Vitis AI: Application Acceleration from Cloud to Edge,” a Presenta...“Vitis and Vitis AI: Application Acceleration from Cloud to Edge,” a Presenta...
“Vitis and Vitis AI: Application Acceleration from Cloud to Edge,” a Presenta...Edge AI and Vision Alliance
 
Edge Computing: A Unified Infrastructure for all the Different Pieces
Edge Computing: A Unified Infrastructure for all the Different PiecesEdge Computing: A Unified Infrastructure for all the Different Pieces
Edge Computing: A Unified Infrastructure for all the Different PiecesCloudify Community
 

Mais procurados (20)

P4/FPGA, Packet Acceleration
P4/FPGA, Packet AccelerationP4/FPGA, Packet Acceleration
P4/FPGA, Packet Acceleration
 
SYCL 2020 Specification
SYCL 2020 SpecificationSYCL 2020 Specification
SYCL 2020 Specification
 
Lightweight Virtualized Containers For Open Platform for NFV* (OPNFV*)
Lightweight Virtualized Containers For Open Platform for NFV* (OPNFV*)Lightweight Virtualized Containers For Open Platform for NFV* (OPNFV*)
Lightweight Virtualized Containers For Open Platform for NFV* (OPNFV*)
 
Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...
Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...
Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...
 
AI for All: Biology is eating the world & AI is eating Biology
AI for All: Biology is eating the world & AI is eating Biology AI for All: Biology is eating the world & AI is eating Biology
AI for All: Biology is eating the world & AI is eating Biology
 
Open Network Edge Services Software for 5G and Edge
Open Network Edge Services Software for 5G and EdgeOpen Network Edge Services Software for 5G and Edge
Open Network Edge Services Software for 5G and Edge
 
End-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooEnd-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics Zoo
 
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...
 
Edge and 5G: What is in it for the developers?
Edge and 5G: What is in it for the developers?Edge and 5G: What is in it for the developers?
Edge and 5G: What is in it for the developers?
 
Introduction to container networking in K8s - SDN/NFV London meetup
Introduction to container networking in K8s - SDN/NFV  London meetupIntroduction to container networking in K8s - SDN/NFV  London meetup
Introduction to container networking in K8s - SDN/NFV London meetup
 
Intel Developer Program
Intel Developer ProgramIntel Developer Program
Intel Developer Program
 
HKG18-301 - Dramatically Accelerate 96Board Software via an FPGA with Integra...
HKG18-301 - Dramatically Accelerate 96Board Software via an FPGA with Integra...HKG18-301 - Dramatically Accelerate 96Board Software via an FPGA with Integra...
HKG18-301 - Dramatically Accelerate 96Board Software via an FPGA with Integra...
 
Khronos Overview Japan Virtual Open House Feb 2021
Khronos Overview Japan Virtual Open House Feb 2021Khronos Overview Japan Virtual Open House Feb 2021
Khronos Overview Japan Virtual Open House Feb 2021
 
Enabling MEC as a New Telco Business Opportunity
Enabling MEC as a New Telco Business OpportunityEnabling MEC as a New Telco Business Opportunity
Enabling MEC as a New Telco Business Opportunity
 
AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019
AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019
AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019
 
"Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres...
"Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres..."Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres...
"Edge/Cloud Tradeoffs and Scaling a Consumer Computer Vision Product," a Pres...
 
oneAPI: Industry Initiative & Intel Product
oneAPI: Industry Initiative & Intel ProductoneAPI: Industry Initiative & Intel Product
oneAPI: Industry Initiative & Intel Product
 
“Vitis and Vitis AI: Application Acceleration from Cloud to Edge,” a Presenta...
“Vitis and Vitis AI: Application Acceleration from Cloud to Edge,” a Presenta...“Vitis and Vitis AI: Application Acceleration from Cloud to Edge,” a Presenta...
“Vitis and Vitis AI: Application Acceleration from Cloud to Edge,” a Presenta...
 
Edge Computing: A Unified Infrastructure for all the Different Pieces
Edge Computing: A Unified Infrastructure for all the Different PiecesEdge Computing: A Unified Infrastructure for all the Different Pieces
Edge Computing: A Unified Infrastructure for all the Different Pieces
 
"Vision for All?," a Presentation from Intel
"Vision for All?," a Presentation from Intel"Vision for All?," a Presentation from Intel
"Vision for All?," a Presentation from Intel
 

Semelhante a "Update on Khronos Standards for Vision and Machine Learning," a Presentation from the Khronos Group

"Current and Planned Standards for Computer Vision and Machine Learning," a P...
"Current and Planned Standards for Computer Vision and Machine Learning," a P..."Current and Planned Standards for Computer Vision and Machine Learning," a P...
"Current and Planned Standards for Computer Vision and Machine Learning," a P...Edge AI and Vision Alliance
 
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...Edge AI and Vision Alliance
 
"APIs for Accelerating Vision and Inferencing: An Industry Overview of Option...
"APIs for Accelerating Vision and Inferencing: An Industry Overview of Option..."APIs for Accelerating Vision and Inferencing: An Industry Overview of Option...
"APIs for Accelerating Vision and Inferencing: An Industry Overview of Option...Edge AI and Vision Alliance
 
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre..."APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...Edge AI and Vision Alliance
 
“Khronos Standard APIs for Accelerating Vision and Inferencing,” a Presentati...
“Khronos Standard APIs for Accelerating Vision and Inferencing,” a Presentati...“Khronos Standard APIs for Accelerating Vision and Inferencing,” a Presentati...
“Khronos Standard APIs for Accelerating Vision and Inferencing,” a Presentati...Edge AI and Vision Alliance
 
PyTorch vs TensorFlow: The Force Is Strong With Which One? | Which One You Sh...
PyTorch vs TensorFlow: The Force Is Strong With Which One? | Which One You Sh...PyTorch vs TensorFlow: The Force Is Strong With Which One? | Which One You Sh...
PyTorch vs TensorFlow: The Force Is Strong With Which One? | Which One You Sh...Edureka!
 
OpenCL Overview Japan Virtual Open House Feb 2021
OpenCL Overview Japan Virtual Open House Feb 2021OpenCL Overview Japan Virtual Open House Feb 2021
OpenCL Overview Japan Virtual Open House Feb 2021The Khronos Group Inc.
 
Innovation with ai at scale on the edge vt sept 2019 v0
Innovation with ai at scale  on the edge vt sept 2019 v0Innovation with ai at scale  on the edge vt sept 2019 v0
Innovation with ai at scale on the edge vt sept 2019 v0Ganesan Narayanasamy
 
Resume_Appaji
Resume_AppajiResume_Appaji
Resume_AppajiAppaji K
 
Austin,TX Meetup presentation tensorflow final oct 26 2017
Austin,TX Meetup presentation tensorflow final oct 26 2017Austin,TX Meetup presentation tensorflow final oct 26 2017
Austin,TX Meetup presentation tensorflow final oct 26 2017Clarisse Hedglin
 
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloLinaro
 
HP CAST 2017 Frankfurt : HPE UberCloud boosting HPC as a Service
HP CAST 2017 Frankfurt : HPE UberCloud boosting HPC as a ServiceHP CAST 2017 Frankfurt : HPE UberCloud boosting HPC as a Service
HP CAST 2017 Frankfurt : HPE UberCloud boosting HPC as a ServiceThomas Francis
 
Webinar: Synergy turbinado com o SSP1.4: criptografia elíptica, vídeo pela US...
Webinar: Synergy turbinado com o SSP1.4: criptografia elíptica, vídeo pela US...Webinar: Synergy turbinado com o SSP1.4: criptografia elíptica, vídeo pela US...
Webinar: Synergy turbinado com o SSP1.4: criptografia elíptica, vídeo pela US...Embarcados
 
DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...
DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...
DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...Docker, Inc.
 
"The Vision Acceleration API Landscape: Options and Trade-offs," a Presentati...
"The Vision Acceleration API Landscape: Options and Trade-offs," a Presentati..."The Vision Acceleration API Landscape: Options and Trade-offs," a Presentati...
"The Vision Acceleration API Landscape: Options and Trade-offs," a Presentati...Edge AI and Vision Alliance
 
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...tdc-globalcode
 
Achieving AI @scale on Mobile Devices
Achieving AI @scale on Mobile DevicesAchieving AI @scale on Mobile Devices
Achieving AI @scale on Mobile DevicesQualcomm Research
 
"Update on Khronos Standards for Vision and Machine Learning," a Presentation...
"Update on Khronos Standards for Vision and Machine Learning," a Presentation..."Update on Khronos Standards for Vision and Machine Learning," a Presentation...
"Update on Khronos Standards for Vision and Machine Learning," a Presentation...Edge AI and Vision Alliance
 
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined InfrastructureRed Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined InfrastructureIntel® Software
 
HKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening KeynoteHKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening KeynoteLinaro
 

Semelhante a "Update on Khronos Standards for Vision and Machine Learning," a Presentation from the Khronos Group (20)

"Current and Planned Standards for Computer Vision and Machine Learning," a P...
"Current and Planned Standards for Computer Vision and Machine Learning," a P..."Current and Planned Standards for Computer Vision and Machine Learning," a P...
"Current and Planned Standards for Computer Vision and Machine Learning," a P...
 
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
“Khronos Group Standards: Powering the Future of Embedded Vision,” a Presenta...
 
"APIs for Accelerating Vision and Inferencing: An Industry Overview of Option...
"APIs for Accelerating Vision and Inferencing: An Industry Overview of Option..."APIs for Accelerating Vision and Inferencing: An Industry Overview of Option...
"APIs for Accelerating Vision and Inferencing: An Industry Overview of Option...
 
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre..."APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
"APIs for Accelerating Vision and Inferencing: Options and Trade-offs," a Pre...
 
“Khronos Standard APIs for Accelerating Vision and Inferencing,” a Presentati...
“Khronos Standard APIs for Accelerating Vision and Inferencing,” a Presentati...“Khronos Standard APIs for Accelerating Vision and Inferencing,” a Presentati...
“Khronos Standard APIs for Accelerating Vision and Inferencing,” a Presentati...
 
PyTorch vs TensorFlow: The Force Is Strong With Which One? | Which One You Sh...
PyTorch vs TensorFlow: The Force Is Strong With Which One? | Which One You Sh...PyTorch vs TensorFlow: The Force Is Strong With Which One? | Which One You Sh...
PyTorch vs TensorFlow: The Force Is Strong With Which One? | Which One You Sh...
 
OpenCL Overview Japan Virtual Open House Feb 2021
OpenCL Overview Japan Virtual Open House Feb 2021OpenCL Overview Japan Virtual Open House Feb 2021
OpenCL Overview Japan Virtual Open House Feb 2021
 
Innovation with ai at scale on the edge vt sept 2019 v0
Innovation with ai at scale  on the edge vt sept 2019 v0Innovation with ai at scale  on the edge vt sept 2019 v0
Innovation with ai at scale on the edge vt sept 2019 v0
 
Resume_Appaji
Resume_AppajiResume_Appaji
Resume_Appaji
 
Austin,TX Meetup presentation tensorflow final oct 26 2017
Austin,TX Meetup presentation tensorflow final oct 26 2017Austin,TX Meetup presentation tensorflow final oct 26 2017
Austin,TX Meetup presentation tensorflow final oct 26 2017
 
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
 
HP CAST 2017 Frankfurt : HPE UberCloud boosting HPC as a Service
HP CAST 2017 Frankfurt : HPE UberCloud boosting HPC as a ServiceHP CAST 2017 Frankfurt : HPE UberCloud boosting HPC as a Service
HP CAST 2017 Frankfurt : HPE UberCloud boosting HPC as a Service
 
Webinar: Synergy turbinado com o SSP1.4: criptografia elíptica, vídeo pela US...
Webinar: Synergy turbinado com o SSP1.4: criptografia elíptica, vídeo pela US...Webinar: Synergy turbinado com o SSP1.4: criptografia elíptica, vídeo pela US...
Webinar: Synergy turbinado com o SSP1.4: criptografia elíptica, vídeo pela US...
 
DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...
DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...
DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...
 
"The Vision Acceleration API Landscape: Options and Trade-offs," a Presentati...
"The Vision Acceleration API Landscape: Options and Trade-offs," a Presentati..."The Vision Acceleration API Landscape: Options and Trade-offs," a Presentati...
"The Vision Acceleration API Landscape: Options and Trade-offs," a Presentati...
 
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...
 
Achieving AI @scale on Mobile Devices
Achieving AI @scale on Mobile DevicesAchieving AI @scale on Mobile Devices
Achieving AI @scale on Mobile Devices
 
"Update on Khronos Standards for Vision and Machine Learning," a Presentation...
"Update on Khronos Standards for Vision and Machine Learning," a Presentation..."Update on Khronos Standards for Vision and Machine Learning," a Presentation...
"Update on Khronos Standards for Vision and Machine Learning," a Presentation...
 
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined InfrastructureRed Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
 
HKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening KeynoteHKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening Keynote
 

Mais de Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...Edge AI and Vision Alliance
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...Edge AI and Vision Alliance
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...Edge AI and Vision Alliance
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...Edge AI and Vision Alliance
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...Edge AI and Vision Alliance
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsightsEdge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...Edge AI and Vision Alliance
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...Edge AI and Vision Alliance
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...Edge AI and Vision Alliance
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...Edge AI and Vision Alliance
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from SamsaraEdge AI and Vision Alliance
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...Edge AI and Vision Alliance
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...Edge AI and Vision Alliance
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...Edge AI and Vision Alliance
 

Mais de Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 

Último

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 

Último (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 

"Update on Khronos Standards for Vision and Machine Learning," a Presentation from the Khronos Group

  • 1. Copyright © Khronos® Group Inc. 2018 - Page 1 Vision and Inferencing Update September 2018 Neil Trevett | Khronos President NVIDIA | VP Developer Ecosystem ntrevett@nvidia.com | @neilt3d www.khronos.org
  • 2. Copyright © Khronos® Group Inc. 2018 - Page 2 Khronos Mission Software Silicon Khronos is an International Industry Consortium creating royalty-free, open standards to enable software to access hardware acceleration for 3D Graphics, Virtual and Augmented Reality, Parallel Computing, Neural Networks and Vision Processing
  • 3. Copyright © Khronos® Group Inc. 2018 - Page 3 Khronos Primary Standards 3D Graphics VR and AR Heterogenous Compute (Parallel Processing) Vision and Inferencing APIs File Formats
  • 4. Copyright © Khronos® Group Inc. 2018 - Page 4 Topics for Today 1 2 3 Landscape of vision and inferencing acceleration Where Khronos open standards provide non-proprietary ecosystem choices New updates on Khronos vision and inferencing standards
  • 5. Copyright © Khronos® Group Inc. 2018 - Page 5 Vision and Inferencing Runtimes Machine Learning Acceleration Desktop and Cloud GPU/TPU Acceleration Diverse Inferencing Acceleration Hardware Training on Desktop and Cloud Neural Net Training Frameworks Neural Net Training Frameworks Neural Net Training Frameworks Training Frameworks Deployment on Embedded Devices Optimization Trained Networks Vision and Inferencing Applications Training Data Sets OR Live Data
  • 6. Copyright © Khronos® Group Inc. 2018 - Page 6 Machine Learning Training Desktop and Cloud Hardware cuDNN MIOpen clDNN Neural Net Training Frameworks Neural Net Training Frameworks Neural Net Training Frameworks Neural Net Training Frameworks TPU Authoring Interchange GPUs have well established APIs and libraries for compute acceleration
  • 7. Copyright © Khronos® Group Inc. 2018 - Page 7 Vision and Inferencing Runtimes Machine Learning Acceleration Desktop and Cloud GPU/TPU Acceleration Diverse Inferencing Acceleration Hardware Training on Desktop and Cloud Neural Net Training Frameworks Neural Net Training Frameworks Neural Net Training Frameworks Training Frameworks Deployment on Embedded Devices Optimization Trained Networks Vision and Inferencing Applications Training Data Sets OR Live Data
  • 8. Copyright © Khronos® Group Inc. 2018 - Page 8 NNEF - Neural Network Exchange Format NN Authoring Framework 1 NN Authoring Framework 2 NN Authoring Framework 3 Inference Engine 1 Inference Engine 2 Inference Engine 3 Every Tool Needs an Exporter to Every Accelerator With NNEF Before NNEF NN Authoring Framework 1 NN Authoring Framework 2 NN Authoring Framework 3 Inference Engine 1 Inference Engine 2 Inference Engine 3 Optimization and processing tools
  • 9. Copyright © Khronos® Group Inc. 2018 - Page 9 Network Data File Binary format contains parameter tensors Supports float and quantized (integer) data Flexible bit widths and quantization algorithms Quantization algorithms expressed as extensible compound operations Quantization info provided as hints for execution Network Data File Binary format contains parameter tensors Supports float and quantized (integer) data Flexible bit widths and quantization algorithms Quantization algorithms expressed as extensible compound operations Quantization info provided as hints for execution NNEF Captures a Neural Network Description Network Structure File Distilled, platform independent network description Human readable, syntactical elements from Python Standardized Operations Rigorously defined semantics Linear, convolution, pooling, normalization, activation, unary/binary Supports fully connected, convolutional, recurrent architectures Two Levels of Expressiveness Flat Basic transfer of computation graphs with standardized operations Simple to parse and translate to vendor specific formats Compositional Define custom compound operations Higher-level graph descriptions More complex to parse but offers more optimization hints Network Data File Binary format contains parameter tensors Supports float and quantized (integer) data Flexible bit widths and quantization algorithms Quantization algorithms expressed as extensible compound operations Quantization info provided as hints for execution Split Structure and Data files Easy independent access to network structure or individual parameter data Set of files can use a container such as tar or zip with optional compression and encryption Can associate multiple Data Files with one Network Structure File e.g. the same data in multiple formats
  • 10. Copyright © Khronos® Group Inc. 2018 - Page 10 NNEF 1.0 Files Syntax Parser/ Validator TensorFlow and Caffe Exporters NNEF open source projects hosted on Khronos NNEF GitHub repository Apache 2.0 license https://github.com/KhronosGroup/NNEF-Tools TensorFlow and Caffe2 Importer / Exporters Google NNAPI Convertor OpenVX Ingestion & Execution Live Imminent NNEF V1.0 released in August 2018 After positive industry feedback on Provisional specification released in December 2017 NNEF Working Group Participants
  • 11. Copyright © Khronos® Group Inc. 2018 - Page 11 NNEF and ONNX Embedded Inferencing Import Training Interchange Defined Specification Open Source Project Stability for hardware deployment Software stack flexibility Multi-company Governance Initiated by Facebook Flexible Precision / Quantization 32-bit Floating Point only Comparing Neural Network Exchange Industry Initiatives ONNX and NNEF are Complementary - ONNX will HAVE to move fast to track authoring framework interchange - NNEF provides a stable bridge from training into edge inferencing engines Bidirectional translator in open source Initiating open source bidirectional translator Khronos tried to use LLVM as a hardware IR BUT LLVM evolves without needing to preserve backwards compatibility. Fine for software compilers – very difficult to manage for hardware toolchains and roadmaps SO Khronos created hardware oriented SPIR-V with bidirectional translation to LLVM Same Industry Dynamics as LLVM and SPIR-V
  • 12. Copyright © Khronos® Group Inc. 2018 - Page 12 Vision and Inferencing Runtimes Machine Learning Acceleration Desktop and Cloud GPU/TPU Acceleration Diverse Inferencing Acceleration Hardware Training on Desktop and Cloud Neural Net Training Frameworks Neural Net Training Frameworks Neural Net Training Frameworks Training Frameworks Deployment on Embedded Devices Optimization Trained Networks Vision and Inferencing Applications Training Data Sets OR Live Data
  • 13. Copyright © Khronos® Group Inc. 2018 - Page 13 Three Broad Inferencing Choices Run in training framework Export to inference runtime Compile to optimized code Acceleration needs high-level programming tools – typically large C++ applications E.g. TensorFlow or TensorFlow Lite The most popular industry choice today. Runtime often uses underlying acceleration APIs E.g. TensorRT, CoreML Often used to merge custom or vision code alongside inferencing runtime – generates LLVM for CPU + accelerated API code
  • 14. Copyright © Khronos® Group Inc. 2018 - Page 14 SYCL Single Source C++ Parallel Programming • SYCL 1.2.1 Adopters Program released in July 2018 with open source conformance tests - https://www.khronos.org/news/press/khronos-releases-conformance-test-suite-for-sycl-1.2.1 • Multiple Implémentations shipping: triSYCL, ComputeCpp - http://sycl.tech • Multiple SYCL libraries for vision and inferencing - SYCL-BLAS, SYCL-DNN, SYCL-Eigen C++ Kernel Fusion can gives better performance on complex apps and libs than hand-coding Single application source file using STANDARD C++C++ templates and lambda functions separate host & device code Accelerated code passed into device OpenCL compilers
  • 15. Copyright © Khronos® Group Inc. 2018 - Page 15 Python Client C++ Client Optional C API TensorFlow tensor Kernels (> 800 kernels) ConvolutionsMatrix multiply Eigen Tensors SYCL-BLAS Library SYCL-DNN Library TensorFlow on SYCL / OpenCL State-of-the-art C++ compilers can fuse nodes in vision and neural network graphs to provide optimized performance often faster than hand- coded applications
  • 16. Copyright © Khronos® Group Inc. 2018 - Page 16 Three Broad Inferencing Choices Run in training framework Export to inference runtime Compile to optimized code Acceleration needs high-level programming tools – typically large C++ applications E.g. TensorFlow or TensorFlow Lite The most popular industry choice today. Runtime often uses underlying acceleration APIs E.g. TensorRT, CoreML Often used to merge custom or vision code alongside inferencing runtime – generates LLVM for CPU + accelerated API code
  • 17. Copyright © Khronos® Group Inc. 2018 - Page 17 Platform Inferencing Stacks Microsoft Windows Windows Machine Learning (WinML) Google Android Neural Network API (NNAPI) Apple MacOS and iOS CoreML https://docs.microsoft.com/en-us/windows/uwp/machine-learning/ https://developer.android.com/ndk/guides/neuralnetworks/ https://developer.apple.com/documentation/coreml Core ML Model Consistent Three Steps 1. Import trained NN model file 2. Build optimized version of graph 3. Run graph on accelerated runtime using underlying low-level API
  • 18. Copyright © Khronos® Group Inc. 2018 - Page 18 NNVM - Open Compiler for AI Inferencing http://www.tvmlang.org/2017/08/17/tvm-release-announcement.html SPIR-V IR for parallel accelerators Backend in development LLVM IR for CPUs 1.Import Trained Network Description 2. Graph-level Optimizations 3. Decompose to primitive instructions and emit programs for accelerated run-times Paul G. Allen School of Computer Science & Engineering, University of Washington Facebook Glow Compiler (Graph Lowering Optimizations) https://facebook.ai/developers/tools/glow
  • 19. Copyright © Khronos® Group Inc. 2018 - Page 19 OpenVX PowerEfficiency Computation Flexibility Dedicated Hardware GPU Compute Multi-core CPUX1 X10 X100 Vision DSPs Wide range of vision hardware architectures OpenVX provides a high-level Graph-based abstraction -> Enables Graph-level optimizations! Can be implemented on almost any hardware or processor! -> Portable, Efficient Vision Processing! Vision Node Vision Node Vision NodeVision Node Vision Processing Graph Shipping Implementations
  • 20. Copyright © Khronos® Group Inc. 2018 - Page 20 Extending OpenVX for Inferencing #1 Neural Network Extension • OpenVX Nodes to represent common NN Layers • 1D-4D Tensors to connect layers and common • INT16, INT7.8, INT8, and U8 Tensor Ops vxActivationLayer vxConvolutionLayer vxDeconvolutionLayer vxFullyConnectedLayer vxNormalizationLayer vxPoolingLayer vxSoftmaxLayer vxROIPoolingLayer … Vision Node Vision Node Vision Node Downstream Application Processing Native Camera Control CNN Nodes An OpenVX graph mixing CNN nodes with traditional vision nodes NNEF Translator converts NNEF representation into OpenVX Node Graphs NNEF Translator • Ingests NNEF File and builds OpenVX node graph • Open source project in progress Importing NNEF Neural Network Descriptions
  • 21. Copyright © Khronos® Group Inc. 2018 - Page 21 Extending OpenVX for Inferencing #2 OpenVX/OpenCL Interop • Provisional Extension • Enables custom OpenCL acceleration to be invoked from OpenVX User Kernels • Memory objects can be mapped or copied Kernel/Graph Import • Provisional Extension • Defines container for executable or IR code • Enables arbitrary code to be inserted as a OpenVX Node in a graph OpenCL Command Queue Application cl_mem buffers Fully asynchronous host-device operations during data exchange OpenVX data objects Runtime Runtime Map or copy OpenVX data objects into cl_mem buffers Copy or export cl_mem buffers into OpenVX data objects OpenVX user-kernels can access command queue and cl_mem objects to asynchronously schedule OpenCL kernel execution OpenVX/OpenCL Interop Creating Custom User Nodes
  • 22. Copyright © Khronos® Group Inc. 2018 - Page 22 NNEF and OpenVX for Inferencing Compilation Kernel Import Ingestion Proprietary Runtimes Vision Nodes User Nodes NN Extension Translator Executable Code To mix inferencing with vision and other custom processing Acceleration APIs Many inferencing stacks end up using OpenCL for hardware acceleration Compile to executable code Execute accelerated OpenVX Runtime Compile to IR/Binary
  • 23. Copyright © Khronos® Group Inc. 2018 - Page 23 GPU OpenCL – Unique Heterogeneous Runtime FPGA DSP Custom Hardware GPU CPUCPUCPUGPU Growing number of optimized OpenCL vision and inferencing libraries Vision: OpenCV, Halide, Visioncpp Machine Learning: Xiaomi MACE, Arm Compute Library Linear Algebra: clDNN, clBlast, ViennaCL Application or Inferencing Run-time Fragmented GPU API Landscape OpenCL is the only industry standard for low-level heterogeneous compute Portable control over memory and parallel task execution “The closest you can be to your embedded accelerator and still be portable” Application or Inferencing Run-time
  • 24. Copyright © Khronos® Group Inc. 2018 - Page 24 OpenCL Ecosystem Roadmap 2011 OpenCL 1.2 OpenCL C Kernel Language OpenCL 2.1 SPIR-V in Core 2015 SYCL 1.2 C++11 Single source programming OpenCL 2.2 C++ Kernel Language 2017 SYCL 1.2.1 C++11 Single source programming Bringing Heterogeneous compute to standard ISO C++ Khronos hosting C++17 Parallel STL C++20 Parallel STL with Ranges Proposal Processor Deployment Flexibility Parallel computation across diverse processor architectures Kernel Deployment Flexibility Execute OpenCL C kernels on Vulkan GPU runtimes OpenCL has an active three track roadmap
  • 25. Copyright © Khronos® Group Inc. 2018 - Page 25 OpenCL Next - Feature Set Flexibility • Defining OpenCL features that become optional for enhanced deployment flexibility - API and language features e.g. floating point precisions • Feature Sets avoid fragmentation - Defined to suit specific markets – e.g. desktop, embedded vision and inferencing • Implementations are conformant if fully support feature set functionality OpenCL 2.2 Functionality = queryable, optional feature Khronos-defined OpenCL 2.2 Full Profile Feature Set Khronos-defined OpenCL 1.2 Full Profile Feature Set Industry-defined Feature Set E.g. Embedded Vision and Inferencing
  • 26. Copyright © Khronos® Group Inc. 2018 - Page 26 Universal Deployment Flexibility Open source SPIRV-Cross converts SPIR-V to MSL or HLSLClspv and clvk open source tools OpenCL Programs Native Vulkan Drivers UWP and D3D based Consoles Open source shims convert Vulkan to Metal or D3D API calls Open source tools enable OpenCL and Vulkan apps to be increasingly deployed on any platform