Enviar pesquisa
Carregar
Q4.11: NEON Intrinsics
•
4 gostaram
•
6,911 visualizações
Linaro
Seguir
Resource: Q4.11 Name: NEON Intrinsics Date: 28-11-2011 Speaker: Michael Hope
Leia menos
Leia mais
Tecnologia
Denunciar
Compartilhar
Denunciar
Compartilhar
1 de 26
Baixar agora
Baixar para ler offline
Recomendados
LLVM Register Allocation
LLVM Register Allocation
Wang Hsiangkai
Introduction to gdb
Introduction to gdb
Owen Hsu
LLVM Instruction Selection
LLVM Instruction Selection
Shiva Chen
QEMU - Binary Translation
QEMU - Binary Translation
Jiann-Fuh Liaw
Instruction Combine in LLVM
Instruction Combine in LLVM
Wang Hsiangkai
from Source to Binary: How GNU Toolchain Works
from Source to Binary: How GNU Toolchain Works
National Cheng Kung University
from Binary to Binary: How Qemu Works
from Binary to Binary: How Qemu Works
Zhen Wei
ARM LinuxのMMUはわかりにくい
ARM LinuxのMMUはわかりにくい
wata2ki
Mais conteúdo relacionado
Mais procurados
GCC LTO
GCC LTO
Wang Hsiangkai
How to write a TableGen backend
How to write a TableGen backend
Min-Yih Hsu
Qemu JIT Code Generator and System Emulation
Qemu JIT Code Generator and System Emulation
National Cheng Kung University
プログラムを高速化する話
プログラムを高速化する話
京大 マイコンクラブ
GDB Rocks!
GDB Rocks!
Kent Chen
Blazing Performance with Flame Graphs
Blazing Performance with Flame Graphs
Brendan Gregg
はじめての4,500円OS無しジャンクIntel MacbookにLinux,NetBSDインストール iin OSC 2020 Online/Kyoto...
はじめての4,500円OS無しジャンクIntel MacbookにLinux,NetBSDインストール iin OSC 2020 Online/Kyoto...
Netwalker lab kapper
Qemu device prototyping
Qemu device prototyping
Yan Vugenfirer
ret2dl resolve
ret2dl resolve
sounakano
LLVM Register Allocation (2nd Version)
LLVM Register Allocation (2nd Version)
Wang Hsiangkai
Dpdk applications
Dpdk applications
Vipin Varghese
[嵌入式系統] 嵌入式系統進階
[嵌入式系統] 嵌入式系統進階
Simen Li
中3女子でもわかる constexpr
中3女子でもわかる constexpr
Genya Murakami
CUDAのアセンブリ言語基礎のまとめ PTXとSASSの概説
CUDAのアセンブリ言語基礎のまとめ PTXとSASSの概説
Takateru Yamagishi
C++ マルチスレッド 入門
C++ マルチスレッド 入門
京大 マイコンクラブ
組み込み関数(intrinsic)によるSIMD入門
組み込み関数(intrinsic)によるSIMD入門
Norishige Fukushima
Lockfree Queue
Lockfree Queue
Kumazaki Hiroki
中3女子が狂える本当に気持ちのいい constexpr
中3女子が狂える本当に気持ちのいい constexpr
Genya Murakami
Glibc malloc internal
Glibc malloc internal
Motohiro KOSAKI
冬のLock free祭り safe
冬のLock free祭り safe
Kumazaki Hiroki
Mais procurados
(20)
GCC LTO
GCC LTO
How to write a TableGen backend
How to write a TableGen backend
Qemu JIT Code Generator and System Emulation
Qemu JIT Code Generator and System Emulation
プログラムを高速化する話
プログラムを高速化する話
GDB Rocks!
GDB Rocks!
Blazing Performance with Flame Graphs
Blazing Performance with Flame Graphs
はじめての4,500円OS無しジャンクIntel MacbookにLinux,NetBSDインストール iin OSC 2020 Online/Kyoto...
はじめての4,500円OS無しジャンクIntel MacbookにLinux,NetBSDインストール iin OSC 2020 Online/Kyoto...
Qemu device prototyping
Qemu device prototyping
ret2dl resolve
ret2dl resolve
LLVM Register Allocation (2nd Version)
LLVM Register Allocation (2nd Version)
Dpdk applications
Dpdk applications
[嵌入式系統] 嵌入式系統進階
[嵌入式系統] 嵌入式系統進階
中3女子でもわかる constexpr
中3女子でもわかる constexpr
CUDAのアセンブリ言語基礎のまとめ PTXとSASSの概説
CUDAのアセンブリ言語基礎のまとめ PTXとSASSの概説
C++ マルチスレッド 入門
C++ マルチスレッド 入門
組み込み関数(intrinsic)によるSIMD入門
組み込み関数(intrinsic)によるSIMD入門
Lockfree Queue
Lockfree Queue
中3女子が狂える本当に気持ちのいい constexpr
中3女子が狂える本当に気持ちのいい constexpr
Glibc malloc internal
Glibc malloc internal
冬のLock free祭り safe
冬のLock free祭り safe
Destaque
Q4.11: Using GCC Auto-Vectorizer
Q4.11: Using GCC Auto-Vectorizer
Linaro
Moving NEON to 64 bits
Moving NEON to 64 bits
Chiou-Nan Chen
GCC for ARMv8 Aarch64
GCC for ARMv8 Aarch64
Yi-Hsiu Hsu
COMPLETE DETAIL ABOUT ARM PART1
COMPLETE DETAIL ABOUT ARM PART1
NOWAY
中華チップ全盛時代のARM SoCの選び方_公開版
中華チップ全盛時代のARM SoCの選び方_公開版
kinneko
64-bit Android
64-bit Android
Chiou-Nan Chen
LAS16-406: Android Widevine on OP-TEE
LAS16-406: Android Widevine on OP-TEE
Linaro
Software, Over the Air (SOTA) for Automotive Grade Linux (AGL)
Software, Over the Air (SOTA) for Automotive Grade Linux (AGL)
Leon Anavi
EXAME-PARTE-II
EXAME-PARTE-II
Neon Online
LAS16-504: Secure Storage updates in OP-TEE
LAS16-504: Secure Storage updates in OP-TEE
Linaro
Introduction to Optee (26 may 2016)
Introduction to Optee (26 may 2016)
Yannick Gicquel
SFO15-503: Secure storage in OP-TEE
SFO15-503: Secure storage in OP-TEE
Linaro
Introduction to armv8 aarch64
Introduction to armv8 aarch64
Yi-Hsiu Hsu
BKK16-110 A Gentle Introduction to Trusted Execution and OP-TEE
BKK16-110 A Gentle Introduction to Trusted Execution and OP-TEE
Linaro
LCU14-103: How to create and run Trusted Applications on OP-TEE
LCU14-103: How to create and run Trusted Applications on OP-TEE
Linaro
HKG15-311: OP-TEE for Beginners and Porting Review
HKG15-311: OP-TEE for Beginners and Porting Review
Linaro
LAS16-111: Easing Access to ARM TrustZone – OP-TEE and Raspberry Pi 3
LAS16-111: Easing Access to ARM TrustZone – OP-TEE and Raspberry Pi 3
Linaro
Arm v8 instruction overview android 64 bit briefing
Arm v8 instruction overview android 64 bit briefing
Merck Hung
BUD17-DF15 - Optimized Android N MR1 + 4.9 Kernel
BUD17-DF15 - Optimized Android N MR1 + 4.9 Kernel
Linaro
XPDS16: Porting Xen on ARM to a new SOC - Julien Grall, ARM
XPDS16: Porting Xen on ARM to a new SOC - Julien Grall, ARM
The Linux Foundation
Destaque
(20)
Q4.11: Using GCC Auto-Vectorizer
Q4.11: Using GCC Auto-Vectorizer
Moving NEON to 64 bits
Moving NEON to 64 bits
GCC for ARMv8 Aarch64
GCC for ARMv8 Aarch64
COMPLETE DETAIL ABOUT ARM PART1
COMPLETE DETAIL ABOUT ARM PART1
中華チップ全盛時代のARM SoCの選び方_公開版
中華チップ全盛時代のARM SoCの選び方_公開版
64-bit Android
64-bit Android
LAS16-406: Android Widevine on OP-TEE
LAS16-406: Android Widevine on OP-TEE
Software, Over the Air (SOTA) for Automotive Grade Linux (AGL)
Software, Over the Air (SOTA) for Automotive Grade Linux (AGL)
EXAME-PARTE-II
EXAME-PARTE-II
LAS16-504: Secure Storage updates in OP-TEE
LAS16-504: Secure Storage updates in OP-TEE
Introduction to Optee (26 may 2016)
Introduction to Optee (26 may 2016)
SFO15-503: Secure storage in OP-TEE
SFO15-503: Secure storage in OP-TEE
Introduction to armv8 aarch64
Introduction to armv8 aarch64
BKK16-110 A Gentle Introduction to Trusted Execution and OP-TEE
BKK16-110 A Gentle Introduction to Trusted Execution and OP-TEE
LCU14-103: How to create and run Trusted Applications on OP-TEE
LCU14-103: How to create and run Trusted Applications on OP-TEE
HKG15-311: OP-TEE for Beginners and Porting Review
HKG15-311: OP-TEE for Beginners and Porting Review
LAS16-111: Easing Access to ARM TrustZone – OP-TEE and Raspberry Pi 3
LAS16-111: Easing Access to ARM TrustZone – OP-TEE and Raspberry Pi 3
Arm v8 instruction overview android 64 bit briefing
Arm v8 instruction overview android 64 bit briefing
BUD17-DF15 - Optimized Android N MR1 + 4.9 Kernel
BUD17-DF15 - Optimized Android N MR1 + 4.9 Kernel
XPDS16: Porting Xen on ARM to a new SOC - Julien Grall, ARM
XPDS16: Porting Xen on ARM to a new SOC - Julien Grall, ARM
Semelhante a Q4.11: NEON Intrinsics
AMP Kynetics - ELC 2018 Portland
AMP Kynetics - ELC 2018 Portland
Kynetics
Asymmetric Multiprocessing - Kynetics ELC 2018 portland
Asymmetric Multiprocessing - Kynetics ELC 2018 portland
Nicola La Gloria
Tiny ML for spark Fun Edge
Tiny ML for spark Fun Edge
艾鍗科技
Heterogeneous multiprocessing on androd and i.mx7
Heterogeneous multiprocessing on androd and i.mx7
Kynetics
Haskell Symposium 2010: An LLVM backend for GHC
Haskell Symposium 2010: An LLVM backend for GHC
dterei
The Past, Present, and Future of OpenACC
The Past, Present, and Future of OpenACC
inside-BigData.com
Challenges in GPU compilers
Challenges in GPU compilers
AnastasiaStulova
Introduction to Parallelization and performance optimization
Introduction to Parallelization and performance optimization
CSUC - Consorci de Serveis Universitaris de Catalunya
openmpfinal.pdf
openmpfinal.pdf
GopalPatidar13
OpenMP.pptx
OpenMP.pptx
MunimAkhtarChoudhury
Mirage: ML kernels in the cloud (ML Workshop 2010)
Mirage: ML kernels in the cloud (ML Workshop 2010)
Anil Madhavapeddy
SNAP MACHINE LEARNING
SNAP MACHINE LEARNING
Ganesan Narayanasamy
不深不淺,帶你認識 LLVM (Found LLVM in your life)
不深不淺,帶你認識 LLVM (Found LLVM in your life)
Douglas Chen
Understand and Harness the Capabilities of Intel® Xeon Phi™ Processors
Understand and Harness the Capabilities of Intel® Xeon Phi™ Processors
Intel® Software
Some experiences for porting application to Intel Xeon Phi
Some experiences for porting application to Intel Xeon Phi
Maho Nakata
Multicore
Multicore
Birgit Plötzeneder
CS4961-L9.ppt
CS4961-L9.ppt
MarlonMagtibay2
BKK16-302: Android Optimizing Compiler: New Member Assimilation Guide
BKK16-302: Android Optimizing Compiler: New Member Assimilation Guide
Linaro
Large-Scale Optimization Strategies for Typical HPC Workloads
Large-Scale Optimization Strategies for Typical HPC Workloads
inside-BigData.com
100Gbps OpenStack For Providing High-Performance NFV
100Gbps OpenStack For Providing High-Performance NFV
NTT Communications Technology Development
Semelhante a Q4.11: NEON Intrinsics
(20)
AMP Kynetics - ELC 2018 Portland
AMP Kynetics - ELC 2018 Portland
Asymmetric Multiprocessing - Kynetics ELC 2018 portland
Asymmetric Multiprocessing - Kynetics ELC 2018 portland
Tiny ML for spark Fun Edge
Tiny ML for spark Fun Edge
Heterogeneous multiprocessing on androd and i.mx7
Heterogeneous multiprocessing on androd and i.mx7
Haskell Symposium 2010: An LLVM backend for GHC
Haskell Symposium 2010: An LLVM backend for GHC
The Past, Present, and Future of OpenACC
The Past, Present, and Future of OpenACC
Challenges in GPU compilers
Challenges in GPU compilers
Introduction to Parallelization and performance optimization
Introduction to Parallelization and performance optimization
openmpfinal.pdf
openmpfinal.pdf
OpenMP.pptx
OpenMP.pptx
Mirage: ML kernels in the cloud (ML Workshop 2010)
Mirage: ML kernels in the cloud (ML Workshop 2010)
SNAP MACHINE LEARNING
SNAP MACHINE LEARNING
不深不淺,帶你認識 LLVM (Found LLVM in your life)
不深不淺,帶你認識 LLVM (Found LLVM in your life)
Understand and Harness the Capabilities of Intel® Xeon Phi™ Processors
Understand and Harness the Capabilities of Intel® Xeon Phi™ Processors
Some experiences for porting application to Intel Xeon Phi
Some experiences for porting application to Intel Xeon Phi
Multicore
Multicore
CS4961-L9.ppt
CS4961-L9.ppt
BKK16-302: Android Optimizing Compiler: New Member Assimilation Guide
BKK16-302: Android Optimizing Compiler: New Member Assimilation Guide
Large-Scale Optimization Strategies for Typical HPC Workloads
Large-Scale Optimization Strategies for Typical HPC Workloads
100Gbps OpenStack For Providing High-Performance NFV
100Gbps OpenStack For Providing High-Performance NFV
Mais de Linaro
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Linaro
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Linaro
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Linaro
Bud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qa
Linaro
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
Linaro
HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018
Linaro
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
Linaro
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Linaro
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Linaro
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Linaro
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
Linaro
HKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening Keynote
Linaro
HKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP Workshop
Linaro
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
Linaro
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
Linaro
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
Linaro
HKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMU
Linaro
HKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8M
Linaro
HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation
Linaro
HKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted boot
Linaro
Mais de Linaro
(20)
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Bud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qa
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening Keynote
HKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP Workshop
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMU
HKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8M
HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted boot
Último
Nanopower In Semiconductor Industry.pdf
Nanopower In Semiconductor Industry.pdf
Pedro Manuel
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
Matt Ray
9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team
Adam Moalla
NIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 Workshop
Bachir Benyammi
201610817 - edge part1
201610817 - edge part1
Jamie (Taka) Wang
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
Md Hossain Ali
How Accurate are Carbon Emissions Projections?
How Accurate are Carbon Emissions Projections?
IES VE
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
DianaGray10
UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6
DianaGray10
UiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation Developers
UiPathCommunity
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
IES VE
Bird eye's view on Camunda open source ecosystem
Bird eye's view on Camunda open source ecosystem
Asko Soukka
Comparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and Istio
Christian Posta
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
shyamraj55
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IES VE
Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptx
GDSC PJATK
OpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability Adventure
Eric D. Schabell
Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1
DianaGray10
Machine Learning Model Validation (Aijun Zhang 2024).pdf
Machine Learning Model Validation (Aijun Zhang 2024).pdf
Aijun Zhang
20150722 - AGV
20150722 - AGV
Jamie (Taka) Wang
Último
(20)
Nanopower In Semiconductor Industry.pdf
Nanopower In Semiconductor Industry.pdf
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team
NIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 Workshop
201610817 - edge part1
201610817 - edge part1
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
How Accurate are Carbon Emissions Projections?
How Accurate are Carbon Emissions Projections?
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6
UiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation Developers
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Bird eye's view on Camunda open source ecosystem
Bird eye's view on Camunda open source ecosystem
Comparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and Istio
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptx
OpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability Adventure
Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1
Machine Learning Model Validation (Aijun Zhang 2024).pdf
Machine Learning Model Validation (Aijun Zhang 2024).pdf
20150722 - AGV
20150722 - AGV
Q4.11: NEON Intrinsics
1.
Michael Hope, Toolchain bzr
branch lp:~michaelh1/+junk/intrinsics-demo NEON Intrinsics
2.
What's NEON? ●Ch 19
'Introducting NEON' http://infocenter.arm.com/help/topic/com.arm.doc.den0013a/
3.
SIMD is... Same instruction,
many values Anything involving signals is great for SIMD
4.
Normalisation
5.
● Easier to
read and write ● Easier (better?) register allocation ● Compiler knows how to schedule ● ABI neutral Advantages
6.
Works across compilers >
gcc-mcpu=cortex-a9 -mfpu=neon -O3 -c test.c > armcc --cpu Cortex-A9 --c99 -O3 -c test.c > clang -mcpu=cortex-a9 -mfpu=neon -O3 -c test.c
7.
Tune for the
architecture -mtune=cortex-a9 -mtune=cortex-a8 -mtune=cortex-a5
8.
SMS, unrolling, profiling?
9.
Writing
10.
Environment #include <arm_neon.h> gcc -march=armv7-a
-mfpu=neon
11.
Data types <type>x<lanes>_t (uint8x4_t) <type>x<lanes>x<#
registers>_t (int16x2x4_t)
12.
Some Instructions
13.
Add uint16x4_t vadd_u16 ( uint16x4_t
left, uint16x4_t right )
14.
Multiply uint64x2_t vmlal_u32 (uint64x2_t, uint32x2_t, uint32x2_t) int32x4_t
vqdmlal_s16 (int32x4_t, int16x4_t, int16x4_t)
15.
Strided load uint8x8x2_t vld2_u8 (const
uint8_t *) Form of expected instruction(s): vld2.8 {d0, d1}, [r0]
16.
Documentation GCC http://gcc.gnu.org/onlinedocs/gcc/ARM-NEON-Intrinsics.html ARM http://infocenter.arm.com/help/topic/com.arm.doc.den0013a Blog posts Search for
“Coding with NEON” on http://blogs.arm.com
17.
Writing
18.
Colour space conversion Y
= 0.2126 R + 0.7152 G + 0.0722 B HD television (ITU BT.709)
19.
Versions
20.
Nils Pipenbrinck http://hilbert-space.de/?p=22
24.
Performance Plain C 48.481 s Assembly 8.727
s (5.55 x faster) Intrinsics 8.728 s (5.55 x faster)
25.
Bigger Routines “libpixelflinger: Add
ARM NEON optimized scanline_t32cb16” http://wiki.linaro.org/RichardSandiford/Sandbox/IntrinsicsPerformance Hand-written 2.831 s Intrinsics 2.637 s (7.4 % faster)
Baixar agora