SlideShare uma empresa Scribd logo
1 de 27
Baixar para ler offline
HSA FOR APPLICATION
PROGRAMMING
Wen-mei W. Hwu
CTO, MulticoreWare, Inc.
Professor University of Illinois, Urbana-Champaign
CURRENT GPU COMPUTING PAIN
POINTS


Kernel launch overhead



Limited virtual and physical memory space



Extra data movement in I/O and networking



Tedious host code



Multiple source code development
KERNEL LAUNCH OVERHEAD
DESIRED DATA TRANSFER BEHAVIOR
Main Memory
(DRAM)

Network I/O

CPU
Disk I/O

Device
Memory

DMA

GPU card
(or other Accelerator cards)
SAMOS 2013
ACTUAL DATA TRANSFER BEHAVIOR

Main Memory
(DRAM)

Each additional copy
diminishes applicationperceived bandwidth
Network I/O

CPU
Disk I/O

Device
Memory

DMA

GPU card
(or other Accelerator cards)
SAMOS 2013
HSA COMMON ADDRESS SPACE
STANDARDIZED USER-LEVEL QUEUES
LOW OVERHEAD KERNEL LAUNCH
DYNAMIC COMPILATION AND BINARY
COMPATIBLITY
FAMILIAR COMPUTATION ORGANIZATION
EXAMPLE: COMPUTER VISION
EXAMPLE EXECUTION TIME
TOOLS WORKING GROUP


Tools Group will be looking at foundation for Developer Tools (compilation,
debugging and performance analysis)




Working with the working group to insure the right interface are in place Compilation,
Debugging, and Performance Analysis initially
We act as stewards for



HSA functional simulator with GDB support (AMD/MCW)



HSA performance simulator (AMD)





LLVM backend for generating HSAIL (AMD/MCW)

Loader Library for Simulator - BRIG object loading support ( AMD)

Longer Terms will spawn sub-group that will drive requirement for language
support for HSA


C, C++, C++ AMP



Java (JVM, Dalvik)



Python



JavaScript



DSL…



…
Hwu 2013
CURRENT TIME TABLE


Kickoff Conference and Refined Charter –
November 2013



Tools Roadmap – December 2013



Initial Tools – December 2013



Initial Language Tool Chains – January 2013

Please join us and contribute!
Hwu 2013
HAS BOOK IN THE MAKING

Heterogeneous System Architecture
For Application Programming


Audience
Performance application developers
 System architects
 Component architects
 Software stack developers


Hwu 2013
EDITORIAL BOARD


Gaster, Benedict – Qualcomm



Hegde Manju - AMD



Hwu, Wen-mei - MulticoreWare/UIUC



Jablin, Thomas – MultcoreWare



Lokhmotov, Anton – ARM



Lu, Chien-Ping – MediaTek



Whitecotton, Bob - AMD

Hwu 2013
CURRENT TABLE OF CONTENTS


HSA Overview (50 pages)



Chapter 1: What is HSA? (Hwu/Jablin)



Chapter 2: The HSA Architecture (Hwu/Jablin/Others)



Chapter 3: A Programmer's View of HSA (Hwu/Jablin)





Chapter 4: The Tools Framework for HAS,
(Hwu/Jablin/Others)
Chapter 5: Mapping of Classical algorithms onto the
HSA Architecture (Hwu/Jablin)
Hwu 2013
CURRENT TABLE OF CONTENTS (CONT.)



Representative Workloads:
Chapter 6: Photography, Minh Do, Professor UIUC, CTO
Personify



Chapter 7: Video-Audio Search, Ren Wu, Baidu



Chapter 8: Augmented Reality Chein-ping Lu, MeidaTek





Chapter 9: Biometrics (Face Detection) Mike Jones, MERL
& Harris Gasparakis, AMD
Chapter 10: Audio-Video processing Bill Herz, Sr. Fellow
AMD
Hwu 2013
CURRENT TABLE OF CONTENTS (CONT.)











Chapter 11: Ray-Tracing, TBD
Chapter 12: Natural user Interfaces, Navneett
CEO Flutter

Dallal,

Chapter 13: Physics Processing, Ronald Fedkiw,
Professor Stanford U. and Eftychios Sifakis
Chapter 14: Graphics enhancement, Ignacio Vargas, CTO,
Nextlimit (Interested)
Chapter 15: Communications and networking, Don BanksCisco (Interested)
Chapter 16: Data/Business Analytics , Zubin Dowlaty,
CTO Mu Sigma
Hwu 2013
CURRENT TABLE OF CONTENTS (CONT.)









Chapter 17: Hadoop, Memcached and cloud frameworks,
Bharath Mundlapudi, CTO Orzota

Chapter 18: Bioinformatics Wu Feng, VirginiaTech
Chapter 19: Scientific Computations, Byunghyun Jang,
Univ Miss.
Chapter 20: Computer Aided Design and Engineering,
Martin Wong, Professor UIUC

Chapter 21: Oil and Gas, Nacho Navarro, UPC/BSC
(interested)

Hwu 2013
CURRENT TABLE OF CONTENTS (CONT.)








Chapter 22: Financial Services & Analysis, Surra
Yanamadala, CA VP
Chapter 23: Computer Vision facial expression,
Elnar Hajiyev, Realeye

Chapter 24: Legacy Code Interoperability, Wenmei and Tom Jablin, MCW
Chapter 25: Java for HSA, Gary Frost, AMD

Hwu 2013
MAJOR UPCOMING MILESTONES


Author Algorithm and Code Due – February 2014



Author Chapters Due – April 2014



Review Due Back to Authors – May 2014



Chapters to Elsevier – June 2014



Books available – October 2014

Hwu 2013
THANK YOU!
ANY MORE QUESTIONS?

Hwu 2013

Mais conteúdo relacionado

Destaque

SWFs and Sustainable Infrastructure
SWFs and Sustainable InfrastructureSWFs and Sustainable Infrastructure
SWFs and Sustainable InfrastructureJ_Schmitz
 
Buy Nevada Holiday Gift Guide 2014-15
Buy Nevada Holiday Gift Guide 2014-15Buy Nevada Holiday Gift Guide 2014-15
Buy Nevada Holiday Gift Guide 2014-15Nevada Agriculture
 
2014.05.14 cofely fabricom maintenance mechanical workshops brochure
2014.05.14   cofely fabricom maintenance mechanical workshops brochure2014.05.14   cofely fabricom maintenance mechanical workshops brochure
2014.05.14 cofely fabricom maintenance mechanical workshops brochureStefanieDeWilde
 
Redex formato encuesta
Redex formato encuestaRedex formato encuesta
Redex formato encuestagohan_go
 
Science Update - No 259 - May 2015
Science Update - No 259 - May 2015Science Update - No 259 - May 2015
Science Update - No 259 - May 2015DOILibrary1151
 
Diplomado Liderazgo Sindical Femenino
Diplomado Liderazgo Sindical FemeninoDiplomado Liderazgo Sindical Femenino
Diplomado Liderazgo Sindical FemeninoComunidadMujer Cm
 
Kultura zientifikoa
Kultura zientifikoaKultura zientifikoa
Kultura zientifikoaiazpiro1
 
Los proyectos de los filósofos
Los proyectos de los filósofosLos proyectos de los filósofos
Los proyectos de los filósofosROUSEBARBER
 
Como Guardar y Consultar Presentaciones Zip, y Pdf
Como Guardar y Consultar Presentaciones  Zip, y  PdfComo Guardar y Consultar Presentaciones  Zip, y  Pdf
Como Guardar y Consultar Presentaciones Zip, y Pdfgrodriguezcu
 
Proyecto Social: Cuidado y Protección Social
Proyecto Social: Cuidado y Protección SocialProyecto Social: Cuidado y Protección Social
Proyecto Social: Cuidado y Protección SocialkRyss
 
Tipos de motores eléctricos
Tipos de motores eléctricosTipos de motores eléctricos
Tipos de motores eléctricosEIYSC
 
PEG y RCIU
PEG y RCIUPEG y RCIU
PEG y RCIUlablih
 
Sociedad anonima exposicion
Sociedad anonima exposicionSociedad anonima exposicion
Sociedad anonima exposicionkathom
 

Destaque (17)

SWFs and Sustainable Infrastructure
SWFs and Sustainable InfrastructureSWFs and Sustainable Infrastructure
SWFs and Sustainable Infrastructure
 
Buy Nevada Holiday Gift Guide 2014-15
Buy Nevada Holiday Gift Guide 2014-15Buy Nevada Holiday Gift Guide 2014-15
Buy Nevada Holiday Gift Guide 2014-15
 
Boletín Oficial 2013 08-16
Boletín Oficial 2013 08-16Boletín Oficial 2013 08-16
Boletín Oficial 2013 08-16
 
Analisis reforma
Analisis reforma Analisis reforma
Analisis reforma
 
2014.05.14 cofely fabricom maintenance mechanical workshops brochure
2014.05.14   cofely fabricom maintenance mechanical workshops brochure2014.05.14   cofely fabricom maintenance mechanical workshops brochure
2014.05.14 cofely fabricom maintenance mechanical workshops brochure
 
Redex formato encuesta
Redex formato encuestaRedex formato encuesta
Redex formato encuesta
 
Science Update - No 259 - May 2015
Science Update - No 259 - May 2015Science Update - No 259 - May 2015
Science Update - No 259 - May 2015
 
Diplomado Liderazgo Sindical Femenino
Diplomado Liderazgo Sindical FemeninoDiplomado Liderazgo Sindical Femenino
Diplomado Liderazgo Sindical Femenino
 
Kultura zientifikoa
Kultura zientifikoaKultura zientifikoa
Kultura zientifikoa
 
Los proyectos de los filósofos
Los proyectos de los filósofosLos proyectos de los filósofos
Los proyectos de los filósofos
 
Como Guardar y Consultar Presentaciones Zip, y Pdf
Como Guardar y Consultar Presentaciones  Zip, y  PdfComo Guardar y Consultar Presentaciones  Zip, y  Pdf
Como Guardar y Consultar Presentaciones Zip, y Pdf
 
Proyecto Social: Cuidado y Protección Social
Proyecto Social: Cuidado y Protección SocialProyecto Social: Cuidado y Protección Social
Proyecto Social: Cuidado y Protección Social
 
Tipos de motores eléctricos
Tipos de motores eléctricosTipos de motores eléctricos
Tipos de motores eléctricos
 
PEG y RCIU
PEG y RCIUPEG y RCIU
PEG y RCIU
 
GRUPO DE A TERCERA FASE: PROFOCOM
GRUPO DE A TERCERA FASE: PROFOCOMGRUPO DE A TERCERA FASE: PROFOCOM
GRUPO DE A TERCERA FASE: PROFOCOM
 
Sociedad anonima exposicion
Sociedad anonima exposicionSociedad anonima exposicion
Sociedad anonima exposicion
 
Whatsup
WhatsupWhatsup
Whatsup
 

Semelhante a HSA-4130, HSA for Application Programming, by Wen Mei Hwu

assignment3
assignment3assignment3
assignment3Kirti J
 
What it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! PerspectivesWhat it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! PerspectivesDataWorks Summit
 
Presentation
PresentationPresentation
Presentationbutest
 
Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010nzhang
 
Big Data, Simple and Fast: Addressing the Shortcomings of Hadoop
Big Data, Simple and Fast: Addressing the Shortcomings of HadoopBig Data, Simple and Fast: Addressing the Shortcomings of Hadoop
Big Data, Simple and Fast: Addressing the Shortcomings of HadoopHazelcast
 
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : Notes
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : NotesIs Multicore Hardware For General-Purpose Parallel Processing Broken? : Notes
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : NotesSubhajit Sahu
 
Worldranking universities final documentation
Worldranking universities final documentationWorldranking universities final documentation
Worldranking universities final documentationBhadra Gowdra
 
"Big Data" Bioinformatics
"Big Data" Bioinformatics"Big Data" Bioinformatics
"Big Data" BioinformaticsBrian Repko
 
A Collaborative Research Proposal To The NSF Research Accelerator For Multip...
A Collaborative Research Proposal To The NSF  Research Accelerator For Multip...A Collaborative Research Proposal To The NSF  Research Accelerator For Multip...
A Collaborative Research Proposal To The NSF Research Accelerator For Multip...Scott Donald
 
Hadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridHadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridEvert Lammerts
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational ScienceChelle Gentemann
 
Cloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control MethodCloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control MethodIRJET Journal
 
Sjug #26 ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
Sjug #26   ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23Sjug #26   ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
Sjug #26 ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23Tomasz Sikora
 
Overview of big data & hadoop v1
Overview of big data & hadoop   v1Overview of big data & hadoop   v1
Overview of big data & hadoop v1Thanh Nguyen
 
HSA Introduction Hot Chips 2013
HSA Introduction  Hot Chips 2013HSA Introduction  Hot Chips 2013
HSA Introduction Hot Chips 2013HSA Foundation
 
Hadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University TalksHadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University Talksyhadoop
 

Semelhante a HSA-4130, HSA for Application Programming, by Wen Mei Hwu (20)

assignment3
assignment3assignment3
assignment3
 
What it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! PerspectivesWhat it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! Perspectives
 
Overview of the Data Processing Error Analysis System (DPEAS)
Overview of the Data Processing Error Analysis System (DPEAS)Overview of the Data Processing Error Analysis System (DPEAS)
Overview of the Data Processing Error Analysis System (DPEAS)
 
Presentation
PresentationPresentation
Presentation
 
Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010
 
Big Data, Simple and Fast: Addressing the Shortcomings of Hadoop
Big Data, Simple and Fast: Addressing the Shortcomings of HadoopBig Data, Simple and Fast: Addressing the Shortcomings of Hadoop
Big Data, Simple and Fast: Addressing the Shortcomings of Hadoop
 
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : Notes
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : NotesIs Multicore Hardware For General-Purpose Parallel Processing Broken? : Notes
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : Notes
 
Hadoop
HadoopHadoop
Hadoop
 
Worldranking universities final documentation
Worldranking universities final documentationWorldranking universities final documentation
Worldranking universities final documentation
 
"Big Data" Bioinformatics
"Big Data" Bioinformatics"Big Data" Bioinformatics
"Big Data" Bioinformatics
 
A Collaborative Research Proposal To The NSF Research Accelerator For Multip...
A Collaborative Research Proposal To The NSF  Research Accelerator For Multip...A Collaborative Research Proposal To The NSF  Research Accelerator For Multip...
A Collaborative Research Proposal To The NSF Research Accelerator For Multip...
 
Hadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridHadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG Grid
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational Science
 
Cloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control MethodCloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control Method
 
MY_PRE.pptx
MY_PRE.pptxMY_PRE.pptx
MY_PRE.pptx
 
Sjug #26 ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
Sjug #26   ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23Sjug #26   ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
Sjug #26 ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
 
HSA Introduction
HSA IntroductionHSA Introduction
HSA Introduction
 
Overview of big data & hadoop v1
Overview of big data & hadoop   v1Overview of big data & hadoop   v1
Overview of big data & hadoop v1
 
HSA Introduction Hot Chips 2013
HSA Introduction  Hot Chips 2013HSA Introduction  Hot Chips 2013
HSA Introduction Hot Chips 2013
 
Hadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University TalksHadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University Talks
 

Mais de AMD Developer Central

DX12 & Vulkan: Dawn of a New Generation of Graphics APIs
DX12 & Vulkan: Dawn of a New Generation of Graphics APIsDX12 & Vulkan: Dawn of a New Generation of Graphics APIs
DX12 & Vulkan: Dawn of a New Generation of Graphics APIsAMD Developer Central
 
Leverage the Speed of OpenCL™ with AMD Math Libraries
Leverage the Speed of OpenCL™ with AMD Math LibrariesLeverage the Speed of OpenCL™ with AMD Math Libraries
Leverage the Speed of OpenCL™ with AMD Math LibrariesAMD Developer Central
 
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware WebinarAn Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware WebinarAMD Developer Central
 
Webinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop IntelligenceWebinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop IntelligenceAMD Developer Central
 
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...AMD Developer Central
 
TressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozTressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozAMD Developer Central
 
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellRendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellAMD Developer Central
 
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonLow-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonAMD Developer Central
 
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornDirect3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornAMD Developer Central
 
Introduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevIntroduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevAMD Developer Central
 
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasHoly smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasAMD Developer Central
 
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...AMD Developer Central
 
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...AMD Developer Central
 
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14AMD Developer Central
 

Mais de AMD Developer Central (20)

DX12 & Vulkan: Dawn of a New Generation of Graphics APIs
DX12 & Vulkan: Dawn of a New Generation of Graphics APIsDX12 & Vulkan: Dawn of a New Generation of Graphics APIs
DX12 & Vulkan: Dawn of a New Generation of Graphics APIs
 
Leverage the Speed of OpenCL™ with AMD Math Libraries
Leverage the Speed of OpenCL™ with AMD Math LibrariesLeverage the Speed of OpenCL™ with AMD Math Libraries
Leverage the Speed of OpenCL™ with AMD Math Libraries
 
Introduction to Node.js
Introduction to Node.jsIntroduction to Node.js
Introduction to Node.js
 
Media SDK Webinar 2014
Media SDK Webinar 2014Media SDK Webinar 2014
Media SDK Webinar 2014
 
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware WebinarAn Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
 
DirectGMA on AMD’S FirePro™ GPUS
DirectGMA on AMD’S  FirePro™ GPUSDirectGMA on AMD’S  FirePro™ GPUS
DirectGMA on AMD’S FirePro™ GPUS
 
Webinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop IntelligenceWebinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop Intelligence
 
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
 
Inside XBox- One, by Martin Fuller
Inside XBox- One, by Martin FullerInside XBox- One, by Martin Fuller
Inside XBox- One, by Martin Fuller
 
TressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozTressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas Thibieroz
 
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellRendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
 
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonLow-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
 
Gcn performance ftw by stephan hodes
Gcn performance ftw by stephan hodesGcn performance ftw by stephan hodes
Gcn performance ftw by stephan hodes
 
Inside XBOX ONE by Martin Fuller
Inside XBOX ONE by Martin FullerInside XBOX ONE by Martin Fuller
Inside XBOX ONE by Martin Fuller
 
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornDirect3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
 
Introduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevIntroduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan Nevraev
 
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasHoly smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
 
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
 
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
 
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
 

Último

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 

Último (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 

HSA-4130, HSA for Application Programming, by Wen Mei Hwu

  • 1. HSA FOR APPLICATION PROGRAMMING Wen-mei W. Hwu CTO, MulticoreWare, Inc. Professor University of Illinois, Urbana-Champaign
  • 2. CURRENT GPU COMPUTING PAIN POINTS  Kernel launch overhead  Limited virtual and physical memory space  Extra data movement in I/O and networking  Tedious host code  Multiple source code development
  • 4. DESIRED DATA TRANSFER BEHAVIOR Main Memory (DRAM) Network I/O CPU Disk I/O Device Memory DMA GPU card (or other Accelerator cards) SAMOS 2013
  • 5. ACTUAL DATA TRANSFER BEHAVIOR Main Memory (DRAM) Each additional copy diminishes applicationperceived bandwidth Network I/O CPU Disk I/O Device Memory DMA GPU card (or other Accelerator cards) SAMOS 2013
  • 9. DYNAMIC COMPILATION AND BINARY COMPATIBLITY
  • 12.
  • 13.
  • 14.
  • 15.
  • 17. TOOLS WORKING GROUP  Tools Group will be looking at foundation for Developer Tools (compilation, debugging and performance analysis)   Working with the working group to insure the right interface are in place Compilation, Debugging, and Performance Analysis initially We act as stewards for   HSA functional simulator with GDB support (AMD/MCW)  HSA performance simulator (AMD)   LLVM backend for generating HSAIL (AMD/MCW) Loader Library for Simulator - BRIG object loading support ( AMD) Longer Terms will spawn sub-group that will drive requirement for language support for HSA  C, C++, C++ AMP  Java (JVM, Dalvik)  Python  JavaScript  DSL…  … Hwu 2013
  • 18. CURRENT TIME TABLE  Kickoff Conference and Refined Charter – November 2013  Tools Roadmap – December 2013  Initial Tools – December 2013  Initial Language Tool Chains – January 2013 Please join us and contribute! Hwu 2013
  • 19. HAS BOOK IN THE MAKING Heterogeneous System Architecture For Application Programming  Audience Performance application developers  System architects  Component architects  Software stack developers  Hwu 2013
  • 20. EDITORIAL BOARD  Gaster, Benedict – Qualcomm  Hegde Manju - AMD  Hwu, Wen-mei - MulticoreWare/UIUC  Jablin, Thomas – MultcoreWare  Lokhmotov, Anton – ARM  Lu, Chien-Ping – MediaTek  Whitecotton, Bob - AMD Hwu 2013
  • 21. CURRENT TABLE OF CONTENTS  HSA Overview (50 pages)  Chapter 1: What is HSA? (Hwu/Jablin)  Chapter 2: The HSA Architecture (Hwu/Jablin/Others)  Chapter 3: A Programmer's View of HSA (Hwu/Jablin)   Chapter 4: The Tools Framework for HAS, (Hwu/Jablin/Others) Chapter 5: Mapping of Classical algorithms onto the HSA Architecture (Hwu/Jablin) Hwu 2013
  • 22. CURRENT TABLE OF CONTENTS (CONT.)   Representative Workloads: Chapter 6: Photography, Minh Do, Professor UIUC, CTO Personify  Chapter 7: Video-Audio Search, Ren Wu, Baidu  Chapter 8: Augmented Reality Chein-ping Lu, MeidaTek   Chapter 9: Biometrics (Face Detection) Mike Jones, MERL & Harris Gasparakis, AMD Chapter 10: Audio-Video processing Bill Herz, Sr. Fellow AMD Hwu 2013
  • 23. CURRENT TABLE OF CONTENTS (CONT.)       Chapter 11: Ray-Tracing, TBD Chapter 12: Natural user Interfaces, Navneett CEO Flutter Dallal, Chapter 13: Physics Processing, Ronald Fedkiw, Professor Stanford U. and Eftychios Sifakis Chapter 14: Graphics enhancement, Ignacio Vargas, CTO, Nextlimit (Interested) Chapter 15: Communications and networking, Don BanksCisco (Interested) Chapter 16: Data/Business Analytics , Zubin Dowlaty, CTO Mu Sigma Hwu 2013
  • 24. CURRENT TABLE OF CONTENTS (CONT.)      Chapter 17: Hadoop, Memcached and cloud frameworks, Bharath Mundlapudi, CTO Orzota Chapter 18: Bioinformatics Wu Feng, VirginiaTech Chapter 19: Scientific Computations, Byunghyun Jang, Univ Miss. Chapter 20: Computer Aided Design and Engineering, Martin Wong, Professor UIUC Chapter 21: Oil and Gas, Nacho Navarro, UPC/BSC (interested) Hwu 2013
  • 25. CURRENT TABLE OF CONTENTS (CONT.)     Chapter 22: Financial Services & Analysis, Surra Yanamadala, CA VP Chapter 23: Computer Vision facial expression, Elnar Hajiyev, Realeye Chapter 24: Legacy Code Interoperability, Wenmei and Tom Jablin, MCW Chapter 25: Java for HSA, Gary Frost, AMD Hwu 2013
  • 26. MAJOR UPCOMING MILESTONES  Author Algorithm and Code Due – February 2014  Author Chapters Due – April 2014  Review Due Back to Authors – May 2014  Chapters to Elsevier – June 2014  Books available – October 2014 Hwu 2013
  • 27. THANK YOU! ANY MORE QUESTIONS? Hwu 2013