SlideShare uma empresa Scribd logo
1 de 22
Compilation guiding and adjusting to hardware changes in Embedded Reconfigurable Architecture (  )  May 4, 2011 Ayal Zaks IBM Haifa Research Lab E A R
Motivation ,[object Object],[object Object],[object Object]
Challenges of  (EU FP7 STREP) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],The adaptive ERA platform will be able to meet these challenges!! E A R
Memory component Network component Processing component Monitoring Hardware scheduler LIBRARIES Applications OS (or software scheduler) C/C++/ Java compiler Power vs. Performance ARM, VEX, DSP, accelerators, etc. Crossbar, bus, NoC, etc. Multi-level caches, controllers, etc. Abstract overview of the  platform E A R
Partners of  Participant no.  Participant organisation name Short name Country 1 (Coordinator) Technische Universiteit Delft TUD NL 2 Industrial Systems Institute ISI GR 3 Universita' degli Studi di Siena UNISI IT 4 Chalmers University CHALMERS SE 5 University of Edinburgh UEDIN UK 6 Evidence EVI IT 7 ST Microelectronics ST IT 8 IBM IBM IL 9 Universidade do Rio Grande do Sul UFRGS BR 10 Uppsala University UPP SE E A R
Key elements of the ERA platform ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Work packages and leaders ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Goals of the ERA project ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
... compilers to deal with dynamically parameterizable hardware ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The architecture:   -VEX ,[object Object],[object Object],[object Object],[object Object]
WP4 1 st  Year Achievements ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],pipe:: c0  shl  $r0.3 = $r0.3,16 c0  shl  $r0.4 = $r0.4,16 c0  shl  $r0.5 = $r0.5,16 c0  shl  $r0.6 = $r0.6,16 ;; ;; ;; c0  shr  $r0.4 = $r0.4,16 c0  shr  $r0.6 = $r0.6,16 c0  shr  $r0.3 = $r0.3,16 c0  shr  $r0.5 = $r0.5,16 ;; ;; ;; c0  mpyll  $r0.3 = $r0.3,$r0.4 c0  mpyll  $r0.5 = $r0.5,$r0.6 ;; ;;;; c0  mpyll  $r0.3 = $r0.3,$r0.5 ;; ;; ;; c0  add  $r0.3 = $r0.3,3 ;;;; ;; c0  sxth  $r0.3 = $r0.3 ;;;; ;; c0  return  $r0.1 = $r0.1,(0x0),$l0.0 ;; ;; ;; ;; __attribute__ ((noinline)) short pipe (short a, short b, short c, short d) { short f, g , t; t = a * b; f = c * d; g = t * f; return g+3; } Source code  -VEX Assembly code generated by GCC
[object Object],[object Object],[object Object],[object Object],[object Object],WP4 1 st  Year Achievements (cont.)
Characterizing ILP of ERA benchmarks ,[object Object],Original program 128 = b + 119  119 = 119 + 4 127= a + 119 129 = MEM[128] 130 = 129 + 1 MEM[127] = 130 [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] 128 = b + 119  119 = 119 + 4 127= a + 119 129 = MEM[128] 130 = 129 + 1 MEM[127] = 130 [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] void foo (unsigned char  *dst ,  unsigned char *src ) { int x ; for( x = 0; x < 100; x+=1 ) dst[x] = ( src [x] + 1 ); } I nitiation  I nterval prologue   epilogue   Source code  -VEX Assembly (transcribed) modulo scheduled by GCC  128 = b + 119  119 = 119 + 4 127= a + 119 129 = MEM[128] 130 = 129 + 1 MEM[127] = 130 [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] 128 = b + 119  119 = 119 + 4 127= a + 119 129 = MEM[128] 130 = 129 + 1 MEM[127] = 130 [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11]
Example - X264 List of hot functions:
Poster at
Recent Developments ,[object Object],[object Object],[object Object],[object Object],Issue-width Slice Registers Slice LUTs BRAMs 2-issue 586 (0%) 6375 (4%) 4  (1%) 4-issue 1046 (0%) 12899 (8%) 16  (4%) 8-issue 1868 (0%) 26252 (17%) 64  (15%)
TLP vs. ILP ,[object Object],[object Object],[object Object],[object Object]
Core vs. Cache GCC EDP Different configurations, same EDP! Copyright © Keramidis & Kaxiras, ERA project
Conclusions ,[object Object],[object Object],[object Object]
Thanks! To you and: ,[object Object],[object Object],[object Object],E A R
Contact information Visit  http://www.era-project.eu  for more information Coordinator: Stephan Wong  (Delft University of Techology) [email_address] http://ce.et.tudelft.nl/~stephan/   IBM representative , Work Package 4 leader: Ayal Zaks  (IBM Haifa Research Lab) [email_address] https://www.research.ibm.com/haifa/dept/svt/code_compiler.html
Strengths of ERA partners ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Mais conteúdo relacionado

Mais procurados

Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...
Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...
Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...
NECST Lab @ Politecnico di Milano
 
LLVM-based Communication Optimizations for PGAS Programs
LLVM-based Communication Optimizations for PGAS ProgramsLLVM-based Communication Optimizations for PGAS Programs
LLVM-based Communication Optimizations for PGAS Programs
Akihiro Hayashi
 
Compiler optimization
Compiler optimizationCompiler optimization
Compiler optimization
liu_ming50
 
186 devlin p-poster(2)
186 devlin p-poster(2)186 devlin p-poster(2)
186 devlin p-poster(2)
vaidehi87
 
Compiler Optimization-Space Exploration
Compiler Optimization-Space ExplorationCompiler Optimization-Space Exploration
Compiler Optimization-Space Exploration
tmusabbir
 

Mais procurados (20)

Ch1
Ch1Ch1
Ch1
 
The CAOS framework: Democratize the acceleration of compute intensive applica...
The CAOS framework: Democratize the acceleration of compute intensive applica...The CAOS framework: Democratize the acceleration of compute intensive applica...
The CAOS framework: Democratize the acceleration of compute intensive applica...
 
Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...
Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...
Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...
 
Improve Vectorization Efficiency
Improve Vectorization EfficiencyImprove Vectorization Efficiency
Improve Vectorization Efficiency
 
Madeo - a CAD Tool for reconfigurable Hardware
Madeo - a CAD Tool for reconfigurable HardwareMadeo - a CAD Tool for reconfigurable Hardware
Madeo - a CAD Tool for reconfigurable Hardware
 
Unleash performance through parallelism - Intel® Math Kernel Library
Unleash performance through parallelism - Intel® Math Kernel LibraryUnleash performance through parallelism - Intel® Math Kernel Library
Unleash performance through parallelism - Intel® Math Kernel Library
 
P4 Introduction
P4 Introduction P4 Introduction
P4 Introduction
 
Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...
Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...
Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...
 
LLVM-based Communication Optimizations for PGAS Programs
LLVM-based Communication Optimizations for PGAS ProgramsLLVM-based Communication Optimizations for PGAS Programs
LLVM-based Communication Optimizations for PGAS Programs
 
HPCLib & Excel : An efficient way to compute with Xeon PHI
HPCLib & Excel : An efficient way to compute with Xeon PHIHPCLib & Excel : An efficient way to compute with Xeon PHI
HPCLib & Excel : An efficient way to compute with Xeon PHI
 
Compiler optimization
Compiler optimizationCompiler optimization
Compiler optimization
 
Hadoop Internals (2.3.0 or later)
Hadoop Internals (2.3.0 or later)Hadoop Internals (2.3.0 or later)
Hadoop Internals (2.3.0 or later)
 
Compiler optimizations based on call-graph flattening
Compiler optimizations based on call-graph flatteningCompiler optimizations based on call-graph flattening
Compiler optimizations based on call-graph flattening
 
Compiler optimization
Compiler optimizationCompiler optimization
Compiler optimization
 
Peephole optimization techniques in compiler design
Peephole optimization techniques in compiler designPeephole optimization techniques in compiler design
Peephole optimization techniques in compiler design
 
186 devlin p-poster(2)
186 devlin p-poster(2)186 devlin p-poster(2)
186 devlin p-poster(2)
 
Compiler Optimization-Space Exploration
Compiler Optimization-Space ExplorationCompiler Optimization-Space Exploration
Compiler Optimization-Space Exploration
 
Compiler optimization techniques
Compiler optimization techniquesCompiler optimization techniques
Compiler optimization techniques
 
Peephole Optimization
Peephole OptimizationPeephole Optimization
Peephole Optimization
 
CC-4007, Large-Scale Machine Learning on Graphs, by Yucheng Low, Joseph Gonza...
CC-4007, Large-Scale Machine Learning on Graphs, by Yucheng Low, Joseph Gonza...CC-4007, Large-Scale Machine Learning on Graphs, by Yucheng Low, Joseph Gonza...
CC-4007, Large-Scale Machine Learning on Graphs, by Yucheng Low, Joseph Gonza...
 

Semelhante a Track A-Compilation guiding and adjusting - IBM

Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Michael_Joshua_Validation
Michael_Joshua_ValidationMichael_Joshua_Validation
Michael_Joshua_Validation
MichaelJoshua
 

Semelhante a Track A-Compilation guiding and adjusting - IBM (20)

Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
 
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...
 
Performance Verification for ESL Design Methodology from AADL Models
Performance Verification for ESL Design Methodology from AADL ModelsPerformance Verification for ESL Design Methodology from AADL Models
Performance Verification for ESL Design Methodology from AADL Models
 
OS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLOS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of ML
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Embedded system
Embedded systemEmbedded system
Embedded system
 
Performance and Power Profiling on Intel Android Devices
Performance and Power Profiling on Intel Android DevicesPerformance and Power Profiling on Intel Android Devices
Performance and Power Profiling on Intel Android Devices
 
Accelerating Spark MLlib and DataFrame with Vector Processor “SX-Aurora TSUBASA”
Accelerating Spark MLlib and DataFrame with Vector Processor “SX-Aurora TSUBASA”Accelerating Spark MLlib and DataFrame with Vector Processor “SX-Aurora TSUBASA”
Accelerating Spark MLlib and DataFrame with Vector Processor “SX-Aurora TSUBASA”
 
openCL Paper
openCL PaperopenCL Paper
openCL Paper
 
Michael_Joshua_Validation
Michael_Joshua_ValidationMichael_Joshua_Validation
Michael_Joshua_Validation
 
The Best Programming Practice for Cell/B.E.
The Best Programming Practice for Cell/B.E.The Best Programming Practice for Cell/B.E.
The Best Programming Practice for Cell/B.E.
 
resume
resumeresume
resume
 
Daya_DBA
Daya_DBADaya_DBA
Daya_DBA
 
DhevendranResume
DhevendranResumeDhevendranResume
DhevendranResume
 
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
Petapath HP Cast 12 - Programming for High Performance Accelerated SystemsPetapath HP Cast 12 - Programming for High Performance Accelerated Systems
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
 
Software Abstractions for Parallel Hardware
Software Abstractions for Parallel HardwareSoftware Abstractions for Parallel Hardware
Software Abstractions for Parallel Hardware
 
(1) c sharp introduction_basics_dot_net
(1) c sharp introduction_basics_dot_net(1) c sharp introduction_basics_dot_net
(1) c sharp introduction_basics_dot_net
 
Enterprise PL1 - Peter Elderon
Enterprise PL1 - Peter ElderonEnterprise PL1 - Peter Elderon
Enterprise PL1 - Peter Elderon
 
ElixirでFPGAを設計する
ElixirでFPGAを設計するElixirでFPGAを設計する
ElixirでFPGAを設計する
 
NWU and HPC
NWU and HPCNWU and HPC
NWU and HPC
 

Mais de chiportal

Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
chiportal
 

Mais de chiportal (20)

Prof. Zhihua Wang, Tsinghua University, Beijing, China
Prof. Zhihua Wang, Tsinghua University, Beijing, China Prof. Zhihua Wang, Tsinghua University, Beijing, China
Prof. Zhihua Wang, Tsinghua University, Beijing, China
 
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
 
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
 
Prof. Uri Weiser,Technion
Prof. Uri Weiser,TechnionProf. Uri Weiser,Technion
Prof. Uri Weiser,Technion
 
Ken Liao, Senior Associate VP, Faraday
Ken Liao, Senior Associate VP, FaradayKen Liao, Senior Associate VP, Faraday
Ken Liao, Senior Associate VP, Faraday
 
Prof. Danny Raz, Director, Bell Labs Israel, Nokia
 Prof. Danny Raz, Director, Bell Labs Israel, Nokia  Prof. Danny Raz, Director, Bell Labs Israel, Nokia
Prof. Danny Raz, Director, Bell Labs Israel, Nokia
 
Marco Casale-Rossi, Product Mktg. Manager, Synopsys
Marco Casale-Rossi, Product Mktg. Manager, SynopsysMarco Casale-Rossi, Product Mktg. Manager, Synopsys
Marco Casale-Rossi, Product Mktg. Manager, Synopsys
 
Dr.Efraim Aharoni, ESD Leader, TowerJazz
Dr.Efraim Aharoni, ESD Leader, TowerJazzDr.Efraim Aharoni, ESD Leader, TowerJazz
Dr.Efraim Aharoni, ESD Leader, TowerJazz
 
Eddy Kvetny, System Engineering Group Leader, Intel
Eddy Kvetny, System Engineering Group Leader, IntelEddy Kvetny, System Engineering Group Leader, Intel
Eddy Kvetny, System Engineering Group Leader, Intel
 
Dr. John Bainbridge, Principal Application Architect, NetSpeed
 Dr. John Bainbridge, Principal Application Architect, NetSpeed  Dr. John Bainbridge, Principal Application Architect, NetSpeed
Dr. John Bainbridge, Principal Application Architect, NetSpeed
 
Xavier van Ruymbeke, App. Engineer, Arteris
Xavier van Ruymbeke, App. Engineer, ArterisXavier van Ruymbeke, App. Engineer, Arteris
Xavier van Ruymbeke, App. Engineer, Arteris
 
Asi Lifshitz, VP R&D, Vtool
Asi Lifshitz, VP R&D, VtoolAsi Lifshitz, VP R&D, Vtool
Asi Lifshitz, VP R&D, Vtool
 
Zvika Rozenshein,General Manager, EngineeringIQ
Zvika Rozenshein,General Manager, EngineeringIQZvika Rozenshein,General Manager, EngineeringIQ
Zvika Rozenshein,General Manager, EngineeringIQ
 
Lewis Chu,Marketing Director,GUC
Lewis Chu,Marketing Director,GUC Lewis Chu,Marketing Director,GUC
Lewis Chu,Marketing Director,GUC
 
Kunal Varshney, VLSI Engineer, Open-Silicon
Kunal Varshney, VLSI Engineer, Open-SiliconKunal Varshney, VLSI Engineer, Open-Silicon
Kunal Varshney, VLSI Engineer, Open-Silicon
 
Gert Goossens,Sen. Director, ASIP Tools, Synopsys
Gert Goossens,Sen. Director, ASIP Tools, SynopsysGert Goossens,Sen. Director, ASIP Tools, Synopsys
Gert Goossens,Sen. Director, ASIP Tools, Synopsys
 
Tuvia Liran, Director of VLSI, Nano Retina
Tuvia Liran, Director of VLSI, Nano RetinaTuvia Liran, Director of VLSI, Nano Retina
Tuvia Liran, Director of VLSI, Nano Retina
 
Sagar Kadam, Lead Software Engineer, Open-Silicon
Sagar Kadam, Lead Software Engineer, Open-SiliconSagar Kadam, Lead Software Engineer, Open-Silicon
Sagar Kadam, Lead Software Engineer, Open-Silicon
 
Ronen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
Ronen Shtayer,Director of ASG Operations & PMO, NXP SemiconductorRonen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
Ronen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
 
Prof. Emanuel Cohen, Technion
Prof. Emanuel Cohen, TechnionProf. Emanuel Cohen, Technion
Prof. Emanuel Cohen, Technion
 

Último

The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
SanaAli374401
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Último (20)

Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 

Track A-Compilation guiding and adjusting - IBM

  • 1. Compilation guiding and adjusting to hardware changes in Embedded Reconfigurable Architecture ( ) May 4, 2011 Ayal Zaks IBM Haifa Research Lab E A R
  • 2.
  • 3.
  • 4. Memory component Network component Processing component Monitoring Hardware scheduler LIBRARIES Applications OS (or software scheduler) C/C++/ Java compiler Power vs. Performance ARM, VEX, DSP, accelerators, etc. Crossbar, bus, NoC, etc. Multi-level caches, controllers, etc. Abstract overview of the platform E A R
  • 5. Partners of Participant no. Participant organisation name Short name Country 1 (Coordinator) Technische Universiteit Delft TUD NL 2 Industrial Systems Institute ISI GR 3 Universita' degli Studi di Siena UNISI IT 4 Chalmers University CHALMERS SE 5 University of Edinburgh UEDIN UK 6 Evidence EVI IT 7 ST Microelectronics ST IT 8 IBM IBM IL 9 Universidade do Rio Grande do Sul UFRGS BR 10 Uppsala University UPP SE E A R
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14. Example - X264 List of hot functions:
  • 16.
  • 17.
  • 18. Core vs. Cache GCC EDP Different configurations, same EDP! Copyright © Keramidis & Kaxiras, ERA project
  • 19.
  • 20.
  • 21. Contact information Visit http://www.era-project.eu for more information Coordinator: Stephan Wong (Delft University of Techology) [email_address] http://ce.et.tudelft.nl/~stephan/ IBM representative , Work Package 4 leader: Ayal Zaks (IBM Haifa Research Lab) [email_address] https://www.research.ibm.com/haifa/dept/svt/code_compiler.html
  • 22.

Notas do Editor

  1. Proebting was talking about performance; what about power/energy?? How can compilers help improve power? Tell me if you know..
  2. This slide is quite self-explanatory
  3. This slide shows the general overview of the ERA platform. Basically, there are different components for “processing”, “networking”, and “memories” that we can choose from in order to build the platform. On top, we want to be able to adapt to different applications by choosing from libraries of these components – an additional advantage is that we want to do this dynamically. For this, we need a hardware scheduler or an OS/software scheduler that works in tandem with the hardware scheduler. The monitoring block monitors for example the power and performance of the system and this information can be fed into the schedulers. Finally, we need a smarter compiler is better aware of the dynamic behavior of the platform.
  4. This slides shows all the partners within the project.
  5. This slide summarizes the slide with the figure of the ERA platform.
  6. -mcpu: architecture (ISA); -mtune: micro-architecture Several PowerPC versions; code size, flexibility, switch versions at specific places in code
  7. Memory params – static analysis of memory access patterns, temporal and spatial reuse Partition code into sections representing phases of distinct ILP/MEM
  8. In the table, you can highlight the fact that we can parameterize the issue width of the roVEX processor and that different instantiations have different resource utilizations.
  9. On this slide, we can see that with the same resources, we can instantiate different cores. 2 smaller ones to handle TLP or combine it into a big to exploit ILP. The idea in the ERA project is to be able to do this on-the-fly in a dynamic way manner.
  10. This slide shows results on EDP (energy-delay product) measurements by varying the instruction window size (this has a clear relation with the parallelism of an application - ILP) and cache sizes. We see in this slide that when we increase the cache size, the EDP decreases. However, more interesting is the fact that the EDP product is similar (almost the same) with varying configurations – see the arrows pointing to different ILP-cache configurations. This means that we can optimize our design by changing the parameters and still achieve the same EDP. Please note that the information on this slide has not been published yet, so it is copyrighted!!