SlideShare a Scribd company logo
1 of 23
Download to read offline
HLR


 Core allocation and relocation
management for self dynamically
  reconfigurable architectures
     Reconfigurable Computing Italian Meeting
                     19 December 2008
         Room S01, Politecnico di Milano - Milan (Italy)



     Massimo Morandi: massimo.morandi@dresd.org
        Marco Novati: marco.novati@dresd.org
Outline

    Aims

    Introduction
    Basic Concepts
    Rationale


    Relocation solutions

    Core Allocation manager

    Concluding Remarks


2
Aims

    Provide support for self partially and dynamically
    reconfigurable systems:

    Relocation support:
      1D and 2D solutions
      SW and HW solutions

    Runtime Core Placement support with:
      Low overhead
      High versatility
      Efficient use of resources


3
Reconfigurable architecture
    A basic reconfigurable architecture consists of:
      a Static area: a basic Harward architecture
      a Reconfigurable area: an device area composed by
      several reconfigurable regions




4
Basic Definitions
    Core: a specific representation of a functionality. It is
    Core
    possible, for example, to have a core described in
    VHDL, in C or in an intermediate representation (e.g. a
    DFG)

    IP-Core: a core described using a HD Language
       Core
    combined with its communication infrastructure (i.e.
    the bus interface)

    Reconfigurable Functional Unit: an IP-Core that can be
                              Unit
    plugged and/or unplugged at runtime in an already
    working architecture

    Reconfigurable Region: a portion of the device area
                   Region
    used to implement a reconfigurable core
5
Relocation: The Problem
         Set of Available                           RFU
         Functionalities                       Implementations
                                                                       B
                      B 1/2            A
         F      2/2
                                           RR1 RR2 RR3   RR1 RR2 RR3
                                C
                         2/2
          2/1    A
                               E 1/1
          D 1/1
                                                                       D
                                       C
        Legenda:
                         Fi
             Area/Time                     RR1 RR2 RR3   RR1 RR2 RR3




                                       E                               F
                                           RR1 RR2 RR3   RR1 RR2 RR3




6
Relocation: Motivation
                                               Area                       Area
           Demanded Tasks


                                                                    A
                                                 A
         2/1 A         B 1/2
                                                             B             B
                                               Rec. C             Rec. C
         2/2 C         D 1/1
                                                                           R2 D


                                                 C                  C      D
    Legenda:
                       E 1/1
                  Ti
      Area/Time
      Request                                     Rec. D
                                                                   R2 F
      Sequnce
                       F   2/2
                                                     D
                                                  Rec. E
                                                                    F
                                                     E
                                                     Rec. F

                                                         F


                                                                 Time
7
Relocation: Rationale
    Bitstreams relocation technique to:
        speedup the overall system execution
        reduce the amount of memory used to store partial bitstreams
        achieve a core preemptive execution
        assign at runtime the bitstreams placement




8
Proposed Relocation Solutions


    Architectural support for relocation:



      Create an integrated HW/SW system to manage online
      relocation (1D and 2D) in reconfigurable architecture

      Create efficient bitstream relocation solutions suitable
      for the target system:
         1D (BiRF) – 2D (BiRF Square)
         HW (BiRF, BiRF Square) – SW (BAnMaT Lite)




9
Xilinx FPGAs and Configuration Memory




10
CRC Calculation
     Particular CRC value, used by Xilinx tools

     Two version of BiRF and BiRF Square:
        By using the “predefined” values
        With actual CRC calculation


     X16 + X15 + X2 + 1 [1D]

     X32 + X28 + X27 + X26 + X25 + X23 + X22 + X20 + X19 + X18 + X14 +
     X13 + X11 + X10 + X9 + X8 + X6 + 1 [2D]




11
Synthesis Results: Area


                           BiRF               BiRF Square
      FPGA       Generic      Optimized   Generic   Optimized
                 Version       Version    Version    Version
                                            −           −
      xc2vp7     11.6 %           3.6 %
                                            −           −
     Xc2vp20      5.8 %           1.8 %
                                            −           −
     xc2vp30      4.2 %           1.3 %
                   −               −
     xc4vlx40                              2.2 %       0.9 %
                   −               −
     xc4vlx60                              1.5 %       0.6 %
                   −               −
     xc4vlx100                             0.8 %       0.3 %
                   −               −
     xc5vlx50                              1.1 %       0.8 %
                   −               −
     xc5vlx85                              0.6 %       0.4 %
                   −               −
     xc5vlx110                             0.5 %       0.3 %



12
Relocation Solutions Results (1/2)
     BiRF, BiRF Square, BAnMaT Lite
       Permit to support relocation in a self partially and
       dynamically 1D or 2D reconfigurable system
       The occupation ratio is relatively small
       Frequency more than acceptable
       Reduction of internal memory requirements



     Throughput:
       BiRF: 6 MB/s
       BiRF Square: 7.3 MB/s
       BAnMaT Lite: 2.6 MB/s


13
Relocation Solutions Results (2/2)


     A total configuration file size is about 1 MB
     Considering an architecture:
        1/3 of the area as fixed part
        2/3 as reconfigurable part with 6 slots

     With such hypothesis
        Size of a partial bitstream will be about 110 KB
        Relocation time of about:
           18 ms with BiRF
           15 ms with BiRF Square
           42 ms with BAnMaT Lite


14
Outline

     Aims

     Introduction
     Basic Concepts
     Rationale


     Relocation solutions

     Core Allocation manager

     Concluding Remarks


15
Runtime Core Allocation Management
     Choose where to place Cores to achieve:
      Low Core Rejection Rate (CRR)
      Fast application completion time
      Small management overhead
      Other policy driven goal
     Choose how to maintain information on empty space
      Keep all information (Expensive but more accurate)
      Heuristically prune information (Cheaper)




16
Evaluation and Proposed Approach




     Choice driven by:
        Need for low complexity solution to reduce overhead at runtime
        Desire to keep high flexibility, to best suit user needs
     We propose heuristic (KNER-like) empty space manager:
        Support both general and focused policy (in particular FF, BF, RA)
        Suitable for dynamic schedule and blind schedule
        Exploiting multiple RFUs per Core, to improve quality
17
The Online Placement Algorithm




     The whole processing of a Core
     is completed in linear time

18
Experiment 1: Routing Aware

     Comparison against literature solutions
       dynamic schedule scenario, RA placement policy
       Measuring CRR, routing costs and overhead

     Benchmark of 100 randomly generated Cores:
       Size (5% to 20% of FPGA), randomly interconnected




19
Experiment 2: Application Completion Time
     Benchmark applications composed of cores taken from
     opencores.org like JPEG, AES, 3DES …
     Blind Schedule, measure the time instants needed to complete
     the applications with different amounts of resources




     Infinite resources is shown, to compare against lower bound
20
Experiment 3: Multiple Shapes
     Similar benchmark, but Cores have deadlines (for CRR)
     Shapes defined with the heuristics described previously




     Difference in runtime on average 30% more for 3
     shapes and 40% more for 5 shapes w.r.t. 1 shape
     CRR more than halved, often reduced to one third
21
Concluding Remarks
     Original goals met:

     Create efficient bitstream relocation solution suitable
     for target systems:
        1D - 2D
        HW – SW

     Create a Core allocation manager with:
        Low overhead
        High efficiency (CRR, application completion time,
        routing costs …)
        High versatility



22
Questions




23

More Related Content

What's hot

LTE Physical layer aspects
LTE Physical layer aspectsLTE Physical layer aspects
LTE Physical layer aspectsBP Tiwari
 
DIC_video_coding_standards_07
DIC_video_coding_standards_07DIC_video_coding_standards_07
DIC_video_coding_standards_07aniruddh Tyagi
 
Analysis of High-Order Residual-Based Dissipation for Unsteady Compressible F...
Analysis of High-Order Residual-Based Dissipation for Unsteady Compressible F...Analysis of High-Order Residual-Based Dissipation for Unsteady Compressible F...
Analysis of High-Order Residual-Based Dissipation for Unsteady Compressible F...kgrimich
 
Iii Data Transmission Fundamentals
Iii  Data Transmission FundamentalsIii  Data Transmission Fundamentals
Iii Data Transmission FundamentalsDon Bosco BSIT
 
RF and Mixed Signal Circuits for a DVB-H Receiver
RF and Mixed Signal Circuits for a DVB-H ReceiverRF and Mixed Signal Circuits for a DVB-H Receiver
RF and Mixed Signal Circuits for a DVB-H ReceiverRFIC-IUMA
 
Fast Factorized Backprojection Algorithm for UWB Bistatic.pdf
Fast Factorized Backprojection Algorithm for UWB Bistatic.pdfFast Factorized Backprojection Algorithm for UWB Bistatic.pdf
Fast Factorized Backprojection Algorithm for UWB Bistatic.pdfgrssieee
 
CoolStreaming
CoolStreamingCoolStreaming
CoolStreamingAriel K
 

What's hot (7)

LTE Physical layer aspects
LTE Physical layer aspectsLTE Physical layer aspects
LTE Physical layer aspects
 
DIC_video_coding_standards_07
DIC_video_coding_standards_07DIC_video_coding_standards_07
DIC_video_coding_standards_07
 
Analysis of High-Order Residual-Based Dissipation for Unsteady Compressible F...
Analysis of High-Order Residual-Based Dissipation for Unsteady Compressible F...Analysis of High-Order Residual-Based Dissipation for Unsteady Compressible F...
Analysis of High-Order Residual-Based Dissipation for Unsteady Compressible F...
 
Iii Data Transmission Fundamentals
Iii  Data Transmission FundamentalsIii  Data Transmission Fundamentals
Iii Data Transmission Fundamentals
 
RF and Mixed Signal Circuits for a DVB-H Receiver
RF and Mixed Signal Circuits for a DVB-H ReceiverRF and Mixed Signal Circuits for a DVB-H Receiver
RF and Mixed Signal Circuits for a DVB-H Receiver
 
Fast Factorized Backprojection Algorithm for UWB Bistatic.pdf
Fast Factorized Backprojection Algorithm for UWB Bistatic.pdfFast Factorized Backprojection Algorithm for UWB Bistatic.pdf
Fast Factorized Backprojection Algorithm for UWB Bistatic.pdf
 
CoolStreaming
CoolStreamingCoolStreaming
CoolStreaming
 

Viewers also liked (7)

UIC Thesis Morandi
UIC Thesis MorandiUIC Thesis Morandi
UIC Thesis Morandi
 
RCIM 2008 - Intro
RCIM 2008 - IntroRCIM 2008 - Intro
RCIM 2008 - Intro
 
3rd 3DDRESD: VGA Core
3rd 3DDRESD: VGA Core3rd 3DDRESD: VGA Core
3rd 3DDRESD: VGA Core
 
RCIM 2008 -- EHW
RCIM 2008 -- EHWRCIM 2008 -- EHW
RCIM 2008 -- EHW
 
UIC Thesis Beretta
UIC Thesis BerettaUIC Thesis Beretta
UIC Thesis Beretta
 
UIC Thesis Candiloro
UIC Thesis CandiloroUIC Thesis Candiloro
UIC Thesis Candiloro
 
3rd 3DDRESD: DRESD Future Plan 0809
3rd 3DDRESD: DRESD Future Plan 08093rd 3DDRESD: DRESD Future Plan 0809
3rd 3DDRESD: DRESD Future Plan 0809
 

Similar to RCIM 2008 - Allocation Relocation

Parallelization Techniques for the 2D Fourier Matched Filtering and Interpola...
Parallelization Techniques for the 2D Fourier Matched Filtering and Interpola...Parallelization Techniques for the 2D Fourier Matched Filtering and Interpola...
Parallelization Techniques for the 2D Fourier Matched Filtering and Interpola...Fisnik Kraja
 
Ilfak Guilfanov - Decompiler internals: Microcode [rooted2018]
Ilfak Guilfanov - Decompiler internals: Microcode [rooted2018]Ilfak Guilfanov - Decompiler internals: Microcode [rooted2018]
Ilfak Guilfanov - Decompiler internals: Microcode [rooted2018]RootedCON
 
Control Flow Analysis
Control Flow AnalysisControl Flow Analysis
Control Flow AnalysisEdgar Barbosa
 
The H.264 Integer Transform
The H.264 Integer TransformThe H.264 Integer Transform
The H.264 Integer TransformIain Richardson
 
Low Power Architecture for JPEG2000
Low Power Architecture for JPEG2000Low Power Architecture for JPEG2000
Low Power Architecture for JPEG2000Rahul Jain
 
Multi-bundle Scoping in OSGi
Multi-bundle Scoping in OSGiMulti-bundle Scoping in OSGi
Multi-bundle Scoping in OSGiglynnormington
 
Conole vilnius 3_nov
Conole vilnius 3_novConole vilnius 3_nov
Conole vilnius 3_novgrainne
 
laptop repairing course
laptop repairing courselaptop repairing course
laptop repairing coursechiptroniks
 
At c51ism
At c51ismAt c51ism
At c51ism1015905
 
Tap lenh ho_8051 (1)
Tap lenh ho_8051 (1)Tap lenh ho_8051 (1)
Tap lenh ho_8051 (1)akprovip
 
SDR channelizer by sooraj
SDR channelizer by soorajSDR channelizer by sooraj
SDR channelizer by soorajsooraj yadav
 
Practical Spherical Harmonics Based PRT Methods
Practical Spherical Harmonics Based PRT MethodsPractical Spherical Harmonics Based PRT Methods
Practical Spherical Harmonics Based PRT MethodsNaughty Dog
 
Executable Specifications in Action
Executable Specifications in ActionExecutable Specifications in Action
Executable Specifications in ActionVagif Abilov
 
Executable Specifications in Action
Executable Specifications in ActionExecutable Specifications in Action
Executable Specifications in ActionVagif Abilov
 

Similar to RCIM 2008 - Allocation Relocation (20)

RCIM 2008 - HLR
RCIM 2008 - HLRRCIM 2008 - HLR
RCIM 2008 - HLR
 
Ig2616051609
Ig2616051609Ig2616051609
Ig2616051609
 
Slide11 icc2015
Slide11 icc2015Slide11 icc2015
Slide11 icc2015
 
Parallelization Techniques for the 2D Fourier Matched Filtering and Interpola...
Parallelization Techniques for the 2D Fourier Matched Filtering and Interpola...Parallelization Techniques for the 2D Fourier Matched Filtering and Interpola...
Parallelization Techniques for the 2D Fourier Matched Filtering and Interpola...
 
Three Band EQ Design Report
Three Band EQ Design ReportThree Band EQ Design Report
Three Band EQ Design Report
 
Ilfak Guilfanov - Decompiler internals: Microcode [rooted2018]
Ilfak Guilfanov - Decompiler internals: Microcode [rooted2018]Ilfak Guilfanov - Decompiler internals: Microcode [rooted2018]
Ilfak Guilfanov - Decompiler internals: Microcode [rooted2018]
 
Control Flow Analysis
Control Flow AnalysisControl Flow Analysis
Control Flow Analysis
 
The H.264 Integer Transform
The H.264 Integer TransformThe H.264 Integer Transform
The H.264 Integer Transform
 
Low Power Architecture for JPEG2000
Low Power Architecture for JPEG2000Low Power Architecture for JPEG2000
Low Power Architecture for JPEG2000
 
UIC Thesis Novati
UIC Thesis NovatiUIC Thesis Novati
UIC Thesis Novati
 
Multi-bundle Scoping in OSGi
Multi-bundle Scoping in OSGiMulti-bundle Scoping in OSGi
Multi-bundle Scoping in OSGi
 
Conole vilnius 3_nov
Conole vilnius 3_novConole vilnius 3_nov
Conole vilnius 3_nov
 
laptop repairing course
laptop repairing courselaptop repairing course
laptop repairing course
 
At c51ism
At c51ismAt c51ism
At c51ism
 
Tap lenh ho_8051 (1)
Tap lenh ho_8051 (1)Tap lenh ho_8051 (1)
Tap lenh ho_8051 (1)
 
Tap lenh ho 8051
Tap lenh ho 8051Tap lenh ho 8051
Tap lenh ho 8051
 
SDR channelizer by sooraj
SDR channelizer by soorajSDR channelizer by sooraj
SDR channelizer by sooraj
 
Practical Spherical Harmonics Based PRT Methods
Practical Spherical Harmonics Based PRT MethodsPractical Spherical Harmonics Based PRT Methods
Practical Spherical Harmonics Based PRT Methods
 
Executable Specifications in Action
Executable Specifications in ActionExecutable Specifications in Action
Executable Specifications in Action
 
Executable Specifications in Action
Executable Specifications in ActionExecutable Specifications in Action
Executable Specifications in Action
 

More from Marco Santambrogio (20)

RCIM 2008 - - hArtes Atmel
RCIM 2008 - - hArtes AtmelRCIM 2008 - - hArtes Atmel
RCIM 2008 - - hArtes Atmel
 
RCIM 2008 - - UniCal
RCIM 2008 - - UniCalRCIM 2008 - - UniCal
RCIM 2008 - - UniCal
 
RCIM 2008 - - ALTERA
RCIM 2008 - - ALTERARCIM 2008 - - ALTERA
RCIM 2008 - - ALTERA
 
DHow2 - L6 VHDL
DHow2 - L6 VHDLDHow2 - L6 VHDL
DHow2 - L6 VHDL
 
DHow2 - L6 Ant
DHow2 - L6 AntDHow2 - L6 Ant
DHow2 - L6 Ant
 
DHow2 - L5
DHow2 - L5DHow2 - L5
DHow2 - L5
 
RCIM 2008 - - ALaRI
RCIM 2008 - - ALaRIRCIM 2008 - - ALaRI
RCIM 2008 - - ALaRI
 
RCIM 2008 - Modello Scheduling
RCIM 2008 - Modello SchedulingRCIM 2008 - Modello Scheduling
RCIM 2008 - Modello Scheduling
 
RCIM 2008 - Modello Generale
RCIM 2008 - Modello GeneraleRCIM 2008 - Modello Generale
RCIM 2008 - Modello Generale
 
RCIM 2008 - - hArtes_Ferrara
RCIM 2008 - - hArtes_FerraraRCIM 2008 - - hArtes_Ferrara
RCIM 2008 - - hArtes_Ferrara
 
RCIM 2008 - Janus
RCIM 2008 - JanusRCIM 2008 - Janus
RCIM 2008 - Janus
 
DHow2 - L2
DHow2 - L2DHow2 - L2
DHow2 - L2
 
DHow2 - L4
DHow2 - L4DHow2 - L4
DHow2 - L4
 
DHow2 - L1
DHow2 - L1DHow2 - L1
DHow2 - L1
 
RCW@DEI - Treasure hunt
RCW@DEI - Treasure huntRCW@DEI - Treasure hunt
RCW@DEI - Treasure hunt
 
RCW@DEI - ADL
RCW@DEI - ADLRCW@DEI - ADL
RCW@DEI - ADL
 
RCW@DEI - Design Flow 4 SoPc
RCW@DEI - Design Flow 4 SoPcRCW@DEI - Design Flow 4 SoPc
RCW@DEI - Design Flow 4 SoPc
 
RCW@DEI - Real Needs And Limits
RCW@DEI - Real Needs And LimitsRCW@DEI - Real Needs And Limits
RCW@DEI - Real Needs And Limits
 
RCW@DEI - Basic Concepts
RCW@DEI - Basic ConceptsRCW@DEI - Basic Concepts
RCW@DEI - Basic Concepts
 
RCW@DEI - Reconf Comp
RCW@DEI - Reconf CompRCW@DEI - Reconf Comp
RCW@DEI - Reconf Comp
 

Recently uploaded

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 

Recently uploaded (20)

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 

RCIM 2008 - Allocation Relocation

  • 1. HLR Core allocation and relocation management for self dynamically reconfigurable architectures Reconfigurable Computing Italian Meeting 19 December 2008 Room S01, Politecnico di Milano - Milan (Italy) Massimo Morandi: massimo.morandi@dresd.org Marco Novati: marco.novati@dresd.org
  • 2. Outline Aims Introduction Basic Concepts Rationale Relocation solutions Core Allocation manager Concluding Remarks 2
  • 3. Aims Provide support for self partially and dynamically reconfigurable systems: Relocation support: 1D and 2D solutions SW and HW solutions Runtime Core Placement support with: Low overhead High versatility Efficient use of resources 3
  • 4. Reconfigurable architecture A basic reconfigurable architecture consists of: a Static area: a basic Harward architecture a Reconfigurable area: an device area composed by several reconfigurable regions 4
  • 5. Basic Definitions Core: a specific representation of a functionality. It is Core possible, for example, to have a core described in VHDL, in C or in an intermediate representation (e.g. a DFG) IP-Core: a core described using a HD Language Core combined with its communication infrastructure (i.e. the bus interface) Reconfigurable Functional Unit: an IP-Core that can be Unit plugged and/or unplugged at runtime in an already working architecture Reconfigurable Region: a portion of the device area Region used to implement a reconfigurable core 5
  • 6. Relocation: The Problem Set of Available RFU Functionalities Implementations B B 1/2 A F 2/2 RR1 RR2 RR3 RR1 RR2 RR3 C 2/2 2/1 A E 1/1 D 1/1 D C Legenda: Fi Area/Time RR1 RR2 RR3 RR1 RR2 RR3 E F RR1 RR2 RR3 RR1 RR2 RR3 6
  • 7. Relocation: Motivation Area Area Demanded Tasks A A 2/1 A B 1/2 B B Rec. C Rec. C 2/2 C D 1/1 R2 D C C D Legenda: E 1/1 Ti Area/Time Request Rec. D R2 F Sequnce F 2/2 D Rec. E F E Rec. F F Time 7
  • 8. Relocation: Rationale Bitstreams relocation technique to: speedup the overall system execution reduce the amount of memory used to store partial bitstreams achieve a core preemptive execution assign at runtime the bitstreams placement 8
  • 9. Proposed Relocation Solutions Architectural support for relocation: Create an integrated HW/SW system to manage online relocation (1D and 2D) in reconfigurable architecture Create efficient bitstream relocation solutions suitable for the target system: 1D (BiRF) – 2D (BiRF Square) HW (BiRF, BiRF Square) – SW (BAnMaT Lite) 9
  • 10. Xilinx FPGAs and Configuration Memory 10
  • 11. CRC Calculation Particular CRC value, used by Xilinx tools Two version of BiRF and BiRF Square: By using the “predefined” values With actual CRC calculation X16 + X15 + X2 + 1 [1D] X32 + X28 + X27 + X26 + X25 + X23 + X22 + X20 + X19 + X18 + X14 + X13 + X11 + X10 + X9 + X8 + X6 + 1 [2D] 11
  • 12. Synthesis Results: Area BiRF BiRF Square FPGA Generic Optimized Generic Optimized Version Version Version Version − − xc2vp7 11.6 % 3.6 % − − Xc2vp20 5.8 % 1.8 % − − xc2vp30 4.2 % 1.3 % − − xc4vlx40 2.2 % 0.9 % − − xc4vlx60 1.5 % 0.6 % − − xc4vlx100 0.8 % 0.3 % − − xc5vlx50 1.1 % 0.8 % − − xc5vlx85 0.6 % 0.4 % − − xc5vlx110 0.5 % 0.3 % 12
  • 13. Relocation Solutions Results (1/2) BiRF, BiRF Square, BAnMaT Lite Permit to support relocation in a self partially and dynamically 1D or 2D reconfigurable system The occupation ratio is relatively small Frequency more than acceptable Reduction of internal memory requirements Throughput: BiRF: 6 MB/s BiRF Square: 7.3 MB/s BAnMaT Lite: 2.6 MB/s 13
  • 14. Relocation Solutions Results (2/2) A total configuration file size is about 1 MB Considering an architecture: 1/3 of the area as fixed part 2/3 as reconfigurable part with 6 slots With such hypothesis Size of a partial bitstream will be about 110 KB Relocation time of about: 18 ms with BiRF 15 ms with BiRF Square 42 ms with BAnMaT Lite 14
  • 15. Outline Aims Introduction Basic Concepts Rationale Relocation solutions Core Allocation manager Concluding Remarks 15
  • 16. Runtime Core Allocation Management Choose where to place Cores to achieve: Low Core Rejection Rate (CRR) Fast application completion time Small management overhead Other policy driven goal Choose how to maintain information on empty space Keep all information (Expensive but more accurate) Heuristically prune information (Cheaper) 16
  • 17. Evaluation and Proposed Approach Choice driven by: Need for low complexity solution to reduce overhead at runtime Desire to keep high flexibility, to best suit user needs We propose heuristic (KNER-like) empty space manager: Support both general and focused policy (in particular FF, BF, RA) Suitable for dynamic schedule and blind schedule Exploiting multiple RFUs per Core, to improve quality 17
  • 18. The Online Placement Algorithm The whole processing of a Core is completed in linear time 18
  • 19. Experiment 1: Routing Aware Comparison against literature solutions dynamic schedule scenario, RA placement policy Measuring CRR, routing costs and overhead Benchmark of 100 randomly generated Cores: Size (5% to 20% of FPGA), randomly interconnected 19
  • 20. Experiment 2: Application Completion Time Benchmark applications composed of cores taken from opencores.org like JPEG, AES, 3DES … Blind Schedule, measure the time instants needed to complete the applications with different amounts of resources Infinite resources is shown, to compare against lower bound 20
  • 21. Experiment 3: Multiple Shapes Similar benchmark, but Cores have deadlines (for CRR) Shapes defined with the heuristics described previously Difference in runtime on average 30% more for 3 shapes and 40% more for 5 shapes w.r.t. 1 shape CRR more than halved, often reduced to one third 21
  • 22. Concluding Remarks Original goals met: Create efficient bitstream relocation solution suitable for target systems: 1D - 2D HW – SW Create a Core allocation manager with: Low overhead High efficiency (CRR, application completion time, routing costs …) High versatility 22