SlideShare uma empresa Scribd logo
1 de 12
Data-Triggered ThreadsEliminating Redundant Computation (HPCA 2011) Hung-Wei Tseng and Dean M. Tullsen Department of Computer Science and Engineering  University of California, San Diego Seminar by: Naman Kumar  for http://carg.uwaterloo.ca
Eliminating Redundant Computation Silent Store: Amemory store operation that does not change the contents at that location 20-68% of all stores are silent [Lepakand Lipasti] How about eliminating the entire stream of computation surrounding a silent store!
Eliminating Redundant Computation Redundant loads:  	silent stores result in redundant loads 	(last time this load loaded this address, it fetched the same value) 	SPEC2000 C: 78% of all loads are redundant 50% of all instructions depend on redundant loads
Data-Triggered Threads
DTT: Implementation The Programming Model Place redundant computation in a separate thread: Thread is restartable Thread may be aborted/restarted multiple times Thread management is through architectural changes. Easy to verify data races as thread life is between time between triggering store and main thread join point.
DTT: Implementation The Programming Model Trigger is placed in data section, not code section
DTT: Implementation The Programming Model Main Thread
DTT: Implementation The Programming Model Support thread
DTT: Implementation Architectural Support Following tables are all implemented in hardware Thread registry (table)  Thread Queue (table) Thread Status Table (table) PC
DTT: Implementation Architectural Support ISA modifications tstore – generate thread when mem modified is not 			silent tspawn – spawn the thread using thread registry treturn– finish execution of the current thread tcancel – terminate a running thread
Computer Architecture Seminar
Computer Architecture Seminar

Mais conteúdo relacionado

Mais procurados

Microarchitecture of a coarse grain out-of-order superscalar processor
Microarchitecture of a coarse grain out-of-order superscalar processorMicroarchitecture of a coarse grain out-of-order superscalar processor
Microarchitecture of a coarse grain out-of-order superscalar processor
ecway
 
Dotnet microarchitecture of a coarse-grain out-of-order superscalar processor
Dotnet  microarchitecture of a coarse-grain out-of-order superscalar processorDotnet  microarchitecture of a coarse-grain out-of-order superscalar processor
Dotnet microarchitecture of a coarse-grain out-of-order superscalar processor
Ecwaytech
 

Mais procurados (17)

Reactive Programming In Java Using: Project Reactor
Reactive Programming In Java Using: Project ReactorReactive Programming In Java Using: Project Reactor
Reactive Programming In Java Using: Project Reactor
 
Matlab Thesis for Phd Students
Matlab Thesis for Phd StudentsMatlab Thesis for Phd Students
Matlab Thesis for Phd Students
 
DESIGN AND SIMULATION OF DIFFERENT 8-BIT MULTIPLIERS USING VERILOG CODE BY SA...
DESIGN AND SIMULATION OF DIFFERENT 8-BIT MULTIPLIERS USING VERILOG CODE BY SA...DESIGN AND SIMULATION OF DIFFERENT 8-BIT MULTIPLIERS USING VERILOG CODE BY SA...
DESIGN AND SIMULATION OF DIFFERENT 8-BIT MULTIPLIERS USING VERILOG CODE BY SA...
 
Matlab Projects for Electrical Students
Matlab Projects for Electrical StudentsMatlab Projects for Electrical Students
Matlab Projects for Electrical Students
 
Matlab Electrical Master Thesis
Matlab Electrical Master ThesisMatlab Electrical Master Thesis
Matlab Electrical Master Thesis
 
Matlab Simulink Electrical Projects
Matlab Simulink Electrical ProjectsMatlab Simulink Electrical Projects
Matlab Simulink Electrical Projects
 
Secure optimization computation outsourcing in cloud computing a case study o...
Secure optimization computation outsourcing in cloud computing a case study o...Secure optimization computation outsourcing in cloud computing a case study o...
Secure optimization computation outsourcing in cloud computing a case study o...
 
Андрій Рева, "How to build reactive java application"
Андрій Рева, "How to build reactive java application"Андрій Рева, "How to build reactive java application"
Андрій Рева, "How to build reactive java application"
 
Code for MATLAB Projects Research Ideas
Code for MATLAB Projects Research IdeasCode for MATLAB Projects Research Ideas
Code for MATLAB Projects Research Ideas
 
Ecursive approach to the design of a parallel self timed adder
Ecursive approach to the design of a parallel self timed adderEcursive approach to the design of a parallel self timed adder
Ecursive approach to the design of a parallel self timed adder
 
A data and task co scheduling algorithm for scientific cloud workflows
A data and task co scheduling algorithm for scientific cloud workflowsA data and task co scheduling algorithm for scientific cloud workflows
A data and task co scheduling algorithm for scientific cloud workflows
 
Dsp Matlab Thesis Topics
Dsp Matlab Thesis TopicsDsp Matlab Thesis Topics
Dsp Matlab Thesis Topics
 
Sistemas Distribuidos
Sistemas DistribuidosSistemas Distribuidos
Sistemas Distribuidos
 
MATLAB Thesis Projects
MATLAB Thesis ProjectsMATLAB Thesis Projects
MATLAB Thesis Projects
 
Microarchitecture of a coarse grain out-of-order superscalar processor
Microarchitecture of a coarse grain out-of-order superscalar processorMicroarchitecture of a coarse grain out-of-order superscalar processor
Microarchitecture of a coarse grain out-of-order superscalar processor
 
Dotnet microarchitecture of a coarse-grain out-of-order superscalar processor
Dotnet  microarchitecture of a coarse-grain out-of-order superscalar processorDotnet  microarchitecture of a coarse-grain out-of-order superscalar processor
Dotnet microarchitecture of a coarse-grain out-of-order superscalar processor
 
MATLAB Major Projects
MATLAB Major ProjectsMATLAB Major Projects
MATLAB Major Projects
 

Destaque

8 Tips that get you more Results from Email Marketing
8 Tips that get you more Results from Email Marketing8 Tips that get you more Results from Email Marketing
8 Tips that get you more Results from Email Marketing
Camelia Bulea
 
Cis et marked du kan nå
Cis   et marked du kan nåCis   et marked du kan nå
Cis et marked du kan nå
Morten Munk
 
Vesten forstår ikke rusland
Vesten forstår ikke ruslandVesten forstår ikke rusland
Vesten forstår ikke rusland
Morten Munk
 
SHS '91 reunion slide show 2011
SHS '91 reunion slide show 2011SHS '91 reunion slide show 2011
SHS '91 reunion slide show 2011
Sherri Wipperman
 
Het iPad paradigma - Joris van Lierop
Het iPad paradigma - Joris van LieropHet iPad paradigma - Joris van Lierop
Het iPad paradigma - Joris van Lierop
NUV-UVW
 
Fresh Business Thinking - Social Media Event - Jemima presentation
Fresh Business Thinking - Social Media Event - Jemima presentationFresh Business Thinking - Social Media Event - Jemima presentation
Fresh Business Thinking - Social Media Event - Jemima presentation
fbtslides
 
Brandweerkazerne Oud Nico
Brandweerkazerne Oud NicoBrandweerkazerne Oud Nico
Brandweerkazerne Oud Nico
kcrabbendam
 

Destaque (18)

8 Tips that get you more Results from Email Marketing
8 Tips that get you more Results from Email Marketing8 Tips that get you more Results from Email Marketing
8 Tips that get you more Results from Email Marketing
 
Cis et marked du kan nå
Cis   et marked du kan nåCis   et marked du kan nå
Cis et marked du kan nå
 
Vesten forstår ikke rusland
Vesten forstår ikke ruslandVesten forstår ikke rusland
Vesten forstår ikke rusland
 
SHS '91 reunion slide show 2011
SHS '91 reunion slide show 2011SHS '91 reunion slide show 2011
SHS '91 reunion slide show 2011
 
達人出版会からPHPの本を出版しよう
達人出版会からPHPの本を出版しよう達人出版会からPHPの本を出版しよう
達人出版会からPHPの本を出版しよう
 
Het iPad paradigma - Joris van Lierop
Het iPad paradigma - Joris van LieropHet iPad paradigma - Joris van Lierop
Het iPad paradigma - Joris van Lierop
 
Essence of Retail e-Commerce and its Optimization Webinar
Essence of Retail e-Commerce and its Optimization WebinarEssence of Retail e-Commerce and its Optimization Webinar
Essence of Retail e-Commerce and its Optimization Webinar
 
FOREIGN NATIONALS
FOREIGN NATIONALSFOREIGN NATIONALS
FOREIGN NATIONALS
 
บุหรี่
บุหรี่บุหรี่
บุหรี่
 
Microsoft Project workshop in Pune 6th & 7th August
Microsoft Project workshop in Pune 6th & 7th AugustMicrosoft Project workshop in Pune 6th & 7th August
Microsoft Project workshop in Pune 6th & 7th August
 
Attack toolkit webinar 9-7-11
Attack toolkit webinar 9-7-11Attack toolkit webinar 9-7-11
Attack toolkit webinar 9-7-11
 
JUMBO Loans
JUMBO LoansJUMBO Loans
JUMBO Loans
 
Disciplinar
DisciplinarDisciplinar
Disciplinar
 
Sportweek
SportweekSportweek
Sportweek
 
Fresh Business Thinking - Social Media Event - Jemima presentation
Fresh Business Thinking - Social Media Event - Jemima presentationFresh Business Thinking - Social Media Event - Jemima presentation
Fresh Business Thinking - Social Media Event - Jemima presentation
 
Safe Learning Environment_HEAL2013
Safe Learning Environment_HEAL2013Safe Learning Environment_HEAL2013
Safe Learning Environment_HEAL2013
 
Brandweerkazerne Oud Nico
Brandweerkazerne Oud NicoBrandweerkazerne Oud Nico
Brandweerkazerne Oud Nico
 
Communication issues TMT 2013
Communication issues TMT 2013Communication issues TMT 2013
Communication issues TMT 2013
 

Semelhante a Computer Architecture Seminar

Different Approaches in Energy Efficient Cache Memory
Different Approaches in Energy Efficient Cache MemoryDifferent Approaches in Energy Efficient Cache Memory
Different Approaches in Energy Efficient Cache Memory
Dhritiman Halder
 
Multicore programmingandtpl(.net day)
Multicore programmingandtpl(.net day)Multicore programmingandtpl(.net day)
Multicore programmingandtpl(.net day)
Yan Drugalya
 
Transforming a traditional home gateway into a hardwareaccelerated SDN switch
Transforming a traditional home gateway into a hardwareaccelerated SDN switchTransforming a traditional home gateway into a hardwareaccelerated SDN switch
Transforming a traditional home gateway into a hardwareaccelerated SDN switch
IJECEIAES
 
System on Chip Based RTC in Power Electronics
System on Chip Based RTC in Power ElectronicsSystem on Chip Based RTC in Power Electronics
System on Chip Based RTC in Power Electronics
journalBEEI
 
Low-Power Design and Verification
Low-Power Design and VerificationLow-Power Design and Verification
Low-Power Design and Verification
DVClub
 
Multicore programmingandtpl
Multicore programmingandtplMulticore programmingandtpl
Multicore programmingandtpl
Yan Drugalya
 

Semelhante a Computer Architecture Seminar (20)

Automated LiveMigration of VMs
Automated LiveMigration of VMs Automated LiveMigration of VMs
Automated LiveMigration of VMs
 
Automating the Hunt for Non-Obvious Sources of Latency Spreads
Automating the Hunt for Non-Obvious Sources of Latency SpreadsAutomating the Hunt for Non-Obvious Sources of Latency Spreads
Automating the Hunt for Non-Obvious Sources of Latency Spreads
 
Solution manual for modern processor design by john paul shen and mikko h. li...
Solution manual for modern processor design by john paul shen and mikko h. li...Solution manual for modern processor design by john paul shen and mikko h. li...
Solution manual for modern processor design by john paul shen and mikko h. li...
 
Full solution manual for modern processor design by john paul shen and mikko ...
Full solution manual for modern processor design by john paul shen and mikko ...Full solution manual for modern processor design by john paul shen and mikko ...
Full solution manual for modern processor design by john paul shen and mikko ...
 
Chap 2 classification of parralel architecture and introduction to parllel p...
Chap 2  classification of parralel architecture and introduction to parllel p...Chap 2  classification of parralel architecture and introduction to parllel p...
Chap 2 classification of parralel architecture and introduction to parllel p...
 
Modern processors
Modern processorsModern processors
Modern processors
 
Tutotial 2 answer
Tutotial 2 answerTutotial 2 answer
Tutotial 2 answer
 
Different Approaches in Energy Efficient Cache Memory
Different Approaches in Energy Efficient Cache MemoryDifferent Approaches in Energy Efficient Cache Memory
Different Approaches in Energy Efficient Cache Memory
 
Modern INTEL Microprocessors' Architecture and Sneak Peak at NVIDIA TEGRA GPU
Modern INTEL Microprocessors' Architecture and Sneak Peak at NVIDIA TEGRA GPUModern INTEL Microprocessors' Architecture and Sneak Peak at NVIDIA TEGRA GPU
Modern INTEL Microprocessors' Architecture and Sneak Peak at NVIDIA TEGRA GPU
 
Improving the Efficiency of Cloud Infrastructures with Elastic Tandem Machine...
Improving the Efficiency of Cloud Infrastructures with Elastic Tandem Machine...Improving the Efficiency of Cloud Infrastructures with Elastic Tandem Machine...
Improving the Efficiency of Cloud Infrastructures with Elastic Tandem Machine...
 
Simulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud InfrastructuresSimulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud Infrastructures
 
Chap 1(one) general introduction
Chap 1(one)  general introductionChap 1(one)  general introduction
Chap 1(one) general introduction
 
Multicore programmingandtpl(.net day)
Multicore programmingandtpl(.net day)Multicore programmingandtpl(.net day)
Multicore programmingandtpl(.net day)
 
Programmable Exascale Supercomputer
Programmable Exascale SupercomputerProgrammable Exascale Supercomputer
Programmable Exascale Supercomputer
 
Transforming a traditional home gateway into a hardwareaccelerated SDN switch
Transforming a traditional home gateway into a hardwareaccelerated SDN switchTransforming a traditional home gateway into a hardwareaccelerated SDN switch
Transforming a traditional home gateway into a hardwareaccelerated SDN switch
 
System on Chip Based RTC in Power Electronics
System on Chip Based RTC in Power ElectronicsSystem on Chip Based RTC in Power Electronics
System on Chip Based RTC in Power Electronics
 
Low-Power Design and Verification
Low-Power Design and VerificationLow-Power Design and Verification
Low-Power Design and Verification
 
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
 
Multicore programmingandtpl
Multicore programmingandtplMulticore programmingandtpl
Multicore programmingandtpl
 
Scolari's ICCD17 Talk
Scolari's ICCD17 TalkScolari's ICCD17 Talk
Scolari's ICCD17 Talk
 

Último

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Computer Architecture Seminar

  • 1. Data-Triggered ThreadsEliminating Redundant Computation (HPCA 2011) Hung-Wei Tseng and Dean M. Tullsen Department of Computer Science and Engineering University of California, San Diego Seminar by: Naman Kumar for http://carg.uwaterloo.ca
  • 2. Eliminating Redundant Computation Silent Store: Amemory store operation that does not change the contents at that location 20-68% of all stores are silent [Lepakand Lipasti] How about eliminating the entire stream of computation surrounding a silent store!
  • 3. Eliminating Redundant Computation Redundant loads: silent stores result in redundant loads (last time this load loaded this address, it fetched the same value) SPEC2000 C: 78% of all loads are redundant 50% of all instructions depend on redundant loads
  • 5. DTT: Implementation The Programming Model Place redundant computation in a separate thread: Thread is restartable Thread may be aborted/restarted multiple times Thread management is through architectural changes. Easy to verify data races as thread life is between time between triggering store and main thread join point.
  • 6. DTT: Implementation The Programming Model Trigger is placed in data section, not code section
  • 7. DTT: Implementation The Programming Model Main Thread
  • 8. DTT: Implementation The Programming Model Support thread
  • 9. DTT: Implementation Architectural Support Following tables are all implemented in hardware Thread registry (table) Thread Queue (table) Thread Status Table (table) PC
  • 10. DTT: Implementation Architectural Support ISA modifications tstore – generate thread when mem modified is not silent tspawn – spawn the thread using thread registry treturn– finish execution of the current thread tcancel – terminate a running thread

Notas do Editor

  1. Memoization and other techniques save on memory access. This technique proposes a solution to save on accesses and the computation involving the data from these access.
  2. eg: sum of all nodes in a 100 node linked-list. Each node has to be accessed when say, only 2 have changed. That’s 98 redundant loads.
  3. If value of SP calculated is diff from what is in memory, then a support thread (S) will be spawned to calculate and B. Main thread will skip code section B since data has already been calculated. Instructions for B will be left as is because support thread may have failed to spawn; skipping the thread, code will be executed by the main thread.
  4. Programmers implement this with C pragma constructs
  5. Every time the variable is WRITTEN to, the associated DTThread is executed
  6. If programmer has a reason to suspect that the thread may crash/be aborted, he can place the #cancel pragma. This will ensure that only the main thread executes this block. Support thread will not be registered.
  7. This function is triggered in a new thread (support thread) when control reaches “#block xxx”
  8. Start PC: PC of the skippable code in the main thread.Destination PC denotes the end of the skippable region.Post skip PC is address after the region is skipped.
  9. CMP only