SlideShare uma empresa Scribd logo
1 de 27
Hyper-Threading Technology
Presented By
KARUNAKAR THAKUR
To Enhance Performance-
Increase in clock rate
o Involves reducing clock cycle time
o Can increase the performance by increasing number of
instructions finishing per second
o H/w limitations limit this feature
Cache hierarchies
o Having frequently used data on the processor caches
reduces average accesses time
Introduction
Pipelining
o Implementation Technique whereby multiple instructions
are overlapped in execution
o Limited by the dependencies between instructions
o Effected by stalls and effective CPI is greater than 1
Instruction Level Parallelism
o It refers to techniques to increase the number of
instructions executed in each clock cycle.
o Exists whenever the machine instructions that make up a
program are insensitive to the order in which they are
executed if dependencies does not exist, they may be
executed.
 Thread level parallelism
Chip Multi Processing
o Two processors, each with full set of execution and
architectural resources, reside on a single die.
Time Slice Multi Threading
o single processor to execute multiple threads by
switching between them
Switch on Event Multi Threading
o switch threads on long latency events such as cache
misses
Simultaneous Multi Threading
o Multiple threads can execute on a single processor
without switching.
oThe threads execute simultaneously and make much
better use of the resources.
oIt maximizes the performance vs. transistor count and
power consumption.
Thread level parallelism (cont..)
Hyper-Threading Technology
Hyper-Threading Technology brings the simultaneous
multi-threading approach to the Intel architecture.
 Hyper-Threading Technology makes a single physical
processor appear as two or more logical processors
 Hyper-Threading Technology first invented by Intel Corp.
 Hyper-Threading Technology provides thread-level-
parallelism (TLP) on each processor resulting in increased
utilization of processor and execution resources.
Each logical processor maintain one copy of the architecture
state
Processor Execution
Resources
Processor Execution
Resources
Arch State Arch State Arch State
Processor with out Hyper-
Threading Technology
Processor with Hyper-
Threading Technology
Ref: Intel Technology Journal, Volume 06 Issue 01, February 14, 2002
Hyper-Threading Technology Architecture
 Register Alias Tables
 Next-Instruction Pointer
 Instruction Streaming Buffers and Trace Cache Fill
Buffers
 Instruction Translation Look-aside Buffer
Following resources are duplicated to support Hyper-
Threading Technology
Figure: Intel Xeon processor pipeline
Sharing of Resources
 Major Sharing Schemes are-
o Partition
o Threshold
o Full Sharing
Partition
 Each logical processor uses half the resources
 Simple and low in complexity
 Ensures fairness and progress
 Good for major pipeline queues
Partitioned Queue Example
• Yellow thread – It is faster thread
• Green thread – It is slower thread
Partitioned Queue Example
• Partitioning resource ensures fairness and
ensures progress for both logical processors.
Threshold
 Puts a threshold on number of resource entries a logical
processor can use.
 Limits maximum resource usage
 For small structures where resource utilization in burst and
time of utilization is short, uniform and predictable
 Eg- Processor Scheduler
Full Sharing
 Most flexible mechanism for resource sharing, do not
limit the maximum uses for resource usage for a logical
processor
 Good for large structures in which working set sizes are
variable and there is no fear of starvation
 Eg: All Processor caches are shared
o Some applications benefit from a shared cache
because they share code and data, minimizing
redundant data in the caches
Netburst Microarchitecture’s execution pipeline
• Two modes of operations
– single-task (ST)
– multi-task (MT).
• MT-mode- There are two active logical processors and
some of the resources are partitioned.
• There are two flavors of ST-mode: single-task logical
processor 0 (ST0) and single-task logical processor 1
(ST1).
• In ST0- or ST1-mode, only one logical processor is active,
and resources that were partitioned in MT-mode are re-
combined to give the single active logical processor use of
all of the resources
SINGLE-TASK AND MULTI-TASK MODES
SINGLE-TASK AND MULTI-TASK
MODES
• HALT instruction that stops processor execution.
• On a processor with Hyper-Threading Technology,
executing HALT transition the processor from MT-mode
to ST0- or ST1-mode, depending on which logical
processor executed the HALT.
• In ST0- or ST1-modes, an interrupt sent to the halted
logical processor would cause a transition to MT-mode.
OPERATING SYSTEM
• For best performance, the operating system should
implement two optimizations.
– The first is to use the HALT instruction if one logical
processor is active and the other is not. HALT will
allow the processor to transition MT mode to either the
ST0- or ST1-mode.
– The second optimization is in scheduling software
threads to logical processors. The operating system
should schedule threads to logical processors on
different physical processors before scheduling two
threads to the same physical processor.
Business Benefits of Hyper-Threading
Technology
• Higher transaction rates for e-Businesses
• Improved reaction and response times for end-users and
customers.
• Increased number of users that a server system can support
• Handle increased server workloads
• Compatibility with existing server applications and
operating systems
Performance increases from
Hyper-Threading Technology on
an OLTP workload
Web server benchmark
performance
Conclusion
•Intel’s Hyper-Threading Technology brings the concept of
simultaneous multi-threading to the Intel Architecture.
•It will become increasingly important going forward as it adds a
new technique for obtaining additional performance for lower
transistor and power costs.
•The goal was to implement the technology at minimum cost
while ensuring forward progress on logical processors, even if
the other is stalled, and to deliver full performance even when
there is only one active logical processor.
References
• “HYPER-THREADING TECHNOLOGY
ARCHITECTURE AND MICROARCHITECTURE” by
Deborah T. Marr, Frank Binns, David L. Hill, Glenn
Hinton,David A. Koufaty, J. Alan Miller, Michael Upton,
intel Technology Journal, Volume 06 Issue 01, Published
February 14, 2002. Pages: 4 –15.
• “:HYPERTHREADING TECHNOLOGY IN THE
NETBURST MICROARCHITECTURE” by David
Koufaty,Deborah T. Marr, IEEE Micro, Vol. 23, Issue 2,
March–April 2003. Pages: 56 – 65.
• http://cache-
www.intel.com/cd/00/00/22/09/220943_220943.pdf
• http://www.cs.washington.edu/research/smt/papers/tlp2ilp.fin
al.pdf
• http://mos.stanford.edu/papers/mj_thesis.pdf
Thank youThank you

Mais conteúdo relacionado

Mais procurados

Hyper Threading Technology
Hyper Threading TechnologyHyper Threading Technology
Hyper Threading Technology
nayakslideshare
 
Fault Tolerance (Distributed computing)
Fault Tolerance (Distributed computing)Fault Tolerance (Distributed computing)
Fault Tolerance (Distributed computing)
Sri Prasanna
 
Cache coherence
Cache coherenceCache coherence
Cache coherence
Employee
 

Mais procurados (20)

Intel core i7 processor
Intel core i7 processorIntel core i7 processor
Intel core i7 processor
 
Hyper Threading Technology
Hyper Threading TechnologyHyper Threading Technology
Hyper Threading Technology
 
Multicore processor by Ankit Raj and Akash Prajapati
Multicore processor by Ankit Raj and Akash PrajapatiMulticore processor by Ankit Raj and Akash Prajapati
Multicore processor by Ankit Raj and Akash Prajapati
 
Multicore computers
Multicore computersMulticore computers
Multicore computers
 
Parallel Processing Concepts
Parallel Processing Concepts Parallel Processing Concepts
Parallel Processing Concepts
 
Multi core processors
Multi core processorsMulti core processors
Multi core processors
 
Pipelining , structural hazards
Pipelining , structural hazardsPipelining , structural hazards
Pipelining , structural hazards
 
Fault Tolerance (Distributed computing)
Fault Tolerance (Distributed computing)Fault Tolerance (Distributed computing)
Fault Tolerance (Distributed computing)
 
Vector Supercomputers and Scientific Array Processors
Vector Supercomputers and Scientific Array ProcessorsVector Supercomputers and Scientific Array Processors
Vector Supercomputers and Scientific Array Processors
 
Computer architecture multi core processor
Computer architecture multi core processorComputer architecture multi core processor
Computer architecture multi core processor
 
Cache coherence
Cache coherenceCache coherence
Cache coherence
 
NUMA
NUMANUMA
NUMA
 
Unit 1
Unit 1Unit 1
Unit 1
 
Process scheduling (CPU Scheduling)
Process scheduling (CPU Scheduling)Process scheduling (CPU Scheduling)
Process scheduling (CPU Scheduling)
 
GPU Programming
GPU ProgrammingGPU Programming
GPU Programming
 
Scheduling
SchedulingScheduling
Scheduling
 
intel core i7
intel core i7 intel core i7
intel core i7
 
Intel Core i7 Processors
Intel Core i7 ProcessorsIntel Core i7 Processors
Intel Core i7 Processors
 
Exception handling in Pipelining in COA
Exception handling in Pipelining in COAException handling in Pipelining in COA
Exception handling in Pipelining in COA
 
Multicore Processor Technology
Multicore Processor TechnologyMulticore Processor Technology
Multicore Processor Technology
 

Destaque

General Director Of Procurement -updated-MODIFIED COVERING LETTER
General Director Of Procurement -updated-MODIFIED COVERING LETTERGeneral Director Of Procurement -updated-MODIFIED COVERING LETTER
General Director Of Procurement -updated-MODIFIED COVERING LETTER
mohsen hussain
 
Government Publications and Research_What You Need to Know 2015 (7)
Government Publications and Research_What You Need to Know 2015  (7)Government Publications and Research_What You Need to Know 2015  (7)
Government Publications and Research_What You Need to Know 2015 (7)
Mary Howrey
 
Personas - Season in the Life
Personas - Season in the LifePersonas - Season in the Life
Personas - Season in the Life
Noel Hatch
 
Day ın the Lıfe - Example
Day ın the Lıfe - ExampleDay ın the Lıfe - Example
Day ın the Lıfe - Example
Noel Hatch
 
Government Publications August 2015 Library Guide (4)
Government Publications August 2015 Library Guide (4)Government Publications August 2015 Library Guide (4)
Government Publications August 2015 Library Guide (4)
Mary Howrey
 
Nation Report: Brazil
Nation Report: BrazilNation Report: Brazil
Nation Report: Brazil
meggss24
 

Destaque (20)

H T T1
H T T1H T T1
H T T1
 
General Director Of Procurement -updated-MODIFIED COVERING LETTER
General Director Of Procurement -updated-MODIFIED COVERING LETTERGeneral Director Of Procurement -updated-MODIFIED COVERING LETTER
General Director Of Procurement -updated-MODIFIED COVERING LETTER
 
Fahad surahio
Fahad surahioFahad surahio
Fahad surahio
 
Hyper thread technology
Hyper thread technologyHyper thread technology
Hyper thread technology
 
GIFI
GIFI GIFI
GIFI
 
Sixth Sence Technology
Sixth Sence TechnologySixth Sence Technology
Sixth Sence Technology
 
Hawk eye Technology
Hawk eye TechnologyHawk eye Technology
Hawk eye Technology
 
My ppt..priya
My ppt..priyaMy ppt..priya
My ppt..priya
 
I P S P O O F I N G
I P  S P O O F I N GI P  S P O O F I N G
I P S P O O F I N G
 
Buffer overflow attacks
Buffer overflow attacksBuffer overflow attacks
Buffer overflow attacks
 
Sixth sense by kuntal ppt
Sixth sense by kuntal pptSixth sense by kuntal ppt
Sixth sense by kuntal ppt
 
Buffer overflow attacks
Buffer overflow attacksBuffer overflow attacks
Buffer overflow attacks
 
Checkdesk @ IPI News Innovation Platform, London (Sep 13, 2013)
Checkdesk @ IPI News Innovation Platform, London (Sep 13, 2013)Checkdesk @ IPI News Innovation Platform, London (Sep 13, 2013)
Checkdesk @ IPI News Innovation Platform, London (Sep 13, 2013)
 
Government Publications and Research_What You Need to Know 2015 (7)
Government Publications and Research_What You Need to Know 2015  (7)Government Publications and Research_What You Need to Know 2015  (7)
Government Publications and Research_What You Need to Know 2015 (7)
 
Cross Border Ediscovery vs. EU Data Protection at LegalTech West Coast
 Cross Border Ediscovery vs. EU Data Protection at LegalTech West Coast Cross Border Ediscovery vs. EU Data Protection at LegalTech West Coast
Cross Border Ediscovery vs. EU Data Protection at LegalTech West Coast
 
Personas - Season in the Life
Personas - Season in the LifePersonas - Season in the Life
Personas - Season in the Life
 
Cafeblog.hu webdesign workshop
Cafeblog.hu webdesign workshop Cafeblog.hu webdesign workshop
Cafeblog.hu webdesign workshop
 
Day ın the Lıfe - Example
Day ın the Lıfe - ExampleDay ın the Lıfe - Example
Day ın the Lıfe - Example
 
Government Publications August 2015 Library Guide (4)
Government Publications August 2015 Library Guide (4)Government Publications August 2015 Library Guide (4)
Government Publications August 2015 Library Guide (4)
 
Nation Report: Brazil
Nation Report: BrazilNation Report: Brazil
Nation Report: Brazil
 

Semelhante a Hyper Threading technology

Four Ways to Improve Linux Performance IEEE Webinar, R2.0
Four Ways to Improve Linux Performance IEEE Webinar, R2.0Four Ways to Improve Linux Performance IEEE Webinar, R2.0
Four Ways to Improve Linux Performance IEEE Webinar, R2.0
Michael Christofferson
 
Maxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialMaxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorial
madhuinturi
 
Modern processor art
Modern processor artModern processor art
Modern processor art
waqasjadoon11
 
Intel i7 Technologies
Intel i7 TechnologiesIntel i7 Technologies
Intel i7 Technologies
Bibhu Biswal
 
Multithreading computer architecture
 Multithreading computer architecture  Multithreading computer architecture
Multithreading computer architecture
Haris456
 

Semelhante a Hyper Threading technology (20)

Hyper-Theading
Hyper-TheadingHyper-Theading
Hyper-Theading
 
Four Ways to Improve Linux Performance IEEE Webinar, R2.0
Four Ways to Improve Linux Performance IEEE Webinar, R2.0Four Ways to Improve Linux Performance IEEE Webinar, R2.0
Four Ways to Improve Linux Performance IEEE Webinar, R2.0
 
Intel hyper threading presentation
Intel hyper threading presentationIntel hyper threading presentation
Intel hyper threading presentation
 
Maxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialMaxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorial
 
Chip Multithreading Systems Need a New Operating System Scheduler
Chip Multithreading Systems Need a New Operating System Scheduler Chip Multithreading Systems Need a New Operating System Scheduler
Chip Multithreading Systems Need a New Operating System Scheduler
 
Study of various factors affecting performance of multi core processors
Study of various factors affecting performance of multi core processorsStudy of various factors affecting performance of multi core processors
Study of various factors affecting performance of multi core processors
 
OpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC SystemsOpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC Systems
 
Modern processor art
Modern processor artModern processor art
Modern processor art
 
processor struct
processor structprocessor struct
processor struct
 
Modern processor art
Modern processor artModern processor art
Modern processor art
 
Danish presentation
Danish presentationDanish presentation
Danish presentation
 
Intel i7 Technologies
Intel i7 TechnologiesIntel i7 Technologies
Intel i7 Technologies
 
Multithreading computer architecture
 Multithreading computer architecture  Multithreading computer architecture
Multithreading computer architecture
 
Intel new processors
Intel new processorsIntel new processors
Intel new processors
 
UNIT 2.pptx
UNIT 2.pptxUNIT 2.pptx
UNIT 2.pptx
 
Multicore Computers
Multicore ComputersMulticore Computers
Multicore Computers
 
Multithreaded processors ppt
Multithreaded processors pptMultithreaded processors ppt
Multithreaded processors ppt
 
A Parallel Computing-a Paradigm to achieve High Performance
A Parallel Computing-a Paradigm to achieve High PerformanceA Parallel Computing-a Paradigm to achieve High Performance
A Parallel Computing-a Paradigm to achieve High Performance
 
Htt1
Htt1Htt1
Htt1
 
Crusoe processor
Crusoe processorCrusoe processor
Crusoe processor
 

Mais de Karunakar Singh Thakur

genetic algorithms-artificial intelligence
 genetic algorithms-artificial intelligence genetic algorithms-artificial intelligence
genetic algorithms-artificial intelligence
Karunakar Singh Thakur
 

Mais de Karunakar Singh Thakur (14)

Rational Team Concert (RTC) installation and setup guide
Rational Team Concert (RTC) installation and setup guideRational Team Concert (RTC) installation and setup guide
Rational Team Concert (RTC) installation and setup guide
 
All About Jazz Team Server Technology
All About Jazz Team Server TechnologyAll About Jazz Team Server Technology
All About Jazz Team Server Technology
 
Android Firewall project
Android Firewall projectAndroid Firewall project
Android Firewall project
 
Advanced sql injection 2
Advanced sql injection 2Advanced sql injection 2
Advanced sql injection 2
 
Advanced sql injection 1
Advanced sql injection 1Advanced sql injection 1
Advanced sql injection 1
 
Plsql programs(encrypted)
Plsql programs(encrypted)Plsql programs(encrypted)
Plsql programs(encrypted)
 
Complete placement guide(non technical)
Complete placement guide(non technical)Complete placement guide(non technical)
Complete placement guide(non technical)
 
Complete placement guide(technical)
Complete placement guide(technical)Complete placement guide(technical)
Complete placement guide(technical)
 
How to answer the 64 toughest interview questions
How to answer the 64 toughest interview questionsHow to answer the 64 toughest interview questions
How to answer the 64 toughest interview questions
 
genetic algorithms-artificial intelligence
 genetic algorithms-artificial intelligence genetic algorithms-artificial intelligence
genetic algorithms-artificial intelligence
 
Hadoop
HadoopHadoop
Hadoop
 
Prepare for aptitude test
Prepare for aptitude testPrepare for aptitude test
Prepare for aptitude test
 
Thesis of SNS
Thesis of SNSThesis of SNS
Thesis of SNS
 
Network survivability karunakar
Network survivability karunakarNetwork survivability karunakar
Network survivability karunakar
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

Hyper Threading technology

  • 2. To Enhance Performance- Increase in clock rate o Involves reducing clock cycle time o Can increase the performance by increasing number of instructions finishing per second o H/w limitations limit this feature Cache hierarchies o Having frequently used data on the processor caches reduces average accesses time Introduction
  • 3. Pipelining o Implementation Technique whereby multiple instructions are overlapped in execution o Limited by the dependencies between instructions o Effected by stalls and effective CPI is greater than 1 Instruction Level Parallelism o It refers to techniques to increase the number of instructions executed in each clock cycle. o Exists whenever the machine instructions that make up a program are insensitive to the order in which they are executed if dependencies does not exist, they may be executed.
  • 4.  Thread level parallelism Chip Multi Processing o Two processors, each with full set of execution and architectural resources, reside on a single die. Time Slice Multi Threading o single processor to execute multiple threads by switching between them Switch on Event Multi Threading o switch threads on long latency events such as cache misses
  • 5. Simultaneous Multi Threading o Multiple threads can execute on a single processor without switching. oThe threads execute simultaneously and make much better use of the resources. oIt maximizes the performance vs. transistor count and power consumption. Thread level parallelism (cont..)
  • 6. Hyper-Threading Technology Hyper-Threading Technology brings the simultaneous multi-threading approach to the Intel architecture.  Hyper-Threading Technology makes a single physical processor appear as two or more logical processors  Hyper-Threading Technology first invented by Intel Corp.  Hyper-Threading Technology provides thread-level- parallelism (TLP) on each processor resulting in increased utilization of processor and execution resources. Each logical processor maintain one copy of the architecture state
  • 7. Processor Execution Resources Processor Execution Resources Arch State Arch State Arch State Processor with out Hyper- Threading Technology Processor with Hyper- Threading Technology Ref: Intel Technology Journal, Volume 06 Issue 01, February 14, 2002 Hyper-Threading Technology Architecture
  • 8.  Register Alias Tables  Next-Instruction Pointer  Instruction Streaming Buffers and Trace Cache Fill Buffers  Instruction Translation Look-aside Buffer Following resources are duplicated to support Hyper- Threading Technology
  • 9. Figure: Intel Xeon processor pipeline
  • 10. Sharing of Resources  Major Sharing Schemes are- o Partition o Threshold o Full Sharing Partition  Each logical processor uses half the resources  Simple and low in complexity  Ensures fairness and progress  Good for major pipeline queues
  • 11. Partitioned Queue Example • Yellow thread – It is faster thread • Green thread – It is slower thread
  • 12.
  • 13.
  • 14.
  • 15. Partitioned Queue Example • Partitioning resource ensures fairness and ensures progress for both logical processors.
  • 16. Threshold  Puts a threshold on number of resource entries a logical processor can use.  Limits maximum resource usage  For small structures where resource utilization in burst and time of utilization is short, uniform and predictable  Eg- Processor Scheduler
  • 17. Full Sharing  Most flexible mechanism for resource sharing, do not limit the maximum uses for resource usage for a logical processor  Good for large structures in which working set sizes are variable and there is no fear of starvation  Eg: All Processor caches are shared o Some applications benefit from a shared cache because they share code and data, minimizing redundant data in the caches
  • 19. • Two modes of operations – single-task (ST) – multi-task (MT). • MT-mode- There are two active logical processors and some of the resources are partitioned. • There are two flavors of ST-mode: single-task logical processor 0 (ST0) and single-task logical processor 1 (ST1). • In ST0- or ST1-mode, only one logical processor is active, and resources that were partitioned in MT-mode are re- combined to give the single active logical processor use of all of the resources SINGLE-TASK AND MULTI-TASK MODES
  • 21. • HALT instruction that stops processor execution. • On a processor with Hyper-Threading Technology, executing HALT transition the processor from MT-mode to ST0- or ST1-mode, depending on which logical processor executed the HALT. • In ST0- or ST1-modes, an interrupt sent to the halted logical processor would cause a transition to MT-mode.
  • 22. OPERATING SYSTEM • For best performance, the operating system should implement two optimizations. – The first is to use the HALT instruction if one logical processor is active and the other is not. HALT will allow the processor to transition MT mode to either the ST0- or ST1-mode. – The second optimization is in scheduling software threads to logical processors. The operating system should schedule threads to logical processors on different physical processors before scheduling two threads to the same physical processor.
  • 23. Business Benefits of Hyper-Threading Technology • Higher transaction rates for e-Businesses • Improved reaction and response times for end-users and customers. • Increased number of users that a server system can support • Handle increased server workloads • Compatibility with existing server applications and operating systems
  • 24. Performance increases from Hyper-Threading Technology on an OLTP workload Web server benchmark performance
  • 25. Conclusion •Intel’s Hyper-Threading Technology brings the concept of simultaneous multi-threading to the Intel Architecture. •It will become increasingly important going forward as it adds a new technique for obtaining additional performance for lower transistor and power costs. •The goal was to implement the technology at minimum cost while ensuring forward progress on logical processors, even if the other is stalled, and to deliver full performance even when there is only one active logical processor.
  • 26. References • “HYPER-THREADING TECHNOLOGY ARCHITECTURE AND MICROARCHITECTURE” by Deborah T. Marr, Frank Binns, David L. Hill, Glenn Hinton,David A. Koufaty, J. Alan Miller, Michael Upton, intel Technology Journal, Volume 06 Issue 01, Published February 14, 2002. Pages: 4 –15. • “:HYPERTHREADING TECHNOLOGY IN THE NETBURST MICROARCHITECTURE” by David Koufaty,Deborah T. Marr, IEEE Micro, Vol. 23, Issue 2, March–April 2003. Pages: 56 – 65. • http://cache- www.intel.com/cd/00/00/22/09/220943_220943.pdf • http://www.cs.washington.edu/research/smt/papers/tlp2ilp.fin al.pdf • http://mos.stanford.edu/papers/mj_thesis.pdf