SlideShare a Scribd company logo
1 of 20
BLUE GENE




      Sunitha M. Jenarius
What is Blue Gene

A  massively parallel supercomputer using
  tens of thousands of embedded PowerPC
  processors supporting a large memory space
 With standard compilers

  and message passing
  environment
Why the name “Blue Gene”?

 “Blue”: The corporate color of IBM
 “Gene”: The intended use of the Blue Gene
  clusters – Computational biology, specifically,
  protein folding
History

   Dec’99, IBM Research announced $100M US effort
    to build a Petaflop scale supercomputer.
   Two goals of The Blue Gene project :
    –   Massively parallel machine architecture and software
    –   Bio-Molecular Simulation – advance orders of magnitude
   November 2001, Partnership with Lawrence
    Livermore National Laboratory (LLNL)

                                        and this resulted in …
Results

 Linpack   Top 500 Supercomputers
Blue Gene Projects

 Four   Blue Gene projects :
  –   BlueGene/L
  –   BlueGene/C
  –   BlueGene/P
  –   BlueGene/Q
Blue Gene/L

 The  first computer in the Blue Gene
  series
 IBM first announced the Blue Gene/L
  project, Sept. 29, 2004
 Final configuration was launched in
  October 2005
Blue Gene/L - Unsurpassed
Performance

 Designed  to deliver the most performance
  per kilowatt of power consumed
 Theoretical peak performance of 360
  TFLOPS
 Final Configuration (Oct. ‘05) scores over
  280 TFLOPS sustained on the Linpack
  benchmark.
 Nov 14, ‘06, at Supercomputing 2006, Blue
  Gene/L was awarded the winning prize in all
  HPC Challenge Classes of awards.
Blue Gene/L Architecture

 Can be scaled up to 65,536 compute or I/O
  nodes, with 131,072 processors
 Each node is a single ASIC with associated
  DRAM memory chips
 Each ASIC has 2 700 MHz IBM PowerPC
  processors
 PowerPC processors
  –   Low-frequency, low-power embedded processors,
      superior to today's high-frequency, high-power
      microprocessors by a factor of 2 or more
Blue Gene/L Architecture contd…

    –   Double-pipeline-double-precision Floating Point Unit
    –   A cache sub-system with built-in DRAM controller
   Node CPUs are not cache coherent with one another
   FPUs and CPUs are designed for low power
    consumption
    –   Using transistors with low leakage current
    –   Local clock gating
    –   Putting the FPU or CPU/FPU pair to sleep
Blue Gene/L Architecture contd…




            1024 nodes




                         System Overview
Blue Gene/L Architecture contd…

1 rack holds 1024 nodes or 2048 processors
 Nodes optimized for low power consumption
 ASIC based on System-on-a-chip technology
  –   Large numbers of low-power system-on-a-chip technology
      allows it to outperform commodity clusters while saving on
      power
  –   Aggressive packaging of processors, memory and
      interconnect
  –   Power Efficient & Space Efficient
  –   Allows for latencies and bandwidths that are significantly
      better than those for nodes typically used in ASC scale
      supercomputers
Blue Gene/L Networks

 Each
     node is attached to 3 main parallel
 communication networks
  –   3D Torus network - peer-2-peer between compute
      nodes
  –   Collective network – collective & global
      communication
  –   Ethernet network - I/O and management (such as
      access to any node for configuration, booting and
      diagnostics )
Blue Gene/L System Software

   System software supports efficient execution of
    parallel applications
   Compiler support for DFPU (C, C++, Fortran)
   Compute nodes use a minimal operating system
    called “BlueGene/L compute node kernel”
    –   A lightweight, single-user operating system
    –   Supports execution of a single dual-threaded application
        compute process
    –   Kernel provides a single and static virtual address space to
        one running compute process
    –   Because of single-process nature, no context switching
        required
Blue Gene/L System Software contd…

   To allow multiple programs to run concurrently
    –   Blue Gene/L system can be partitioned into electronically
        isolated sets of nodes
    –   The number of nodes in a partition must be a positive
        integer power of 2
    –   To run program – reserve this partition
    –   No other program can use till partition is done with current
        program
    –   With so many nodes, component failures are inevitable. The
        system is able to electrically isolate faulty hardware to allow
        the machine to continue to run
Blue Gene/L System Software contd…

 Parallel   Programming model
  –   Message Passing – supported through an
      implementation of MPI
  –   Only a subset of POSIX calls are supported
  –   Green threads are also used to simulate local
      concurrency
Blue Gene/C

 Sister-project  to BlueGene/L
 Renamed to Cyclops64
 Massively parallel, supercomputer-on-a-chip
  cellular architecture
 Cellular architecture gives the programmer
  the ability to run large numbers of concurrent
  threads within a single processor.
Blue Gene/P

 Architecturally
                similar to BlueGene/L
 Expected to operate around one petaflop
 Expected around 2008
Blue Gene/Q

 Last known supercomputer in the Blue Gene
  series
 Expected to reach 3-10 petaflops
Resources

 Wikipedia.org
 IBM   website
  –   (
      www.03.ibm.com/servers/deepcomputing/bluegene.htm
      )
 www.supercomp.org/sc2002/paperpdfs/pap.p
  ap207.pdf

More Related Content

What's hot (20)

Blue brain project ppt
Blue brain project pptBlue brain project ppt
Blue brain project ppt
 
Cluster computing
Cluster computingCluster computing
Cluster computing
 
Green computing
Green computingGreen computing
Green computing
 
Edge Computing
Edge ComputingEdge Computing
Edge Computing
 
Green computing
Green computingGreen computing
Green computing
 
High performance computing
High performance computingHigh performance computing
High performance computing
 
Green Computing
Green ComputingGreen Computing
Green Computing
 
Dna digital data storage
Dna digital data storageDna digital data storage
Dna digital data storage
 
Internship - Python - AI ML.pptx
Internship - Python - AI ML.pptxInternship - Python - AI ML.pptx
Internship - Python - AI ML.pptx
 
Bio-Molecular computers
Bio-Molecular computersBio-Molecular computers
Bio-Molecular computers
 
Open Source Cloud
Open Source CloudOpen Source Cloud
Open Source Cloud
 
Green cloud computing
Green cloud computingGreen cloud computing
Green cloud computing
 
GPU
GPUGPU
GPU
 
Prediction of cardiovascular disease with machine learning
Prediction of cardiovascular disease with machine learningPrediction of cardiovascular disease with machine learning
Prediction of cardiovascular disease with machine learning
 
Smart quill
Smart quillSmart quill
Smart quill
 
Neuromorphic computing
Neuromorphic computingNeuromorphic computing
Neuromorphic computing
 
Cluster Computing Seminar.
Cluster Computing Seminar.Cluster Computing Seminar.
Cluster Computing Seminar.
 
Green cloud computing India
Green cloud computing IndiaGreen cloud computing India
Green cloud computing India
 
Deep learning ppt
Deep learning pptDeep learning ppt
Deep learning ppt
 
Green cloud computing
Green cloud computingGreen cloud computing
Green cloud computing
 

Similar to Blue gene

Parallel_and_Cluster_Computing.ppt
Parallel_and_Cluster_Computing.pptParallel_and_Cluster_Computing.ppt
Parallel_and_Cluster_Computing.pptMohmdUmer
 
Blue gene detail journal
Blue gene detail journalBlue gene detail journal
Blue gene detail journalVivek Jha
 
Clustering by AKASHMSHAH
Clustering by AKASHMSHAHClustering by AKASHMSHAH
Clustering by AKASHMSHAHAkash M Shah
 
Systems Support for Many Task Computing
Systems Support for Many Task ComputingSystems Support for Many Task Computing
Systems Support for Many Task ComputingEric Van Hensbergen
 
Cluster Computers
Cluster ComputersCluster Computers
Cluster Computersshopnil786
 
Maxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialMaxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialmadhuinturi
 
Copy of Parallel_and_Cluster_Computing.pptx
Copy of Parallel_and_Cluster_Computing.pptxCopy of Parallel_and_Cluster_Computing.pptx
Copy of Parallel_and_Cluster_Computing.pptxgopikahari7
 
HPC_June2011
HPC_June2011HPC_June2011
HPC_June2011cfloare
 
Modern processor art
Modern processor artModern processor art
Modern processor artwaqasjadoon11
 
From Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computersFrom Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computersRyousei Takano
 
Cluster Computing
Cluster ComputingCluster Computing
Cluster ComputingNIKHIL NAIR
 
Modern processor art
Modern processor artModern processor art
Modern processor artwaqasjadoon11
 

Similar to Blue gene (20)

blue-gene-ppt.pptx
blue-gene-ppt.pptxblue-gene-ppt.pptx
blue-gene-ppt.pptx
 
Parallel_and_Cluster_Computing.ppt
Parallel_and_Cluster_Computing.pptParallel_and_Cluster_Computing.ppt
Parallel_and_Cluster_Computing.ppt
 
Blue gene detail journal
Blue gene detail journalBlue gene detail journal
Blue gene detail journal
 
Clustering by AKASHMSHAH
Clustering by AKASHMSHAHClustering by AKASHMSHAH
Clustering by AKASHMSHAH
 
Systems Support for Many Task Computing
Systems Support for Many Task ComputingSystems Support for Many Task Computing
Systems Support for Many Task Computing
 
Cluster Computers
Cluster ComputersCluster Computers
Cluster Computers
 
Maxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialMaxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorial
 
Copy of Parallel_and_Cluster_Computing.pptx
Copy of Parallel_and_Cluster_Computing.pptxCopy of Parallel_and_Cluster_Computing.pptx
Copy of Parallel_and_Cluster_Computing.pptx
 
Cluster computer
Cluster  computerCluster  computer
Cluster computer
 
Presentation
PresentationPresentation
Presentation
 
Cluster Computing
Cluster ComputingCluster Computing
Cluster Computing
 
BLUE GENE/L
BLUE GENE/LBLUE GENE/L
BLUE GENE/L
 
chameleon chip
chameleon chipchameleon chip
chameleon chip
 
HPC_June2011
HPC_June2011HPC_June2011
HPC_June2011
 
Modern processor art
Modern processor artModern processor art
Modern processor art
 
processor struct
processor structprocessor struct
processor struct
 
From Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computersFrom Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computers
 
Cluster Computing
Cluster ComputingCluster Computing
Cluster Computing
 
Modern processor art
Modern processor artModern processor art
Modern processor art
 
Danish presentation
Danish presentationDanish presentation
Danish presentation
 

More from ch samaram

Syllabus r10-ecm-42-network security and cryptography
 Syllabus r10-ecm-42-network security and cryptography Syllabus r10-ecm-42-network security and cryptography
Syllabus r10-ecm-42-network security and cryptographych samaram
 
Restaurant billing application
Restaurant billing applicationRestaurant billing application
Restaurant billing applicationch samaram
 
Spintronics (1)
Spintronics (1)Spintronics (1)
Spintronics (1)ch samaram
 
Computer forensics law and privacy
Computer forensics   law and privacyComputer forensics   law and privacy
Computer forensics law and privacych samaram
 
Javascript sivasoft
Javascript sivasoftJavascript sivasoft
Javascript sivasoftch samaram
 
Basics java scripts
Basics java scriptsBasics java scripts
Basics java scriptsch samaram
 

More from ch samaram (18)

Syllabus r10-ecm-42-network security and cryptography
 Syllabus r10-ecm-42-network security and cryptography Syllabus r10-ecm-42-network security and cryptography
Syllabus r10-ecm-42-network security and cryptography
 
Restaurant billing application
Restaurant billing applicationRestaurant billing application
Restaurant billing application
 
Spintronics
SpintronicsSpintronics
Spintronics
 
Spintronics (1)
Spintronics (1)Spintronics (1)
Spintronics (1)
 
Shore
ShoreShore
Shore
 
Presentation
PresentationPresentation
Presentation
 
Open gl
Open glOpen gl
Open gl
 
Opengl (1)
Opengl (1)Opengl (1)
Opengl (1)
 
Computer forensics law and privacy
Computer forensics   law and privacyComputer forensics   law and privacy
Computer forensics law and privacy
 
Blue gene
Blue geneBlue gene
Blue gene
 
Wearable (1)
Wearable (1)Wearable (1)
Wearable (1)
 
Javascript sivasoft
Javascript sivasoftJavascript sivasoft
Javascript sivasoft
 
Html siva
Html sivaHtml siva
Html siva
 
Css siva
Css sivaCss siva
Css siva
 
Basics java scripts
Basics java scriptsBasics java scripts
Basics java scripts
 
Ajax
AjaxAjax
Ajax
 
Html 5
Html 5Html 5
Html 5
 
Css siva
Css sivaCss siva
Css siva
 

Recently uploaded

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 

Recently uploaded (20)

DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 

Blue gene

  • 1. BLUE GENE Sunitha M. Jenarius
  • 2. What is Blue Gene A massively parallel supercomputer using tens of thousands of embedded PowerPC processors supporting a large memory space  With standard compilers and message passing environment
  • 3. Why the name “Blue Gene”?  “Blue”: The corporate color of IBM  “Gene”: The intended use of the Blue Gene clusters – Computational biology, specifically, protein folding
  • 4. History  Dec’99, IBM Research announced $100M US effort to build a Petaflop scale supercomputer.  Two goals of The Blue Gene project : – Massively parallel machine architecture and software – Bio-Molecular Simulation – advance orders of magnitude  November 2001, Partnership with Lawrence Livermore National Laboratory (LLNL) and this resulted in …
  • 5. Results  Linpack Top 500 Supercomputers
  • 6. Blue Gene Projects  Four Blue Gene projects : – BlueGene/L – BlueGene/C – BlueGene/P – BlueGene/Q
  • 7. Blue Gene/L  The first computer in the Blue Gene series  IBM first announced the Blue Gene/L project, Sept. 29, 2004  Final configuration was launched in October 2005
  • 8. Blue Gene/L - Unsurpassed Performance  Designed to deliver the most performance per kilowatt of power consumed  Theoretical peak performance of 360 TFLOPS  Final Configuration (Oct. ‘05) scores over 280 TFLOPS sustained on the Linpack benchmark.  Nov 14, ‘06, at Supercomputing 2006, Blue Gene/L was awarded the winning prize in all HPC Challenge Classes of awards.
  • 9. Blue Gene/L Architecture  Can be scaled up to 65,536 compute or I/O nodes, with 131,072 processors  Each node is a single ASIC with associated DRAM memory chips  Each ASIC has 2 700 MHz IBM PowerPC processors  PowerPC processors – Low-frequency, low-power embedded processors, superior to today's high-frequency, high-power microprocessors by a factor of 2 or more
  • 10. Blue Gene/L Architecture contd… – Double-pipeline-double-precision Floating Point Unit – A cache sub-system with built-in DRAM controller  Node CPUs are not cache coherent with one another  FPUs and CPUs are designed for low power consumption – Using transistors with low leakage current – Local clock gating – Putting the FPU or CPU/FPU pair to sleep
  • 11. Blue Gene/L Architecture contd… 1024 nodes System Overview
  • 12. Blue Gene/L Architecture contd… 1 rack holds 1024 nodes or 2048 processors  Nodes optimized for low power consumption  ASIC based on System-on-a-chip technology – Large numbers of low-power system-on-a-chip technology allows it to outperform commodity clusters while saving on power – Aggressive packaging of processors, memory and interconnect – Power Efficient & Space Efficient – Allows for latencies and bandwidths that are significantly better than those for nodes typically used in ASC scale supercomputers
  • 13. Blue Gene/L Networks  Each node is attached to 3 main parallel communication networks – 3D Torus network - peer-2-peer between compute nodes – Collective network – collective & global communication – Ethernet network - I/O and management (such as access to any node for configuration, booting and diagnostics )
  • 14. Blue Gene/L System Software  System software supports efficient execution of parallel applications  Compiler support for DFPU (C, C++, Fortran)  Compute nodes use a minimal operating system called “BlueGene/L compute node kernel” – A lightweight, single-user operating system – Supports execution of a single dual-threaded application compute process – Kernel provides a single and static virtual address space to one running compute process – Because of single-process nature, no context switching required
  • 15. Blue Gene/L System Software contd…  To allow multiple programs to run concurrently – Blue Gene/L system can be partitioned into electronically isolated sets of nodes – The number of nodes in a partition must be a positive integer power of 2 – To run program – reserve this partition – No other program can use till partition is done with current program – With so many nodes, component failures are inevitable. The system is able to electrically isolate faulty hardware to allow the machine to continue to run
  • 16. Blue Gene/L System Software contd…  Parallel Programming model – Message Passing – supported through an implementation of MPI – Only a subset of POSIX calls are supported – Green threads are also used to simulate local concurrency
  • 17. Blue Gene/C  Sister-project to BlueGene/L  Renamed to Cyclops64  Massively parallel, supercomputer-on-a-chip cellular architecture  Cellular architecture gives the programmer the ability to run large numbers of concurrent threads within a single processor.
  • 18. Blue Gene/P  Architecturally similar to BlueGene/L  Expected to operate around one petaflop  Expected around 2008
  • 19. Blue Gene/Q  Last known supercomputer in the Blue Gene series  Expected to reach 3-10 petaflops
  • 20. Resources  Wikipedia.org  IBM website – ( www.03.ibm.com/servers/deepcomputing/bluegene.htm )  www.supercomp.org/sc2002/paperpdfs/pap.p ap207.pdf