SlideShare uma empresa Scribd logo
1 de 1
Baixar para ler offline
3
                                                                                                                                        XCPU
                                         Workload Distribution & Aggregation
                                                                     Pravin Shinde & Eric Van Hensbergen
Problem                                                                                                                                                             Related Work
                                                                                            work                                                       node

       • Workload distribution hasn’t
         evolved much from when we
         were batch scheduling tasks to
         single machines

       • Today’s Cluster Based
         Schedulers:
                                                                                                                                                                               System V UNIX               Plan 9 from Bell Labs                           XCPU (LANL)
            •    Not interactive.
            •    Not resilient to failure.
                                                                                                                                                                                                       Extended basic procfs
                                                                                                                                                                         Provided synthetic file                                                  Built an application-layer
            •    Difficult for existing tasks to
                 dynamically grow or shrink resources                                                                                                                    system access to process      concepts by also enabling                  provided file system for
                 allocated to it.                                                                                                                                        information which was         control and debug                          UNIX systems using the
            •    Difficult to deploy & administer.                                                                                                                       later extended to a           interfaces. The nature of                  Plan 9 model.          XCPU
            •    Based on middleware instead of                                                                                                                          hierarchy in Linux procfs.    the Plan 9 distributed                     extended previous work by
                 integrated with underlying operating                                                                                                                                                  namespace also made                        allowing process creation
                 system.
                                                                                                                                                                                                       these process interfaces                   to occur via the file system
            •    In many cases tightly bound to the
                                                                                                                                                                                                       available over the network.                and allowed for execution
                 underlying runtime or language.
            •    Unlikely to function at exascale.                                                                                                                                                                                                and coordination of groups
                                                                                                                                                                                                                                                  of processes on remote
                                                                                                                                                                                                                                                  systems.

Our Approach
                                                                                                                                                                   Environment Syntax            Name Space File Syntax

   •    Establish hierarchical namespace of cluster services                                                                                                                                          • mount[–abcC] new old: Bind new on old. servename on old.
                                                                                                                                                                                                                [–abcC] servename old [spec]: Mount

   •    Automount remote servers based on reference (ie. cd /csrv/criswell)
                                                                                                                               c3
                                                                                                                                                                         • key=value                  • bind [–abc] host [remotepath] mountpoint: Import remotepath from machine
   •    Export local services for use elsewhere within the network /csrv                                                                                                 • OBJTYPE=386                • import attach it to mountpoint.
                                                                                                                                                                                                        server and
                                                                                                                                                                         • SYSTYPE=Linux              • cd dir: Changeold: Unmountdirectory toold, or everything mounted on old if new is
                                                      /csrv
                                                                                                              /local
                                                                                                                                                                         • etc.                       • unmount [new]
                                                                                                                                                                                                                         the working
                                                                                                                                                                                                                                     new from
                                                                                                                                                                                                                                                dir.

                                                               /local                                                                                                                                   missing.
                                                                                                              /l2
                    t                                          /L                                                                                                                                     • clear: Clear the name space withfile path. Note that path must be present in the
                                                                                                                                                                                                                                          rfork(RFCNAMEG).

                                                                  /local
                                                                                                                 /local
                                                                                                                 /c4
                                                                                                                                                                                                      • . path:space being built.
                                                                                                                                                                                                        name
                                                                                                                                                                                                                Execute the namespace

                                                                  /l1
                                                                     /local
                                                                                                                    /local                                      /local
                    L                                                /c1                                         /L
                                                                        /local                                      /local
                                                                     /c2                                            /l1                                                  arch                   - architecture & platform (ie. Linux i386)
       I1                    I2
                                                                        /local                                          /local                                           env                    - default environment variables for host
                                                                                                                        /c1
                                                               /l2                                                                                                       ns                     - default name space for host
                                                                                                                           /local
  c1        c2          c3        c4
                                                                  /local
                                                                  /c3                                                   /c2                                              fs                     - access to host file system
                                                                     /local                                                /local                                        net                    - access to host network (i.e. Plan 9 devip)
                                                                  /c4                                                /t                                                  status                 - load average, running jobs, available memory
                                                                     /local                                              /local                                          clone                  - open to establish new session
                                                                                                                                                                         /0
                                                                                                                                                                         /1                     - session subdirectories
        Control File Syntax
                                                                                                                                                                         /n
                 • reserve [n] [os] [arch] - reserve a (number of) resources with os and arch
                   specification                                                                                                                                           ctl                  - reservation and task control
                 • dir [wdir] - set theargs ... - spawn a hostthe task to run the command with
                                         working directory for
                                                                                                                                                                           env                  - environment variables for task
                 • exec commands                                 process
                   arguments as given
                                                                                                                                                                           ns                   - name space for task
                 • kill - kill the-hostthe device to kill the host command when the ctl file is closed
                                        command immediately
                 • killonclose set scheduling priority of the host command                                                                                                 args                 - task arguments
                 • nice [n][path] -the standard output to [path] (on executing host)
                               - set
                 • splice            splice                                                                                                                                wait                 - blocks until all threads complete
                                                                                                                                                                           status               - current task status (reserved, running, etc.)
                                                                                                                                     !"#$%&'
                                                                                                                                      !"#(#
                                                                                                                                                                           stdin                - aggregate standard input for task
                                                                                                                            !"#$%&
                                                                                                                                     !"#$%&'
                                                                                                                                      !"#(#
                                                                                                                                               !"#(#
                                                                                                                                                                           stdout               - aggregate standard output for task
                                                                                                                                     !"#$%&'
                                                                                                                                      !"#(#
                                                                                                                                                                           stdio                - combined standard I/O for task
                                                                                                                                                                                                - component thread session subdirectories
                                                                                                                                     !"#$%&'
                                                                                                                                      !"#(#



                                                                                                                   !"#$%&   !"#$%&
                                                                                                                                     !"#$%&'
                                                                                                                                               !"#(#    !"#(#
                                                                                                                                                                              /0
                                                                                                                                                                              /n
                                                                                                                                      !"#(#




                                                                                                                                                                                ctl             - thread control
                                                                                                                                     !"#$%&'
                                                                                                                                      !"#(#


                                                     local service   proxy service        aggregate service                          !"#$%&'




                                                                                                                            !"#$%&
                                                                                                                                      !"#(#


                                                                                                                                     !"#$%&'
                                                                                                                                               !"#(#
                                                                                                                                                                                env             - environment variables for thread
                                                                                                                                                                                ns              - name space for thread
                                                                                                                                      !"#(#



                                                     local service                                                                   !"#$%&'
                                                                                                                                      !"#(#


                                                                                                                                                                                args            - thread arguments
        Desktop Extension                                                                                           PUSH Pipeline Model                                         wait            - blocks until thread completes
                                                                                     remote services
                                                                                                                                                                                status          - current thread status (reserved, running, etc.)
                                                             Aggregation Via                                                                                                    stdin           - standard input for thread
                                                           Dynamic Namespace                                                                                                    stdout          - standard output for thread
                   Scaling                                                                                                     Reliability
                                                                    and                                                                                                         stdio           - standard I/O for thread
                                                            Distributed Service
                                                                   Model




                                       This project is supported in part by the
                                       U.S. Department of Energy under
                                       Award Number DE-FG02- 08ER25851                                                                                                                                                                                 http://www.research.ibm.com/austin




                                       For More Information: http://www.research.ibm.com/hare

Mais conteúdo relacionado

Semelhante a XCPU3: Workload Distribution and Aggregation

Research Issues in P2P Netwroks
Research Issues in P2P NetwroksResearch Issues in P2P Netwroks
Research Issues in P2P Netwrokssabumt
 
20120524 cern data centre evolution v2
20120524 cern data centre evolution v220120524 cern data centre evolution v2
20120524 cern data centre evolution v2Tim Bell
 
Sun Microsystems Puppet Case Study
Sun Microsystems Puppet Case StudySun Microsystems Puppet Case Study
Sun Microsystems Puppet Case StudyPuppet
 
Erlang Cache
Erlang CacheErlang Cache
Erlang Cacheice j
 
Oracle hard and soft parsing
Oracle hard and soft parsingOracle hard and soft parsing
Oracle hard and soft parsingIshaan Guliani
 
Sun Microsystems Puppet Case Study
Sun Microsystems Puppet Case StudySun Microsystems Puppet Case Study
Sun Microsystems Puppet Case StudyPuppet
 
NICS Puppet Case Study
NICS Puppet Case StudyNICS Puppet Case Study
NICS Puppet Case StudyPuppet
 
Novedades Denali Integration Services
Novedades Denali Integration ServicesNovedades Denali Integration Services
Novedades Denali Integration ServicesSolidQ
 
Tajo: A Distributed Data Warehouse System for Hadoop
Tajo: A Distributed Data Warehouse System for HadoopTajo: A Distributed Data Warehouse System for Hadoop
Tajo: A Distributed Data Warehouse System for HadoopHyunsik Choi
 
Ogce Workflow Suite Tg09
Ogce Workflow Suite Tg09Ogce Workflow Suite Tg09
Ogce Workflow Suite Tg09smarru
 
A Performance Comparison of Container-based Virtualization Systems for MapRed...
A Performance Comparison of Container-based Virtualization Systems for MapRed...A Performance Comparison of Container-based Virtualization Systems for MapRed...
A Performance Comparison of Container-based Virtualization Systems for MapRed...Miguel Xavier
 
Dynamic Provisioning of Data Intensive Computing Middleware Frameworks
Dynamic Provisioning of Data Intensive Computing Middleware FrameworksDynamic Provisioning of Data Intensive Computing Middleware Frameworks
Dynamic Provisioning of Data Intensive Computing Middleware FrameworksLinh Ngo
 
Взгляд на облака с точки зрения HPC
Взгляд на облака с точки зрения HPCВзгляд на облака с точки зрения HPC
Взгляд на облака с точки зрения HPCOlga Lavrentieva
 
High Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of ViewHigh Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of Viewaragozin
 

Semelhante a XCPU3: Workload Distribution and Aggregation (20)

Google Compute and MapR
Google Compute and MapRGoogle Compute and MapR
Google Compute and MapR
 
Research Issues in P2P Netwroks
Research Issues in P2P NetwroksResearch Issues in P2P Netwroks
Research Issues in P2P Netwroks
 
20120524 cern data centre evolution v2
20120524 cern data centre evolution v220120524 cern data centre evolution v2
20120524 cern data centre evolution v2
 
Sun Microsystems Puppet Case Study
Sun Microsystems Puppet Case StudySun Microsystems Puppet Case Study
Sun Microsystems Puppet Case Study
 
Erlang Cache
Erlang CacheErlang Cache
Erlang Cache
 
Construction of a Disaster Recovery Plan with Business Only Broadband
Construction of a Disaster Recovery Plan with Business Only BroadbandConstruction of a Disaster Recovery Plan with Business Only Broadband
Construction of a Disaster Recovery Plan with Business Only Broadband
 
Construction of a Disaster Recovery Plan Webinar
Construction of a Disaster Recovery Plan WebinarConstruction of a Disaster Recovery Plan Webinar
Construction of a Disaster Recovery Plan Webinar
 
Oracle hard and soft parsing
Oracle hard and soft parsingOracle hard and soft parsing
Oracle hard and soft parsing
 
Sun Microsystems Puppet Case Study
Sun Microsystems Puppet Case StudySun Microsystems Puppet Case Study
Sun Microsystems Puppet Case Study
 
NICS Puppet Case Study
NICS Puppet Case StudyNICS Puppet Case Study
NICS Puppet Case Study
 
SLPAY: Distributed Systems Made Simple
SLPAY: Distributed Systems Made SimpleSLPAY: Distributed Systems Made Simple
SLPAY: Distributed Systems Made Simple
 
Hadoop availability
Hadoop availabilityHadoop availability
Hadoop availability
 
Novedades Denali Integration Services
Novedades Denali Integration ServicesNovedades Denali Integration Services
Novedades Denali Integration Services
 
Tajo: A Distributed Data Warehouse System for Hadoop
Tajo: A Distributed Data Warehouse System for HadoopTajo: A Distributed Data Warehouse System for Hadoop
Tajo: A Distributed Data Warehouse System for Hadoop
 
Ogce Workflow Suite Tg09
Ogce Workflow Suite Tg09Ogce Workflow Suite Tg09
Ogce Workflow Suite Tg09
 
A Performance Comparison of Container-based Virtualization Systems for MapRed...
A Performance Comparison of Container-based Virtualization Systems for MapRed...A Performance Comparison of Container-based Virtualization Systems for MapRed...
A Performance Comparison of Container-based Virtualization Systems for MapRed...
 
Dynamic Provisioning of Data Intensive Computing Middleware Frameworks
Dynamic Provisioning of Data Intensive Computing Middleware FrameworksDynamic Provisioning of Data Intensive Computing Middleware Frameworks
Dynamic Provisioning of Data Intensive Computing Middleware Frameworks
 
Взгляд на облака с точки зрения HPC
Взгляд на облака с точки зрения HPCВзгляд на облака с точки зрения HPC
Взгляд на облака с точки зрения HPC
 
High Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of ViewHigh Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of View
 
Philly DB MapR Overview
Philly DB MapR OverviewPhilly DB MapR Overview
Philly DB MapR Overview
 

Mais de Eric Van Hensbergen

Scaling Arm from One to One Trillion
Scaling Arm from One to One TrillionScaling Arm from One to One Trillion
Scaling Arm from One to One TrillionEric Van Hensbergen
 
Balance, Flexibility, and Partnership: An ARM Approach to Future HPC Node Arc...
Balance, Flexibility, and Partnership: An ARM Approach to Future HPC Node Arc...Balance, Flexibility, and Partnership: An ARM Approach to Future HPC Node Arc...
Balance, Flexibility, and Partnership: An ARM Approach to Future HPC Node Arc...Eric Van Hensbergen
 
ISC14 Embedded HPC BoF Panel Presentation
ISC14 Embedded HPC BoF Panel PresentationISC14 Embedded HPC BoF Panel Presentation
ISC14 Embedded HPC BoF Panel PresentationEric Van Hensbergen
 
Simulation Directed Co-Design from Smartphones to Supercomputers
Simulation Directed Co-Design from Smartphones to SupercomputersSimulation Directed Co-Design from Smartphones to Supercomputers
Simulation Directed Co-Design from Smartphones to SupercomputersEric Van Hensbergen
 
Scalable Elastic Systems Architecture (SESA)
Scalable Elastic Systems Architecture (SESA)Scalable Elastic Systems Architecture (SESA)
Scalable Elastic Systems Architecture (SESA)Eric Van Hensbergen
 
Effect of Virtualization on OS Interference
Effect of Virtualization on OS InterferenceEffect of Virtualization on OS Interference
Effect of Virtualization on OS InterferenceEric Van Hensbergen
 
Systems Support for Many Task Computing
Systems Support for Many Task ComputingSystems Support for Many Task Computing
Systems Support for Many Task ComputingEric Van Hensbergen
 
Holistic Aggregate Resource Environment
Holistic Aggregate Resource EnvironmentHolistic Aggregate Resource Environment
Holistic Aggregate Resource EnvironmentEric Van Hensbergen
 

Mais de Eric Van Hensbergen (20)

Scaling Arm from One to One Trillion
Scaling Arm from One to One TrillionScaling Arm from One to One Trillion
Scaling Arm from One to One Trillion
 
Balance, Flexibility, and Partnership: An ARM Approach to Future HPC Node Arc...
Balance, Flexibility, and Partnership: An ARM Approach to Future HPC Node Arc...Balance, Flexibility, and Partnership: An ARM Approach to Future HPC Node Arc...
Balance, Flexibility, and Partnership: An ARM Approach to Future HPC Node Arc...
 
ISC14 Embedded HPC BoF Panel Presentation
ISC14 Embedded HPC BoF Panel PresentationISC14 Embedded HPC BoF Panel Presentation
ISC14 Embedded HPC BoF Panel Presentation
 
Simulation Directed Co-Design from Smartphones to Supercomputers
Simulation Directed Co-Design from Smartphones to SupercomputersSimulation Directed Co-Design from Smartphones to Supercomputers
Simulation Directed Co-Design from Smartphones to Supercomputers
 
Brasil Ross 2011
Brasil Ross 2011Brasil Ross 2011
Brasil Ross 2011
 
Scalable Elastic Systems Architecture (SESA)
Scalable Elastic Systems Architecture (SESA)Scalable Elastic Systems Architecture (SESA)
Scalable Elastic Systems Architecture (SESA)
 
Multipipes
MultipipesMultipipes
Multipipes
 
Multi-pipes
Multi-pipesMulti-pipes
Multi-pipes
 
VirtFS
VirtFSVirtFS
VirtFS
 
HARE 2010 Review
HARE 2010 ReviewHARE 2010 Review
HARE 2010 Review
 
PUSH-- a Dataflow Shell
PUSH-- a Dataflow ShellPUSH-- a Dataflow Shell
PUSH-- a Dataflow Shell
 
9P Code Walkthrough
9P Code Walkthrough9P Code Walkthrough
9P Code Walkthrough
 
9P Overview
9P Overview9P Overview
9P Overview
 
Push Podc09
Push Podc09Push Podc09
Push Podc09
 
Libra: a Library OS for a JVM
Libra: a Library OS for a JVMLibra: a Library OS for a JVM
Libra: a Library OS for a JVM
 
Effect of Virtualization on OS Interference
Effect of Virtualization on OS InterferenceEffect of Virtualization on OS Interference
Effect of Virtualization on OS Interference
 
PROSE
PROSEPROSE
PROSE
 
Libra Library OS
Libra Library OSLibra Library OS
Libra Library OS
 
Systems Support for Many Task Computing
Systems Support for Many Task ComputingSystems Support for Many Task Computing
Systems Support for Many Task Computing
 
Holistic Aggregate Resource Environment
Holistic Aggregate Resource EnvironmentHolistic Aggregate Resource Environment
Holistic Aggregate Resource Environment
 

Último

A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 

Último (20)

A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 

XCPU3: Workload Distribution and Aggregation

  • 1. 3 XCPU Workload Distribution & Aggregation Pravin Shinde & Eric Van Hensbergen Problem Related Work work node • Workload distribution hasn’t evolved much from when we were batch scheduling tasks to single machines • Today’s Cluster Based Schedulers: System V UNIX Plan 9 from Bell Labs XCPU (LANL) • Not interactive. • Not resilient to failure. Extended basic procfs Provided synthetic file Built an application-layer • Difficult for existing tasks to dynamically grow or shrink resources system access to process concepts by also enabling provided file system for allocated to it. information which was control and debug UNIX systems using the • Difficult to deploy & administer. later extended to a interfaces. The nature of Plan 9 model. XCPU • Based on middleware instead of hierarchy in Linux procfs. the Plan 9 distributed extended previous work by integrated with underlying operating namespace also made allowing process creation system. these process interfaces to occur via the file system • In many cases tightly bound to the available over the network. and allowed for execution underlying runtime or language. • Unlikely to function at exascale. and coordination of groups of processes on remote systems. Our Approach Environment Syntax Name Space File Syntax • Establish hierarchical namespace of cluster services • mount[–abcC] new old: Bind new on old. servename on old. [–abcC] servename old [spec]: Mount • Automount remote servers based on reference (ie. cd /csrv/criswell) c3 • key=value • bind [–abc] host [remotepath] mountpoint: Import remotepath from machine • Export local services for use elsewhere within the network /csrv • OBJTYPE=386 • import attach it to mountpoint. server and • SYSTYPE=Linux • cd dir: Changeold: Unmountdirectory toold, or everything mounted on old if new is /csrv /local • etc. • unmount [new] the working new from dir. /local missing. /l2 t /L • clear: Clear the name space withfile path. Note that path must be present in the rfork(RFCNAMEG). /local /local /c4 • . path:space being built. name Execute the namespace /l1 /local /local /local L /c1 /L /local /local /c2 /l1 arch - architecture & platform (ie. Linux i386) I1 I2 /local /local env - default environment variables for host /c1 /l2 ns - default name space for host /local c1 c2 c3 c4 /local /c3 /c2 fs - access to host file system /local /local net - access to host network (i.e. Plan 9 devip) /c4 /t status - load average, running jobs, available memory /local /local clone - open to establish new session /0 /1 - session subdirectories Control File Syntax /n • reserve [n] [os] [arch] - reserve a (number of) resources with os and arch specification ctl - reservation and task control • dir [wdir] - set theargs ... - spawn a hostthe task to run the command with working directory for env - environment variables for task • exec commands process arguments as given ns - name space for task • kill - kill the-hostthe device to kill the host command when the ctl file is closed command immediately • killonclose set scheduling priority of the host command args - task arguments • nice [n][path] -the standard output to [path] (on executing host) - set • splice splice wait - blocks until all threads complete status - current task status (reserved, running, etc.) !"#$%&' !"#(# stdin - aggregate standard input for task !"#$%& !"#$%&' !"#(# !"#(# stdout - aggregate standard output for task !"#$%&' !"#(# stdio - combined standard I/O for task - component thread session subdirectories !"#$%&' !"#(# !"#$%& !"#$%& !"#$%&' !"#(# !"#(# /0 /n !"#(# ctl - thread control !"#$%&' !"#(# local service proxy service aggregate service !"#$%&' !"#$%& !"#(# !"#$%&' !"#(# env - environment variables for thread ns - name space for thread !"#(# local service !"#$%&' !"#(# args - thread arguments Desktop Extension PUSH Pipeline Model wait - blocks until thread completes remote services status - current thread status (reserved, running, etc.) Aggregation Via stdin - standard input for thread Dynamic Namespace stdout - standard output for thread Scaling Reliability and stdio - standard I/O for thread Distributed Service Model This project is supported in part by the U.S. Department of Energy under Award Number DE-FG02- 08ER25851 http://www.research.ibm.com/austin For More Information: http://www.research.ibm.com/hare