SlideShare uma empresa Scribd logo
1 de 27
Scheduling In Distributed Systems
          Candidacy exam


                              Andrii Vozniuk
                              EPFL
                              July 4, 2012
Big Data
       Data explosion
       Processing gets more complicated




          Generates: 25 TB/day       Generates: 40 TB/day
          Stores:    10 PB/year      Stores:    20 PB/year

            Resources of many computers should be used
    2
Typical Data Processing Pipeline


                     Log              Sensor
                     data              data


ETL-like batch      Clean            Analyze        Using resources of
 processing         data              data          many organizations

                                  Particle found!
Efficient query     Query
  execution         data


                  User model

           No one-size-fits-all system currently exists
 3
Outline
    Ɣ Gamma - parallel database
        MapReduce - data-intensive system

        Condor - compute-intensive system

 Conclusions
 Future Research




4
Scheduling In Distributed Systems
       Scheduling
           Policy: setting an ordering of tasks                            task
                                                       task
           Assigning resources to tasks
                                                       task
                                                       task


                                        How to match resources and tasks?




              Scheduling is challenging in distributed systems
    5
Matching Tasks With Resources
       Perspectives
           Data model
           Execution model


             System/Perspecti   Data model      Execution model
             ve
             Gamma              Relational      Multioperator
             MapReduce          Unconstrained   MapReduce
             Condor             Unconstrained   Unconstrained




            How scheduling is influenced by data and execution
    6                             models?
Gamma                                                Ɣ
       Pioneering parallel database
       Data model: constrained
           Relational data model
           Relations are horizontally partitioned
       Execution model: constrained
           Multioperator queries
           Operators employ hash-based algorithms




    7
Gamma: Scheduler                                                         Ɣ
SELECT r FROM R      Query                                   Host
WHERE r < ‘k’ query Manager          Catalog
                                                             Machine

                                                             Gamma
       Optimizes query                                Schedules
                                  Scheduler                   Database
       Compiles plan                                  operators
                                   Process


                          Operator            Operator
               Node 1     Process              Process        Node 2
         Execution on
         relevant nodes     a-m                 n-z



          Scheduling is done at the operator level
 8
Gamma: Batch Scheduling                                           Ɣ
       Exploit sharing by scheduling in a batch
       Example of selection sharing


                σ1      σ2            σ1       σ2
                                                    Shared scan

                A       A                  A



       Reads of A can be shared applying predicates in turn
       Shared relation A is scanned only once


              Batch scheduling trades latency for throughput
    9
Gamma: Batch Scheduling Joins                                           Ɣ
    Several hash-joins in a batch of queries
    Hash table for the same relation can be shared
    Example assumes 100% selectivity of σ
                                                      Shared hash-table for A


             ⋈            ⋈                   ⋈        ⋈

         σ       σ    σ       σ           σ       σ     σ

         A       Β    A       C           B       A     C


    Sharing reduces I/O and memory usage

             Sharing among joins reduces total execution time
    10
Limitations Of Gamma                                           Ɣ
    Gamma offers
        Efficient query execution
        Sharing in a batch of queries
    Gamma operates on structured data
    Gamma is not suitable for
        Unstructured data processing
        ETL type of workload
        Running on large scale




             A different system for ETL processing is needed
    11
MapReduce
    System for data-intensive applications
    Execution model: constrained
        Job is a set of map and reduce tasks
        Tasks are independent
    Data model: unconstrained
        Arbitrary data format
        Files are partitioned into chunks
        Each chunk is replicated several times




    12
MapReduce: Scheduling
                                    Map
                                    Reduc             Map
                                     1e                2
          Example:
                          Chunk1            Chunk2
         MapReduce job
                          Result1
                          Temp1             Temp2
         4 Map tasks

         2 Reduce task              Map               Reduc
                                                      Map
                                     3                 4e
                          Chunk3            Chunk4
                          Temp3             Result2
                                            Temp4
    Tasks are scheduled close to data
    Execution is scalable and fault-tolerant
    Execution is elastic
           Fine grain scheduling improves fault tolerance and
    13                          elasticity
MapReduce: Speculative Execution
    Nodes may become slow
    Speculative execution minimizes job’s response time
    Launch if progress is 20% less than average
                                        backup
          Normal node


                            straggler

Temporary slow node




         Speculative execution works well in homogeneous
    14                     environment
Emerging Heterogeneous Infrastructures
    Replacement of failed components
    Extending existing cluster with new machines
    Virtualized data centers of cloud providers
        CPU and RAM are isolated
        Contention for disk and network
              IO Performance per




                                   60
                  VM (MB/s)




                                   40

                                   20

                                   0
                                        1   2     3      4      5      6   7
                                                VMs on Physical Host

In many real-life cases the infrastructure is heterogeneous
    15
MapReduce: Heterogeneous Cluster
    Fast node




Slow node



    Performance degrades on heterogeneous cluster
        Slow nodes are wasted
        Backup tasks on slow nodes
        All straggling tasks are treated equally
        Thrashing due to excessive speculative execution

     Speculative execution should be improved for heterogeneous
    16                         cluster
MapReduce: LATE Scheduler
    Idea: back up the task with the largest estimated finish
     time (Longest Approximate Time to End)
                                          progress score
                      progress rate =
                                          execution time

                                         1 – progress score
                estimated time left =
                                           progress rate
    Thresholds
        Limit the number of backup tasks
        Launch backup tasks on fast nodes
        Backup only sufficiently slow tasks
         LATE looks forward to prioritize tasks to speculate
    17
MapReduce: LATE Example
   Back up the task with Longest Approximate Time to End
                                   2 min

1                                                 Estimated time left:
                                                  (1-0.66) / (1/3) = 1
     1 task/min

2                 Progress = 66%
                                                  Estimated time left:
                                                  (1-0.05) / (1/1.9) = 1.8
     3x slower
                            Progress = 5.3%
3
    1.9x slower


                               Time (min)     improvement

LATE correctly identifies task which hurts the response time the
18                             most
Limitations Of MapReduce
    MapReduce offers
        High scalability
        Good fault tolerance
        Handling of unstructured data
    MapReduce is not suitable for
        Running on multi organization infrastructure
        Harvesting idle resources in organization




     A different system for multi organization infrastructure is
    19                       needed
Condor
    Compute-intensive system harvesting idle resources
    Data model: arbitrary
    Execution model: arbitrary
                           How to increase utilization
                           and respect the owners?




                                          job

                                          job
                                                              job
                                          job
       Increase resources utilization by scheduling jobs on idle
    20                         machines
Condor Scheduler: Centralized?
                         Scheduler




                                     job

                                     job
                                                       job
                                     job
     Efficient but not reliable, possible bottleneck
21
Condor Scheduler: Distributed?
                                            Scheduler


     Scheduler




                                            Scheduler

                       Scheduler



                                   job

                                   job
                                                 job
                                   job
                 Reliable but inefficient
22
Condor Scheduler: Hybrid!

Information about tasks            Matchmaker           Information about nodes

      Scheduler           1
                              3                          1
                                          1
                                                    2
                                              3                     Scheduler

                              Scheduler


                              4
                                                  job

                                                  job
                                                                          job
                                                  job
            Hybrid approach has the best of both worlds
 23
ClassAds: Describing Jobs and Resources
          Job Description          Machine Description

          [MyType=“Job”            [MyType=“Machine“
          TargetType = “Machine“   TargetType=“Job“
          Department=“CompSci“     Machine=“nostos.cs.wisc.edu“
          Requirements =           OpSys=“LINUX“
          (other.OpSys==LINUX &&   Disk=3076077
          other.Disk > 10000000)   Requirement = (LoadAvg <= 0.3) &&
          Rank=Memory]             (KeyboardIdle > (15*60))
                                   Rank =
                                   other.Department==self.Department]
    Requirements should be satisfied
    Candidate with the highest rank is returned
         Matchmaker is suitable for heterogeneous shared clusters
    24
Conclusions
    Scheduling done at different levels
        Gamma: operator level scheduling enables sharing
        MR and Condor: arbitrary code => sharing is hard
        Condor: matchmaking gives control on job placement

    Hybrid approaches are promising for big data processing
    Scheduling in heterogeneous deployments is challenging




    25
Thank you for your attention!

        Feedback & Question?
        Andrii.Vozniuk@epfl.ch




26
References
    Matchmaking: Distributed Resource Management for
     High Throughput Computing by Rajesh Raman, Miron
     Livny and Marvin Solomon.
    Batch Scheduling in Parallel Database Systems by Manish
     Mehta, Valery Soloviev and David J. DeWitt.
    Improving MapReduce performance in heterogeneous
     environments by Matei Zaharia, Andy Konwinski, Anthony
     D. Joseph, Randy Katz and Ion Stoica
    Slides 14 and 18 exploit presentation ideas from the LATE
     slides for OSDI 2008 by Matei Zaharia


    27

Mais conteúdo relacionado

Mais procurados

Dichotomy of parallel computing platforms
Dichotomy of parallel computing platformsDichotomy of parallel computing platforms
Dichotomy of parallel computing platformsSyed Zaid Irshad
 
Introduction to Parallel Computing
Introduction to Parallel ComputingIntroduction to Parallel Computing
Introduction to Parallel ComputingAkhila Prabhakaran
 
Database , 8 Query Optimization
Database , 8 Query OptimizationDatabase , 8 Query Optimization
Database , 8 Query OptimizationAli Usman
 
Process scheduling (CPU Scheduling)
Process scheduling (CPU Scheduling)Process scheduling (CPU Scheduling)
Process scheduling (CPU Scheduling)Mukesh Chinta
 
Algorithm And analysis Lecture 03& 04-time complexity.
 Algorithm And analysis Lecture 03& 04-time complexity. Algorithm And analysis Lecture 03& 04-time complexity.
Algorithm And analysis Lecture 03& 04-time complexity.Tariq Khan
 
Instruction level parallelism
Instruction level parallelismInstruction level parallelism
Instruction level parallelismdeviyasharwin
 
Lecture 2 more about parallel computing
Lecture 2   more about parallel computingLecture 2   more about parallel computing
Lecture 2 more about parallel computingVajira Thambawita
 
Finite Automata in compiler design
Finite Automata in compiler designFinite Automata in compiler design
Finite Automata in compiler designRiazul Islam
 
Memory management
Memory managementMemory management
Memory managementcpjcollege
 
Peephole optimization techniques in compiler design
Peephole optimization techniques in compiler designPeephole optimization techniques in compiler design
Peephole optimization techniques in compiler designAnul Chaudhary
 
Cpu scheduling in operating System.
Cpu scheduling in operating System.Cpu scheduling in operating System.
Cpu scheduling in operating System.Ravi Kumar Patel
 
Artificial Intelligence -- Search Algorithms
Artificial Intelligence-- Search Algorithms Artificial Intelligence-- Search Algorithms
Artificial Intelligence -- Search Algorithms Syed Ahmed
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)Yuriy Guts
 
Heterogeneous computing
Heterogeneous computingHeterogeneous computing
Heterogeneous computingRashid Ansari
 
RPC: Remote procedure call
RPC: Remote procedure callRPC: Remote procedure call
RPC: Remote procedure callSunita Sahu
 

Mais procurados (20)

Message passing in Distributed Computing Systems
Message passing in Distributed Computing SystemsMessage passing in Distributed Computing Systems
Message passing in Distributed Computing Systems
 
Dichotomy of parallel computing platforms
Dichotomy of parallel computing platformsDichotomy of parallel computing platforms
Dichotomy of parallel computing platforms
 
Introduction to Parallel Computing
Introduction to Parallel ComputingIntroduction to Parallel Computing
Introduction to Parallel Computing
 
Database , 8 Query Optimization
Database , 8 Query OptimizationDatabase , 8 Query Optimization
Database , 8 Query Optimization
 
Process scheduling (CPU Scheduling)
Process scheduling (CPU Scheduling)Process scheduling (CPU Scheduling)
Process scheduling (CPU Scheduling)
 
Algorithm And analysis Lecture 03& 04-time complexity.
 Algorithm And analysis Lecture 03& 04-time complexity. Algorithm And analysis Lecture 03& 04-time complexity.
Algorithm And analysis Lecture 03& 04-time complexity.
 
Instruction level parallelism
Instruction level parallelismInstruction level parallelism
Instruction level parallelism
 
Lecture 2 more about parallel computing
Lecture 2   more about parallel computingLecture 2   more about parallel computing
Lecture 2 more about parallel computing
 
Daa notes 3
Daa notes 3Daa notes 3
Daa notes 3
 
Finite Automata in compiler design
Finite Automata in compiler designFinite Automata in compiler design
Finite Automata in compiler design
 
Parallel Algorithms
Parallel AlgorithmsParallel Algorithms
Parallel Algorithms
 
Scheduling algorithms
Scheduling algorithmsScheduling algorithms
Scheduling algorithms
 
Memory management
Memory managementMemory management
Memory management
 
Peephole optimization techniques in compiler design
Peephole optimization techniques in compiler designPeephole optimization techniques in compiler design
Peephole optimization techniques in compiler design
 
Cpu scheduling in operating System.
Cpu scheduling in operating System.Cpu scheduling in operating System.
Cpu scheduling in operating System.
 
Artificial Intelligence -- Search Algorithms
Artificial Intelligence-- Search Algorithms Artificial Intelligence-- Search Algorithms
Artificial Intelligence -- Search Algorithms
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Distributed Operating System_4
Distributed Operating System_4Distributed Operating System_4
Distributed Operating System_4
 
Heterogeneous computing
Heterogeneous computingHeterogeneous computing
Heterogeneous computing
 
RPC: Remote procedure call
RPC: Remote procedure callRPC: Remote procedure call
RPC: Remote procedure call
 

Semelhante a Scheduling in distributed systems - Andrii Vozniuk

Hadoop Network Performance profile
Hadoop Network Performance profileHadoop Network Performance profile
Hadoop Network Performance profilepramodbiligiri
 
The Performance of MapReduce: An In-depth Study
The Performance of MapReduce: An In-depth StudyThe Performance of MapReduce: An In-depth Study
The Performance of MapReduce: An In-depth StudyKevin Tong
 
Next generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labNext generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labImpetus Technologies
 
Architecting and productionising data science applications at scale
Architecting and productionising data science applications at scaleArchitecting and productionising data science applications at scale
Architecting and productionising data science applications at scalesamthemonad
 
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...Reynold Xin
 
Hanborq Optimizations on Hadoop MapReduce
Hanborq Optimizations on Hadoop MapReduceHanborq Optimizations on Hadoop MapReduce
Hanborq Optimizations on Hadoop MapReduceHanborq Inc.
 
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...Yahoo Developer Network
 
Hanborq optimizations on hadoop map reduce 20120221a
Hanborq optimizations on hadoop map reduce 20120221aHanborq optimizations on hadoop map reduce 20120221a
Hanborq optimizations on hadoop map reduce 20120221aSchubert Zhang
 
MapReduce Paradigm
MapReduce ParadigmMapReduce Paradigm
MapReduce ParadigmDilip Reddy
 
MapReduce Paradigm
MapReduce ParadigmMapReduce Paradigm
MapReduce ParadigmDilip Reddy
 
Взгляд на облака с точки зрения HPC
Взгляд на облака с точки зрения HPCВзгляд на облака с точки зрения HPC
Взгляд на облака с точки зрения HPCOlga Lavrentieva
 
Strata + Hadoop World 2012: Knitting Boar
Strata + Hadoop World 2012: Knitting BoarStrata + Hadoop World 2012: Knitting Boar
Strata + Hadoop World 2012: Knitting BoarCloudera, Inc.
 
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARNHadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARNJosh Patterson
 
High Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of ViewHigh Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of Viewaragozin
 
Parallel Data Processing with MapReduce: A Survey
Parallel Data Processing with MapReduce: A SurveyParallel Data Processing with MapReduce: A Survey
Parallel Data Processing with MapReduce: A SurveyKyong-Ha Lee
 
Ling liu part 02:big graph processing
Ling liu part 02:big graph processingLing liu part 02:big graph processing
Ling liu part 02:big graph processingjins0618
 
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...areej qasrawi
 
Seminar Presentation Hadoop
Seminar Presentation HadoopSeminar Presentation Hadoop
Seminar Presentation HadoopVarun Narang
 

Semelhante a Scheduling in distributed systems - Andrii Vozniuk (20)

Hadoop Network Performance profile
Hadoop Network Performance profileHadoop Network Performance profile
Hadoop Network Performance profile
 
The Performance of MapReduce: An In-depth Study
The Performance of MapReduce: An In-depth StudyThe Performance of MapReduce: An In-depth Study
The Performance of MapReduce: An In-depth Study
 
Spark
SparkSpark
Spark
 
Next generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labNext generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph lab
 
Architecting and productionising data science applications at scale
Architecting and productionising data science applications at scaleArchitecting and productionising data science applications at scale
Architecting and productionising data science applications at scale
 
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
 
Hanborq Optimizations on Hadoop MapReduce
Hanborq Optimizations on Hadoop MapReduceHanborq Optimizations on Hadoop MapReduce
Hanborq Optimizations on Hadoop MapReduce
 
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
 
Hanborq optimizations on hadoop map reduce 20120221a
Hanborq optimizations on hadoop map reduce 20120221aHanborq optimizations on hadoop map reduce 20120221a
Hanborq optimizations on hadoop map reduce 20120221a
 
MapReduce Paradigm
MapReduce ParadigmMapReduce Paradigm
MapReduce Paradigm
 
MapReduce Paradigm
MapReduce ParadigmMapReduce Paradigm
MapReduce Paradigm
 
Взгляд на облака с точки зрения HPC
Взгляд на облака с точки зрения HPCВзгляд на облака с точки зрения HPC
Взгляд на облака с точки зрения HPC
 
Hadoop at JavaZone 2010
Hadoop at JavaZone 2010Hadoop at JavaZone 2010
Hadoop at JavaZone 2010
 
Strata + Hadoop World 2012: Knitting Boar
Strata + Hadoop World 2012: Knitting BoarStrata + Hadoop World 2012: Knitting Boar
Strata + Hadoop World 2012: Knitting Boar
 
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARNHadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
 
High Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of ViewHigh Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of View
 
Parallel Data Processing with MapReduce: A Survey
Parallel Data Processing with MapReduce: A SurveyParallel Data Processing with MapReduce: A Survey
Parallel Data Processing with MapReduce: A Survey
 
Ling liu part 02:big graph processing
Ling liu part 02:big graph processingLing liu part 02:big graph processing
Ling liu part 02:big graph processing
 
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
 
Seminar Presentation Hadoop
Seminar Presentation HadoopSeminar Presentation Hadoop
Seminar Presentation Hadoop
 

Mais de Andrii Vozniuk

Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...
Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...
Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...Andrii Vozniuk
 
Embedded interactive learning analytics dashboards with Elasticsearch and Kib...
Embedded interactive learning analytics dashboards with Elasticsearch and Kib...Embedded interactive learning analytics dashboards with Elasticsearch and Kib...
Embedded interactive learning analytics dashboards with Elasticsearch and Kib...Andrii Vozniuk
 
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...Andrii Vozniuk
 
Combining content analytics and activity tracking to mine user interests and ...
Combining content analytics and activity tracking to mine user interests and ...Combining content analytics and activity tracking to mine user interests and ...
Combining content analytics and activity tracking to mine user interests and ...Andrii Vozniuk
 
TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...
TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...
TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...Andrii Vozniuk
 
Contextual learning analytics apps to create awareness in blended inquiry lea...
Contextual learning analytics apps to create awareness in blended inquiry lea...Contextual learning analytics apps to create awareness in blended inquiry lea...
Contextual learning analytics apps to create awareness in blended inquiry lea...Andrii Vozniuk
 
Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...
Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...
Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...Andrii Vozniuk
 
Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...
Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...
Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...Andrii Vozniuk
 
AngeLA: Putting the teacher in control of student privacy in the online class...
AngeLA: Putting the teacher in control of student privacy in the online class...AngeLA: Putting the teacher in control of student privacy in the online class...
AngeLA: Putting the teacher in control of student privacy in the online class...Andrii Vozniuk
 
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii VozniukCloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii VozniukAndrii Vozniuk
 
Symbolic Reasoning and Concrete Execution - Andrii Vozniuk
Symbolic Reasoning and Concrete Execution - Andrii Vozniuk Symbolic Reasoning and Concrete Execution - Andrii Vozniuk
Symbolic Reasoning and Concrete Execution - Andrii Vozniuk Andrii Vozniuk
 

Mais de Andrii Vozniuk (11)

Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...
Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...
Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...
 
Embedded interactive learning analytics dashboards with Elasticsearch and Kib...
Embedded interactive learning analytics dashboards with Elasticsearch and Kib...Embedded interactive learning analytics dashboards with Elasticsearch and Kib...
Embedded interactive learning analytics dashboards with Elasticsearch and Kib...
 
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
 
Combining content analytics and activity tracking to mine user interests and ...
Combining content analytics and activity tracking to mine user interests and ...Combining content analytics and activity tracking to mine user interests and ...
Combining content analytics and activity tracking to mine user interests and ...
 
TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...
TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...
TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...
 
Contextual learning analytics apps to create awareness in blended inquiry lea...
Contextual learning analytics apps to create awareness in blended inquiry lea...Contextual learning analytics apps to create awareness in blended inquiry lea...
Contextual learning analytics apps to create awareness in blended inquiry lea...
 
Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...
Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...
Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...
 
Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...
Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...
Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...
 
AngeLA: Putting the teacher in control of student privacy in the online class...
AngeLA: Putting the teacher in control of student privacy in the online class...AngeLA: Putting the teacher in control of student privacy in the online class...
AngeLA: Putting the teacher in control of student privacy in the online class...
 
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii VozniukCloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
 
Symbolic Reasoning and Concrete Execution - Andrii Vozniuk
Symbolic Reasoning and Concrete Execution - Andrii Vozniuk Symbolic Reasoning and Concrete Execution - Andrii Vozniuk
Symbolic Reasoning and Concrete Execution - Andrii Vozniuk
 

Último

Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 

Último (20)

Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Scheduling in distributed systems - Andrii Vozniuk

  • 1. Scheduling In Distributed Systems Candidacy exam  Andrii Vozniuk  EPFL  July 4, 2012
  • 2. Big Data  Data explosion  Processing gets more complicated Generates: 25 TB/day Generates: 40 TB/day Stores: 10 PB/year Stores: 20 PB/year Resources of many computers should be used 2
  • 3. Typical Data Processing Pipeline Log Sensor data data ETL-like batch Clean Analyze Using resources of processing data data many organizations Particle found! Efficient query Query execution data User model No one-size-fits-all system currently exists 3
  • 4. Outline Ɣ Gamma - parallel database MapReduce - data-intensive system Condor - compute-intensive system Conclusions Future Research 4
  • 5. Scheduling In Distributed Systems  Scheduling  Policy: setting an ordering of tasks task task  Assigning resources to tasks task task How to match resources and tasks? Scheduling is challenging in distributed systems 5
  • 6. Matching Tasks With Resources  Perspectives  Data model  Execution model System/Perspecti Data model Execution model ve Gamma Relational Multioperator MapReduce Unconstrained MapReduce Condor Unconstrained Unconstrained How scheduling is influenced by data and execution 6 models?
  • 7. Gamma Ɣ  Pioneering parallel database  Data model: constrained  Relational data model  Relations are horizontally partitioned  Execution model: constrained  Multioperator queries  Operators employ hash-based algorithms 7
  • 8. Gamma: Scheduler Ɣ SELECT r FROM R Query Host WHERE r < ‘k’ query Manager Catalog Machine Gamma Optimizes query Schedules Scheduler Database Compiles plan operators Process Operator Operator Node 1 Process Process Node 2 Execution on relevant nodes a-m n-z Scheduling is done at the operator level 8
  • 9. Gamma: Batch Scheduling Ɣ  Exploit sharing by scheduling in a batch  Example of selection sharing σ1 σ2 σ1 σ2 Shared scan A A A  Reads of A can be shared applying predicates in turn  Shared relation A is scanned only once Batch scheduling trades latency for throughput 9
  • 10. Gamma: Batch Scheduling Joins Ɣ  Several hash-joins in a batch of queries  Hash table for the same relation can be shared  Example assumes 100% selectivity of σ Shared hash-table for A ⋈ ⋈ ⋈ ⋈ σ σ σ σ σ σ σ A Β A C B A C  Sharing reduces I/O and memory usage Sharing among joins reduces total execution time 10
  • 11. Limitations Of Gamma Ɣ  Gamma offers  Efficient query execution  Sharing in a batch of queries  Gamma operates on structured data  Gamma is not suitable for  Unstructured data processing  ETL type of workload  Running on large scale A different system for ETL processing is needed 11
  • 12. MapReduce  System for data-intensive applications  Execution model: constrained  Job is a set of map and reduce tasks  Tasks are independent  Data model: unconstrained  Arbitrary data format  Files are partitioned into chunks  Each chunk is replicated several times 12
  • 13. MapReduce: Scheduling Map Reduc Map 1e 2 Example: Chunk1 Chunk2 MapReduce job Result1 Temp1 Temp2 4 Map tasks 2 Reduce task Map Reduc Map 3 4e Chunk3 Chunk4 Temp3 Result2 Temp4  Tasks are scheduled close to data  Execution is scalable and fault-tolerant  Execution is elastic Fine grain scheduling improves fault tolerance and 13 elasticity
  • 14. MapReduce: Speculative Execution  Nodes may become slow  Speculative execution minimizes job’s response time  Launch if progress is 20% less than average backup Normal node straggler Temporary slow node Speculative execution works well in homogeneous 14 environment
  • 15. Emerging Heterogeneous Infrastructures  Replacement of failed components  Extending existing cluster with new machines  Virtualized data centers of cloud providers  CPU and RAM are isolated  Contention for disk and network IO Performance per 60 VM (MB/s) 40 20 0 1 2 3 4 5 6 7 VMs on Physical Host In many real-life cases the infrastructure is heterogeneous 15
  • 16. MapReduce: Heterogeneous Cluster Fast node Slow node  Performance degrades on heterogeneous cluster  Slow nodes are wasted  Backup tasks on slow nodes  All straggling tasks are treated equally  Thrashing due to excessive speculative execution Speculative execution should be improved for heterogeneous 16 cluster
  • 17. MapReduce: LATE Scheduler  Idea: back up the task with the largest estimated finish time (Longest Approximate Time to End) progress score progress rate = execution time 1 – progress score estimated time left = progress rate  Thresholds  Limit the number of backup tasks  Launch backup tasks on fast nodes  Backup only sufficiently slow tasks LATE looks forward to prioritize tasks to speculate 17
  • 18. MapReduce: LATE Example  Back up the task with Longest Approximate Time to End 2 min 1 Estimated time left: (1-0.66) / (1/3) = 1 1 task/min 2 Progress = 66% Estimated time left: (1-0.05) / (1/1.9) = 1.8 3x slower Progress = 5.3% 3 1.9x slower Time (min) improvement LATE correctly identifies task which hurts the response time the 18 most
  • 19. Limitations Of MapReduce  MapReduce offers  High scalability  Good fault tolerance  Handling of unstructured data  MapReduce is not suitable for  Running on multi organization infrastructure  Harvesting idle resources in organization A different system for multi organization infrastructure is 19 needed
  • 20. Condor  Compute-intensive system harvesting idle resources  Data model: arbitrary  Execution model: arbitrary How to increase utilization and respect the owners? job job job job Increase resources utilization by scheduling jobs on idle 20 machines
  • 21. Condor Scheduler: Centralized? Scheduler job job job job Efficient but not reliable, possible bottleneck 21
  • 22. Condor Scheduler: Distributed? Scheduler Scheduler Scheduler Scheduler job job job job Reliable but inefficient 22
  • 23. Condor Scheduler: Hybrid! Information about tasks Matchmaker Information about nodes Scheduler 1 3 1 1 2 3 Scheduler Scheduler 4 job job job job Hybrid approach has the best of both worlds 23
  • 24. ClassAds: Describing Jobs and Resources Job Description Machine Description [MyType=“Job” [MyType=“Machine“ TargetType = “Machine“ TargetType=“Job“ Department=“CompSci“ Machine=“nostos.cs.wisc.edu“ Requirements = OpSys=“LINUX“ (other.OpSys==LINUX && Disk=3076077 other.Disk > 10000000) Requirement = (LoadAvg <= 0.3) && Rank=Memory] (KeyboardIdle > (15*60)) Rank = other.Department==self.Department]  Requirements should be satisfied  Candidate with the highest rank is returned Matchmaker is suitable for heterogeneous shared clusters 24
  • 25. Conclusions  Scheduling done at different levels  Gamma: operator level scheduling enables sharing  MR and Condor: arbitrary code => sharing is hard  Condor: matchmaking gives control on job placement  Hybrid approaches are promising for big data processing  Scheduling in heterogeneous deployments is challenging 25
  • 26. Thank you for your attention! Feedback & Question? Andrii.Vozniuk@epfl.ch 26
  • 27. References  Matchmaking: Distributed Resource Management for High Throughput Computing by Rajesh Raman, Miron Livny and Marvin Solomon.  Batch Scheduling in Parallel Database Systems by Manish Mehta, Valery Soloviev and David J. DeWitt.  Improving MapReduce performance in heterogeneous environments by Matei Zaharia, Andy Konwinski, Anthony D. Joseph, Randy Katz and Ion Stoica  Slides 14 and 18 exploit presentation ideas from the LATE slides for OSDI 2008 by Matei Zaharia 27