SlideShare uma empresa Scribd logo
1 de 47
Baixar para ler offline
glideinWMS Training @ UCSD




                     Condor overview
                        by Igor Sfiligoi (UCSD)




UCSD Jan 17th 2012               Condor           1
Acknowledgement
 ●   These slides are heavily based on the
     presentation Todd Tannenbaum gave at
     CERN in Feb 2011
     https://indico.cern.ch/conferenceTimeTable.py?confId=124982#20110214.detailed




UCSD Jan 17th 2012                                Condor                             2
Outline
                     ●   What is Condor
                     ●   Condor principles
                     ●   Condor daemons
                     ●   Condor protocol overview




UCSD Jan 17th 2012                  Condor          3
What is Condor




UCSD Jan 17th 2012         Condor     4
What is Condor
 ●   Condor is a Workload Management System
      ●   i.e. a batch system
 ●   Strong points
      ●   Fault tolerant
      ●   Robust feature set
      ●   Flexible
 ●   Development team dedicated to working closely
     w/ scientific community as priority #1


UCSD Jan 17th 2012              Condor           5
How can Condor be used
 ●   Managing local processes (local)
                                             Only vanilla
 ●   Managing local cluster (~vanilla)       in this talk
 ●   Connecting clusters (flocking)
 ●   Handling resource overlays (glideins)
 ●   Swiss-knife for accessing other WMS
     (Condor-G)
      ●   e.g. Grid, Cloud, pbs, etc.



UCSD Jan 17th 2012              Condor                  6
(Vanilla)
                     Condor principles




UCSD Jan 17th 2012          Condor       7
(Vanilla) Condor principles
 ●   Two parts of the equation
      ●   Jobs
      ●   Machines/Resources
 ●   Jobs
      ●   Condor’s quanta of work
      ●   Like a UNIX process
      ●   Can be an element of a workflow
 ●   Machines
      ●   Represent available resources
      ●   Mostly CPU, but indirectly memory and disk as well

UCSD Jan 17th 2012             Condor                          8
Jobs Have Wants & Needs
 ●   Jobs state their requirements and preferences:
      ●   Requirements:
           –   I require a Linux/x86 platform
      ●   Preferences ("Rank"):
           –   I prefer a machine owned by CMS
 ●   Jobs describe themselves via attributes:
      ●   Standard, i.e. defined by Condor:
           –   I am owned by Albert
      ●   Custom, i.e. specified by the user (or the administrator):
           –   I am a Monte Carlo job
           –   I will be done within 12h

UCSD Jan 17th 2012                    Condor                           9
Machines Do Too!
 ●   Machine requirements and preferences:
      ●   Requirements:
           –   I require that jobs declare a runtime shorter than 18h
      ●   Preferences ("Rank"):
           –   I prefer Monte Carlo jobs
 ●   Machine attributes:
      ●   Standard, i.e. defined by Condor:
           –   I am a Linux node
           –   I control 2GB of memory
      ●   Custom, i.e. specified by the administrator:
           –   I have been paid with CMS money

UCSD Jan 17th 2012                   Condor                             10
Condor brings them together




                         Central manager

                            Condor

        Job repository
                                           Machine
            Condor                         Condor




UCSD Jan 17th 2012              Condor               11
Condor ClassAds




                           Classified Ads




UCSD Jan 17th 2012          Condor          12
What are Condor ClassAds?
 ●   ClassAds is a language for objects
     (jobs and machines) to
      ●   Express attributes about themselves
      ●   Express what they require/desire in a match
          (similar to personal classified ads)
 ●   Structure
      ●   Set of attribute name/value pairs
      ●   Value : Literals (string, bool, int, float)
                  or an expression


UCSD Jan 17th 2012                 Condor               13
Example ClassAd

                     MyType = "Machine"
                      MyType = "Machine"
                     TargetType = "Job"
                      TargetType = "Job"
                     Name = "glidein_999@cabinet-2-2-1.t2.ucsd.edu"
                      Name = "glidein_999@cabinet-2-2-1.t2.ucsd.edu"
                     Machine = "cabinet-2-2-1.t2.ucsd.edu"
                      Machine = "cabinet-2-2-1.t2.ucsd.edu"
                     StartdIpAddr = "<169.228.131.179:56787>"
                      StartdIpAddr = "<169.228.131.179:56787>"
                     State = "Claimed"
                      State = "Claimed"
                     Activity = "Busy"
                      Activity = "Busy"
                     Cpus = 1
                      Cpus = 1
                     Memory = 36170
                      Memory = 36170
                     Disk = 231463800
                      Disk = 231463800
                     OpSys = "LINUX"
                      OpSys = "LINUX"
                     Arch = "X86_64"
                      Arch = "X86_64"
                     Requirements = JOB_Is_ITB != true
                      Requirements = JOB_Is_ITB != true
                     Rank = 1
                      Rank = 1
                     KFlops = 972989
                      KFlops = 972989
                     Mips = 3499
                      Mips = 3499
                     HasFileTransfer = true
                      HasFileTransfer = true
                     IS_GLIDEIN = true
                      IS_GLIDEIN = true
                     GLIDEIN_SEs = "bsrm-1.t2.ucsd.edu"
                      GLIDEIN_SEs = "bsrm-1.t2.ucsd.edu"
                     DaemonStartTime = 1324784426
                      DaemonStartTime = 1324784426


UCSD Jan 17th 2012                           Condor                    14
ClassAd Expressions
 ●   Similar look to C : operators, references, functions
 ●   Operators: +, -, *, /, <, <=,>, >=, ==, !=, &&, and ||
     all work as expected
      ●   Type checking ops: =?=, =!=
 ●   Functions: if/then/else, string manipulation,
     list operations, dates, randomization, …
 ●   References: to other attributes in the same ad, or
     attributes in an ad that is a candidate for a match
 ●   True==1 and False==0 (guaranteed)
      ●   e.g. (3 == (2+True)) is identical to True
 ●   Explicit UNDEFINED
                     http://www.cs.wisc.edu/condor/manual/v7.6/4_1Condor_s_ClassAd.html#SECTION00512300000000000000

UCSD Jan 17th 2012                                   Condor                                                     15
Example Expression


   ifthenelse(LastVacateTime=?=UNDEFINED,
     ifthenelse(LastVacateTime=?=UNDEFINED,
               ifthenelse(NormMaxMins=!=UNDEFINED,
                 ifthenelse(NormMaxMins=!=UNDEFINED,
                         (NormMaxMins*60)<(ToRetire+JobMaxTime-MyCurrentTime),
                           (NormMaxMins*60)<(ToRetire+JobMaxTime-MyCurrentTime),
                          (8*3600)<(ToRetire+JobMaxTime-MyCurrentTime)),
                            (8*3600)<(ToRetire+JobMaxTime-MyCurrentTime)),
               ifthenelse(MaxMins=!=UNDEFINED,
                 ifthenelse(MaxMins=!=UNDEFINED,
                          (MaxMins*60)<(ToRetire+JobMaxTime-MyCurrentTime),
                            (MaxMins*60)<(ToRetire+JobMaxTime-MyCurrentTime),
                          (16*3600)<(ToRetire+JobMaxTime-MyCurrentTime)))&&
                            (16*3600)<(ToRetire+JobMaxTime-MyCurrentTime)))&&
   (ImageSize<(MaxMemMBs*1024))&&
     (ImageSize<(MaxMemMBs*1024))&&
   (stringListMember(GLIDEIN_SEs,DESIRED_SEs,",")=?=True)&&
     (stringListMember(GLIDEIN_SEs,DESIRED_SEs,",")=?=True)&&
   (JOB_Is_ITB =!= TRUE)
     (JOB_Is_ITB =!= TRUE)




UCSD Jan 17th 2012                    Condor                                  16
ClassAd Types
 ●   Condor has many types of ClassAds
      ●   A "Job Ad" represents a job to Condor
      ●   A "Machine Ad" represents a computing resource
      ●   Others types of ads represent instances of
          other services, users, licenses, etc

                             glideinWMS defines some




UCSD Jan 17th 2012              Condor                     17
Central Manager holds them all



                                                              Machine

                                  Central manager             Machine
        Job repository
                                                              Machine
                                     Condor
        Job repository
                                                              Machine
        Job repository
                         Job Ad                     Machine   Machine
            Condor                                  Ad
                                                              Condor




UCSD Jan 17th 2012                       Condor                         18
Match & start



                                            Machine

                          Central manager   Machine
        Job repository
                                            Machine
                             Condor
        Job repository
                                            Machine
        Job repository
                                            Machine
            Condor                Job
                                            Condor

                                              Job


UCSD Jan 17th 2012               Condor               19
The Magic of Matchmaking
 ●   Two ads match if both their Requirements expressions
     evaluate to True
     ●   If more than one match, the match with
         the highest Rank is preferred (float)
 ●   Condor evaluates job ads in the context
     of a candidate machine ad looking for a match
     ●   MY.name – Value for attribute “name” in local ClassAd
     ●   TARGET.name – Value for attribute “name” in match candidate
         ClassAd
     ●   Name – Looks for “name” in the local ClassAd, then the
         candidate ClassAd



UCSD Jan 17th 2012                  Condor                             20
Example Fancy Match

   Pet Ad                       Buyer Ad
  MyType = “Pet”                MyType = “Buyer”
  TargetType = “Buyer”          TargetType = “Pet”
  Requirements =                Requirements =
      DogLover =?= True          (PetType == “Dog”) &&
  Rank = 0                       (TARGET.Price <= MY.AcctBalance) &&
  PetType = “Dog”                (Size == "Large"||Size == "Very Large")
  Color = “Brown”               Rank = (Breed == "Saint Bernard")
  Price = 75                    AcctBalance = 100
  Breed = "Saint Bernard"       DogLover = True
  Size = "Very Large"           ...
  ...
   Dog == Resource ~= Machine                               Buyer ~= Job

UCSD Jan 17th 2012               Condor                                21
(Vanilla)
                     Condor Daemons




UCSD Jan 17th 2012         Condor     22
Condor Daemons –
                Mix’n Match Components
                              Negotiator


                              Collector

                     Master                  Shadow
                               Schedd
                                             Procd

                                Startd
                                             Starter


                                                       Job



UCSD Jan 17th 2012                  Condor                   23
Condor Daemons –
                Mix’n Match Components
                Central
                                  Negotiator
                Manager


                                      Collector
                                                                 Submit
                     Master                         Shadow         Node
                                                              (job repo)
                                      Schedd
                                                    Procd

                                       Startd
                                                    Starter


                                                              Job
             Execute node (Machine)



UCSD Jan 17th 2012                         Condor                          24
Central manager
                                                         Submit node
                      condor_master                     Execute node


 ●   You start it, it starts up the other Condor daemons
     ●   If a daemon exits unexpectedly, restarts deamon and
         emails administrator
     ●   If a daemon binary is updated (timestamp changed),
         restarts the daemon
 ●   Provides access to many remote administration
     commands:
     ●   condor_reconfig, condor_restart,
         condor_off, condor_on, etc.
 ●   Default server for many other commands:
     ●   condor_config_val, etc.


UCSD Jan 17th 2012                 Condor                        25
Submit node
                      condor_procd                 Execute node


 ●   Monitors all other processes on the node
      ●   Information then used by the other daemons
 ●   Builds process tree
      ●   Tracks birth and death of processes
      ●   Monitors resource consumption (memory, CPU)




UCSD Jan 17th 2012             Condor                          26
Submit node
                      condor_schedd
 ●   Represents jobs to the Condor pool
 ●   Maintains persistent queue of jobs
      ●   Queue is not strictly first-in-first-out (priority based)
      ●   Each machine running condor_schedd maintains
          its own independent queue
 ●   Responsible for contacting available machines
     and spawning waiting jobs
      ●   When told to by condor_negotiator
 ●   Services most user commands:
      ●   condor_submit, condor_rm, condor_q

UCSD Jan 17th 2012                Condor                              27
Submit node
                     condor_shadow
 ●   Spawned by condor_schedd
 ●   Represents a running job on the
     submit machine
      ●   Yes, one per running job
 ●   Handles file transfers
 ●   Enforces Periodic_* expressions
      ●   Hold, release, remove, ...




UCSD Jan 17th 2012              Condor           28
condor_startd            Execute node


 ●   Represents a machine willing to run jobs to
     the Condor pool
 ●   Run on any machine you want to run jobs on
 ●   Enforces the wishes of the machine owner
     (the owner’s “policy”)
 ●   Starts, stops, suspends jobs
 ●   Provides other administrative commands
      ●   for example, condor_vacate


UCSD Jan 17th 2012            Condor                   29
condor_starter                    Execute node


 ●   Spawned by the condor_startd
 ●   Handles all the details of
     starting and managing the job
      ●   Transfer job’s binary to execute machine
      ●   Send back exit status
      ●   Etc.
 ●   One per running job
      ●   The default configuration is willing to run
          one condor_starter per CPU


UCSD Jan 17th 2012               Condor                          30
Central manager


                     condor_collector
 ●   Collects information from all other Condor
     daemons in the pool
 ●   Each daemon sends a periodic update called
     a ClassAd to the collector
      ●   Old ClassAds removed after a timeout (~15 mins)
 ●   Services queries for information:
      ●   Queries from other Condor daemons
      ●   Queries from users (condor_status)


UCSD Jan 17th 2012            Condor                      31
Central manager


                     condor_negotiator
 ●   Performs matchmaking in Condor
      ●   Pulls list of available machines from
          condor_collector, gets jobs from condor_schedds
      ●   Matches jobs with available machines
      ●   Both the job and the machine must satisfy each
          other’s requirements (2-way matching)
 ●   Handles user priorities and accounting




UCSD Jan 17th 2012            Condor                         32
Sample Condor pool




UCSD Jan 17th 2012           Condor       33
Sample Condor pool

                              Central manager
                              Master               Machine

                                  Collector        Machine
        Job repository
                                                   Machine
        Job repository           Negotiator
                                                   Machine
        Job repository
                                                   Machine
        Master                                       Master

             Schedd                                Startd

                     Shadow                     Shadow

                                                         Job


UCSD Jan 17th 2012                     Condor                  34
CCB: Condor Connection Broker
 ●   Condor wants two-way p2p connectivity
 ●   With CCB, one-way is good enough
      ●   Collector requests reversed connections for clients
                                                                  Execute
                                  CCB                             Node
      Job Submit Point
                           Call me back

                               I want to connect
                             to the execute node


                                  transfer files
          reversed connection                      CCB_ADDRESS=ccb.host.name

UCSD Jan 17th 2012                  Condor                                  35
Limitations of CCB
 ●   Collector (CCB Broker) needs to be accessible by everyone
 ●   Requires outgoing connectivity
 ●   Can’t have BOTH submit and execute points behind
     different firewalls
                                                           Execute
                                                           Node
      Job Submit Point

                                     no go!




                                              CCB_ADDRESS=ccb2.host
     CCB_ADDRESS=ccb1.host

UCSD Jan 17th 2012              Condor                               36
(Vanilla)
                     Condor protocol




UCSD Jan 17th 2012         Condor      37
Claiming Protocol
                                  Central Manager


                              Negotiator            Collector
                                                                   S




              Submit Machine                            Execute Machine

                                                                          S
              J
                     Schedd                                     Startd


    J

        Submit

UCSD Jan 17th 2012                         Condor                             38
Claiming Protocol
                                  Central Manager                 Q


                      J   S   Negotiator            Collector
                                                                   S




              Submit Machine                            Execute Machine

                                                                          S
              J
                     Schedd                                     Startd




UCSD Jan 17th 2012                         Condor                             39
Claiming Protocol
                                  Central Manager                 Q


                      J   S   Negotiator            Collector
                                                                   S




              Submit Machine                            Execute Machine

                                      CLAIM                              J   S
        Q     J
                     Schedd                                     Startd




UCSD Jan 17th 2012                         Condor                                40
Claim Activation
                              Central Manager


                         Negotiator               Collector




             Submit Machine                           Execute Machine

                                 CLAIMED
                Schedd                                        Startd
                              Activate
                              Claim
                                              Starter                  Job

                     Shadow


UCSD Jan 17th 2012                       Condor                              41
Repeat until Claim released
                              Central Manager


                         Negotiator               Collector




             Submit Machine                           Execute Machine

                                 CLAIMED
                Schedd                                        Startd
                              Activate
                              Claim
                                              Starter                  Job

                     Shadow


UCSD Jan 17th 2012                       Condor                              42
When is claim released?
 ●   When relinquished by one of the following
     ●   lease on the claim is not renewed
          –   Why? Machine powered off, disappeared, etc
     ●   schedd
          –   Why? Out of jobs, shutting down, schedd didn’t “like” the
              machine, etc
     ●   startd
          –   Why? Policy re claim lifetime, prefers a different match (via Rank),
              non-dedicated desktop, etc
     ●   negotiator                                      Defining Rank is dangerous!
          –   Why? User priority inversion policy                (preemption)
     ●   explicitly via a command-line tool
          –   E.g. condor_vacate

UCSD Jan 17th 2012                      Condor                                    43
The end




UCSD Jan 17th 2012     Condor   44
The Condor Project (Established ‘85)
 ●   Research and Development in the Distributed High
     Throughput Computing field
 ●   Team of ~35 faculty, full time staff and students
     ●   Face software engineering challenges in a distributed
         UNIX/Linux/NT environment
     ●   Are involved in national and international grid
         collaborations
     ●   Actively interact with academic and commercial entities
         and users
     ●   Maintain and support large distributed production
         environments
     ●   Educate and train students

UCSD Jan 17th 2012               Condor                            45
The Condor Team




UCSD Jan 17th 2012         Condor      46
Pointers
 ●   Condor Home Page
     http://www.cs.wisc.edu/condor/
 ●   Condor Manual
     http://www.cs.wisc.edu/condor/manual/v7.6/
 ●   Support
     condor-user@cs.wisc.edu
     condor-admin@cs.wisc.edu




UCSD Jan 17th 2012       Condor                   47

Mais conteúdo relacionado

Destaque

The Django Book - Chapter 6 the django admin site
The Django Book - Chapter 6  the django admin siteThe Django Book - Chapter 6  the django admin site
The Django Book - Chapter 6 the django admin siteVincent Chien
 
Weblogic Server
Weblogic ServerWeblogic Server
Weblogic Serveracsvianabr
 
Securing the e health cloud
Securing the e health cloudSecuring the e health cloud
Securing the e health cloudBong Young Sung
 
Cookies
CookiesCookies
Cookiesepo273
 
East Algarve Magazine - NOVEMBER 2010
East Algarve Magazine - NOVEMBER 2010East Algarve Magazine - NOVEMBER 2010
East Algarve Magazine - NOVEMBER 2010Nick Eamag
 
Uhy global directory-2013
Uhy global directory-2013Uhy global directory-2013
Uhy global directory-2013Dawgen Global
 
Be2Awards and Be2Talks 2013 - event slides
Be2Awards and Be2Talks 2013 - event slidesBe2Awards and Be2Talks 2013 - event slides
Be2Awards and Be2Talks 2013 - event slidesBe2camp Admin
 
2345014 unix-linux-bsd-cheat-sheets-i
2345014 unix-linux-bsd-cheat-sheets-i2345014 unix-linux-bsd-cheat-sheets-i
2345014 unix-linux-bsd-cheat-sheets-iLogesh Kumar Anandhan
 
ISC West 2014 Korea Pavilion Directory
ISC West 2014 Korea Pavilion DirectoryISC West 2014 Korea Pavilion Directory
ISC West 2014 Korea Pavilion DirectoryCindy Moon
 
Adobe Marketing Cloud Integration with Adobe AEM
Adobe Marketing Cloud Integration with Adobe AEMAdobe Marketing Cloud Integration with Adobe AEM
Adobe Marketing Cloud Integration with Adobe AEMDeepak Narisety
 
Dedo talk-2014-flat
Dedo talk-2014-flatDedo talk-2014-flat
Dedo talk-2014-flat23rd & 5th
 
120000 trang edu urls
120000 trang edu urls120000 trang edu urls
120000 trang edu urlssieuthi68
 
Data Mining With R
Data Mining With RData Mining With R
Data Mining With RAjay Ohri
 
Kony - End-to-End Proof of Technology
Kony - End-to-End Proof of TechnologyKony - End-to-End Proof of Technology
Kony - End-to-End Proof of TechnologyDipesh Mukerji
 
RailsAdmin - Overview and Best practices
RailsAdmin - Overview and Best practicesRailsAdmin - Overview and Best practices
RailsAdmin - Overview and Best practicesBenoit Bénézech
 

Destaque (20)

The Django Book - Chapter 6 the django admin site
The Django Book - Chapter 6  the django admin siteThe Django Book - Chapter 6  the django admin site
The Django Book - Chapter 6 the django admin site
 
EdCamp News & UpDates
EdCamp News & UpDatesEdCamp News & UpDates
EdCamp News & UpDates
 
Weblogic Server
Weblogic ServerWeblogic Server
Weblogic Server
 
Securing the e health cloud
Securing the e health cloudSecuring the e health cloud
Securing the e health cloud
 
ITIL v3 story
ITIL v3 storyITIL v3 story
ITIL v3 story
 
Cookies
CookiesCookies
Cookies
 
Discover the Baltic states for studies
Discover the Baltic states for studiesDiscover the Baltic states for studies
Discover the Baltic states for studies
 
East Algarve Magazine - NOVEMBER 2010
East Algarve Magazine - NOVEMBER 2010East Algarve Magazine - NOVEMBER 2010
East Algarve Magazine - NOVEMBER 2010
 
Uhy global directory-2013
Uhy global directory-2013Uhy global directory-2013
Uhy global directory-2013
 
Be2Awards and Be2Talks 2013 - event slides
Be2Awards and Be2Talks 2013 - event slidesBe2Awards and Be2Talks 2013 - event slides
Be2Awards and Be2Talks 2013 - event slides
 
2345014 unix-linux-bsd-cheat-sheets-i
2345014 unix-linux-bsd-cheat-sheets-i2345014 unix-linux-bsd-cheat-sheets-i
2345014 unix-linux-bsd-cheat-sheets-i
 
ISC West 2014 Korea Pavilion Directory
ISC West 2014 Korea Pavilion DirectoryISC West 2014 Korea Pavilion Directory
ISC West 2014 Korea Pavilion Directory
 
Uk norway ib directory
Uk norway ib directoryUk norway ib directory
Uk norway ib directory
 
Adobe Marketing Cloud Integration with Adobe AEM
Adobe Marketing Cloud Integration with Adobe AEMAdobe Marketing Cloud Integration with Adobe AEM
Adobe Marketing Cloud Integration with Adobe AEM
 
Dedo talk-2014-flat
Dedo talk-2014-flatDedo talk-2014-flat
Dedo talk-2014-flat
 
120000 trang edu urls
120000 trang edu urls120000 trang edu urls
120000 trang edu urls
 
Data Mining With R
Data Mining With RData Mining With R
Data Mining With R
 
Kony - End-to-End Proof of Technology
Kony - End-to-End Proof of TechnologyKony - End-to-End Proof of Technology
Kony - End-to-End Proof of Technology
 
RailsAdmin - Overview and Best practices
RailsAdmin - Overview and Best practicesRailsAdmin - Overview and Best practices
RailsAdmin - Overview and Best practices
 
2009 04.s10-admin-topics1
2009 04.s10-admin-topics12009 04.s10-admin-topics1
2009 04.s10-admin-topics1
 

Semelhante a Condor overview - glideinWMS Training Jan 2012

Condor from the user point of view - glideinWMS Training Jan 2012
Condor from the user point of view - glideinWMS Training Jan 2012Condor from the user point of view - glideinWMS Training Jan 2012
Condor from the user point of view - glideinWMS Training Jan 2012Igor Sfiligoi
 
Monitoring and troubleshooting a glideinWMS-based HTCondor pool
Monitoring and troubleshooting a glideinWMS-based HTCondor poolMonitoring and troubleshooting a glideinWMS-based HTCondor pool
Monitoring and troubleshooting a glideinWMS-based HTCondor poolIgor Sfiligoi
 
glideinWMS validation scirpts - glideinWMS Training Jan 2012
glideinWMS validation scirpts - glideinWMS Training Jan 2012glideinWMS validation scirpts - glideinWMS Training Jan 2012
glideinWMS validation scirpts - glideinWMS Training Jan 2012Igor Sfiligoi
 
glideinWMS Training Jan 2012 - Condor tuning
glideinWMS Training Jan 2012 - Condor tuningglideinWMS Training Jan 2012 - Condor tuning
glideinWMS Training Jan 2012 - Condor tuningIgor Sfiligoi
 
glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012
glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012
glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012Igor Sfiligoi
 
Truemotion Adventures in Containerization
Truemotion Adventures in ContainerizationTruemotion Adventures in Containerization
Truemotion Adventures in ContainerizationRyan Hunter
 
Eric Lafortune - ProGuard: Optimizer and obfuscator in the Android SDK
Eric Lafortune - ProGuard: Optimizer and obfuscator in the Android SDKEric Lafortune - ProGuard: Optimizer and obfuscator in the Android SDK
Eric Lafortune - ProGuard: Optimizer and obfuscator in the Android SDKGuardSquare
 
Droidcon2013 pro guard, optimizer and obfuscator in the android sdk_eric lafo...
Droidcon2013 pro guard, optimizer and obfuscator in the android sdk_eric lafo...Droidcon2013 pro guard, optimizer and obfuscator in the android sdk_eric lafo...
Droidcon2013 pro guard, optimizer and obfuscator in the android sdk_eric lafo...Droidcon Berlin
 
Swarm: Native Docker Clustering
Swarm: Native Docker ClusteringSwarm: Native Docker Clustering
Swarm: Native Docker ClusteringRoyee Tager
 
Dart the Better JavaScript
Dart the Better JavaScriptDart the Better JavaScript
Dart the Better JavaScriptJorg Janke
 
Javascript Update May 2013
Javascript Update May 2013Javascript Update May 2013
Javascript Update May 2013Ramesh Nair
 
Docker and Your Path to a Better Staging Environment - webinar by Gil Tayar
Docker and Your Path to a Better Staging Environment - webinar by Gil TayarDocker and Your Path to a Better Staging Environment - webinar by Gil Tayar
Docker and Your Path to a Better Staging Environment - webinar by Gil TayarApplitools
 
Tuning and Debugging in Apache Spark
Tuning and Debugging in Apache SparkTuning and Debugging in Apache Spark
Tuning and Debugging in Apache SparkPatrick Wendell
 
Doctrine with Symfony - SymfonyCon 2019
Doctrine with Symfony - SymfonyCon 2019Doctrine with Symfony - SymfonyCon 2019
Doctrine with Symfony - SymfonyCon 2019julien pauli
 
Code Cleanup: A Data Scientist's Guide to Sparkling Code
Code Cleanup: A Data Scientist's Guide to Sparkling CodeCode Cleanup: A Data Scientist's Guide to Sparkling Code
Code Cleanup: A Data Scientist's Guide to Sparkling CodeCorrie Bartelheimer
 
Tuning and Debugging in Apache Spark
Tuning and Debugging in Apache SparkTuning and Debugging in Apache Spark
Tuning and Debugging in Apache SparkDatabricks
 
Beyond unit tests: Testing for Spark/Hadoop Workflows with Shankar Manian Ana...
Beyond unit tests: Testing for Spark/Hadoop Workflows with Shankar Manian Ana...Beyond unit tests: Testing for Spark/Hadoop Workflows with Shankar Manian Ana...
Beyond unit tests: Testing for Spark/Hadoop Workflows with Shankar Manian Ana...Spark Summit
 
Hashicorp-Terraform-Deep-Dive-with-no-Fear-Victor-Turbinsky-Texuna.pdf
Hashicorp-Terraform-Deep-Dive-with-no-Fear-Victor-Turbinsky-Texuna.pdfHashicorp-Terraform-Deep-Dive-with-no-Fear-Victor-Turbinsky-Texuna.pdf
Hashicorp-Terraform-Deep-Dive-with-no-Fear-Victor-Turbinsky-Texuna.pdfssuser705051
 

Semelhante a Condor overview - glideinWMS Training Jan 2012 (20)

Condor from the user point of view - glideinWMS Training Jan 2012
Condor from the user point of view - glideinWMS Training Jan 2012Condor from the user point of view - glideinWMS Training Jan 2012
Condor from the user point of view - glideinWMS Training Jan 2012
 
Monitoring and troubleshooting a glideinWMS-based HTCondor pool
Monitoring and troubleshooting a glideinWMS-based HTCondor poolMonitoring and troubleshooting a glideinWMS-based HTCondor pool
Monitoring and troubleshooting a glideinWMS-based HTCondor pool
 
glideinWMS validation scirpts - glideinWMS Training Jan 2012
glideinWMS validation scirpts - glideinWMS Training Jan 2012glideinWMS validation scirpts - glideinWMS Training Jan 2012
glideinWMS validation scirpts - glideinWMS Training Jan 2012
 
glideinWMS Training Jan 2012 - Condor tuning
glideinWMS Training Jan 2012 - Condor tuningglideinWMS Training Jan 2012 - Condor tuning
glideinWMS Training Jan 2012 - Condor tuning
 
glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012
glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012
glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012
 
Truemotion Adventures in Containerization
Truemotion Adventures in ContainerizationTruemotion Adventures in Containerization
Truemotion Adventures in Containerization
 
Eric Lafortune - ProGuard: Optimizer and obfuscator in the Android SDK
Eric Lafortune - ProGuard: Optimizer and obfuscator in the Android SDKEric Lafortune - ProGuard: Optimizer and obfuscator in the Android SDK
Eric Lafortune - ProGuard: Optimizer and obfuscator in the Android SDK
 
Droidcon2013 pro guard, optimizer and obfuscator in the android sdk_eric lafo...
Droidcon2013 pro guard, optimizer and obfuscator in the android sdk_eric lafo...Droidcon2013 pro guard, optimizer and obfuscator in the android sdk_eric lafo...
Droidcon2013 pro guard, optimizer and obfuscator in the android sdk_eric lafo...
 
Swarm: Native Docker Clustering
Swarm: Native Docker ClusteringSwarm: Native Docker Clustering
Swarm: Native Docker Clustering
 
Dart the Better JavaScript
Dart the Better JavaScriptDart the Better JavaScript
Dart the Better JavaScript
 
Javascript Update May 2013
Javascript Update May 2013Javascript Update May 2013
Javascript Update May 2013
 
Docker and Your Path to a Better Staging Environment - webinar by Gil Tayar
Docker and Your Path to a Better Staging Environment - webinar by Gil TayarDocker and Your Path to a Better Staging Environment - webinar by Gil Tayar
Docker and Your Path to a Better Staging Environment - webinar by Gil Tayar
 
Tuning and Debugging in Apache Spark
Tuning and Debugging in Apache SparkTuning and Debugging in Apache Spark
Tuning and Debugging in Apache Spark
 
Doctrine with Symfony - SymfonyCon 2019
Doctrine with Symfony - SymfonyCon 2019Doctrine with Symfony - SymfonyCon 2019
Doctrine with Symfony - SymfonyCon 2019
 
Code Cleanup: A Data Scientist's Guide to Sparkling Code
Code Cleanup: A Data Scientist's Guide to Sparkling CodeCode Cleanup: A Data Scientist's Guide to Sparkling Code
Code Cleanup: A Data Scientist's Guide to Sparkling Code
 
Tuning and Debugging in Apache Spark
Tuning and Debugging in Apache SparkTuning and Debugging in Apache Spark
Tuning and Debugging in Apache Spark
 
Introduction solidity
Introduction solidityIntroduction solidity
Introduction solidity
 
Beyond unit tests: Testing for Spark/Hadoop Workflows with Shankar Manian Ana...
Beyond unit tests: Testing for Spark/Hadoop Workflows with Shankar Manian Ana...Beyond unit tests: Testing for Spark/Hadoop Workflows with Shankar Manian Ana...
Beyond unit tests: Testing for Spark/Hadoop Workflows with Shankar Manian Ana...
 
Hashicorp-Terraform-Deep-Dive-with-no-Fear-Victor-Turbinsky-Texuna.pdf
Hashicorp-Terraform-Deep-Dive-with-no-Fear-Victor-Turbinsky-Texuna.pdfHashicorp-Terraform-Deep-Dive-with-no-Fear-Victor-Turbinsky-Texuna.pdf
Hashicorp-Terraform-Deep-Dive-with-no-Fear-Victor-Turbinsky-Texuna.pdf
 
Terraform-2.pdf
Terraform-2.pdfTerraform-2.pdf
Terraform-2.pdf
 

Mais de Igor Sfiligoi

Preparing Fusion codes for Perlmutter - CGYRO
Preparing Fusion codes for Perlmutter - CGYROPreparing Fusion codes for Perlmutter - CGYRO
Preparing Fusion codes for Perlmutter - CGYROIgor Sfiligoi
 
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...Igor Sfiligoi
 
Comparing single-node and multi-node performance of an important fusion HPC c...
Comparing single-node and multi-node performance of an important fusion HPC c...Comparing single-node and multi-node performance of an important fusion HPC c...
Comparing single-node and multi-node performance of an important fusion HPC c...Igor Sfiligoi
 
The anachronism of whole-GPU accounting
The anachronism of whole-GPU accountingThe anachronism of whole-GPU accounting
The anachronism of whole-GPU accountingIgor Sfiligoi
 
Auto-scaling HTCondor pools using Kubernetes compute resources
Auto-scaling HTCondor pools using Kubernetes compute resourcesAuto-scaling HTCondor pools using Kubernetes compute resources
Auto-scaling HTCondor pools using Kubernetes compute resourcesIgor Sfiligoi
 
Speeding up bowtie2 by improving cache-hit rate
Speeding up bowtie2 by improving cache-hit rateSpeeding up bowtie2 by improving cache-hit rate
Speeding up bowtie2 by improving cache-hit rateIgor Sfiligoi
 
Performance Optimization of CGYRO for Multiscale Turbulence Simulations
Performance Optimization of CGYRO for Multiscale Turbulence SimulationsPerformance Optimization of CGYRO for Multiscale Turbulence Simulations
Performance Optimization of CGYRO for Multiscale Turbulence SimulationsIgor Sfiligoi
 
Comparing GPU effectiveness for Unifrac distance compute
Comparing GPU effectiveness for Unifrac distance computeComparing GPU effectiveness for Unifrac distance compute
Comparing GPU effectiveness for Unifrac distance computeIgor Sfiligoi
 
Managing Cloud networking costs for data-intensive applications by provisioni...
Managing Cloud networking costs for data-intensive applications by provisioni...Managing Cloud networking costs for data-intensive applications by provisioni...
Managing Cloud networking costs for data-intensive applications by provisioni...Igor Sfiligoi
 
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory AccessAccelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory AccessIgor Sfiligoi
 
Using A100 MIG to Scale Astronomy Scientific Output
Using A100 MIG to Scale Astronomy Scientific OutputUsing A100 MIG to Scale Astronomy Scientific Output
Using A100 MIG to Scale Astronomy Scientific OutputIgor Sfiligoi
 
Using commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobsUsing commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobsIgor Sfiligoi
 
Modest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYROModest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYROIgor Sfiligoi
 
Data-intensive IceCube Cloud Burst
Data-intensive IceCube Cloud BurstData-intensive IceCube Cloud Burst
Data-intensive IceCube Cloud BurstIgor Sfiligoi
 
Scheduling a Kubernetes Federation with Admiralty
Scheduling a Kubernetes Federation with AdmiraltyScheduling a Kubernetes Federation with Admiralty
Scheduling a Kubernetes Federation with AdmiraltyIgor Sfiligoi
 
Accelerating microbiome research with OpenACC
Accelerating microbiome research with OpenACCAccelerating microbiome research with OpenACC
Accelerating microbiome research with OpenACCIgor Sfiligoi
 
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...Igor Sfiligoi
 
Porting and optimizing UniFrac for GPUs
Porting and optimizing UniFrac for GPUsPorting and optimizing UniFrac for GPUs
Porting and optimizing UniFrac for GPUsIgor Sfiligoi
 
Demonstrating 100 Gbps in and out of the public Clouds
Demonstrating 100 Gbps in and out of the public CloudsDemonstrating 100 Gbps in and out of the public Clouds
Demonstrating 100 Gbps in and out of the public CloudsIgor Sfiligoi
 
TransAtlantic Networking using Cloud links
TransAtlantic Networking using Cloud linksTransAtlantic Networking using Cloud links
TransAtlantic Networking using Cloud linksIgor Sfiligoi
 

Mais de Igor Sfiligoi (20)

Preparing Fusion codes for Perlmutter - CGYRO
Preparing Fusion codes for Perlmutter - CGYROPreparing Fusion codes for Perlmutter - CGYRO
Preparing Fusion codes for Perlmutter - CGYRO
 
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
 
Comparing single-node and multi-node performance of an important fusion HPC c...
Comparing single-node and multi-node performance of an important fusion HPC c...Comparing single-node and multi-node performance of an important fusion HPC c...
Comparing single-node and multi-node performance of an important fusion HPC c...
 
The anachronism of whole-GPU accounting
The anachronism of whole-GPU accountingThe anachronism of whole-GPU accounting
The anachronism of whole-GPU accounting
 
Auto-scaling HTCondor pools using Kubernetes compute resources
Auto-scaling HTCondor pools using Kubernetes compute resourcesAuto-scaling HTCondor pools using Kubernetes compute resources
Auto-scaling HTCondor pools using Kubernetes compute resources
 
Speeding up bowtie2 by improving cache-hit rate
Speeding up bowtie2 by improving cache-hit rateSpeeding up bowtie2 by improving cache-hit rate
Speeding up bowtie2 by improving cache-hit rate
 
Performance Optimization of CGYRO for Multiscale Turbulence Simulations
Performance Optimization of CGYRO for Multiscale Turbulence SimulationsPerformance Optimization of CGYRO for Multiscale Turbulence Simulations
Performance Optimization of CGYRO for Multiscale Turbulence Simulations
 
Comparing GPU effectiveness for Unifrac distance compute
Comparing GPU effectiveness for Unifrac distance computeComparing GPU effectiveness for Unifrac distance compute
Comparing GPU effectiveness for Unifrac distance compute
 
Managing Cloud networking costs for data-intensive applications by provisioni...
Managing Cloud networking costs for data-intensive applications by provisioni...Managing Cloud networking costs for data-intensive applications by provisioni...
Managing Cloud networking costs for data-intensive applications by provisioni...
 
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory AccessAccelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
 
Using A100 MIG to Scale Astronomy Scientific Output
Using A100 MIG to Scale Astronomy Scientific OutputUsing A100 MIG to Scale Astronomy Scientific Output
Using A100 MIG to Scale Astronomy Scientific Output
 
Using commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobsUsing commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobs
 
Modest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYROModest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYRO
 
Data-intensive IceCube Cloud Burst
Data-intensive IceCube Cloud BurstData-intensive IceCube Cloud Burst
Data-intensive IceCube Cloud Burst
 
Scheduling a Kubernetes Federation with Admiralty
Scheduling a Kubernetes Federation with AdmiraltyScheduling a Kubernetes Federation with Admiralty
Scheduling a Kubernetes Federation with Admiralty
 
Accelerating microbiome research with OpenACC
Accelerating microbiome research with OpenACCAccelerating microbiome research with OpenACC
Accelerating microbiome research with OpenACC
 
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
 
Porting and optimizing UniFrac for GPUs
Porting and optimizing UniFrac for GPUsPorting and optimizing UniFrac for GPUs
Porting and optimizing UniFrac for GPUs
 
Demonstrating 100 Gbps in and out of the public Clouds
Demonstrating 100 Gbps in and out of the public CloudsDemonstrating 100 Gbps in and out of the public Clouds
Demonstrating 100 Gbps in and out of the public Clouds
 
TransAtlantic Networking using Cloud links
TransAtlantic Networking using Cloud linksTransAtlantic Networking using Cloud links
TransAtlantic Networking using Cloud links
 

Último

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 

Último (20)

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

Condor overview - glideinWMS Training Jan 2012

  • 1. glideinWMS Training @ UCSD Condor overview by Igor Sfiligoi (UCSD) UCSD Jan 17th 2012 Condor 1
  • 2. Acknowledgement ● These slides are heavily based on the presentation Todd Tannenbaum gave at CERN in Feb 2011 https://indico.cern.ch/conferenceTimeTable.py?confId=124982#20110214.detailed UCSD Jan 17th 2012 Condor 2
  • 3. Outline ● What is Condor ● Condor principles ● Condor daemons ● Condor protocol overview UCSD Jan 17th 2012 Condor 3
  • 4. What is Condor UCSD Jan 17th 2012 Condor 4
  • 5. What is Condor ● Condor is a Workload Management System ● i.e. a batch system ● Strong points ● Fault tolerant ● Robust feature set ● Flexible ● Development team dedicated to working closely w/ scientific community as priority #1 UCSD Jan 17th 2012 Condor 5
  • 6. How can Condor be used ● Managing local processes (local) Only vanilla ● Managing local cluster (~vanilla) in this talk ● Connecting clusters (flocking) ● Handling resource overlays (glideins) ● Swiss-knife for accessing other WMS (Condor-G) ● e.g. Grid, Cloud, pbs, etc. UCSD Jan 17th 2012 Condor 6
  • 7. (Vanilla) Condor principles UCSD Jan 17th 2012 Condor 7
  • 8. (Vanilla) Condor principles ● Two parts of the equation ● Jobs ● Machines/Resources ● Jobs ● Condor’s quanta of work ● Like a UNIX process ● Can be an element of a workflow ● Machines ● Represent available resources ● Mostly CPU, but indirectly memory and disk as well UCSD Jan 17th 2012 Condor 8
  • 9. Jobs Have Wants & Needs ● Jobs state their requirements and preferences: ● Requirements: – I require a Linux/x86 platform ● Preferences ("Rank"): – I prefer a machine owned by CMS ● Jobs describe themselves via attributes: ● Standard, i.e. defined by Condor: – I am owned by Albert ● Custom, i.e. specified by the user (or the administrator): – I am a Monte Carlo job – I will be done within 12h UCSD Jan 17th 2012 Condor 9
  • 10. Machines Do Too! ● Machine requirements and preferences: ● Requirements: – I require that jobs declare a runtime shorter than 18h ● Preferences ("Rank"): – I prefer Monte Carlo jobs ● Machine attributes: ● Standard, i.e. defined by Condor: – I am a Linux node – I control 2GB of memory ● Custom, i.e. specified by the administrator: – I have been paid with CMS money UCSD Jan 17th 2012 Condor 10
  • 11. Condor brings them together Central manager Condor Job repository Machine Condor Condor UCSD Jan 17th 2012 Condor 11
  • 12. Condor ClassAds Classified Ads UCSD Jan 17th 2012 Condor 12
  • 13. What are Condor ClassAds? ● ClassAds is a language for objects (jobs and machines) to ● Express attributes about themselves ● Express what they require/desire in a match (similar to personal classified ads) ● Structure ● Set of attribute name/value pairs ● Value : Literals (string, bool, int, float) or an expression UCSD Jan 17th 2012 Condor 13
  • 14. Example ClassAd MyType = "Machine" MyType = "Machine" TargetType = "Job" TargetType = "Job" Name = "glidein_999@cabinet-2-2-1.t2.ucsd.edu" Name = "glidein_999@cabinet-2-2-1.t2.ucsd.edu" Machine = "cabinet-2-2-1.t2.ucsd.edu" Machine = "cabinet-2-2-1.t2.ucsd.edu" StartdIpAddr = "<169.228.131.179:56787>" StartdIpAddr = "<169.228.131.179:56787>" State = "Claimed" State = "Claimed" Activity = "Busy" Activity = "Busy" Cpus = 1 Cpus = 1 Memory = 36170 Memory = 36170 Disk = 231463800 Disk = 231463800 OpSys = "LINUX" OpSys = "LINUX" Arch = "X86_64" Arch = "X86_64" Requirements = JOB_Is_ITB != true Requirements = JOB_Is_ITB != true Rank = 1 Rank = 1 KFlops = 972989 KFlops = 972989 Mips = 3499 Mips = 3499 HasFileTransfer = true HasFileTransfer = true IS_GLIDEIN = true IS_GLIDEIN = true GLIDEIN_SEs = "bsrm-1.t2.ucsd.edu" GLIDEIN_SEs = "bsrm-1.t2.ucsd.edu" DaemonStartTime = 1324784426 DaemonStartTime = 1324784426 UCSD Jan 17th 2012 Condor 14
  • 15. ClassAd Expressions ● Similar look to C : operators, references, functions ● Operators: +, -, *, /, <, <=,>, >=, ==, !=, &&, and || all work as expected ● Type checking ops: =?=, =!= ● Functions: if/then/else, string manipulation, list operations, dates, randomization, … ● References: to other attributes in the same ad, or attributes in an ad that is a candidate for a match ● True==1 and False==0 (guaranteed) ● e.g. (3 == (2+True)) is identical to True ● Explicit UNDEFINED http://www.cs.wisc.edu/condor/manual/v7.6/4_1Condor_s_ClassAd.html#SECTION00512300000000000000 UCSD Jan 17th 2012 Condor 15
  • 16. Example Expression ifthenelse(LastVacateTime=?=UNDEFINED, ifthenelse(LastVacateTime=?=UNDEFINED, ifthenelse(NormMaxMins=!=UNDEFINED, ifthenelse(NormMaxMins=!=UNDEFINED, (NormMaxMins*60)<(ToRetire+JobMaxTime-MyCurrentTime), (NormMaxMins*60)<(ToRetire+JobMaxTime-MyCurrentTime), (8*3600)<(ToRetire+JobMaxTime-MyCurrentTime)), (8*3600)<(ToRetire+JobMaxTime-MyCurrentTime)), ifthenelse(MaxMins=!=UNDEFINED, ifthenelse(MaxMins=!=UNDEFINED, (MaxMins*60)<(ToRetire+JobMaxTime-MyCurrentTime), (MaxMins*60)<(ToRetire+JobMaxTime-MyCurrentTime), (16*3600)<(ToRetire+JobMaxTime-MyCurrentTime)))&& (16*3600)<(ToRetire+JobMaxTime-MyCurrentTime)))&& (ImageSize<(MaxMemMBs*1024))&& (ImageSize<(MaxMemMBs*1024))&& (stringListMember(GLIDEIN_SEs,DESIRED_SEs,",")=?=True)&& (stringListMember(GLIDEIN_SEs,DESIRED_SEs,",")=?=True)&& (JOB_Is_ITB =!= TRUE) (JOB_Is_ITB =!= TRUE) UCSD Jan 17th 2012 Condor 16
  • 17. ClassAd Types ● Condor has many types of ClassAds ● A "Job Ad" represents a job to Condor ● A "Machine Ad" represents a computing resource ● Others types of ads represent instances of other services, users, licenses, etc glideinWMS defines some UCSD Jan 17th 2012 Condor 17
  • 18. Central Manager holds them all Machine Central manager Machine Job repository Machine Condor Job repository Machine Job repository Job Ad Machine Machine Condor Ad Condor UCSD Jan 17th 2012 Condor 18
  • 19. Match & start Machine Central manager Machine Job repository Machine Condor Job repository Machine Job repository Machine Condor Job Condor Job UCSD Jan 17th 2012 Condor 19
  • 20. The Magic of Matchmaking ● Two ads match if both their Requirements expressions evaluate to True ● If more than one match, the match with the highest Rank is preferred (float) ● Condor evaluates job ads in the context of a candidate machine ad looking for a match ● MY.name – Value for attribute “name” in local ClassAd ● TARGET.name – Value for attribute “name” in match candidate ClassAd ● Name – Looks for “name” in the local ClassAd, then the candidate ClassAd UCSD Jan 17th 2012 Condor 20
  • 21. Example Fancy Match Pet Ad Buyer Ad MyType = “Pet” MyType = “Buyer” TargetType = “Buyer” TargetType = “Pet” Requirements = Requirements = DogLover =?= True (PetType == “Dog”) && Rank = 0 (TARGET.Price <= MY.AcctBalance) && PetType = “Dog” (Size == "Large"||Size == "Very Large") Color = “Brown” Rank = (Breed == "Saint Bernard") Price = 75 AcctBalance = 100 Breed = "Saint Bernard" DogLover = True Size = "Very Large" ... ... Dog == Resource ~= Machine Buyer ~= Job UCSD Jan 17th 2012 Condor 21
  • 22. (Vanilla) Condor Daemons UCSD Jan 17th 2012 Condor 22
  • 23. Condor Daemons – Mix’n Match Components Negotiator Collector Master Shadow Schedd Procd Startd Starter Job UCSD Jan 17th 2012 Condor 23
  • 24. Condor Daemons – Mix’n Match Components Central Negotiator Manager Collector Submit Master Shadow Node (job repo) Schedd Procd Startd Starter Job Execute node (Machine) UCSD Jan 17th 2012 Condor 24
  • 25. Central manager Submit node condor_master Execute node ● You start it, it starts up the other Condor daemons ● If a daemon exits unexpectedly, restarts deamon and emails administrator ● If a daemon binary is updated (timestamp changed), restarts the daemon ● Provides access to many remote administration commands: ● condor_reconfig, condor_restart, condor_off, condor_on, etc. ● Default server for many other commands: ● condor_config_val, etc. UCSD Jan 17th 2012 Condor 25
  • 26. Submit node condor_procd Execute node ● Monitors all other processes on the node ● Information then used by the other daemons ● Builds process tree ● Tracks birth and death of processes ● Monitors resource consumption (memory, CPU) UCSD Jan 17th 2012 Condor 26
  • 27. Submit node condor_schedd ● Represents jobs to the Condor pool ● Maintains persistent queue of jobs ● Queue is not strictly first-in-first-out (priority based) ● Each machine running condor_schedd maintains its own independent queue ● Responsible for contacting available machines and spawning waiting jobs ● When told to by condor_negotiator ● Services most user commands: ● condor_submit, condor_rm, condor_q UCSD Jan 17th 2012 Condor 27
  • 28. Submit node condor_shadow ● Spawned by condor_schedd ● Represents a running job on the submit machine ● Yes, one per running job ● Handles file transfers ● Enforces Periodic_* expressions ● Hold, release, remove, ... UCSD Jan 17th 2012 Condor 28
  • 29. condor_startd Execute node ● Represents a machine willing to run jobs to the Condor pool ● Run on any machine you want to run jobs on ● Enforces the wishes of the machine owner (the owner’s “policy”) ● Starts, stops, suspends jobs ● Provides other administrative commands ● for example, condor_vacate UCSD Jan 17th 2012 Condor 29
  • 30. condor_starter Execute node ● Spawned by the condor_startd ● Handles all the details of starting and managing the job ● Transfer job’s binary to execute machine ● Send back exit status ● Etc. ● One per running job ● The default configuration is willing to run one condor_starter per CPU UCSD Jan 17th 2012 Condor 30
  • 31. Central manager condor_collector ● Collects information from all other Condor daemons in the pool ● Each daemon sends a periodic update called a ClassAd to the collector ● Old ClassAds removed after a timeout (~15 mins) ● Services queries for information: ● Queries from other Condor daemons ● Queries from users (condor_status) UCSD Jan 17th 2012 Condor 31
  • 32. Central manager condor_negotiator ● Performs matchmaking in Condor ● Pulls list of available machines from condor_collector, gets jobs from condor_schedds ● Matches jobs with available machines ● Both the job and the machine must satisfy each other’s requirements (2-way matching) ● Handles user priorities and accounting UCSD Jan 17th 2012 Condor 32
  • 33. Sample Condor pool UCSD Jan 17th 2012 Condor 33
  • 34. Sample Condor pool Central manager Master Machine Collector Machine Job repository Machine Job repository Negotiator Machine Job repository Machine Master Master Schedd Startd Shadow Shadow Job UCSD Jan 17th 2012 Condor 34
  • 35. CCB: Condor Connection Broker ● Condor wants two-way p2p connectivity ● With CCB, one-way is good enough ● Collector requests reversed connections for clients Execute CCB Node Job Submit Point Call me back I want to connect to the execute node transfer files reversed connection CCB_ADDRESS=ccb.host.name UCSD Jan 17th 2012 Condor 35
  • 36. Limitations of CCB ● Collector (CCB Broker) needs to be accessible by everyone ● Requires outgoing connectivity ● Can’t have BOTH submit and execute points behind different firewalls Execute Node Job Submit Point no go! CCB_ADDRESS=ccb2.host CCB_ADDRESS=ccb1.host UCSD Jan 17th 2012 Condor 36
  • 37. (Vanilla) Condor protocol UCSD Jan 17th 2012 Condor 37
  • 38. Claiming Protocol Central Manager Negotiator Collector S Submit Machine Execute Machine S J Schedd Startd J Submit UCSD Jan 17th 2012 Condor 38
  • 39. Claiming Protocol Central Manager Q J S Negotiator Collector S Submit Machine Execute Machine S J Schedd Startd UCSD Jan 17th 2012 Condor 39
  • 40. Claiming Protocol Central Manager Q J S Negotiator Collector S Submit Machine Execute Machine CLAIM J S Q J Schedd Startd UCSD Jan 17th 2012 Condor 40
  • 41. Claim Activation Central Manager Negotiator Collector Submit Machine Execute Machine CLAIMED Schedd Startd Activate Claim Starter Job Shadow UCSD Jan 17th 2012 Condor 41
  • 42. Repeat until Claim released Central Manager Negotiator Collector Submit Machine Execute Machine CLAIMED Schedd Startd Activate Claim Starter Job Shadow UCSD Jan 17th 2012 Condor 42
  • 43. When is claim released? ● When relinquished by one of the following ● lease on the claim is not renewed – Why? Machine powered off, disappeared, etc ● schedd – Why? Out of jobs, shutting down, schedd didn’t “like” the machine, etc ● startd – Why? Policy re claim lifetime, prefers a different match (via Rank), non-dedicated desktop, etc ● negotiator Defining Rank is dangerous! – Why? User priority inversion policy (preemption) ● explicitly via a command-line tool – E.g. condor_vacate UCSD Jan 17th 2012 Condor 43
  • 44. The end UCSD Jan 17th 2012 Condor 44
  • 45. The Condor Project (Established ‘85) ● Research and Development in the Distributed High Throughput Computing field ● Team of ~35 faculty, full time staff and students ● Face software engineering challenges in a distributed UNIX/Linux/NT environment ● Are involved in national and international grid collaborations ● Actively interact with academic and commercial entities and users ● Maintain and support large distributed production environments ● Educate and train students UCSD Jan 17th 2012 Condor 45
  • 46. The Condor Team UCSD Jan 17th 2012 Condor 46
  • 47. Pointers ● Condor Home Page http://www.cs.wisc.edu/condor/ ● Condor Manual http://www.cs.wisc.edu/condor/manual/v7.6/ ● Support condor-user@cs.wisc.edu condor-admin@cs.wisc.edu UCSD Jan 17th 2012 Condor 47