Beruflich Dokumente
Kultur Dokumente
SGE
v1.4
Author: Anand Vaidya
anand@vsa-services.com
Why use SGE?
● Maintain order in a shared resource – like queing
up at a movie ticket counter rather than mobbing
the counter
● Apply different usage policies – PhDs and Profs
get better treatment than first year grads
● Everyone gets a fair (!) share of the computing
resource.
What is SGE?
● SGE is a distributed resource management
software
● Provides users the means to submit
simulations to run.
● Can we just use these machines to run
those computations?
● Who will manage this herd? SGE will...
SGE Overview
execution hosts
● Records of each jobs progress through the
DRMAA client
(applications)
Supported OS
● Linux 32 and 64 bit
● Solaris (Sparc and x64)
● Windows (exec only)
● OSX
● AIX
● HPUX/IRIX etc
SGE Components
● Hosts
➢ Master (coordinate activities, hold queues)
➢ Shadow Master
➢ Execution (workers)
➢ Administration (sets up system, queues etc)
➢ Submit (users can submit jobs from these)
SGE Components
● Usually the master and admin host are the same
machines
● Queues (defined by the administrator)
● User and Administrator Commands
● Daemons:
● sge_qmaster (Master Daemon),
● sge_schedd (Scheduler Daemon),
● sge_execd (Execution Daemon)
● sge_commd (Communication Daemon)
4 Job Types
● Interactive jobs - user gets back a shell window
● Batch jobs – just run once and store output for
review later
● Array jobs (aka parametric – eg image rendering )
● Parallel (MPI) jobs – Can't describe in one line :-(
Accessing...
● GUI (qmon)
● Command Line / textual (qsub etc)
● Programmatic (DRMAA)