Beruflich Dokumente
Kultur Dokumente
by
B oontee Kruatrachue
A THESIS
submitted to
Oregon State University
in partial fulfillment of
the requirements for the
degree of
Doctor of Philosophy
.Page
I. INTRODUCTION 1
1.3 Scope 11
2.1 Background 17
3.1 Introduction 30
3.2 Definitions 30
TABLE OF CONTENTS
(continuation)
.Page
5.1 Introduction 79
VI. CONCLUSION 93
BIBLIOGRAPHY 96
APPENDIX 101
LIST OF FIGURES
Figure .Page
sent from one node to another. A node must wait for all of its input
messages before it can start execution. A node N1 is called an
immediate predecessor to node N2, if N1 sends a message to N2.
Likewise, node N2 is called the immediate successor node of N1
because N2 receives a message from N1. A successor (predecessor)
node may have zero, one or more immediate predecessors
(successors).
*
where P = Number of Processors, T = Task execution time, Ar
= Arbitrary (precedence, number of processors, execution times).
Yu's [Yu 84] heuristics are based on Hu's algorithm [Hu 61].
Yu's improvements were to consider communication delays when
9
as possible.
5: (Min Communication) : Communication time delays must be
suppressed as much as possible.
analyzed),
2: (Locality): Transmission time between nodes located on the
same processor is assumed to be zero time units,
3: (Identicality): Each processor is the same (speed, function).
4: (Co-Processor I /O): A processor can execute tasks and
communicate with another processor at the same time. This is
typical with IO-processors (channel) and direct memory
access. Hence, the term "processing element" (PE) is used
instead of "processor" to imply the existence of an I0-
processor,
5: (Single Application): Only one program is executed at a time
on the parallel processing system--this is to maximize
execution speed.
y = communication delay
x = Node Number
z = Node size
Program Parallel;
Var a,b,c,d,e,f,g,h,i,j,k,l,m,n,o : real;
Begin
a:= 1; b:= 2; c:= 3; d:=4; e:=5; f:=6; {Node 1}
g := a*b; {Node 2}
h := c*d; {Node 3)
i := d*e; {Node 4}
j := e*f; {Node 5}
k := d*f; {Node 6}
1 := j*k; {Node 7)
m := 4*1; {Node 8)
n := 3*m; {Node 9}
o := n*i; {Node 10}
p := o*h; {Node 11}
q := p*g; {Node 12}
end.
CHAPTER II
THE PARALLEL PROCESSOR SCHEDULING ENVIRONMENT
2.1 Background
2.1.2. Critical Path Nature of the Task Graph and Gantt Chart.
The length of a path in a task graph is the sum of all node and
branch sizes along the path. The critical path of a task graph is the
longest path from the start node to the exit node. For example, there
are two identical longest length paths in Figure 2.1 so there are two
critical paths. One is through nodes 1,5,7,8,9,10,11,12 and the other
through nodes 1,6,7,8,9,10,11,12. The lower bound on the execution
time of the task graph is the length of the critical path. That is, a
program cannot execute to completion in less time than given by the
length of the critical path regardless of the number of processors in
the system.
Gantt Chart in Figure 2.1 , the longest run time belongs to processor
PE 1, and the execution time of the task graph is 10 time units. Notice
that this time includes the communication delay time (2 units, from
node 1 to 5 and node 5 to 7) plus node execution time in the critical
path (8 units, node 1, 5, 7, 8, 9, 10, 11, 12).
node 5 can be assigned and executed before nodes 2,3, and 4 then
an overall shorter execution time is obtained as shown in the
scheduling Gantt chart of Figure 2.1.
The schedulers in this class differ only in the way that each
scheduler assigns priorities to nodes. Priority assignment results in
different schedules because nodes are selected in a different order.
The comparison between different node priority (level, co-level,
random) has been studied by Adam et al [Adam 74]. The comparison
suggests that the use of level number as node priority is the nearest
to optimal.
2.2 The Parallel Processor Scheduling Problem
Figure 2.4 shows the technique used to define the best grain
size. The grain size is defined by grouping the scheduled tasks
obtained from the small grain schedule shown in Gantt chart A. This
forms the larger grain schedule shown in Gantt chart B. The
grouping decision depends on the underlying parallel processor
system hardware and software. Usually, the more we can group
smaller grains, the shorter the task graph execution time because of
the reduction of overhead.
[Adam 74] showed that among all priority schedulers, level priority
schedulers are the best at getting close to the optimal schedule.
y = communication delay
x = Node Number
z = Node size
1 1
6 6
2 2
2 5
3 3
3 2
4 4 7
4 3
5 5
5 8 4
6 6
9
7 7
7 10
8 8
8 11
9 9
9 12
10 10
10
11 11
11
12 12
12
13 13
14 14
I
0 0
1
1
1 1
1
Dx
2
2
Dx
2
2
T
3
3
3 T 3
3
1
D3
1 1
1 1
2
3
T1 2
3 T 2
3
Dx
4 T 4
4
3 2
3
7 3 II
4 4
8 4 IV
5 5
9
6
6
10
7
7
V
11 8
8
12 9
9
CHAPTER III
DUPLICATION SCHEDULING HEURISTIC (DSH) & INSERTION
SCHEDULING HEURISTIC (ISH)
3.1. Introduction
Two new scheduling heuristics are proposed to solve the
scheduling problem as discussed in Chapter I. They are the insertion
scheduling heuristic (ISH) and the duplication scheduling heuristic
(DSH). Both heuristics are improvements over the Hu heuristic which
solve the communication delay and the max-min problems. ISH is
essentially Hu's heuristic with an improvement to the
communication delay problem by inserting tasks in available
communication delay time slots. However, ISH does not solve the
max-min problem. DSH solves the max-min problem using task
duplication scheduling.
3.2. definitions
A node is ready if all of its immediate predecessors have been
assigned. Node 8 is ready since node 7 was assigned to PE2.
finished its assigned task and is ready for execution of the next task.
The ready times for PEI, PE2, PE3 are 10,14,9 respectively.
The finishing time (tf)of a node for a PE is the time when the
node finishes its execution on that PE. It is the starting time of that
node plus the size of that node.
tf = tsn + node size
32
The first ready PE (PERF) is the first PE in the set of all PEs
to become ready after each assigned node is scheduled on the
assigned PE. After node 7 is assigned, PE3 is PERF and is ready at
time unit 9.
ISH and DSH share the same list scheduling heuristic shown in
pseudocode in Figure 3.2. The heuristic tries to minimize the
schedule length by minimizing the finishing time of each assigned
node. At first, the level of each node in the task graph is calculated
and used as each node's priority. (An example of node level is shown
in Figure 3.4). The ready queue is then initialized by inserting all
nodes with no immediate predecessors into the ready queue. This
33
usually means that only one node is inserted into the ready queue
initially. Then, the node at the top of the queue (with the highest
priority, i.e. highest level) is assigned to a PE. The PE is selected by
the processor selection routine called Locate-PE. For the first node,
any PE can be selected because no PE has been assigned a task, yet.
The heuristic continues assigning nodes and updating the ready
queue until all the nodes in the task graph are assigned.
ready nodes from the front of the ready queue to find nodes (hole
tasks) that can be inserted into the idle time slot. Searching
continues until the idle time slot is filled or no hole task is found.
The details of the Assign_node routine are shown in Figure 3.6.
Figure 3.7 starts with the Locate_PE for node 7 (Node 1,4,5,
and 6 have already been assigned, Gantt chart A). The Locate_PE
returns PEi as the assigned PE with node 7 start time is 4 time unit
and the idle time slot is 3-4 time unit. Then, the Assign_node tries to
assign the hole task to the idle time slot, and finally Assign_node
assigns node 7 to PEi start at 4 time unit.
modification, the Idle time slot is empty and the hole task
assignment step (Step 2) is done. Then, node 7 is assigned at its start
time on PE 1.
start time, since that node has a tendency to start execution on the
other PE earlier (Figure 3.7, node 2 is not a hole task since its
message ready time on PE1 is 1 time unit and the idle time slot start
time is 3 time unit).
For the second case, the LIP has already been duplicated in the
idle time slot of the PEE . To start the assigned node sooner, the LIP
has to start it's execution sooner. Thus, the CTlst is searched to find
the node that affects the start time of the LIP node. The search
starts with the LIP of the LIP of the assigned node. If that node is
assigned to some other PE, the search process stops and that node is
the search node. Otherwise, the process searches deeper levels until
it finds the LIP of the LIP node that is located in some other PE, and
41
Once the search node is found, it is copied into the idle time
slot of the PEc . Then all the duplicated task nodes that start after the
copied node are removed and re-copied due to the duplication of the
search node. The reason for re-copying is that the start time (and
the order in CTlst) of the node located after the copied-search node
may change due to the presence of the search node. The re-copy
may indirectly cause the LIP node starting time to decrease so the
assigned node can start execution sooner. The duplication process
continues until duplication fails or the LIP was already assigned to
PEC (No reason to copy). If some idle time remains, hole tasks are
inserted into the time slot as in the ISH heuristic.
order to find the starting time and the duplicated task list for each
assigned node on any specified PE.
From Figure 3.14, all node in the task graph have already been
assigned except the last node (node 11, exit node, Gantt chart A).
Locate_PE of DSH have to find out the start time of node 11 by using
TDP on all three PEs in order to compare and make the assignment
decision. Figure 3.14 shows only TDP (CTlst) on PEi which is also the
one that Locate_PE will select for node 11's assignment.
execute serially.
ready queue, the complexity of step 4 and step 8.1 is 0(N). Also, N
nodes are inserted into a ready queue so the node insertion is 0(N).
But before each insertion of ready node, each ready node
predecessor must be assigned to some PE. To check predecessor
node assignment each branch has to be traversed once. So the
complexity of step 3 and step 7 is 0(N+B). Each PE is initialized in
step 2 with complexity 0(P). In step 6, PERF is located before each
node assignment, so the complexity of step 6 is 0(NP). The
complexity of the main algorithm is
0(B+N+P+NP+O(Locate_PE)+0(Assign_node)).
Gantt Chart
x x 5
9
4 x
10
6
11
i
12
i
x = don't care 13
i = idle time slot 7
14
repeat
6) Update_R_queue(RQ,AN,TG); (update the ready queue
using the assigned node)
7) if RQ, ready queue, is not empty then
7.1) Get node (AN,RQ); (get a new AN from front of RQ)
7.2) Locate_PE (AN,PE_GC,PEL,ST); (return PEL that has
the smallest ST, AN's start time)
7.3) Assign_node(AN,ST,PE_GC,PEL,RQ);
until all nodes in TG are assigned
Figure 3.2 Main List scheduler.
49
Update_R_queue (AN,RQ,TG);
Begin
Let IMS be a set of immediate successor nodes of EN
For all IN node in IMS
Do
1) N = Num of Immediate Pred(1N) (N = Number of
Immediate Predecessors of node IN)
2) N = N 1; (store N as number of predecessors of node
IN for next Update_r_queue called)
3) if N = 0 then
Ordered Insert(IN,Ready_queue)
{ Keep the ready_queue in order by level number}
After initializing Gantt chart and Ready queue (step 3 of Figure 3.2)
Gantt Chart Ready Queue N# #Pr Level
PE1 PE2
1 1 0 4
2 1 2
1
3 1 3
4 2 1
4 2 1
After update Ready queue using node 1, Event node (Step 6).
4 2 1
N# = Node Number
#Pr = Number of Immediate Predecessor of node N#
Level = Level number of node N#
After updAte Ready queue using node 3 (repeat loop back to Step 6)
Iiiett ivy Ready Queue N# #Pr Level
1 2 1 0 4
2 0 2
3 3 0 3
4 1 1
Begin
Gantt chart A and ready queue A after Locate_PE routine selects PE1 as assigned PE for
node 7 start at 4 time unit with idle time slot 4-5.
Gantt Chart B is after node 3 (hole task) assignment followed by node 7's assignment.
Gantt Chart A Gantt Chart B
PE1 PE2 PE3 Time PE1 PE2 PE3 Ready Queue B
Time Ready Queue A NodeLevel
0 Node Level 0
1 1 2 2
1 1
4 4
2 2
5 6 5 6
3 3
3
4 4
7* 7
5 5
* Locate_PE return PE1 for node 7 at time 4, but node 7 has not been assigned to PE1 yet.
The final Gantt Chart of ISH in comparison to Gantt chart of level-list scheduling
without task insertion (NISH).
Gantt Chart with Gantt Chart without
Task insertion Task insertion
Time PE1 PE2 PE3 Time PE1 PE2 PE3
0 0
1 1
1 1
4 4
2 2
5 6 2 5 6 2
3 3
3
4 4
7 7 3
5 5
8 8
6 6
9
7 7
9
8 8
-- NISH
- ISHO
ISM
- ISH2
12 16 20 24 28 32 36 40 44 48
1 1* 1
1 1
2 3 2
2 2
a 5 3
3 3
4 5
4 4
4
5
Copy_LIP (LIP,LIPLIP,CT1st,STL,CTcnt,COPY)
2.1) Search_CT1st_LIP(LIPLIP,LIP,Found);
{ Search ancestor of LIPLIP from CT1st to find LIP node that
is not in CTlst or LIP becomes an entry node)
Gantt Chart A is an intermediate DSH's scheduling result before Node 11's assignment.
Gantt Chart B is a final result after Assign_Node (node 1 I,CTIst).
Gantt Chart C is a Gantt chart B after removing redundant tasks
1
1 1 1 1
1 I 1 1
1
2 3 4 2 3 4 2 4
2 2 2
1 it. 2_
3
2. 4 a 3
a
3
5 6 7 5 6 7
4 4 4
8 9 10 8 9 10 8
5 5 5
6_ b.
6 6 6
2 2.
7 7 7
2.. 1
8 8 8
IQ IQ
9 9 9
11 11
10 10 10
Figure 3.13 The Sample Task Graph of 11 Nodes and Its
Intermediate Gantt Chart using DSH.
61
N# LIP ST FT MSRT
0 10 11 11 0
N# LIP ST FT MSRT
0 8 5 5 0
1: re-calculated start time of node 11
(LIP=10, ST=9). 6 4 5 6 5
2: No idle time slot left TDP terminated 9 6 6 7 6
with Node 11's start time on PE1 is 9
and four nodes duplicated. 7 4 7 8 5
3: Final Gantt chart (B) is shown on 10 7 8 9 8
Figure 3.13
0 10 9 9 0
CHAPTER IV
EXPERIMENT RESULTS
This Chapter describes the test results for ISH and DSH
compared to the results for Hu's heuristic [Hu 61] and Yu's heuristic
D[Yu 84]. The comparison test consists of applying the heuristics to a
wide range of randomly generated task graphs with 20, 50, 100,
150, 250, and 350 nodes for a total of 340 task graphs. This
approach follows closely the approach used by [Adam 74]. The only
difference from Adam's test is that we compared our results with
Hu's and Yu's results instead of the optimal solution because there is
no algorithm to find the optimal solution for precedence task graphs
that include communication delays.
The random graphs can be classified into two groups. The first
group has identical node sizes and identical communication delays in
order to study the effects of communication delay on the
performance of each heuristic. The second group has variable node
sizes and variable communication delays in order to study the
stability of the heuristic when node and communication delay sizes
66
change.
For the first group, the task graph node size is one time unit
and communication delay sizes are varied; 1, 3, 5, 10, and 20 time
units. At each communication delay size, 10 task graphs were
randomly generated and scheduled. An average speedup ratio was
compared from the 10 task graph runs for each communication
delay and the results plotted in Figures 4.1-4.6. The minimum,
maximum and average of each data point in Figures 4.1-4.6 are
summarized in the Appendix. The speedup ratios at the saturation
points Figures 4.1-4.6 and the percentage improvements of ISH and
DSH are summarized in Table 4.1 and Table 4.2, respectively. Table
4.3 summarizes the percent of the average speedup ratio versus the
average speedup ratio for unit delay task graphs for all four
heuristics. This percent, which is the ratio of speedup ratio at each
delay to the speedup ratio of unit delay, shows how speedup
decreases as the delay increases.
For the second group, the task graphs have 20, 150, 250, and
350 nodes. For each number of nodes 10 task graphs were randomly
generated with different node size and communication delay as
shown in Table 4.4. The schedule results were plotted in Figure 4.8.
The speedup ratios at the saturation points marked in Figures 4.8
and the percentage improvements of DSH are summarized in Table
4.4. The results are about the same as the first group for the same
communication delay ratios.
20 nodes, delays - i 20 nodes, delays - 10
2.8
1.15
2.
1.0
2.4 DSH
DSH
2.2 0.95
ISH ISH
O
tx 2 1// 0.
1.2 0.45
4- 9+ ii t 13 145
0.
5
t f-4
9
I
13
I
i5
Number of PEs Number of PEs
0.
1.5
0._
DSH DSH
ISH ISH
\.O.
Li
Yu 0.5 -- Yu
Hu
0.4 ' ... Hu
0.7
0.2
2.1 2.5 Hu Hu
1.5
13 15 17 19 21 23 25 27 11 13 i5 17
1.35
-- Yu 1:2 -- Yu
a 0.75
cn
Hu Hu
0.55
0.35
11 13 15 17 19 21 23 25 27 ii 13 15 17 19 21 23 25 27
Number of PEs Number of PEs
OSH DSH
ISH ISH
-- Yu -- Yu
Hu 11U
1 4 1 1 I I I 1 1 1 1 1 1 1 1 1 1 1 1
12 16 20 24 28 32 36 40 44 48
Number of PEs Number of PEs
1.0
ISH 40 1.4
ISH
CO
IX
1. 2
0. -- Yu
-- Yu
2.5 '0
Hu 0. Hu
0.8
1.5 0.6
1 1 1 1 1 0.4
12 16 20 24 28 32 36 40 44 48 12 16 20 24 28 32 36 40 44 48
3.5
1...f..1.....
DSH DSH
a
..,
4.,
ISH 0 ISH
rc 2.5
a.
-- Yu a
.1:3 -- Yu
a))
a
cx
U3
Hu Hu
1.5
I i i 1 1 1 1 1 1 i 1 1 1 1 1 1
12 16 20 24 28 32 36 40 44 48 16 20 24 28 32 36 40 44 48
Number of PEs Number of PEs
= 3.../
CC
Q
Ca.
m -- Yu 1.5-- -- Yu
-0
5 ,../----
2.5 ;,-- ----N. Hu ia)
Hu
2
.....,,.," ....
A. -
1.5 \-----------"'"
...
I 0.5
20 24 28 32 36 40 44 48 12 16 20 24 28 32 36 40 44 48
Number of PEs Number of PEs
13 5
4
ISH ISH
9
3.5
0. -- Yu
" " Yu
3
Hu Si 2.5 Hu
1.5
1 4 12 16 20 24 28 32 36 40 44 48 2 56 60 64 68 12 16 20 24 28 32 36 40 44 48 2 56 60 64 60
Number of PEs Number of PEs
DSH DSH
0
2.8 ISH
ISH
CC
CL
2.3 -- Yu
-- Yu
CL
Hu cn 1.8 Hu
4 8 12 16 20 24 28 32 36 40 44 48 52 56 60 64 68 12 16 20 24 28 32 36 40 44 48 52 56 60 64 68
Number of PEs Number of PEs
Figure 4.5 The Average Speedup Ratio Comparison (250 Nodes). t.)
350 nodes, delays - i 350 nodes, delay = 10
DSH DSH
0
ISH 4-1
03
ISH
CC
-- Yu 7
.0 4
-- Yu
Hu Hu
-1-1-4-+-1-4-1 I I I I
4 8 12 16 2 0 24 28 32 36 40 44 48 b2 bb 60 64 68 8 12 16 20 24 28 32 36 401 44 4181 421 461 601 64 I 68
Number of PEs Number of PEs
10
DSH DSH
4-1
ISH ISH
7
----------------------------
-- Yu Yu
m
m
Hu . -- Hu
2
1 1 I I I I I +-4-+++-141- I I 1 1 -I-1-1-++
1 4 8 12 16 20 24 20 32 36 40 44 48 52 56 60 64 68 1 4 8 12 16 20 24 28 32 36 40 44 48 52 56 60 64 68
Number of PEs Number of PEs
10.5
8.5
0
CC 8.5
0
0. 1.8
4.
0. 1.3
2.5
1V-- I 10 I 12 I 14 I 10 14 18 22 26 30 34 38 42 48 50
18
7.2
16
6.2
0 g 10
/./
D
6
4.4.....40
4
2.2
2
1.2 62 66
10 14 18 22 28 30 34 38 42 46 18 22 26 30 34 38 42 46 50 4 0
Figure 4.7 The Average DSH Speedup Ratio Comparison for Different Delay
20 nodes 250 nodes
1.2
DSH DSH
ISH ISH
0.8
-- Yu 4
- - Yu
0.6
Hu Hu
------------------------------------
0.4
0.2
1 7 9 11 15 4 8 12 16 20 24 28 32 36 56 60 64
Number of PEs Number of PEs
10
5T
9
4.5- DSH DSH
4 0 8
0
4-1 ISH
ISH CO
cc
7
CO
cc4.1
3.5 ca. 6
0. 3 -- Yu -a
a)
5
- - - Yu
2.5 Hu 4 Hu
2 3
1.5 2
1 1 III
4 8
1-1111-4-1-4-1111111111111
1216 20 24 28 32 36 40 44 48 4 12 16 20 24 28 32 36 40 44 48 52 56 60 64 68
Number of PEs Number of PEs
D = communication delay size (time unit), Nodes size are 1 time unit.
% A, percentage improvement over heuristic A = (ISH's speedup A's speedup)/ A's speedup * 100
D % Hu % Yu % Hu % Yu % Hu % Yu % Hu % Yu
20 Nodes
D DSH S S% Hu S% Yu S% ISH S% DSH
1 2.714 100.00 100.00 100.00 100.00
3 1.964 52.03 50.00 57.26 72.37
5 1.650 30.02 36.85 43.77 60.80
10 1.220 16.43 21.04 26.01 44.95
20 1.000 11.07 10.10 14.74 36.85
150 Nodes
D DSH S S% Hu S% Yu S% ISH S% DSH
1 10.018 100.00 100.00 100.00 100.00
3 7.162 54.51 55.36 57.82 71.49
5 5.559 32.83 36.17 42.58 55.49
10 3.774 16.95 16.40 23.07 37.67
20 2.552 9.88 10.88 12.58 25.47
250 Nodes
D DSH S S% Hu S% Yu S% ISH S% DSH
1 14.726 100.00 100.00 100.00 100.00
3 10.795 54.17 55.71 57.59 73.31
5 8.341 32.73 37.99 38.57 56.60
10 5.735 19.10 20.73 23.71 38.94
20 3.814 9.69 11.23 12.74 25.90
350 Nodes
D DSH S S% Hu S% Yu S% ISH S% DSH
1 18.524 100.00 100.00 100.00 100.00
3 13.802 51.28 54.37 56.17 74.51
5 11.30 35.63 38.92 41.08 61.00
10 7.695 20.68 21.39 24.84 41.54
20 4.865 9.10 10.88 12.82 26.26
ISH DSH
Node Size 1-5(3) means node sizez varied from 1 to 5 with average
of 3 time unit.
Delay Size (Communication Delay) used the same format as node
size.
RD is a ratio of average communication delay
average node size
Table 4.4 DSH and ISH Speedup Ratio Improvement over Hu and YU
D Heuristics for Variable Node Size aand Communication Delay.
79
CHAPTER V
OPTIMAL GRAIN DETERMINATION FOR PARALLEL
PROCESSING SYSTEMS
5.1 Introduction
Solutions to the "grain size" problem for parallel processing
systems and an example of grain size determination are given in this
Chapter. This grain size problem as previously described in Chapter
II is stated as follows:
Grain Size Problem. How to partition a given program into
concurrent modules to obtain the shortest possible program
execution time? What is the "best" size for each concurrent
module?
Since the grain size and the user program's execution time
depends on the scheduler used, the choice of scheduler is very
critical. The choice of scheduler used in grain packing must provide
a solution to the max-min problem, and give monotonically growing
improvement as the number of processors is increased (Speedup
ratio >=1).
SEQ
All Al2 B11 B12 C11 C127
Chan lin ? All * =
Chan lin ? Al2
Chan lin? B11 [A21 A22 B21 B22 C21 C22
Chan lin 7 B21
C11 := (All*B11)+(Al2*B21)
Chan lout ! C11 C11 = All*B11 + Al2*B21
INT All,Al2,B12,B22,C12 : C12 = Al l*B12 + Al2*B22
SEQ
Chanlin ? All C21 = A21*B11 + A22*B21
Chan 2in ? Al2
Chan 2in? B12
Chan 2in 7 B22 C22 = A21*B12 + A22*B22
C12 := (A1l*B12)+(Al2*B22)
Chan 2out ! C12 Sum = C11 + C12 + C21 + C22
INT A21,A22,B11,B21,C21 :
SEQ
Chan3in ? A21 Task Graph representation of OCCAM Program
Chan 3in ? A22
Chan 3M7 B11
Chan 3M 7 B21
C21 := (A21*B11)+(A22*B21)
Chan 3out ! C21
INT A21,A22,B12,B22,C22 :
SEQ
Chan4in ? A21
Chan 4in 7 A22
Chan 4in? B12
Chan 4in 7 B22
C22 := (A21*B12)+(A22*B22)
Chan 4out ! C22
INT C11,C12,C21,C22 :
SEQ
Chanlout? C11
Chan 2out? C12
Chan 3out C21
Chan 4out C22
Sum := C11+C12+C21+C22
Chan 5out ! Sum
CPU CYCLE
MOVE.W Axx, DI 15
MOVE.W Bxx, D2 15
MULU DI, D2 71
CM. OPT.* MOVE.L D2, PAR 20
Ann * Bnn O Node size = 101
O Node size = 8
0 0 0 0
20 20+ 20
a a 0
20+ 0+
PE1 PE2
A
T
I B
( PE1 ) ( PE2 )
T1
T3
DMA T5
T2 Serial Link
MEM. MEM.
4 ci1 a il
1
es
212 12
Time P1 P2 P3 P4 P5 P6 P7 P8 Time P1 P2 P3 P4 P5 P6 P7 P8 P1
0 0
3 4 5 6 7 8 1 2 3 4 5 6 7 8 1
101
101
4 6 8 2
202 11 12
210 3
313 313
4
5
361
533
6
8
761
9
TZ-
13
14
864
INT A22,B22,Mu17
SEQ
Chanlin 7 A22
Chanlin ? B22
Mu17 := A22*B22
Chanlout! Mu17
INT C11,Mul2,Mul3,Mul4,Mul5,Mul6,Mul7,Sum Grain after Grain Packing
SEQ
Chanlout? C11
Clian2out? Mu12
Chanlout? Mu17
Sum := Cll+Mu12+Mu13+Mul4+Mu15+Mu16+Mu17
Chan8out! Sum
CHAPTER VI
CONCLUSION
BIBLIOGRAPHY
Adam 74 T. L. Adam, K. M. Chandy, J. R. Dickson, "A Comparison of
List Schedules for Parallel Processing Systems," Comm.
ACM., Vol. 17, pp. 685-690, Dec. 1974.
Babb 84 R. G. Babb, "Parallel Processing with Large-Grain Data
Flow Techniques", Computer, Vol. 17, No. 7, July 1984,
pp. 55-61.
Bash 83 A. F. Bashir, V. Susarla, and K. Varavan, "A Statistical
Study of a Task Scheduling Algorithm," IEEE Trans.
Comput., Vol. C-32, No. 8, Aug. 1983, pp 774-777
Blaz 84 J. Blazewicz and J. Weglarz, "Scheduling Independent 2-
Processor Tasks to Minimize Scheduling," Information
Processing Letter, Vol 18, No. 5, June 1984, pp. 267-273.
Bokh 81 S. H. Bokhari, "On the Mapping Problem," IEEE Trans.
Computers, Vol. C-30, No. 3, pp. 207-214, March 1981.
Chen 75 N. F. Chen, and C. L. Liu, On a Class of Scheduling
Algorithm for Multiprocessing Systems, Proc. 1974
Segamore Computer Conference on Parallel Processing, T.
Feng, ed., Springer, Berlin, 1975, pp. 1-16.
Chou 82 T. Chou and J. Abraham, "Load Balancing in Distributed
Systems," IEEE Transactions on Software Engineering,
Vol. SE-8, No. 4, July 1982, pp. 401-412.
Chu 80 W. W. Chu et al., "Task Allocation in Distributed Data
Processing," Computer, Vol. 13, No. 11, pp. 57-69, Nov.
1980.
Clar 52 W. Clark, The Gantt Chart, 3rd edition, London: Pittman
and Sons, 1952.
Coff 72 E. G. Coffman, Jr., and R. L. Graham, "Optimal Scheduling
for two Processor System," Acta Information, Vol. 1, No.
3, pp. 200-213, 1972.
Coff 76 E. G. Coffman, Computer and Job-Shop Scheduling Theory.
New York: Wiley, 1976.
Dogr 78 A. Dogramaci and J. Surkis, "Limitation of a Parallel
97
APPENDIX
Data points of Figure 4.1 (20 nodes, delays = 1)
P # min Hu max Hu Ave Hu min Yu max Yu ave Yu min ISH max ISH ave ISH min DSH max DSH ave DSH
2 1.6670 1.6670 1.6670 1.6670 1.8180 1.7878 1.6670 1.8180 1.7878 1.8180 1.8180 1.8180
3 2.0000 2.2220 2.0444 2.0000 2.2220 2.1332 2.0000 2.2220 2.1776 2.2220 2.2220 2.2220
4 2.2220 2.5000 2.3332 2.2220 2.5000 2.4444 2.5000 2.5000 2.5000 2.5000 2.5000 2.5000
5 2.2220 2.2220 2.2220 2.5000 2.8570 2.6428 2.5000 2.8570 2.5714 2.5000 2.8570 2.6428
6 2.2220 2.5000 2.2776 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142
7 2.2220 2.2220 2.2220 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142
8 2.2220 2.2220 2.2220 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142
9 2.2220 2.2220 2.2220 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142
10 2.2220 2.2220 2.2220 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142
11 2.2220 2.2220 2.2220 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142
12 2.2220 2.2220 2.2220 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142
13 2.2220 2.2220 2.2220 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142
14 2.2220 2.2220 2.2220 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142
15 2.2220 2.2220 2.2220 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142 2.5000 2.8570 2.7142 .
Data points of Figure 4.1 (20 nodes, delays = 5)
P # min Hu max Hu Ave Hu min Yu max Yu ave Yu min ISH max ISH ave ISH min DSH max DSH ave DSH
2 0.6450 0.7410 0.7016 0.7140 0.8700 0.7670 0.9090 1.1760 0.9978 1.1760 1.3330 1.2370
3 0.6450 0.8000 0.7096 0.7690 0.8000 0.7938 1.0000 1.1760 1.1162 1.3330 1.5380 1.4342
4 0.5710 0.8330 0.7342 0.8000 1.0000 0.9312 1.0000 1.3330 1.1772 1.4290 1.6670 1.5420 .
5 0.5560 0.8330 0.7250 1.0000 1.0000 1.0000 1.0530 1.3330 1.1878 1.4290 1.6670 1.5678
6 0.5560 0.8000 0.6670 1.0000 1.0000 1.0000 1.0530 1.3330 1.1878 1.5380 1.8180 1.6500
7 0.5560 0.8000 0.6670 1.0000 1.0000 1.0000 1.0530 1.3330 1.1878 1.5380 1.8180 1.6500
8 0.5560 0.8000 0.6670 1.0000 1.0000 1.0000 1.0530 1.3330 1.1878 1.5380 1.8180 1.6500
9 0.5560 0.8000 0.6670 1.0000 1.0000 1.0000 1.0530 1.3330 1.1878 1.5380 1.8180 1.6500
10 0.5560 0.8000 0.6670 1.0000 1.0000 1.0000 1.0530 1.3330 1.1878 1.5380 1.8180 1.6500
11 0.5560 0.8000 0.6670 1.0000 1.0000 1.0000 1.0530 1.3330 1.1878 1.5380 1.8180 1.6500
12 0.5560 0.8000 0.6670 1.0000 1.0000 1.0000 1.0530 1.3330 1.1878 1.5380 1.8180 1.6500
13 0.5560 0.8000 0.6670 1.0000 1.0000 1.0000 1.0530 1.3330 1.1878 1.5380 1.8180 1.6500
14 0.5560 0.8000 0.6670 1.0000 1.0000 1.0000 1.0530 1.3330 1.1878 1.5380 1.8180 1.6500
15 0.5560 0.8000 0.6670 1.0000 1.0000 1.0000 1.0530 1.3330 1.1878 1.5380 1.8180 1.6500 .
Data points of Figure 4.1 (20 nodes, delays = 10)
P # min Hu max Hu Ave Hu min Yu max Yu ave Yu min ISH max ISH ave ISH min DSH max DSH ave DSH
2 0.2630 0.4350 0.3716 0.4350 0.5410 0.4922 0.5560 0.8000 0.6478 1.0530 1.1110 1.0646
3 0.3030 0.4350 0.3690 0.4350 0.5560 0.5288 0.5560 0.7690 0.7152 1.1110 1.1760 1.1370
4 0.2630 0.4440 0.4078 0.4550 0.5710 0.5418 0.5710 0.8000 0.7056 1.1110 1.2500 1.1926
5 0.2630 0.4550 0.3828 0.5560 0.5880 0.5714 0.5710 0.8000 0.7056 1.1760 1.2500 1.2056
6 0.2630 0.4440 0.3646 0.5560 0.5880 0.5714 0.5710 0.8000 0.7056 1.1760 1.2500 1.2204
7 0.2630 0.4440 0.3646 0.5560 0.5880 0.5714 0.5710 0.8000 0.7056 1.1760 1.2500 1.2204
8 0.2630 0.4440 0.3646 0.5560 0.5880 0.5714 0.5710 0.8000 0.7056 1.1760 1.2500 1.2204
9 0.2630 0.4440 0.3646 0.5560 0.5880 0.5714 0.5710 0.8000 0.7056 1.1760 1.2500 1.2204
10 0.2630 0.4440 0.3646 0.5560 0.5880 0.5714 0.5710 0.8000 0.7056 1.1760 1.2500 1.2204
11 0.2630 0.4440 0.3646 0.5560 0.5880 0.5714 0.5710 0.8000 0.7056 1.1760 1.2500 1.2204
12 0.2630 0.4440 0.3646 0.5560 0.5880 0.5714 0.5710 0.8000 0.7056 1.1760 1.2500 1.2204
13 0.2630 0.4440 0.3646 0.5560 0.5880 0.5714 0.5710 0.8000 0.7056 1.1760 1.2500 1.2204
14 0.2630 0.4440 0.3646 0.5560 0.5880 0.5714 0.5710 0.8000 0.7056 1.1760 1.2500 1.2204
15 0.2630 0.4440 0.3646 0.5560 0.5880 0.5714 0.5710 0.8000 0.7056 1.1760 1.2500 1.2204 .
Data points of Figure 4.1 (20 nodes, delays = 20)
P # min Hu max Hu Ave Hu min Yu max Yu ave Yu min ISH max ISH ave ISH min DSH max DSH ave DSH
2 0.2330 0.2990 0.2462 0.2330 0.3080 0.2744 0.3030 0.4260 0.3996 1.0000 1.0000 1.0000
3 0.1900 0.2380 0.2258 0.2330 0.3030 0.2478 0.3080 0.4440 0.3606 1.0000 1.0000 1.0000
4 0.1900 0.3080 0.2412 0.2350 0.3080 0.2784 0.3080 0.4440 0.3878 1.0000 1.0000 1.0000
5 0.1900 0.2350 0.2170 0.2380 0.3130 0.2810 0.3080 0.4440 0.3616 1.0000 1.0000 1.0000
6 0.1600 0.2380 0.2030 0.3080 0.4440 0.3372 0.3080 0.4440 0.3616 1.0000 1.0000 1.0000
7 0.1600 0.2350 0.1934 0.3080 0.4440 0.3372 0.3080 0.4440 0.3616 1.0000 1.0000 1.0000
8 0.1600 0.2350 0.1934 0.3080 0.4440 0.3372 0.3080 0.4440 0.3616 1.0000 1.0000 1.0000
9 0.1600 0.2350 0.1934 0.3080 0.4440 0.3372 0.3080 0.4440 0.3616 1.0000 1.0000 1.0000
10 0.1600 0.2350 0.1934 0.3080 0.4440 0.3372 0.3080 0.4440 0.3616 1.0000 1.0000 1.0000
11 0.1600 0.2350 0.1934 0.3080 0.4440 0.3372 0.3080 0.4440 0.3616 1.0000 1.0000 1.0000
12 0.1600 0.2350 0.1934 0.3080 0.4440 0.3372 0.3080 0.4440 0.3616 1.0000 1.0000 1.0000
13 0.1600 0.2350 0.1934 0.3080 0.4440 0.3372 0.3080 0.4440 0.3616 1.0000 1.0000 1.0000
14 0.1600 0.2350 0.1934 0.3080 0.4440 0.3372 0.3080 0.4440 0.3616 1.0000 1.0000 1.0000
15 0.1600 0.2350 0.1934 0.3080 0.4440 0.3372 0.3080 0.4440 0.3616 1.0000 1.0000 1.0000 .
Data points of Figure 4.6 (350 nodes, delays = 1)
P # min Hu max Hu Ave Hu min Yu max Yu ave Yu min ISH max ISH ave ISH min DSH max DSH ave DSH
2 1.9770 1.9770 1.9770 1.9770 1.9890 1.9866 1.9770 1.9890 1.9866 1.9890 1.9890 1.9890
4 3.8890 3.8890 3.8890 3.8890 3.9330 3.9066 3.8890 3.9330 3.9066 3.8890 3.9330 3.9242
6 5.7380 5.7380 5.7380 5.7380 5.8330 5.7570 5.7380 5.8330 5.7570 5.7380 5.8330 5.7570
8 7.4470 7.6090 7.4794 7.4470 7.6090 7.5442 7.4470 7.6090 7.5442 7.6090 7.6090 7.6090
10 8.9740 9.2110 9.1162 8.9740 9.2110 9.1636 8.9740 9.2110 9.1636 9.2110 9.4590 9.3598
12 10.6060 10.9380 10.6724 10.6060 10.9380 10.6724 10.6060 10.9380 10.6724 10.9380 10.9380 10.9380
14 12.0690 12.0690 12.0690 12.0690 12.0690 12.0690 12.0690 12.0690 12.0690 12.5000 12.5000 12.5000
16 12.9630 13.4620 13.3622 12.9630 13.4620 13.3622 12.9630 13.4620 13.3622 14.0000 14.0000 14.0000
18 14.0000 14.5830 14.4664 14.5830 14.5830 14.5830 14.0000 14.5830 14.4664 15.2170 15.2170 15.2170
20 14.5830 15.9090 15.3670 15.2170 15.9090 15.4938 15.2170 15.9090 15.4938 15.9090 16.6670 16.3638
22 12.9630 16.6670 15.7746 15.2170 16.6670 16.2254 15.2170 17.5000 16.3920 15.9090 17.5000 17.0152
24 11.6670 17.5000 15.8486 15.2170 17.5000 16.7252 15.2170 18.4210 16.9094 15.9090 18.4210 17.3836
26 10.9380 18.4210 15.7486 15.2170 18.4210 17.0936 15.2170 18.4210 17.0936 16.6670 19.4440 17.9240
28 10.6060 18.4210 14.5958 15.2170 18.4210 16.9094 15.2170 19.4440 17.2982 16.6670 19.4440 17.9240
30 10.6060 18.4210 13.7344 15.2170 18.4210 16.9094 15.2170 19.4440 17.4824 16.6670 19.4440 17.9240
32 10.2940 15.2170 12.3156 15.2170 18.4210 17.0936 15.2170 19.4440 17.4824 16.6670 19.4440 18.1286
34 10.0000 13.4620 11.8304 15.2170 18.4210 17.0936 15.2170 19.4440 17.4824 16.6670 19.4440 18.1286
36 9.7220 12.9630 11.3300 15.2170 18.4210 17.0936 15.2170 19.4440 17.4824 16.6670 19.4440 18.1286
38 9.7220 12.9630 11.3300 15.2170 18.4210 17.0936 15.2170 19.4440 17.4824 16.6670 19.4440 18.1286
40 9.7220 12.9630 11.1792 15.2170 18.4210 17.0936 15.2170 19.4440 17.4824 16.6670 19.4440 18.2952
42 9.7220 12.9630 11.1792 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 19.4440 18.2952
44 9.7220 12.9630 11.1792 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 20.5880 18.5240
46 9.7220 12.9630 11.1792 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 19.4440 18.2952
48 9.7220 12.9630 11.1128 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 20.5880 18.5240
50 9.7220 12.9630 11.1128 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 20.5880 18.5240
52 9.7220 12.9630 11.1128 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 20.5880 18.5240
54 9.7220 12.9630 11.1128 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 20.5880 18.5240
56 9.7220 12.9630 11.1128 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 20.5880 18.5240
58 9.7220 12.9630 11.1128 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 20.5880 18.5240
60 9.7220 12.9630 11.1128 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 20.5880 18.5240
62 9.7220 12.9630 11.1128 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 20.5880 18.5240
64 9.7220 12.9630 11.1128 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 20.5880 18.5240
66 9.7220 12.9630 11.1128 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 20.5880 18.5240
68 9.7220 12.9630 11.1128 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 20.5880 18.5240
70 9.7220 12.9630 11.1128 15.2170 18.4210 17.2778 15.2170 19.4440 17.4824 16.6670 20.5880 18.5240 .
Data points of Figure 4.6 (350 nodes, delays = 5)
P# min Hu max Hu Ave flu min Yu max Yu ave Yu min ISH max ISH ave ISH min DSH max DSH ave DSH
2 1.9020 1.9340 1.9212 1.9130 1.9440 1.9294 1.9340 1.9440 1.9400 1.9550 1.9660 1.9616
4 3.3330 3.5710 3.4546 3.3980 3.6080 3.5158 3.5710 3.6840 3.6234 3.6460 3.7630 3.7156
6 4.0700 4.6050 4.3400 4.3750 4.9300 4.6040 4.7950 5.2240 4.9350 5.0000 5.3030 5.1800
8 4.3210 5.2240 4.7350 4.6050 5.3030 4.9832 5.5560 6.0340 5.7624 5.8330 6.3640 6.1054
10 4.7300 5.3850 4.9810 5.0720 5.8330 5.2850 6.2500 6.6040 6.4132 6.6040 7.1430 6.8946
12 5.0720 5.6450 5.2624 5.2240 6.0340 5.4666 6.6040 7.0000 6.8660 7.2920 7.7780 7.5146
14 5.3030 5.6450 5.4042 5.6450 6.1400 5.7812 6.6040 7.1430 7.0352 7.9550 8.3330 8.1432
16 5.3030 5.6450 5.5246 5.7380 6.1400 5.9982 6.4810 7.4470 7.1310 8.5370 8.9740 8.6670
18 5.3850 5.9320 5.7252 5.8330 6.1400 6.0574 6.6040 7.2920 7.0650 8.9740 9.4590 9.1658
20 5.3850 5.9320 5.7442 5.8330 6.3640 6.1674 6.7310 7.4470 7.1810 9.2110 9.7220 9.5146
22 5.7380 6.3640 6.0234 5.8330 6.8630 6.2900 6.7310 7.2920 7.1202 9.7220 10.0000 9.8332
24 5.3030 6.4810 6.0462 6.1400 7.0000 6.5968 6.7310 7.4470 7.1512 10.0000 10.2940 10.1764
26 5.3030 6.3640 6.0228 6.2500 7.0000 6.6680 6.7310 7.4470 7.1810 10.0000 10.6060 10.2976
28 4.9300 6.3640 5.9050 6.2500 6.8630 6.5642 6.7310 7.4470 7.1512 10.0000 10.6060 10.4224
30 4.0700 6.3640 5.3178 6.2500 7.0000 6.6680 6.6040 7.4470 7.1258 10.2940 10.9380 10.6764
32 4.0700 6.2500 4.9670 6.2500 7.1430 6.5968 6.7310 7.4470 7.1512 10.2940 10.9380 10.7428
34 3.8890 4.8610 4.2168 6.2500 7.1430 6.6202 6.7310 7.4470 7.1512 10.6060 11.2900 10.9460
36 3.8460 4.8610 4.1720 6.2500 7.0000 6.6954 6.7310 7.4470 7.1810 10.6060 11.2900 10.9460
38 3.8460 4.8610 4.1634 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.6060 11.2900 10.9460
40 3.8460 4.8610 4.1634 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.6060 11.2900 10.9460
42 3.8460 4.8610 4.1634 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.6060 11.6670 11.0918
44 3.8460 4.7950 4.0988 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.6060 11.6670 11.1672
46 3.4650 4.7950 3.9826 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.6060 11.6670 11.1672
48 3.4650 4.7950 3.9826 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.6060 11.6670 11.1672
50 3.4650 4.7950 3.9826 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.9380 11.6670 11.3000
52 3.4650 4.7950 3.9826 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.9380 11.6670 11.3000
54 3.4650 4.4870 3.9210 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.9380 11.6670 11.3000
56 3.4650 4.4870 3.9210 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.9380 11.6670 11.3000
58 3.4650 4.4870 3.9210 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.9380 11.6670 11.3000
60 3.4650 4.4870 3.9210 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.9380 11.6670 11.3000
62 3.4650 4.4870 3.9210 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.9380 11.6670 11.3000
64 3.4650 4.4870 3.9210 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.9380 11.6670 11.3000
66 3.4650 4.4870 3.9210 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.9380 11.6670 11.3000
68 3.4650 4.4870 3.9210 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.9380 11.6670 11.3000
70 3.4650 4.4870 3.9210 6.2500 7.1430 6.7240 6.7310 7.4470 7.1810 10.9380 11.6670 11.3000 .
Data points of Figure 4.6 (350 nodes, delays = 10)
P # min Hu max Hu Ave Hu min Yu max Yu ave Yu min ISH max ISH ave ISH min DSH max DSH ave DSH
2 1.7070 1.8320 1.7724 1.7410 1.8320 1.7862 1.8040 1.8620 1.8308 1.9020 1.9340 1.9210
4 2.1210 2.6920 2.4662 2.2440 2.8230 2.5910 2.7780 3.0700 2.9698 3.0430 3.3020 3.2076
6 2.3490 2.8000 2.5654 2.4820 2.9660 2.7986 3.2410 3.7630 3.5160 3.7630 4.1180 3.9540
8 2.5180 2.8690 2.7376 2.6520 3.0700 2.9216 3.5710 4.0230 3.7874 4.3210 4.6050 4.5006
10 2.5930 3.0970 2.8620 2.7340 3.0970 2.9680 3.5710 4.3210 4.0302 4.9300 5.1470 5.0452
12 2.8460 3.1250 2.9512 2.8690 3.3980 3.1286 3.8040 4.6050 4.1292 5.3030 5.5560 5.4538
14 2.8690 3.3980 3.1120 3.0970 3.4650 3.2900 3.7630 4.2680 4.0970 5.5560 5.9320 5.8180
16 2.8690 3.3980 3.1544 3.0970 3.7230 3.4028 3.8040 4.3750 4.1868 5.8330 6.2500 6.1014
18 3.0970 3.4650 3.2542 3.1250 3.7630 3.3874 4.0230 4.4870 4.2550 6.0340 6.4810 6.2986
20 3.1250 3.5000 3.3348 3.1250 3.8040 3.5396 3.8040 4.5450 4.2118 6.1400 6.7310 6.5140
22 3.1820 3.5000 3.3590 3.1530 3.8040 3.5452 4.0230 4.5450 4.2666 6.3640 6.8630 6.7114
24 3.1530 3.8460 3.5054 3.4310 3.8460 3.6868 3.8040 4.7300 4.2574 6.4810 7.0000 6.8150
26 2.6320 3.8460 3.2582 3.4310 3.8890 3.6954 3.8460 4.7300 4.2872 6.6040 7.4470 7.0400
28 2.5930 3.8890 3.3358 3.4310 3.8890 3.6332 3.6840 4.7300 4.2658 6.7310 7.4470 7.1524
30 2.2290 3.5350 2.9758 3.4310 3.8890 3.6262 3.9770 4.7950 4.3724 6.7310 7.4470 7.2430
32 1.9770 3.1250 2.7066 3.4310 3.8890 3.6332 4.0700 4.7300 4.3430 6.7310 7.6090 7.2754
34 1.9770 3.1530 2.7098 3.4310 3.8890 3.6954 3.9330 4.4870 4.2670 6.8630 7.7780 7.4302
36 1.9770 2.8460 2.5498 3.4310 3.8890 3.6954 3.9770 4.6050 4.2590 6.8630 7.7780 7.4302
38 1.9770 2.8460 2.5452 3.4310 3.8890 3.6954 3.9330 4.6050 4.2502 6.8630 7.7780 7.4640
40 1.9770 2.6120 2.3482 3.4310 3.8890 3.6954 3.9330 4.5450 4.2382 7.0000 7.7780 7.5224
42 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9330 4.5450 4.2900 7.0000 7.9550 7.5578
44 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9330 4.5450 4.2606 7.0000 7.9550 7.5916
46 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9330 4.7950 4.3222 7.0000 7.9550 7.6270
48 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9330 4.4870 4.2376 7.0000 7.9550 7.6270
50 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9330 4.5450 4.2382 7.0000 7.9550 7.6624
52 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9770 4.7950 4.3430 7.0000 7.9550 7.6948
54 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9770 4.7950 4.3430 7.0000 7.9550 7.6948
56 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9770 4.7950 4.3430 7.0000 7.9550 7.6948
58 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9770 4.7950 4.3430 7.0000 7.9550 7.6948
60 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9770 4.7950 4.3430 7.0000 7.9550 7.6948
62 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9770 4.7950 4.3430 7.0000 7.9550 7.6948
64 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9770 4.7950 4.3430 7.0000 7.9550 7.6948
66 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9770 4.7950 4.3430 7.0000 7.9550 7.6948
68 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9770 4.7950 4.3430 7.0000 7.9550 7.6948
70 1.9770 2.4480 2.3120 3.4310 3.8890 3.6954 3.9770 4.7950 4.3430 7.0000 7.9550 7.6948 .
Data points of Figure 4.6 (350 nodes, delays = 20)
P# min Hu max Hu Ave Hu min Yu max Yu ave Yu min ISH max ISH ave ISH min DSH max DSH ave DSH
2 1.0800 1.2540 1.1440 1.0940 1.2680 1.1988 1.4290 1.5770 1.5026 1.6360 1.7240 1.6706
4 1.1820 1.2730 1.2306 1.2150 1.3510 1.3080 1.7330 1.9230 1.8390 2.3490 2.4140 2.3878
6 1.1780 1.3620 1.2846 1.3260 1.4400 1.3650 1.8920 2.1470 2.0018 2.8000 2.9170 2.8558
8 1.1860 1.4770 1.3524 1.3730 1.6130 1.4632 1.8040 2.2010 1.9810 3.0970 3.2710 3.1712
10 1.2680 1.4960 1.4140 1.3730 1.6360 1.5064 1.8320 2.2580 2.1142 3.3650 3.5000 3.4316
12 1.2730 1.5150 1.4312 1.4770 1.7680 1.6032 1.8320 2.3180 2.1356 3.5350 3.6840 3.6314
14 1.3670 1.6430 1.5014 1.4960 1.8040 1.6414 1.9890 2.3180 2.1540 3.6840 3.8460 3.7968
16 1.3780 1.7950 1.5708 1.5150 1.8230 1.6806 2.0000 2.3330 2.1848 3.8890 4.0700 3.9964
18 1.4960 1.8230 1.6242 1.6430 1.8230 1.7176 2.0350 2.3490 2.2294 4.0230 4.2170 4.1190
20 1.5090 1.8320 1.6332 1.5150 1.8320 1.7290 2.0000 2.5360 2.2236 4.1180 4.3750 4.2390
22 1.6590 1.8320 1.7572 1.6590 1.8320 1.7592 2.0110 2.3490 2.1912 4.2170 4.4300 4.3222
24 1.5020 1.8320 1.6968 1.6590 1.8320 1.7644 2.1470 2.5000 2.2992 4.3210 4.4870 4.4092
26 1.1180 1.8420 1.4726 1.8230 1.8420 1.8324 2.1340 2.3490 2.2458 4.3210 4.6050 4.4666
28 1.1180 1.8320 1.4356 1.6590 2.0470 1.8744 2.1600 2.2880 2.2418 4.4300 4.6670 4.5354
30 1.1080 1.8420 1.4194 1.6590 2.0470 1.8792 2.1340 2.4820 2.2724 4.4300 4.7300 4.5834
32 1.0320 1.6590 1.3646 1.8230 2.0470 1.9102 2.1340 2.4820 2.2610 4.4870 4.7300 4.6188
34 0.9830 1.1860 1.1146 1.8230 2.0470 1.9078 2.0830 2.5360 2.2422 4.5450 4.8610 4.6816
36 0.9310 1.1860 1.0920 1.6590 2.0470 1.8798 2.1340 2.3650 2.2292 4.5450 4.8610 4.7070
38 0.9260 1.1860 1.0902 1.6590 2.0470 1.8798 2.1340 2.3650 2.2552 4.5450 4.9300 4.7334
40 0.8790 1.1860 1.0808 1.6590 2.0470 1.8798 2.0350 2.5550 2.2030 4.6050 4.9300 4.7454
42 0.8790 1.1860 1.0660 1.6590 2.0470 1.8798 2.0350 2.3650 2.2014 4.6050 4.9300 4.7454
44 0.8790 1.1860 1.0660 1.6590 2.0470 1.8798 2.1340 2.3650 2.2610 4.6050 5.0000 4.7850
46 0.8790 1.1860 1.0556 1.6590 2.0470 1.8798 2.0590 2.4310 2.2592 4.6050 5.0720 4.8126
48 0.8370 1.1860 1.0472 1.6590 2.0470 1.8798 2.0590 2.3970 2.2552 4.6050 5.0720 4.8256
50 0.8370 1.1860 1.0324 1.6590 2.0470 1.8798 2.0590 2.3970 2.2552 4.6050 5.0720 4.8388
52 0.8370 1.1110 1.0174 1.6590 2.0470 1.8798 2.0590 2.3330 2.2424 4.6050 5.0720 4.8388
54 0.8370 1.1110 1.0174 1.6590 2.0470 1.8798 2.0590 2.3330 2.2424 4.6670 5.0720 4.8512
56 0.8370 1.1110 1.0174 1.6590 2.0470 1.8798 2.0590 2.3330 2.2424 4.6050 5.0720 4.8526
58 0.8370 1.1110 1.0174 1.6590 2.0470 1.8798 2.0590 2.3330 2.2424 4.6670 5.0720 4.8650
60 0.8370 1.1110 1.0174 1.6590 2.0470 1.8798 2.0590 2.3330 2.2424 4.6670 5.0720 4.8650
62 0.8370 1.1110 1.0174 1.6590 2.0470 1.8798 2.0590 2.3330 2.2424 4.6670 5.0720 4.8650
64 0.8370 1.1110 1.0174 1.6590 2.0470 1.8798 2.0590 2.3330 2.2424 4.6050 5.1470 4.8676
66 0.8370 1.1110 1.0174 1.6590 2.0470 1.8798 2.0590 2.3330 2.2424 4.6670 5.1470 4.8800
68 0.8370 1.1110 1.0174 1.6590 2.0470 1.8798 2.0590 2.3330 2.2424 4.6670 5.1470 4.8800
70 0.8370 1.1110 1.0174 1.6590 2.0470 1.8798 2.0590 2.3330 2.2424 4.6670 5.1470 4.8800 .