Beruflich Dokumente
Kultur Dokumente
1{1
1{3
Denitions
1{4
World War II
{ Hand-
omputed artillery tables: ENIAC (USA)
{ Break Nazi
odes: Bombe, Colossus (UK)
Cold War
{ Nu
lear weapons design
{ Air
raft, submarine, et
. design
{ Intelligen
e gathering
{ Code breaking
1{5
ENIAC, 1943
E
kert and Mau
hly build the ENIAC (Ele
troni
Numeri
al Integrator And
Cal
ulator) | the rst stored-program \ele
troni
omputer"
1{6
Today
{ Consumer produ
ts
{ Pharma
euti
al design
{ Cir
uit simulation
{ ...
1{9
64-node hyper
ube based on Intel 8086 + 8087, 128 KB RAM per node
8 MHz, 10 MFLOPS, $80,000
Programmed in Pas
al or C, with message-passing library.
1{10
Tried to model the human brain: variable-
onne
tivity 12-D hyper
ube
65,536 1-bit pro
essing elements, 4 Kbit (CM-1) or 64 Kbits (CM-2) per
pro
essor. 2,500 MIPS and 2,500 MFLOPS (CM-2)
2-D mesh, up to 16K 1-bit (MP-1) and 32-bit (MP-2) pro
essors
Full-
edged SIMD, with Xnet and global router
ommuni
ation
Programmed in MPL (MasPar Language) and HPF (High-Performan
e
Fortran)
1{12
512-PE linear SIMD array, 8-bit Pro
essing Elements (PE), 20 MHz
64 PEs/
hip, 0.5 m CMOS (HP), 256 bytes SRAM per PE
30 GOPs (integer, 8-bit), 1 W peak power per
hip
1{14
64K nodes (32 32 64) in a 3-D torus, 2 PPC 440 at 700 MHz per node
360 TFLOPS peak (World's Fastest Super
omputer)
Starting at only $1.5 million per ra
k (1024 nodes)
1{15
1{16
IBM
NEC
Cray In
.
Dell
Hewlett-Pa
kard
Sun Mi
rosystems
Sili
on Graphi
s
1{17
Data parallelism
Fun
tional parallelism
1{19
P
Q
T
1{20
= (X + Y) * (X - Y)
= Z - W
= P + Q
Data parallelism
1{21
1{22
1{23
Disadvantages:
1{24
Strategy 1: Example
www.parallelsp. om
1{25
Disadvantages:
Strategy 2: Examples
All most popular parallel programming tools belong to this
lass:
1{27
Lower layer:
{ Single-pro
ess
omputation (
ore of the
omputation)
{ Expressed in any sequential programming language
Uppper layer:
{ Creation and syn
hronization of pro
esses
{ Partitioning of data among pro
esses
1{28
Strategy 3: Example
Advantages:
Disadvantages:
1{30
Strategy 4: Examples
INMOS' O
am language
High-Performan e FORTRAN
1{31
Current status
Low-level approa
h is most popular:
Advantages:
E
ien
y
Portability
Disadvantages:
Harder to program
Harder to debug
1{32
MPI
CPU
CPU
Cache
Cache
Cache
Memory
I/O
devices
Memory
I/O
devices
Memory
I/O
devices
INTERCONNECTION NETWORK
1{33
OpenMP
CPU
CPU
CPU
Cache
Cache
Cache
Cache
BUS
MAIN
MEMORY
1{34
I/O Dev.
Pthreads
CPU
CPU
CPU
Cache
Cache
Cache
Cache
BUS
MAIN
MEMORY
1{35
I/O Dev.
Pra ti e
1{36
Pra ti e
1{37