Beruflich Dokumente
Kultur Dokumente
Tabulating machine
(H. Hollerith, 1889)
Difference Engine
Harvard Mark I
(C. Babbage, 1822)
(IBM, H. Aiken, 1944)
Computer Abstractions and Technology-5 Computer Architecture
0
4 ?
t ENIAC (Electronic Numerical Integrator
and Calculator)
t Work started in 1943 in Moore School of Electrical
Engineering at the University of Pennsylvania, by
John Mauchly and J. Presper Eckert
t Completed in 1946
t 25 2.5
t 20 10-digit registers, each 2 feet
t 18,000
(electronic switches, 1906 )
t 1900
t Programming manually by
plugging cables and setting
switches
This is how
IBM is called
“Big Blue”!
Computer Abstractions and Technology-11 Computer Architecture
5 IC
t 1958 Jack Kilby: integrated a
transistor with resistors and capacitors on a single
semiconductor chip, which is a monolithic IC
PC
(Embedded Computer)
Output
device
Network
cable
Input Input
device device
747-400:
• Capacity: 470 persons
• Range: 4150 miles
• Cruising speed: 610 mph
BAC/Sud BAC/Sud
Concorde Concorde
Douglas DC- Douglas DC-
8-50 8-50
0 100 200 300 400 500 0 2000 4000 6000 8000 10000
BAC/Sud BAC/Sud
Concorde Concorde
Douglas DC- Douglas DC-
8-50 8-50
Performance X Performance Y
= Execution time Y Execution time X = n
t Example: time taken to run a program
l 10s on A, 15s on B
l Execution TimeB / Execution TimeA
= 15s / 10s = 1.5
l So A is 1.5 times faster than B
Clock period
Clock (cycles)
Data transfer
and computation
Update state
n
Clock Cycles = ∑ (CPIi × Instruct. Counti )
i=1
n
Clock Cycles ⎛ Instruct. Counti ⎞
CPI = = ∑ ⎜ CPIi × ⎟
Instruct. Count i=1 ⎝ Instruct. Count ⎠
Relative frequency
Class A B C
CPI for class 1 2 3
IC in sequence 1 2 1 2
IC in sequence 2 4 1 1
t Sequence 1: IC = 5 t Sequence 2: IC = 6
l Clock Cycles l Clock Cycles
=2 1+1 2+2 3 =4 1+1 2+1 3
= 10 =9
l Avg. CPI = 10/5 = 2.0 l Avg. CPI = 9/6 = 1.5
t In CMOS IC technology
30 5V → 1V 1000
n
n
∏ Execution time ratio
i=1
i
perl Interpreted string processing 2,118 0.75 0.40 637 9,777 15.3
hmmer Search gene sequence 2,783 0.80 0.40 890 9,330 10.5
libquantum Quantum computer simulation 1,623 1.61 0.40 1,047 20,720 19.8
omnetpp Discrete event simulation 587 2.94 0.40 690 6,250 9.1
⎛ 10 ⎞ ⎛ 10 ⎞
Overall ssj_ops per Watt = ⎜ ∑ ssj_ops i ⎟ ⎜ ∑ poweri ⎟
⎝ i =0 ⎠ ⎝ i=0 ⎠
t 4+1
speedup = ----------------------- = ---------- = 2.5
1+1
Taffected
Timproved = + Tunaffected
improvemen t factor
t Example: multiply accounts for 80s/100s
l How much improvement in multiply performance to
get 5 overall?
80 l Can’t be done!
20 = + 20
n
t Corollary: make the common case fast
Computer Abstractions and Technology-81 Computer Architecture
idle時消耗少是錯的!!
Instruct. count
MIPS =
Execution time × 10 6
Instruct. count Clock rate
= =
Instruct. count × CPI 6 CPI × 10 6
× 10
Clock rate
l CPI varies between programs on a given CPU