Beruflich Dokumente
Kultur Dokumente
CS G553
Lecture 1,2,3
Introduction: Motivation, Goals, etc
CS G553
Introduction
Research in computer (processor) architecture
o The investing goals vary according to
Target applications
Price of the final equipment
Programmability of the system
The environment in which processors will be deployed
Many others
CS G553
Introduction
Computer anytime anywhere (pervasive and ubiquity)
PDA
PC
Car
Home Networking
Game console
Household
Body
Super Computer
Entertainment
Medicine
Communication
CS G553
Introduction
... communication also.
CS G553
Introduction
Explosive growth in
o Computing
o Communication
Information technology
o Hand in hand com growth in computing and communication
CS G553
Performance VS Cost
CS G553
Computing Paradigms
The Von Neumann Computer
Domain specific processors
Application specific instructionset processors
Application specific processors or ASICs
Reconfigurable Processors
CS G553
CS G553
CS G553
10
Memory
Datapath
Data
Data
and
Instructions
Registers
Instruction
register
PC
Address
register
Address
Controllpath
CS G553
11
Program execution
o Instruction Fetch (IF): The next instruction to be executed is
fetched from the memory
o Decode (D): The instruction is decoded to determine the
operation
o Read operand (R): The operands are read from the memory
o Execute (EX): The required operation is executed on the ALU
o Write result (W): The result of the operation is written back to
the memory
o Instruction execution in Cycle (IF, D, R, EX, W)
CS G553
12
Drawbacks
o Speed efficiency: Not efficient, due to the sequential program
execution (temporal resource sharing).
Resource efficiency: Only one part of the hardware resources is
required for the execution of an instruction. The rest remains
idle.
Memory access: Memories are about 10 time slower than the
processor
CS G553
13
Pipelining:
case.
9*tcycle on a Harvard architecture.
Increased throughput
Even with pipeline and other improvements like cache, the execution remain sequential.
CS G553
14
Flexible
Each algorithm can be implemented on a VN machine only if it is coded
according to the VN rules.
Modification in VN
CS G553
15
CS G553
16
many MAC-operations
many memory accesses
special support for efficient looping
CS G553
17
CS G553
18
Example
o ASIP for Image processing
CS G553
19
CS G553
20
ASIC Example:
c = a*b;
}
else
{
d = a+1;
c = b-1;
}
At least 3 instructions
run-time >= 3*tinstruction
35tcycle=15 tcycle
The VN computer needs to be clocked
at least 15 times faster
CS G553
21
CS G553
22
Overall Conclusion
Von Neumann computer:
General purpose, used for any kind of function.
High degree of flexibility.
However, high restrictions on the program coding and execution
scheme
the program have to adapt to the machine
ASICs are
Tailored for one application.
Very efficient in speed and resource.
23
Conclusion
General
Purpose
Domain
Specific
Application
Specific
Min Flexibility
Max Performance
Max Flexibility
Min Performance
CS G553
24
Performance VS Flexibility
CS G553
25
Reconfigurable Computing
The Ideal device should combine:
o the flexibility of the Von Neumann computer
o the efficiency of ASICs
CS G553
26
Flexibility
Flexibility vs Efficiency
Von Neumann
General purpose
computing
DSP
Domain specific
computing
Reconfigurable
systems
Reconfigurable
computing
ASIC
ASIP
Application
specific
computing
Perfromance
CS G553
27
Spatial-based execution
(reconfigurable computing)
CS G553
28
Advantages
o very high performance
and efficient
Disadvantages
o not flexible (cant be
altered after
fabrication)
o expensive
Reconfigurable
computing
Advantages
o fills the gap between
hardware and software
o much higher
performance than
software
o higher level of
flexibility than
hardware
CS G553
Software-programmed
processors
Advantages
o software is very
flexible to change
Disadvantages
o performance can
suffer if clock is
not fast
o fixed instruction
set by hardware
29
Reconfigurable Computing
Ideally, we would like to have the flexibility of the GPP and
the performance of the ASIC in the same device.
o
a reconfigurable hardware or
reconfigurable device or
reconfigurable processing unit (RPU) in analogy the Central
Processing Unit (CPU)
CS G553
30
Reconfigurable Computing
Definition: Reconfigurable computing can be defined as the
study of computations involving reconfigurable devices.
This includes, architecture, algorithms and applications.
o Spatial structure of the device will be modified such as to use the
best computing approach to speed up that application
o For an application, the device structure will be modified again to
match the new application
CS G553
31
32
Rapid prototyping
Testing hardware in real conditions
before fabrication
o Software simulation
Relatively inexpensive
Slow
Accuracy ?
o Hardware emulation
Hardware testing under real operation conditions
Fast
Accurate
Allow several iterations
ITALTEL FLEXBENCH
CS G553
33
Manufacturer
functions can be
executed on the fly
during system
debugging
CS G553
34
phones, etc..
Built-in Digital Camera
Video phone service
Games
Internet
Navigation system
Emergency
Diagnostics
Different standard and
protocols
o Monitoring
o Entertainment
service request
Configuration
CS G553
35
CS G553
36
Wireless
o Self-organization
Repositioning for better coverage
CS G553
37
o Fusion of information
Better coverage of a complete area through self-organization
o Data transmission
Characteristics of a suspect in the covering range
Operation on failure
o
o
o
o
CS G553
38
Application
4
3
5
Physical Topology
Virtual Topology
1
2
1
2
4
3
5
3
5
Physical Topology
Virtual Topology
CS G553
39
CS G553
40
The Microprocessor
10 years of Moores-law progress led to the microprocessor
Raised engineers productivity
Problem-solving became programming
Grew to billions of units/year
Further speed gains will not be seen any more due to
unreliability and higher variations of transistor
Stalled progress in design methods for thirty years
CS G553
41
Microprocessor bottlenecks
CS G553
42
CS G553
43
CS G553
44
CS G553
45
The End
Questions ?
46