Beruflich Dokumente
Kultur Dokumente
PERFORMANCE
By
Sateesh D
154553
PERFORMAN
CE
Which is faster?
Which is bigger?
Which plane moves most number of
passengers in least
time
Performance Metrics
Purchasing perspective
given a collection of machines, which has the
best performance ?
least cost ?
best cost/performance?
Design perspective
faced with design options, which has the
best performance improvement ?
least cost ?
best cost/performance?
Both require
basis for comparison
metric for evaluation
CPU time
Time spent processing a given job
Discounts I/O time, other jobs shares
Relative Performance
Performance = 1/Execution Time
X is n time faster than Y
performance x/performance y = n
Example: time taken to run a program
10s on A, 15s on B
Execution Time B / Execution Time A
= 15s / 10s = 1.5
So A is 1.5 times faster than B
CPU Time
CPU Time CPU Clock Cycles Clock Cycle Time
CPU Clock Cycles
Clock Rate
Performance improved by
Reducing number of clock cycles
Increasing clock rate
Hardware designer must often trade off
clock rate against cycle count
CPI Example
CPU Time
CPU Time
CPU Time
B I 600ps 1.2
CPU Time
I 500ps
A
Pitfall :
Expecting the improvement of
one aspect a
computer to increase
performance by an amount proportional to
Effective CPI
Computing the overall effective CPI is done by
looking at the different types of instructions and
their individual cycle counts and averaging
n
(CPIi x ICi)
CPU time
clock_cycle
= Instruction_count x CPI x
Instruction
_count
Algorithm
CPI
Programming
language
Compiler
ISA
Processor
organization
Technology
clock_cycle
X
X
A Simple Example
Op
Freq
CPIi
Freq x CPIi
ALU
50%
.5
Load
20%
.4
Store
10%
.4
Branch
20%
.2
1.5
Performance evaluation
Agenda
Why we need benchmarks
What makes up a good benchmark
What are some commonly used BAD
benchmarks
Choosing the right benchmark
Benchmarks used by the industry
Why Benchmarks
Want to see which processor is better
quickly. But better in what aspect?
Benchmarks can be blend of
anything
Raw speed
Power consumption
Memory usage
Cost
Repeatable
Relevant
Fair
Have comparable results
Applications Contd
May measure the evaluation kit that
came with the DSP processor; not
just the DSP processor.
Which benchmark to
choose?
Architecture independence
Should perfectly reflect what the DSP
chip will be used for
Should blend in factors such as cost,
power usage, in the proportions you
care about
***It probably doesnt exist***
Benchmark results:
Application profiling:
Conclusions
Benchmarks should be repeatable,
relevant, fair, and readily-comparable
Benchmarks test an application that
probably isnt the same as yours
Take benchmarks with a grain of salt
Questions?