Beruflich Dokumente
Kultur Dokumente
Supported by
CPU Computing
CPU performance is the product of many related advances Increased transistor density Increased transistor performance Wider data paths Pipelining Superscalar execution Speculative execution Caching Chip- and system-level integration
Supported by
Supported by
Superscalar Execution
Superscalar and, by extension, out-of-order execution is one solution that has been included on CPUs for a long time
Out-of-order scheduling logic requires a substantial area of the CPU die to maintain dependence information and queues of instructions to deal with dynamic schedules throughout the hardware Speculative instruction execution necessary to expand the window of out-of-order instructions to execute in parallel results in inefficient execution of throwaway work
Supported by
Out-of-order Execution
Supported by
VLIW
VLIW is a heavily compiler-dependent method for increasing instruction-level parallelism in a processor
VLIW moves the dependence analysis work into the compiler
Supported by
Supported by
Multithreading
Supported by
10
Supported by
10
Multi-Core Architectures
A large number of threads interleave execution to keep the device busy, whereas each individual thread takes longer to execute than the theoretical minimum
11
Supported by
11
12
Supported by
12
13
Supported by
13
14
Supported by
14
15
Supported by
15
16
Supported by
16
17
Supported by
17
18
Supported by
18
Intels Nehalem
Four-wide superscalar Out of order, speculative execution Simultaneous multithreading Multiple branch predictors On-die power gating On-die memory controllers Large caches Multiple interprocessor interconnects
19
Supported by
19
20
Supported by
20
Source: http://www.2cpu.com/review.php?id=109
21
Supported by
21
Shown in 4 socket configuraton, 8 GB/sec per link Northbridge/HyperTransport is on die Glueless logic
to DDR, DDR2 memory PCI-X/PCIe bridges (usually implemented in Southbridge)
22
Supported by
22
23
Supported by
23
24
24