Beruflich Dokumente
Kultur Dokumente
1 | Page
1.9 Support for multithreading and different types of multithreading being supported
It has single core. And it doesnt support multi threading.
1.10 Turbo boost
It doesnt support turbo boost.
1.11Types and hierarchy of caches being used
1.11.1 Register
Register are at the top of memory hierarchy with size of 32 bits in Pentium Pro, it provides very
fast access of data in X8086 CPU.They are very expensive memory unit.
1.11.2 Level 1 Cache
This is the next level of memory access in Intel Pentium pro. Level 1 cache in Pentium pro is a
fixed sized cache with the size of 16 KB, 8KB for instructions and 8KB for data. The cost of
memory access in level 1 cache in Pentium pro is much lower than register. It is 2-Way Set
Associative
1.11.3 Level 2 Cache
After Register access, Level 1 cache, Level 2 cache come in hierarchy of caches in Pentium pro.
That is an integrated level 2 cache that makes cache run at same clock speed as the processor
usually at 0.5ns.
As it has integrated cache memory it is not possible to expand memory without expanding
CPU.Level 2 cache is of 1Mb. It is also very expensive and large chip is required for level 2
cache that is also consider as a drawback and Intel didnt use integrated level 2 cache in next
versions.
3 | Page
Chapter 2
2 MIPS R5000
2.1 Overview
In 1996 the R5000 processor was released by MIPS Computer Systems Inc.
The design goal MIPS had with the R5000 was to provide several advanced features at a
Low cost. The way this was accomplished was that they borrowed a lot of features from
The more advanced R10000 processor, while they still managed to retain a low chip
Complexity. The role of the R5000 was to provide good performance for mid- to low-range
Segments of workstations. Its main strength was its good price/performance ratio in regards to
graphics processing.
2.2 Instruction Set architecture
The R5000 is a 64-bit superscalar RISC processor, but it is also capable of running in 32-bit
mode. It has 32 integer registers and 32 floating-point registers, all of which are 64 bits wide.
The processor can execute both a single precision floating-point Instruction and a dissimilar (i.e.
integer, load/store etc.) instruction simultaneously.
2.3 Type of instruction set
MIPS-IV that uses Register to register architecture.
4 | Page
5 | Page
6 | Page
three 41-bit instruction slots and a template field. Each instruction can take one or two slots
depending on the size needed. The template field contains information about the instructions in
the bundle and it tells if the bundle can be executed in parallel with the next bundle.
3.4 Micro architecture Diagram
Rename
Word-Line Decode
Register Read
Execute
Exception Detect
Write Back
Its clock speed is 800Mhz and and clock cycle time is 0.12ns.
the time, many functional units are under used. Since a lot of modern code is multi-threaded
(particularly server code, and servers will be targetted first) it makes sense to try to interleave
multiple threads simultaneously. The hardware pretends to be multiple virtual processors. This
has to be done carefully. For example, suppose we have a simple machine with one integer unit
and one floating point unit, executing mainly integer code. The limiting factor in such a machine
is likely to be the single integer unit, and unless we can interleave a separate mainly floating
point thread, we are unlikely to see any increase in performance. In fact, we are likely to slow
things down. However, suppose we have many integer units. Inter-instruction dependencies are
now likely to be the limiting factor, and one integer thread is unlikely to keep many units busy
simultaneously
3.10 Turbo boost
It doesnt support turbo boost.
3.11 Types and hierarchy of caches being used
3.11.1 Caches
The cache memory of the Itanium consists of 2 or 4 MB off-die L3 cache, 96 KB L2 cache and
32 KB L1. The L1 cache is divided in half between instructions and data. As can be seen in
Figure 8.1 the latency of the L3 cache is quite high which one of the main bottlenecks of the
CPU is. The other latencies in the memory hierarchy is also quite high and the bandwidth of the
memory bus is a bit poor. All these things were vastly improved in the next generation of the
CPU
10 | P a g e
Chapter 4
4.1 Critical thinking
Here is the comparative analysis between the three processes that are discussed in previous
chapters.
Manufacturer
Version
Intel
Pentium Pro
Intel
Intel Itanium (IA-64
Merced)
MIPS
R5000
ISA Support
X86
Very wide RISC
instructions
RISC
MIPS-4- RISC
No. of Pipelining
stages,
clock speed
No. of transistors
Multiple
Instruction Issue
Benefits/Drawback
s
14
10
200 MHz
5.5 Million
Yes 2 way
800 MHz
221 Million
200 MHz
3.6 Million
Yes 2 way
Very difficult to
increase cache size,
because of its integrated
cache.
Costly at its time
because its target was
server and gaming
machines.
Sometimes Processes
Crashing problem with
optimum usage
Cost Analysis
Memory hierarchy
Multi-Threading
No
Yes
memory slot.
L1, L2 , Internal and
external cache plus it
has secondary cache
support
no
It has been terribly troublesome for Intel to manufacture the Pentium professional at the volumes
and value levels necessary for it to become a thought processor. There are unit 2 main reasons for
this. First, the cache itself is much miniaturized and so rather pricier to provide than the everyday
SRAM chips used on a Pentium motherboard for level a pair of cache. Second, some issues with
the cache don't seem to be found till when it's been mated with the processor and put in in their
shared package; once this happens the total package (including the processor) should be thrown
away, reducing yields and increasing prices. Owing to the issues with its style, Intel has
abandoned the integrated-cache construct and it's unlikely that any future computer processors
can use it within the same means that the Pentium professional will.
The R5000 processor obtains high performance at low cost. New features include the following:
The R5000 processor runs the MIPS IV instruction set which contains additional floating-point
instructions such as the multiply-add (MADD) which accelerates geometry-processing in 3-D
graphics applications.
A dual-issue instruction mechanism allows floating-point load (or store) and floating-point
ALU instructions to be dispatched in the same cycle. Highly pipelined floating point ALU
instructions greatly enhance 3D graphics performance.
An on-chip write buffer enhances bus performance by facilitating pipelined write operations.
Separate integer and floating-point ALUs enhance performance in mixed integer/floating point
applications by allowing long latency instructions such as floating-point divide and square-root
operations to be performed at the same time as integer ALU operations.
13 | P a g e
Large on-chip instruction and data caches, each 32 Kbytes in size, allows many common
applications to run within the primary cache, enhancing performance by reducing the number of
accesses to both secondary cache and main memory.
The multi-processing capability of the R5000 processor enables multiprocessor servers with
additional processors added at low-cost. This system configuration is especially valuable for the
Windows NT midrange server market.
4.5 Conclusion
In a Nutshell, every processor has its own drawbacks and benefits like intel Pentium pro and
MIPS r500 were initially designed of servers machines, but Pentium pro failure was due the not
availability of expandability factor. Its cache is fixed with motherboard and we cannot increase
the size of memory. But we can increase memory in R5000 because it has extra memory slot. All
of the three have reduced instructions set computer and they have less number of pipelining
stalls.
Intel itanium clock speed was very much high as compare to the above two mentioned. It has 800
MHz speed with 221 million transistor and it is also latest as compared to Pentium pro and MIPS
R500 while Intel Pentium pro have 5.5 million and R5000 have 3.6 we cannot increase them.
Pentium pro and r5000 dont have multithreading support and we cannot implement multithreading in it, but Itanium have support of multi-threading From these above mentioned three
processors I would recommend to buy and build a machine of Intel Itanium processor, because of
its high clock speed multi-threading. As I mentioned above Intel Itanium had heating problem
that why it failed and Intel overcome this issue with next processors.
References
1) https://www.pctechguide.com/pentium-cpus/pentium-pro
2) https://en.wikipedia.org/wiki/Pentium_Pro
3) http://www.pcguide.com/ref/cpu/fam/g6PPro-c.html
4) http://www.cpu-world.com/CPUs/Pentium-II/Intel-Pentium%20Pro
%20200%20256%20KB%20-%20KB80521EX200%20256K
%20%28BP80521200%20256K%29.html
5) http://www.cpushack.com/CIC/announce/1996/MIPSR5000.html
6) http://www.sgidepot.co.uk/depot/R5000_Pr_Ov.pdf
7) http://www.princeton.edu/~ajavadia/ELE475.pdf
8) http://home.deib.polimi.it/silvano/FilePDF/ARCMULTIMEDIA/Lesson_2_Branch_Prediction.pdf
9) https://en.wikipedia.org/wiki/R5000
10) http://euler.mat.uson.mx/~havillam/ca/CS323/0708.cs-323014.html
11)
14 | P a g e
15 | P a g e