Beruflich Dokumente
Kultur Dokumente
(CA)
Spring 2019
Section V3
Lecture 1: Introduction
Course Page
Moodle (http://lms.umt.edu.pk/)
Search course ( Computer Architecture or Tanzeela
Shakeel)
SST - School of Systems and Technology - undergraduate
courses
Textbook
Textbook
Patterson, David A. and John L. Hennessy. Computer Organization
and Design: The Hardware/Software Interface, Fifth edition
(softcopy available)
Reference book
Heuring, Vincent P. and Harry F. Jordan. Computer Systems Design
and Architecture, 2nd Edition. ISBN: 0-13-048440
William Stallings, Computer Organization and Architecture: Designing for
Performance, Sixth Edition, Pearson Prentice Hall, Pearson Education,
Inc. Upper Saddle River, New Jersey, 2003
Technology
MARS IDE (MIPS)
Xilinx
Grading Policy
Instrument Description Weight
Class Exercises In-class exercises and evaluation 5%
Assignments Assigned during important stages of the course to apply and 10%
practice the learnt concepts
Final Exam Will cover the entire course. At least 75% of the material 40%
would be post mid term.
10 Pipelining
Course Outline (cont.)
Week Contents Assessments
13 Using predictions
Quiz 6
14 MemoryTechnology, Basic of Cache, cache Performance
Processor
datapath (functional units) manipulate the bits
control hardware manages the manipulation
Memory
Registers – 100s of bytes, very fast, on the CPU
cache memory – 1000s of bytes, fast, on the CPU
main memory – millions of bytes, slower, off the CPU
Input / Output
interface to the rest of the world
A typical organization
I/O bus
software
instruction set
hardware
Architecture and Organization
Applications
Computer
Architecture
Operating
Systems
History
(A = F / M)
Below Your Program
Application software
Written in high-level language
System software
Compiler: translates HLL code to machine
code
Operating System: service code
Handling input/output
Managing memory and storage
Scheduling tasks & sharing resources
Hardware
Processor, memory, I/O controllers
Levels of Program Code
High-level language
Level of abstraction closer to
problem domain
Provides for productivity and
portability
Assembly language
Textual representation of
instructions
Hardware representation
Binary digits (bits)
Encoded instructions and data
Levels of Representation
High Level Language temp = v[k];
Program v[k] = v[k+1];
v[k+1] = temp;
Compiler
lw $15, 0($2)
Assembly Language lw $16, 4($2)
Program sw $16, 0($2)
sw $15, 4($2)
Assembler
0000 1001 1100 0110 1010 1111 0101 1000
Machine Language 1010 1111 0101 1000 0000 1001 1100 0110
Program 1100 0110 1010 1111 0101 1000 0000 1001
0101 1000 0000 1001 1100 0110 1010 1111
Machine Interpretation
Output
device
Network
cable
Input Input
device device
Inside the Processor
AMD Barcelona: 4 processor cores
A Safe Place for Data
Volatile main memory
Loses instructions and data when power off
Non-volatile secondary memory
Magnetic disk
Flash memory
Optical disk (CDROM, DVD)
The von Neumann Computer Architecture
51 EADS 11/01/2008
Memory bottleneck
The CPU can add two numbers in less than one nanosecond.
If they are both in registers
Putting a number from memory into a register takes about
100 nanoseconds.
“Stall” – the CPU waits on memory.
52
Intel Haswell
Size/speed
1K / 1 ns
128Kb / 5 ns
1 Mb / 20 ns
4 Gb /125 ns
1 Tb / 1 ms
53
GPU architecture
GPUs have much less space devoted to cache.
GPUs have multiple (100-1000) cores, which are simpler,
slower processing units.
GPU cores all perform the same instructions, but on
different data.
Not all the cores can be active at once. When one stalls,
another one starts up.
54
GPU and CPU: The Differences
ALU ALU
Control
ALU ALU
Cache
DRAM DRAM
CPU GPU
GPU
More transistors devoted to computation, instead of caching
or flow control
Suitable for data-intensive computation
High arithmetic/memory operation ratio
Intel Core i5-3470 Ivy Bridge Processor:
57