CPEG655-High-Perf Computing Lab 0: 1. Environment

Hochgeladen von

Haoke

0% fanden dieses Dokument nützlich (0 Abstimmungen)

33 Ansichten3 Seiten

Originaltitel

CPEG655-Haoke Xu-lab0.docx

Copyright

Verfügbare Formate

DOCX, PDF, TXT oder online auf Scribd lesen

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

Copyright:

Verfügbare Formate

Als DOCX, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

0% fanden dieses Dokument nützlich (0 Abstimmungen)

33 Ansichten3 Seiten

CPEG655-High-Perf Computing Lab 0: 1. Environment

Hochgeladen von

Haoke

Copyright:

Verfügbare Formate

Als DOCX, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 3

Im Dokument suchen

CPEG655-High-Perf Computing Lab 0

Haoke Xu
UD ID: 702367277

1. Environment
Processor: Intel Core i7-4710MQ CPU 2.50GHZ
Ram: 8.00 GB
OS: Windows 7 64 bit SP 1
Compiler: Visual studio 2015 community
2. Introduction
To have a deeper understanding of how CPU and memory
work, we double the value of every element in an array by
different orders. Then we collect the time the machine
cost of each order and analysis the result we get.
3. Experiment design
My design is that, process the elements by strides. Process
the n element, then process the n+i element, then n+2i,
then n+3i, and go on. After the array is done by stride by
stride, we begin to process n+1, then n+i+1 and we
repeat this kind of processing unless we double all the
element in the array.
So it is easily to know that if stride is 1, then we just
process the element one by one. If stride is 2, then we do
all the elements with odd orders and then all the even
ones.
So my design is collect the time of processing using 1 to
100 as stride, and collect the time of each iteration.
3. Experiment result
This is part of my result:

To observe it intuitively, I make a diagram below:

Computing time of each stride set
12
10
8

Computing tim(Second)

6
4
2
0

Stride

We can see that the time increase before the stride is

about 50, then it become lesser.
It is also obvious that at beginning the points are quite
dense. With the stride increase, they become looser.
4. Analyses
After do some research, I think there are two main factor
to cause that result:
1. The time to read array from memories, and make
them into cache.

2. The number of iterations. Cause the length of array is

fix, so the less the iterations are, the less the time
wasted in structure process.
When the stride become large, every time the program
process the next element, it cant find the current element
in cache(cause the cache will only take the elements
around the object element. Once the stride is large, and it
jump out of the coverage of cache, the cache must reload
the elements). So it makes the time increase when stride
increase.
As to the factor 2, cause iteration is (array size/stride), so
when stride increase, the numbers of iteration decrease,
which makes the time less.
These 2 factor both effect the result, one makes it
increase and another one makes it decrease. So Why is
final result increase and then decrease
I can assume a example to help to understand it. when the
stride begin to increase, at first, the first three stride is still
covered in cache. So the cache must reload every 4
processes. Then the stride increase, only two stride is
covered, the the cache must reload every 3 processes.
After the stride increasing, the cache must reload more
and more often, which makes the time increase before 50.
As to the factor2(structure cost decrease), it is much little
and cannot pad the increase of cache-loading cost.
But when the stride is totally bigger than the coverage of
cache, the cache must reload every time, and this cost
become fix, and do not increase anymore. But factor 2,
the structure cost is still decreasing because of the less
number of iteration. So the time begin to decrease.
5. Conclusion
In my deduction, there are two different factor to effect
the time cost. One is the loading time of cache, and
another is the structure cost of the program it self. By their
own properties, they finally make the result as I plot in the
diagram.

Das könnte Ihnen auch gefallen

Lab 5
Dokument8 Seiten
Lab 5
roberto
Noch keine Bewertungen
8051 Interfacing LCD 16x2
Dokument14 Seiten
8051 Interfacing LCD 16x2
Imran Shaukat
100% (3)
Microprocessor Interfacing and Programming: Laboratory Manual
Dokument7 Seiten
Microprocessor Interfacing and Programming: Laboratory Manual
f180535 SajjadAhmad
Noch keine Bewertungen
Sheet 03
Dokument3 Seiten
Sheet 03
eir.gn
Noch keine Bewertungen
Project Report CS 341: Computer Architecture Lab
Dokument12 Seiten
Project Report CS 341: Computer Architecture Lab
thumarushik
Noch keine Bewertungen
Neural Networks Project Report: 1. Background
Dokument5 Seiten
Neural Networks Project Report: 1. Background
sunil sanju
Noch keine Bewertungen
Daa Lab Manual PDF
Dokument22 Seiten
Daa Lab Manual PDF
Sindhu
Noch keine Bewertungen
Unit - Iv
Dokument37 Seiten
Unit - Iv
Gangisetti Srihari
Noch keine Bewertungen
Factorio Friday Facts 215
Dokument2 Seiten
Factorio Friday Facts 215
ShoggothBoy
Noch keine Bewertungen
CS 2110 - Fall 2012: Homework 2 Scheduling
Dokument6 Seiten
CS 2110 - Fall 2012: Homework 2 Scheduling
ferrarienzo973
Noch keine Bewertungen
Real Time Systems Lab: Handling Shared Resources
Dokument14 Seiten
Real Time Systems Lab: Handling Shared Resources
J roberts
Noch keine Bewertungen
General Notes: Heruntergeladen Durch Petre Weinberger (Extern - Weinberger@tum - De)
Dokument6 Seiten
General Notes: Heruntergeladen Durch Petre Weinberger (Extern - Weinberger@tum - De)
Akshay Kumbarwar
Noch keine Bewertungen
This Lesson Details The Reasons Why Threads Exist and What Bene T Do They Provide. We Also Discuss The Problems That Come With Threads
Dokument5 Seiten
This Lesson Details The Reasons Why Threads Exist and What Bene T Do They Provide. We Also Discuss The Problems That Come With Threads
Ashilesh Sonkusle
Noch keine Bewertungen
Quiz For Chapter 7 With Solutions
Dokument8 Seiten
Quiz For Chapter 7 With Solutions
Thịi Ánhh
Noch keine Bewertungen
IXY 3Pf7Eei5Kg7DUflKxA Course Notes MOOC IO
Dokument36 Seiten
IXY 3Pf7Eei5Kg7DUflKxA Course Notes MOOC IO
Дима Даньков
Noch keine Bewertungen
CS 6290: High-Performance Computer Architecture: Summer 2019
Dokument5 Seiten
CS 6290: High-Performance Computer Architecture: Summer 2019
playboi partie
Noch keine Bewertungen
Spark Optimization Notes
Dokument7 Seiten
Spark Optimization Notes
Data Monk
Noch keine Bewertungen
Basic Neural Network Tutorial - C++ Implementation and Source Code Taking Initiative
Dokument45 Seiten
Basic Neural Network Tutorial - C++ Implementation and Source Code Taking Initiative
Abderrahmen Benyamina
Noch keine Bewertungen
Lec 36
Dokument21 Seiten
Lec 36
Elisée Ndjabu
Noch keine Bewertungen
Deep Neural Nets - 33 Years Ago and 33 Years From Now
Dokument17 Seiten
Deep Neural Nets - 33 Years Ago and 33 Years From Now
txboxi23
Noch keine Bewertungen
Intro To DS and Algo Analysis
Dokument51 Seiten
Intro To DS and Algo Analysis
sachinkumardbg766
Noch keine Bewertungen
Begin Parallel Programming With OpenMP - CodeProject
Dokument8 Seiten
Begin Parallel Programming With OpenMP - CodeProject
ManojSudarshan
Noch keine Bewertungen
Solutions OS6e
Dokument33 Seiten
Solutions OS6e
Jasmin Patel
Noch keine Bewertungen
Midterm Fall2012Solutions
Dokument6 Seiten
Midterm Fall2012Solutions
anjugadu
Noch keine Bewertungen
Unit1 RMD PDF
Dokument27 Seiten
Unit1 RMD PDF
Monika
Noch keine Bewertungen
ESO207 ProgAssign 4 2022-Revised
Dokument3 Seiten
ESO207 ProgAssign 4 2022-Revised
Harsh Vardhan Bauddh
Noch keine Bewertungen
hệ điều hành
Dokument18 Seiten
hệ điều hành
franklampardnumber8
Noch keine Bewertungen
Assignment 2
Dokument4 Seiten
Assignment 2
hadi.dadic
Noch keine Bewertungen
Take Home Exam 3: Optimization
Dokument45 Seiten
Take Home Exam 3: Optimization
ottopor
Noch keine Bewertungen
CENG400-Final-Fall 2014
Dokument11 Seiten
CENG400-Final-Fall 2014
Mohamad Issa
Noch keine Bewertungen
Computer Labs Post
Dokument57 Seiten
Computer Labs Post
soniakutti
Noch keine Bewertungen
Efficiency and Algorithms: 5.1 The Big Picture
Dokument8 Seiten
Efficiency and Algorithms: 5.1 The Big Picture
GingerAle
Noch keine Bewertungen
Puter System Technology
Dokument4 Seiten
Puter System Technology
Bhagwat Bangar
Noch keine Bewertungen
DAA Notes
Dokument222 Seiten
DAA Notes
The Fun Explosion
Noch keine Bewertungen
Lab Report - Assignment 1: Variables
Dokument4 Seiten
Lab Report - Assignment 1: Variables
ABC
Noch keine Bewertungen
Computational Complexity: Section 1: Why Is It Important?
Dokument5 Seiten
Computational Complexity: Section 1: Why Is It Important?
Dalia Pal
Noch keine Bewertungen
Ethereum
Dokument4 Seiten
Ethereum
ctorrep1853
Noch keine Bewertungen
HPC Project Mpi
Dokument17 Seiten
HPC Project Mpi
jaya vignesh
Noch keine Bewertungen
Assignment 4 LP
Dokument8 Seiten
Assignment 4 LP
Preet Hundal
Noch keine Bewertungen
Python Advanced - Threads and Threading
Dokument9 Seiten
Python Advanced - Threads and Threading
Pedro Elias Romero Nieto
Noch keine Bewertungen
Programming Assignment 2: Priority Queues and Disjoint Sets
Dokument11 Seiten
Programming Assignment 2: Priority Queues and Disjoint Sets
Ananya Kallankudlu
Noch keine Bewertungen
Parallel Sort
Dokument7 Seiten
Parallel Sort
zololuxy
Noch keine Bewertungen
RTSLab 1
Dokument9 Seiten
RTSLab 1
Andrew Walley
Noch keine Bewertungen
Survey On MultiCore Operation Systems
Dokument9 Seiten
Survey On MultiCore Operation Systems
Muhammad Talal
Noch keine Bewertungen
Com-Sci 111 Eyolfson mt1 Fall20 Id186
Dokument9 Seiten
Com-Sci 111 Eyolfson mt1 Fall20 Id186
mike
Noch keine Bewertungen
Whats App
Dokument221 Seiten
Whats App
Kamal Thakur
Noch keine Bewertungen
Jawaharlal Nehru Engineering College: Laboratory Manual
Dokument26 Seiten
Jawaharlal Nehru Engineering College: Laboratory Manual
Malathi Sankar
Noch keine Bewertungen
Programming Assignment 2: Priority Queues and Disjoint Sets
Dokument11 Seiten
Programming Assignment 2: Priority Queues and Disjoint Sets
Daniel Serrano
Noch keine Bewertungen
Parallel Computing
Dokument16 Seiten
Parallel Computing
Reaper Grim
Noch keine Bewertungen
ADEOYE Victoria Exercice
Dokument2 Seiten
ADEOYE Victoria Exercice
Pamit Duggal
Noch keine Bewertungen
DAA Notes
Dokument80 Seiten
DAA Notes
Anil
Noch keine Bewertungen
Lab 2: Brief Tutorial On Openmp Programming Model: Adrián Álvarez, Sergi Gil Par4207 2019/2020
Dokument11 Seiten
Lab 2: Brief Tutorial On Openmp Programming Model: Adrián Álvarez, Sergi Gil Par4207 2019/2020
Adrián Alvarez
Noch keine Bewertungen
Improving Data Generation Process For A Printer of Holographic Stereograms
Dokument28 Seiten
Improving Data Generation Process For A Printer of Holographic Stereograms
Leandro Bertoluzzi
Noch keine Bewertungen
Priority CPU Scheduling With Different Arrival Time
Dokument11 Seiten
Priority CPU Scheduling With Different Arrival Time
Magarsa Bedasa
Noch keine Bewertungen
Mem Man
Dokument25 Seiten
Mem Man
Rude Maverick
Noch keine Bewertungen
Pytorch GPU
Dokument20 Seiten
Pytorch GPU
Sandro Skansi GMAIL
Noch keine Bewertungen
Solutions Manual Perf by Design
Dokument167 Seiten
Solutions Manual Perf by Design
Adrian Mihăilă
Noch keine Bewertungen
KK List of Java 5
Dokument7 Seiten
KK List of Java 5
Smith F. John
Noch keine Bewertungen
Unit 11
Dokument17 Seiten
Unit 11
Saddam
Noch keine Bewertungen
Time and Space Analysis of Algorithms
Dokument5 Seiten
Time and Space Analysis of Algorithms
Abhishek Mitra
Noch keine Bewertungen
Lab Manual For Computer Organization and Assembly Language: Stack
Dokument8 Seiten
Lab Manual For Computer Organization and Assembly Language: Stack
Abdul Hannan
Noch keine Bewertungen
Good Habits for Great Coding: Improving Programming Skills with Examples in Python
Von Everand
Good Habits for Great Coding: Improving Programming Skills with Examples in Python
Michael Stueben
Noch keine Bewertungen
It Research Paper
Dokument5 Seiten
It Research Paper
Junaid
Noch keine Bewertungen
Dds Picbasic
Dokument24 Seiten
Dds Picbasic
olopez
Noch keine Bewertungen
DLF 2 Mark Question Banks
Dokument5 Seiten
DLF 2 Mark Question Banks
meenakshisankar2013
100% (1)
Using The SDRAM PDF
Dokument14 Seiten
Using The SDRAM PDF
emmanuel
Noch keine Bewertungen
HDL Manual 2019 5th Sem E&CE 17ECL58
Dokument77 Seiten
HDL Manual 2019 5th Sem E&CE 17ECL58
vishvakirana
100% (1)
IBM Centenniel - HPC History
Dokument22 Seiten
IBM Centenniel - HPC History
Ibm Hpc
Noch keine Bewertungen
Mca R16-Scet Syllabus
Dokument135 Seiten
Mca R16-Scet Syllabus
Siva Kiran
Noch keine Bewertungen
Exercises 1 Assembly and Machine Languages: Number Systems
Dokument3 Seiten
Exercises 1 Assembly and Machine Languages: Number Systems
Thomas Wang
Noch keine Bewertungen
Motherboard
Dokument3 Seiten
Motherboard
Tony
Noch keine Bewertungen
Genesys Logic GL823 - C48653
Dokument21 Seiten
Genesys Logic GL823 - C48653
casaswq
Noch keine Bewertungen
Non Deterministic Finite Automata
Dokument37 Seiten
Non Deterministic Finite Automata
Marryam Zulfiqar
Noch keine Bewertungen
Advance Digital Design Using Veilog
Dokument99 Seiten
Advance Digital Design Using Veilog
pcjoshi02
Noch keine Bewertungen
Microcontrollers For Embedded Systems Nec022r
Dokument1 Seite
Microcontrollers For Embedded Systems Nec022r
Sudhir Goswami
Noch keine Bewertungen
Atheros Valkyrie BT Soc Brief
Dokument2 Seiten
Atheros Valkyrie BT Soc Brief
Zimmy Zizake
Noch keine Bewertungen
USB 8255 DIO User Manual: PC Hardware & Service, Inc
Dokument17 Seiten
USB 8255 DIO User Manual: PC Hardware & Service, Inc
Venkat Vadlamani
Noch keine Bewertungen
Ladder Logic Symbols
Dokument13 Seiten
Ladder Logic Symbols
Cristel Caraig
Noch keine Bewertungen
KIM 1 Microprocessor Fundamentals PDF
Dokument244 Seiten
KIM 1 Microprocessor Fundamentals PDF
Ali AH
Noch keine Bewertungen
Parity Generator Checker PDF
Dokument3 Seiten
Parity Generator Checker PDF
Sujesh P Lal
67% (3)
State Machine Present State: A0-Ak-1 Inputs Outputs B0-Bm-1
Dokument4 Seiten
State Machine Present State: A0-Ak-1 Inputs Outputs B0-Bm-1
Fatmir Kelmendi
Noch keine Bewertungen
Logic Gate, Types With Symbols, Truth Tables Etc
Dokument6 Seiten
Logic Gate, Types With Symbols, Truth Tables Etc
Chidinma Glory Ejike
Noch keine Bewertungen
History: Dyna-Micro Dyna-Micro E&L Instruments MMD-1
Dokument4 Seiten
History: Dyna-Micro Dyna-Micro E&L Instruments MMD-1
Danielle Rodriguez
Noch keine Bewertungen
KONTRAC Vehicle Control Unit Type VCU Locomotives
Dokument2 Seiten
KONTRAC Vehicle Control Unit Type VCU Locomotives
deepakrnair
Noch keine Bewertungen
Device Drivers For Interrupt Handling: 19.1 Definition
Dokument6 Seiten
Device Drivers For Interrupt Handling: 19.1 Definition
Soundarya Svs
Noch keine Bewertungen
(WWW - Entrance-Exam - Net) - Accenture Placement Sample Paper 2
Dokument8 Seiten
(WWW - Entrance-Exam - Net) - Accenture Placement Sample Paper 2
Swati Sinsinwar Tewatia
Noch keine Bewertungen
Control Cabinet Manufacturing 4.0 Study
Dokument19 Seiten
Control Cabinet Manufacturing 4.0 Study
Hitesh Panigrahi
Noch keine Bewertungen
Introduction To Computer1 PDF
Dokument23 Seiten
Introduction To Computer1 PDF
Zabit Zahir
Noch keine Bewertungen
Mouse: System Unit
Dokument8 Seiten
Mouse: System Unit
gaby plaza
Noch keine Bewertungen
Designing With The 80C51BH - Application Note AP252 - 27006802
Dokument28 Seiten
Designing With The 80C51BH - Application Note AP252 - 27006802
anon_385961021
0% (1)