Beruflich Dokumente
Kultur Dokumente
16/2016
MBF- 122
Quantitate Methods in Finance
by
Dr.MawMaw Khin
Professor/Head
DepartmentofStatistics
YangonUniversityofEconomics
16/2016
LearningObjective
Understand why we study statistics
Explain what is meant by descriptive statistics and inferential statistics
Distinguish between a quantitative variable and a qualitative variable.
Describe how a discrete variable is differ from a continuous variable.
Distinguish among the nominal, ordinal, interval, and ratio levels of
measurement.
16/2016
Business
Psychologists
Engineers
Education
Finance
Whyisstatisticsrequiredinsomany
majors?
t reason is that
erical information is everywhere.
newspapers, news magazines,
magazines(People),...
business
magazines,
or
general
e national average weekly income rose 2.5% from $598 in 2007 to $613 in
n increased just over 0.1 % during the same period.
s conducted by various organizations.
16/2016
16/2016
ird reason for taking a statistics course is that the knowledge of sta
ds will help you understand how decisions are made and give you a
standing of how they affect you.
nalysis is helpful.
16/2016
at is Statistics?
16/2016
AStatistic
16/2016
TwoTypesofStatistics:
16/2016
16/2016
Figure1
OverviewofStatistics
Statistics
MakingInferences
fromSamples
Collectingand
DescribingData
Sampling
and
Surveys
Visual Numerical
Displays Summaries
Probability
Models
Estimating
Parameters
Testing
Hypotheses
Regressio
nand
Trend
Quality
Control
16/2016
Categorical
(qualitative)
Numerical
(quantitative)
VerbalLabel
Coded
Vehicletype
Vehicletype
Discrete
X=car,truck,SUV
X=1,2,3
Gender(binary)
Gender(binary)
Brokeneggsina
carton
X=male,female
X=0,1
Continuous
X=0,1,2,3,,12
Patientwaiting
time
X=14.27
minutes
Annualdentalvisits
Customer
satisfaction
X=0,1,2,3,
X=85.2%
CategoricalData
ical data (also called qualitative) have values that are described by words rath
rs.
obile style
= full, midsize, compact, subcompact).
16/2016
16/2016
binaryvariable hasonlytwovalues,indicatingthepresence(1)orabsence(0)of
characteristicofinterest.Forexample,foranindividual:
Employment
1=employed
0=notemployed
Education
1=collegegraduate
0=notcollegegraduate
Maritalstatus
1=currentlymarried
0=notcurrentlymarried
Thecodesarearbitrary.Avariablelikegendercouldbecodedinmanyways:
16/2016
LikeThis
1=female
0=male
OrlikeThis
0=female
1=male
OrlikeThis
1=female
2=male
hecodingitselfhasnonumericalmeaning,sobinaryvariablesa
categoricaldata.
16/2016
16/2016
NumericalData
MeasurementScales
1.NominalMeasurement
16/2016
16/2016
2.OrdinalMeasurement
16/2016
16/2016
3.IntervalMeasurement
xt step up the measurement scale is interval data, which not only is a rank but a
gful intervals between scale points.
es are the Celsius or Fahrenheit scales of temperature.
tervals between numbers represent distances, we can do mathematical operatio
ng an average. But because the zero point of these scales is arbitrary, we can't s
twice as warm as 30F or that is 30F is 50 percent warmer than 20F.
ratios are not meaningful for interval data.
ence of a meaningful zero is a key characteristic of interval data.
16/2016
4.RatioMeasurement
16/2016
16/2016
16/2016
Figure 3:
Measurement
Level
Ratio
Nominal
Vehicletype
X=SUV,car,truck
Binarydata
X=0,1(male,
female)
Weeklypay
Ordinal
X=$457.14
Rateanewsong
X=poor,OK,good,great
*Likert scale
Rateyourdormfood
Very12345Very
Interval
TemperatureF
X=72.3F
Annual
dentalvisits
X=0,1,2,
3,
PoorGood
Likert Scales
16/2016
Statisticsisadifficultsubject.
16/2016
Stronglyagree
Slightlyagree
Neitheragreenordisagree
Slightlydisagree
Stronglydisagree
DescribingData
I.PicturesofData
gram
andleafdisplay
resofonecategoricalvariables:
BarChartsandPieChart
graphsfornumericaldata:
Scatterplot
Sequentialdata:timeseriesplots
orialdatathatcomeinpairs:
Contingencytables
Reportingpercentages
16/2016
Histogram
16/2016
16/2016
1Drawandlabelthexandyaxes.
Thexaxisisalwaysthe
horizontalaxis,andtheyaxisis
alwaystheverticalaxis.
2Representthefrequency onthe
yaxisandtheclass boundaries
onthexaxis.
3Usingthefrequenciesasthe
heights,drawverticalbars for
eachclass.
16/2016
16/2016
Figure 1:
Histogram
18
16
14
12
frequency
10
8
6
4
2
0
classboundary
16/2016
Figure2:
Histogram
50
45
40
35
30
25
20
15
10
5
outlier
FrequencyPolygon
16/2016
Findthemidpoint ofeachclass.
Drawthexandyaxes.Labelthexaxis
iththemidpointofeachclass,andthen
seasuitablescaleontheyaxisforthe
equencies.
Usingthemidpointsforthexvaluesand
hefrequenciesastheyvalues,plotthe
oints.
Connectadjacentpointswithline
gments.Drawalinebacktothexaxis
thebeginning andend ofthegraph,at
esamedistancethatthepreviousand
extmidpointswouldbelocated.
16/2016
16/2016
ple(3)
nstructafrequencydistributionforthese32observationsont
erofcustomerstouseadowntownCitiBank ATMduringthen
on32consecutiveworkdays.Using6classes.
keahistogram,afrequencypolygon.
16/2016
253918333732269 2321341626261832
3019313540273542 2532211526253324
ForATMdata,
16/2016
16/2016
Exercise(1)
ThesedatarepresenttherecordhightemperaturesinF. foreachofthe
50states.(i)Constructagroupedfrequencydistributionforthedatausing
7classes.(ii)Drawahistogram.(iii)Commentonyourresults.
112 100 127 120 134 118 105 110 109 112
110 118 117 116 118 122 114 114 105 109
107 112 114 115 118 117 118 122 106 110
116 108 110 121 113 120 119 111 104 111
120 113 120 117 105 110 118 112 114 114
16/2016
Range=134100=34
Width=34/7=4.9 5
Class limit
Class boundary
frequency
Cumulative
frequency
100-104
99.5-104.5
105-109
104.5-109.5
10
110-114
109.5-114.5
18
28
115-119
114.5-119.5
13
41
120-124
119.5-124.5
48
125-129
124.5-129.5
49
130-134
129.5-134.5
50
16/2016
16/2016
16/2016
Atlanta
Philadelphia
55
70
44
36
40
61
40
38
32
30
63
40
44
34
38
58
40
40
25
30
60
47
52
32
32
54
40
36
30
30
50
53
32
28
31
53
39
36
34
33
52
32
34
32
50
50
38
36
39
32
26
29
Solution
1
Arrange the data for both data sets in order.
2
Construct a stem and leaf plot using the same digits as s
the digits for the leaves for Atlanta on the left side of the stem
igits for the leaves for Philadelphia on the right side, as sh
16/2016
AtlantaPhiladelphia
9 8 6
2
5
8 6 4 4 2 2 2 2 2 1
0 0 0 0 2 2 3 4 6 6 6 8 8 9 9
74 4 0 0
0 0 0 0
5 3 2 2 0 0
0 3 4 8
3 0
16/2016
BarChart
16/2016
16/2016
Figure4: BarCharts
Goodyear
Firestone
Michelin
General
BFGoodrich
Bridgestone
Uniroyal
Continental
Dunlop
40
35
Persent
30
25
20
15
10
5
0
10
15
20
25
30
35
40
PieChart
16/2016
Figure 3:
16/2016
Pie Chart
Retail
Outlet
25%
Another
Student
3%
PieGraph
16/2016
16/2016
Class
A
B
O
AB
Frequency
5
7
9
4
25
Percent
20
28
36
16
100
ution
16/2016
Step1Findthenumberofdegreesforeachclass,
usingtheformula
Degree=
Foreachclass,then,thefollowingresultsare
obtained.
A= .
B= .
O= .
AB= .
=
=
=
=
Findthepercentages.
Grapheachsectionandwriteitsnameandcorrespondingpercentage,asshowni
gFigure.
16/2016
Type B
Type O 28%
36%
16/2016
Customer
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
Quarter
NW
NE
SE
NW
NW
SW
NE
NE
NW
SW
SW
NW
SE
SW
NW
Customer
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Quarter
NW
NE
NW
SE
SW
NW
SW
SE
SW
NW
SW
SE
SE
NW
NE
hefrequenciesandrelativefrequencie
are:
16/2016
Quarter
NE
NW
SE
SW
Frequency
5
11
6
8
30
Relative
frequency
0.167
0.367
0.200
0.267
1.00
egoricaldatacanalsobereportedaspercentagesasshownbe
16/2016
Quarter
NE
NW
SE
SW
Total
Frequency
5
11
6
8
30
Percentage
16.7
36.7
20.0
26.7
100
Sequentialdata:timeseriesplots
16/2016
FuelConsumption
Year
Consumption
9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 59 61 63 65 67 69 71 73 75 77 79 81 83
16/2016
Categoricaldatathatcomeinpairs
16/2016
16/2016
Graduateno.
Gender
EmploymentStatus
MALE
UNEMP
MALE
TEMP
FEMALE
TEMP
FEMALE
PERM
FEMALE
TEMP
MALE
PERM
7
.
MALE
PERM
.
16/2016
Employment
Status
PERM
TEMP
UNEMP
PERM
TEMP
UNEMP
Frequency
170
28
20
57
27
8
16/2016
EmploymentStatus
Permanent
temporary
Unemployed
Male
170
28
20
Female
57
27
Gender
tice that each row corresponds to category of gender and each colum
ry of employment status and the numbers in the cells of the table a
ncies of the corresponding combination of gender and employment statu
example, of the 310 graduates, 170 are male and in permanent emplo
as 27 are female and in temporary employment.
16/2016
Itisusualtoincluderowandcolumntotalsasfollows
Male
Female
Total
Permanent
Temporary
Unemployed
Total
170
57
227
28
27
55
20
8
28
218
92
310
e that the row total gives the frequencies of each category of gende
lumn totals the frequencies of each category of employment statu
16/2016
VisualDescription
FrequencyDistributionandHistograms
ency distribution is a table formed by classifying n data values into k classes cal
opt this terminology from Excel).
limits define the values to be included in each bin. Usually, all the bin widths
16/2016
ConstructingFrequencyDistribution
16/2016
CategoricalFrequencyDistribution
16/2016
16/2016
Example(2)
Twentyfivestudentsweregivenabloodtestto
determinetheirbloodtype.Thedatasetis
A
B
B
AB O
O
O
B
AB B
B
B
O
A
O
A
O
O
O
AB
AB A
O
B
A
Constructafrequencydistributionforthedata.
16/2016
Class
Tally
Frequency
Percent
////
20
//// //
28
//// ////
36
////
16
25
100
AB
Total
For the sample, more students have type O blood than any other type.
16/2016
GroupedFrequencyDistributions
Class boundaries
Lower limit 0.5
Upper limit +0.5
Class width
The class width for a class in a frequency distribution is found
by subtracting the lower (or upper) class limit of one class
form the lower (or upper) class limit of the next class.
Upper class boundary Lower class boundary
16/2016
Class midpoint
Theclassmidpoint Xm isobtainedbyaddingthe
lowerandupperboundariesanddividingby2,or
addingthelowerandupperlimitsanddividingby2:
II.Summarizingdata
1.MeasuresofCentralTendenc
n
mean is the sum of the values, divided by the total numb
s.
16/2016
Rawdata
Thepopulationmean
Thesamplemean
16/2016
GroupedData
Where,
f =classfrequency
xm =classmidpoint
n=totalfrequency
16/2016
Median
median is the halfway point in a data set. Before one can find this poi
must be arranged in order.
the data set is ordered, it is called a data array.
edian either will be a specific value in the data set or will fall betwee
es.
edian is the midpoint of the data array. The symbol for the median is M
16/2016
16/2016
16/2016
Median
Upper 50%
Lower 50%
15
17
21
32
MD = (15+17)/2= 16
16/2016
12 23 23 25 27 34 41
MD = 25
The position of median in the
sorted array is (n+1)/2
xample,
derthethreestudentsscoresonfivequizzes:
scores:20,40,70,75,80
=57,median=70(Tomwhomeanispulleddownbyafewlow
s)
scores:60,65,70,90,95
=76,median=70(Jakewhomeanispulledupbyafewhighsco
s scores:50,65,70,75,90
=70,median=70(shehassymmetricscores)
16/2016
16/2016
GroupedData
where
n=sumoffrequencies
Fm1 =cumulativefrequencyofclassimmediately
precedingthemedianclass
Cm =widthofmedianclass
fm =frequencyofmedianclass
Lm =lowerboundaryofmedianclass