Bum2413 - Applied Statistics 21112

lUniversiti
$ Malaysia
PAHANG
I-
En
FACULTY OF INDUSTRIAL SCIENCES & TECHNOLOGY

FINAL EXAMINATION
COURSE
APPLIED STATISTICS
COURSE CODE
BUM2413/BSU1023/BCT20531BPF3313/
BKU20321BAM3022
LECTURER
DR ROSLINAZAIRIMAII BINTI ZAKARIA

NOR HAFIZAH BINTI MOSLIM
AZLYNA BINTI SENAWI
MOHD RASHID BIN AB HAMID
NOR AZILA BINTI CHE MUSA
NOOR FADHILAH BINTI AHMAD RADI
DATE
5 JUNE 2012
DURATION
3 HOURS
SESSION/SEMESTER :
SESSION 2011/2012 SEMESTER II
PROGRAMME CODE :
BSB/BSK1BAAIBAE/BCNIBCG1BCS/BPPIBPT/
BPS/BFFIBFMIBKCIBKGIBKB/BMMIBM1/
BMBIBMF/BMAJBEE/BEP/BEC
INSTRUCTIONS TO CANDIDATES
1.
2.
3.
4.
This question paper consists of SEVEN (7) questions. Answer all questions.
All answers to a new question should start on new page.
All the calculations and assumptions must be clearly stated.
Candidates are not allowed to bring any material other than those allowed by
the invigilator into the examination room.
EXAMINATION REQUIREMENTS:
Statistical Table
1.
Scientific Calculator
2.
DO NOT TURN THIS PAGE UNTIL YOU ARE TOLD TO DO SO
This examination paper consists of FIFTEEN(15) printed pages including front page.
BSBIBSBAAIBAEIBCNIBCG/BCS1BPPIBPTIBPS1BFF1BFW
BKCIBKG/BKBIBMMJBMIJBMBIBMFIBMA/BEEIBEPIBEC/
1 1121118UM24131BSU1023BCT20531BPF33131BKU20321BAM3022
CONFIDENTIAL
QUESTION 1
An article in the Journal of Strain Analysis compares Karlsruhe and Lehigh methods for
predicting the shear strengths for steel plate girders. Data of these two methods, are
shown in Table 1.
Girder
Gi
G2
G3
G4
G5
G6
G7
G8
G9
Karlsruhe
Method
1.186
1.151
1.322
1.339
1.200
1.402
1.365
1.537
1.559
Lehigh
Method
1.067
0.992
1.063
1.062
1.062
1.178
1.037
1.086
1.052
Table 1: Shear strengths for steel plate girders of two methods

(a)
Find the mean and standard deviation for the difference of methods in Table 1.
(3 Marks)
(b)
Find a 98% confidence interval for the mean difference in shear strengths between
Karlsruhe and Lehigh methods.
(4 Marks)
(c)
Is there any mean difference between the two methods? By assuming the data is
normally distributed, test the hypothesis at 5% level of significance.
(7 Marks)
CONFIDENTIAL
BSBIBSKAMBAE1BCN/BCGIBCSIBFP1BPT1BPSIBFF'BFW
BKCiBKG/BKBIBMMiBMUBMBIBMF/BMA1BEEfBEPC/
1 112IIIBUM2413IBSU1O23BCT2O53/BPF33131BKU20321BAM3022
QUESTION 2
The variability in the thickness of the oxide layers is a critical characteristic of the
semiconductor wafers. Low variability of the oxide thickness is desirable for subsequent
processing steps. Two different mixtures of gases are being studied to determine whether
one is superior in reducing the variability of the oxide thickness. Twenty one wafers are
etched in each gas. For gas A, the mean of oxide thickness is 10.05 angstroms and
standard deviation is 1.96 angstroms while for gas B the mean is 13.22 angstroms and
standard deviation is 2.13 angstroms.
(a)
Find a 98% confidence interval for population mean for mixture of gas B and give
the interpretation of the parameter estimate.
(5 Marks)
(b)
Determine whether the two mixtures of gases have different in variability of oxide
layers thickness at the 0.1 level of significance.
(8 Marks)
(c)
Can we conclude that the mean mixture of gas A is not more than gas B at 10%
significance level? Your assumption of the condition of population variances is
based on your answer in (b).
(8 Marks)
BSBfBSKIBAAIBAEIBCN/BCGIBCS1BPP1BPTIBPS1BFFIB
BKCIBKG/BKB/BMMIBMIJBMEIBMF/BMAI'BEEJBEP/BEC/
11 12IIIBUM2413IBSU1023BCT2053IBPF3313IBKU2032AM3022
CONFIDENTIAL
QUESTION 3
A researcher wishes to see whether there is any difference in the weight gains of athletes
following one of three special diets. Athletes are randomly assigned to three groups and
placed on three different diets for six weeks. The weight gains (kg) are shown below.
Type of diet
Diet
Diet
Diet
Weight Gains (kg)

2
4
1
Table 2: Weight gains (kg) for different type of diets
(a)
How many treatments involved in the experiment?

(1 Mark)
(b)
Based on the data in Table 2, is there any treatment effect between the type of
diets at 5% level of significance?
(16 Marks)
CONFIDENTIAL
BKC/BKG1BKB/BMM/BMIJBMBIBMFIBMAIBEE/BEPIBEC/
11 12111BUM24131BSU1023BCT20531BPF33131BKU20321BAM3022
QUESTION 4
(a)
What is the different between simple linear regression and multiple linear
regression?
(1 Mark)
(b)
A fitted simple linear regression equation is given by j 12.9 + 2.34x where

n=1O, S,=929.98 and S=389.93.
(i)
Calculate the value of correlation coefficient, r and comment on the

value.
(4 Marks)
(ii)
Complete the ANOVA table below and test the hypothesis that the
linearity of the regression line at a = 0.05 significance level.
Source of
variation
Sum of squares
Degrees of
freedom
Mean of
squares
F lest
Regression
Residual
Total
929.98
Table 3: ANOVA
(12 Marks)
BSB[BSKIBAAIBAEBCNIBCGIBCSIBPPJBPTIBPSIBFFIBFMI
BKCIBKGIBKB/BMMIBMIIBMB/BMF/BMAIBEEIBEP113EC/
11 1211JBUM24131BSU1023BCT2053111PF33131BKU2032113AM3022
CONFIDENTIAL
QUESTION 5
A study was performed to investigate the car performance for car models produced by the
U.S., Japan, Germany and Sweden between 1978 and 1979. The fuel mileage in
kilometers per liter is believed to be related to car weight (in thousand kilograms), driveratio and horsepower. A multiple regression analysis is conducted to determine the
multiple linear regression equation which gives the best fit to the data. The following
show the Excel outputs of the multiple regression analysis for the study.
SUMMARY OUTPUT
Regression Statistics
0.9031
Multiple R
0.8155
RSquare
0.8104
Adjusted R Square
2.9508
Standard Error
38!
Observations
ANOVA
1
36
37
Residual
Total
SignficanceF
8.88939E-15!
1293.5156 1293.5155 159.1610

8.12711
292 5752
1586.0908;
Lower 95%
Upper 95% Lower 95.0% Upper 95.0%

52.1665.7:
-7.0199
447452
-9.7093.
52 6697
-7.0199
447452
-9.7093!
0.0000;
0.0000
249311
-12.5159
19537
0.6630
4871
-8.36.
P-value
tStat
Coefficients Standard Error

Intercept
Weight
MS
SS
df
Regression
SUMMARY OUTPUT:
Multiple
RSquare
Adjusted RSquare
Standard Error
.
Observations
0.4172
0.1741
01511
6.0323
38.
..
.
.
ANOVA
df
Regression
Residual
Total
1:
36
37
276.1009
13099899
15860908
Coefficients . Standard Error

Intercept
8.44
Drive Ratio
5.28
MS
SS
6.0065
1.9158
276.1009
353886
tStat
7.5876:
I P-value
1.4046
2.7546.
0.1687
0.0092
Significance F
0.0092
..
. Upper95% Lower95.0% Upper 95.0%

-3.7453
20.6181!
20.5181
-3.7453
9.1624
1.39171
1.3917
9.1524.
Lower 95%
BSB/BSKBAAIBAEIBCNIBCG/BCS/BPPIBPTIBPS/BFFIBFM/
BKCIBKGIBKB[BMMIBMIJBMBIBMF/BMAIBEEIBEP1BEC/
I 112111BUM2413/BSU1O23BCT2O53IBPF3313IBKU2O32IBAM302Z
CONFIDENTIAL
SUMMARY OUTPUT
0. 8713
0.7591
0.7524
3.2.1576:
38
'MultipleR
RSquare
re
Adjusted RS
Standard Error
Observations
:ANOVA
df
Regression
Residual
Total
2.1270
0.0203
46.71!
-0.22
Intercept
Horsepower
Significance
MS
SS
1204.0530: 1204.0530 113.45<38,

332.0377 10.61211
15860908
1
36
37
1.11961E-12!
La wer 95%
P-value
tStat
21.9587
-10.6517
0.6000
0.0000;
42.3928
-0.2568!
Upper95% Lower95.0% Upper95.0%
42.3928
-0.2568
51.0205
-0.1746
51.0205
-0.1746
SUMMARY OUTPUT
Multiple
R Square
Adjusted Rsquare
Standard Error
Observations
0.9858!
0.8945:
0.6885
2.1864
38:
.
ANOVA
df
Regression
Residual
Total
:
:
2!
35
37
70.92
-10.83!
-4.90!
MS
SS
Coefficients
Intercept
Weight
Drive Ratio
Signficonce F:
1418.7830: 709.3915! 148.4014; 8.04992E-18:

..
167.3079, 4.7802,
1536.0908
.
Standard Error
P-value
tStcit
4.5904: 15.4495
0.7006! -15.4610!
0.9566; -5.1191 ;
0.0000
0.0000
0.0000
Lower 95%
51.6001,
-12.2538
-6.8392.
Upper 95%
80.2380
-9.4093;
-2.9551
tower95.0% . Upper 95.0%
61.6001
-12.2538
_6.8392
80.2330
-9.4093
19551
BSB/BSKJBAAIBAEfBCNIBCG[BCS1BPP1BPTiBPSlBFFFM/
BKC/BKG/BKB/BMMJBMI[BMBIBMF1BMA1BEEEPIBEC/
11 12111BUM24131BSU1023BCT20531BPF33131BKU203218AM3022
CONFIDENTIAL
SUMMARY OUTPUT
0.9G951
Multiple R
0.82721
RSquare
0.8173:
Adjusted R Square
27986
Standard Error
38
Observations
ANOVA
df
SS

1.9240
4894
1.6338
-6.06.
0.0437
-0.07
Intercept
Weight
Horsepower
MS
83.7553
1311 9559 655.9830i

78321
2741249
1586.0908.
2.
35
37
Regression
Residual
Total
P-value
tStat
0 0000
254379
0.0007
-3.7119
0.1338,
-1.5348;:
Significance
14
4.55362&14
Upper95% Lower 95.0% Upper 95.0%

Lower 95016
528479
450361
52.8475
45.0361:
2.7477
-9.3814
-2.7477
-9.3814:
0.0216.
-0.15571
0.0216
-0.1557
SUMMARY OUTPUT
Multiple
RSquare
Adjusted R Square
Standard Error
Observations
08793
07732
07602
3.2051
38.
ANOVA
df
Regression
Residual
Total
Intercept
Drive Ratio
Horsepower
2
35
37
MS
SS
1226.3751 613.1875
359.7157 10.2776
1586.0908
Coefficients iStandard Error

5 7676
54.63
1.2597E
1 86
0.0247
-0.24.
i
SignficanceF:
F
5.29113E-12
59.6626
P-value
tStat
9.4714
00000
0.1495
-1.4737 :
0.0000
-9.6157
Upper 95% Lower 95.0% Upper 95.0%

Lower 95%
663359
42 9182
56 3359
42 9182
07009
44140
0.7009,:
-4.4140,
-0.1871.
-0.2872.
-0.1871
-0.2872
BSBSAAAE1BCN/BCGJBCSIBPPIBPT1BPS/BFF1BFM1
BKCIBKG/BKIBIBMMIBMIJBMBIBMF/BMAIBEEJBEPIBEC/
11 I2IUBUM2413IBSUIO23BCT2O53[BPF3313/BKU2032/BAM3022
CONFIDENTIAL
SUMMARY OUTPUT
Multiple R
Rsquare
Adjusted R Square
Standard Error
Observations
0.9482
0.8991
0.8902
2.1595
38
ANOVA
3
34
3T
70.28
928
472
-0.04:
Significance F
1426.0575 475.3525 1 100.9914i

150.0333 4.7069:
1585.0908
Coefficients : Standard Error
Intercept
Weight
Drive Ratio
Horseoower
MS
SS
df
Regression
Residual
Total
tStat
P4'alue
4.5838: 15.3326i
14255 65133
09595 49234
0.0342; -1.2432 :
0.0000
00000
0.0000 !
5.26413E17
Lower 95%
60.96611
121813
-6.6736;
-0.11211
Upper 9556 Lower 95.0% 1 Upper95.0%
79.5969
63875
-2.7739
60.9561
121813
-6.67361
-0.1121;
Based on the given Excel outputs, answer the following.

(a)
State the response and predictor variables?

(2 Marks)
(b)
Extract the relevant informations and summarize your results in a table.

(7 Marks)
(c)
Hence, determine the best regression equation for predicting the mileage (kilometer
per liter) value.
(2 Marks)
IN
79.5969
63876
27739
0.0270
BSB/BSK/BAA/BAE/BCN/BCG/BCS/BPP/BPT/BPS/BFF/BFNF
BKCIBKGIBKBIBMMJBMUBMBIBMFIBMAIBEEIBEPIBEC/
11 12111BUM24131BSU1023BCT20531BPF33131BKU2032/BAM3022
CONFIDENTIAL
QUESTION 6
The goals scored per match by MyKid football team gave the following results:
Number of goals per match

Number of matches
1
IL_L 18
0
3 1 4
2
29 1 18 1 10
7
1
Table 4: Goals scored in all football matches

Test whether the number of goals per match follows a Poisson distribution at 10%
significance level.
(12 Marks)
QUESTION 7
In a study of the television viewing habits of children, a developmental psychologist
selects a random sample of 300 primary students; 100 boys and 200 girls. Each student is
asked which of the following TV programs they preferred most; Word World, Dibo the
Gift Dragon or Mickey Mouse Clubhouse. Results are shown in Table 5 below.
Boys
Girls
Column total
Viewing Preferences
Mickey Mouse
Dibo the Gift
Word World
Clubhouse
Dragon
20
30
50
70
80
50
90
110
100
Row total
100
200
300
Table 5: TV program viewing preferences

At a = 0.01, is there enough evidence to support the claim that the proportions of
viewing preferences for boys are equal for each of the three TV programs?
(8 Marks)
END OF QUESTION PAPER
10
BSBIBSKIBAAIBAEIBCN/BCG/BCSIBPP1BPTIBPSIBFFIBFMI
BKCIBKG/BKBIBMMIBMIIBMBIBMF1BMAIBEEIBEP1BEC/
11 12111BUM24131BSU1023BCT20531BPF33131BKU2032/BAM3O2Z
CONFIDENTIAL
Appendix - Table Of Formulas

Confidence Intervals, Sample Sizes and Hypothesis Testing
Hypothesis testing for p
Confidence intervals for p
I.
x_z
a/ 7;
2
X - Z
Xt
X+Zai7=J
, X + Za/2
Ztt/T
Ztest
7J
a12vJ_ , X+ta/2v_j=J
where
ttest =
v=n-1
Hypothesis testing for A . -
Confidence intervals for du - 4u2
i2),u0
2
Ztest
JcY22
cT
Foro^o:
Foro-^o:
(i 2)u0
____
Zte
fl
11
L
2
111
fl2
BSBIBSKJBAA/BAEIBCNIBCGIBCSIBPPIBPTIBPSIBFFIBFM/
BKCIBKGIBKBIBMMIBMIJBMBIBMFIBMAIBEEIBEPIBEC/
11 12111BUM24131BSU1023BCT20531BPF3313/BKU2032/BAM3022
CONFIDENTIAL
For o
For o
#
2)u0
F2
Vi
12
2\
where v
n1
n2 )
n2
n)
n1-1
S2 2
n2-1
For o = o:
o- :
Ztest =
For cr
2s\2
n2)
n2 -1
'2
In1
where v =
12 1fl2
ni)
n1 -1
41+
V'i
X2)tai2v /L+L
I (
For o
ttest
)Z
12 sj-_-+_1_
x2)P0
J
For cr = o:
(yi
2)ta12v
test
where v-n1 +n2 -2
where
ri^ + n2
v=n1+n2-2
Pooled estimator, s
/(n-1)s+(n2-1)s
n1+n2-2
Hypothsis Testing for PD
Confidence Interval for PD
d _z/iTd
,,,Fn
ttest
S D
t\
SD)
Za//=s +Za/T)
12
d-,u D
SD/'.I
where v = n -1
BSBIBSKBAAIBAEIBCN/BCG/BCS/BPPJBPT[BPSIBFFIBFMI
BKCIBKG/BKBJBMM/BMIJBMBJBMFIBMAIBEEIBEP/BEC/
11 12111BUM24131BSU1023BCT20531BPF33131BKU20321BAM3022
CONFIDENTIAL
n-I
f'
a/2,n1
Hypothesis testing for 7r
Confidence intervals for ir
[JP(1_P) ,
-_______
Ztest
1-7r0)
p+ai2ifr(1_
]
Confidence intervals for 7t1
Hypothesis testing for it1 - it2
7t2
If ,r
[(p1 _p2)Za12
(1
)+
(p, p2 )r0
0 1 Zt.,
= o,

___________
it1 (1irk ) 92(1-7r2)
Z. =
+ n2
PP (I --PP
F
where pp=
x1 + x 2
fl1 + fl2
Hypothesis testing for a2
Confidence intervals for a2
((n_1 )s 2 (n _l)s2
II X.2
' 2
k
(n-1)s
Z1-a/2,v )
/2,
where
II
Zest
co
v=n-1
Hypothesis testing for
Confidence intervals for

U2
1
2
S2 faizvi,v2
where
' S2 2

fa12
V2 V1 I
)
v1 =nj 1
v2= fl2 1
.test - 2
S2
Sample sizes
n = ['
n=p(1_p)2-J
0 J2
13
a2
BSB/BSK/BAA/BAE1BCNIBCGIBCSIBPPIBPTIBPSIBFFJBFMJ
BKCIBKG/BKBIBMMIBMIJBMB/BMF/BMA[BEE[BEPIBEC/
ii 12111BUM24131BSU1023BCT20531BPF33131BKU20321BAM3022
CONFiDENTIAL
Analysis of Variance (ANOVA)

Two-way ANOVA
One-way ANOVA
SST =
k
SST = >jx
ar
1=1 j1 k=1
1
__x2
H jl
1
___x2
SSA =--x2
br 1=1 '
abr
___x2
SSB= - x.
ar 1
11
fl
iab
ar
1
SSAB=--x2' -------x2 SSASSB

abr
r
1=1 j1
SSE = SST SSA SSBSSAB
SSE = SSTSS(Tr)
Goodness of Fit Test and Contingency Tables

Test using Contingency Tables
Goodness of Fit Test
n,.
1.
E.
2' (0 i E)
L4
1=1
fl
Xtest _
Free distribution DoF; v = ki

Hypothesized distribution DoF;
c(O_E)
ZtSt
V
=k - p 1
Ejj
where v_( r - 1)(c 1)
Simple Linear Regression and Correlation

Simple Linear Regression and Correlation
r-
[Exi)t i J
S,=x1y1 '
i=1
Regression line equation: 5' =
.'jsxxs.y
SXY
[nJ2
______
S=Jx '
1=1
S,=y
fl
1=1
and
+/3 x where /3 =
xx
14
fl = -
1=1
BSBIBSKIBAAIBAE/BCNIBCGIBCSIBPPIBPTIBPSIBFFIBFMJ
BKCIBKGIBKB1BMMJBMI1BMB1BMF/BMA/BEEIBEPEC/
11 12II1BUM2413IBSU1O23BCT2O531BPF33131BKU20321BAM3OZ2
CONFIDENTIAL
Hypothesis Testing for Slope, /3
Hypothesis Testing for Intercept, '80
ttest
s.e(flo)
_i-i
s.e(i)
y2
F
MSR- !+T
FMSR-
Sum of Squares Regression,
Mean Square Residual, MSReS
SSR
MSRes_[
SSR/31SXY
15
n-2

Bum2413 - Applied Statistics 21112

Hochgeladen von

Dokumentinformationen

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Bum2413 - Applied Statistics 21112

Hochgeladen von

Copyright:

Verfügbare Formate

lUniversiti

FACULTY OF INDUSTRIAL SCIENCES & TECHNOLOGY

DR ROSLINAZAIRIMAII BINTI ZAKARIA

SESSION 2011/2012 SEMESTER II

Table 1: Shear strengths for steel plate girders of two methods

Weight Gains (kg)

Table 2: Weight gains (kg) for different type of diets

How many treatments involved in the experiment?

A fitted simple linear regression equation is given by j 12.9 + 2.34x where

Calculate the value of correlation coefficient, r and comment on the

1293.5156 1293.5155 159.1610

Upper 95% Lower 95.0% Upper 95.0%

Coefficients Standard Error

Coefficients . Standard Error

. Upper95% Lower95.0% Upper 95.0%

Coefficients Standard Error

1204.0530: 1204.0530 113.45<38,

Upper95% Lower95.0% Upper95.0%

1418.7830: 709.3915! 148.4014; 8.04992E-18:

tower95.0% . Upper 95.0%

Coefficients Standard Error

1311 9559 655.9830i

Upper95% Lower 95.0% Upper 95.0%

Coefficients iStandard Error

Upper 95% Lower 95.0% Upper 95.0%

1426.0575 475.3525 1 100.9914i

Coefficients : Standard Error

Upper 9556 Lower 95.0% 1 Upper95.0%

Based on the given Excel outputs, answer the following.

State the response and predictor variables?

Extract the relevant informations and summarize your results in a table.

Number of goals per match

Table 4: Goals scored in all football matches

Table 5: TV program viewing preferences

END OF QUESTION PAPER

Appendix - Table Of Formulas

Confidence intervals for p

Hypothesis testing for A . -

Confidence intervals for du - 4u2

where v-n1 +n2 -2

Confidence Interval for PD

Hypothesis testing for 7r

Confidence intervals for ir

Confidence intervals for 7t1

Hypothesis testing for it1 - it2

Hypothesis testing for a2

Confidence intervals for a2

Hypothesis testing for

Confidence intervals for

Analysis of Variance (ANOVA)

SSAB=--x2' -------x2 SSASSB

SSE = SST SSA SSBSSAB

Goodness of Fit Test and Contingency Tables

Goodness of Fit Test

Free distribution DoF; v = ki

where v_( r - 1)(c 1)

Simple Linear Regression and Correlation

Regression line equation: 5' =

Hypothesis Testing for Slope, /3

Hypothesis Testing for Intercept, '80

Sum of Squares Regression,

Mean Square Residual, MSReS

Das könnte Ihnen auch gefallen