Beruflich Dokumente
Kultur Dokumente
Test of Significance:
Assess the evidence provided by data in favor of
some claim about the population
Start by setting up a hypothesis
This is a statement about a population parameter
Results of the test are expressed in terms of a
probability
Usually called a p-value
This probability (p-value) measures how well the data
and the hypothesis agree.
3
HYPOTHESES
Null hypothesis
Alternative hypothesis
ALTERNATIVE HYPOTHESIS
Predicts a relationship between two or more
variables
It is called a research hypothesis because its
formulation is based on gathering empirical
evidence and deducing from theory
The statement we hope or suspect is true
What we are trying to prove or the effect we
are hoping to see
Also, called “RESEARCH hypothesis”,
“empirical hypothesis” or substance
hypothesis
Denoted by H1 OR Ha
RESEARCH HYPOTHESIS
Examples
The educational history of high school
freshmen is related to their achievement
Non directional
Affirms the relationship but failed to qualify it
A more favorable organizational climate is
associated with greater efficiency at work
Directional
Suggests directionality between variables
NULL HYPOTHESIS
The null hypothesis assumes that any kind of
difference or significance you see in a set of
data is due to chance.
It is considered null because it denies the
existence of any relationship between or among
variables
Denoted by Ho.
Also called as “statistical hypothesis” because
it is more suitable for the application of
statistical test
Why null hypothesis?
Statistically speaking, we temporarily adopt the
critical stance that our independent variable does
NOT matter. So we have to test it. The
acceptance or non acceptance of it provides
support or no support for the research
hypothesis. The fate of the research hypothesis
depends upon what happens to Ho
Hypotheses flows from
research problem
True State
H0 True Correct Type I Error
Decision
H0 False Type II Error Correct
Decision
15
Significance Level
Basics of Significance
16 3/9/2018 Testing
One-tailed & Two-tailed
test
One-tailed test only considers one end distribution. It is also called as
directional test.
H a : 0
or 0
17
One-tailed & Two-tailed
test
Two-tailed test is one that considers both ends of the distribution.
There is no predictions concerning the direction of the difference in
means.
H a : 0
18
Rejection Region or Critical
Region
It is a area of sampling distribution that lies beyond the test statistic’s
critical value; when the score falls within this region, the null
hypothesis is rejected (Jackson, 2012)
19
P-value
P-value ≡ the probability the test statistic
would take a value as extreme or more extreme
than observed test statistic, when H0 is true
Smaller-and-smaller P-values → stronger-and-
stronger evidence against H0
Conventions for interpretation
P > .10 evidence against H0 not significant
.05 < P ≤ .10 evidence marginally significant
.01 < P ≤ .05 evidence against H0 significant
P ≤ .01 evidence against H0 very significant
Basics of Significance
20 3/9/2018 Testing
DECISION
Decision rule
P ≤ α statistically significant evidence
P > α nonsignificant evidence
For example, if we set α = 0.01, a P-value of
0.0006 is considered significant
MEANING, we will reject the null
hypothesis.so, accept the alternative
hypothesis.
21
Statistical Treatment
Data Parametric
DECISION:
We failed to reject the null hypothesis (p > 0.05)
Conclusion :the
mean number of hours per day using
the computer by students at The University is
the same as the typical student’s usage
TWO SAMPLE
INDEPENDENT TTEST
A teacher drew a sample of students in his Math class & randomly assigned 10
of them to an experimental group and 10 to a control group. The teacher
taught the female group by use of PSI and the Male group with the traditional
technique. At the end of the semester, a standardized mathematics
achievement test was given to both groups. On the basis of these data, should
the teacher conclude that the PSI is more effective than the traditional
method?
Female Male
25 30
23 18
40 20
50 25
48 45
27 20
30 16 27
Hypotheses
28
29
Minitab output…
Two-sample T for experimental vs control
31
Significance of Difference
Between Two Means of
Correlated Samples
32
Example
The following are the pre and post-test scores for ten subjects in an
experiment to determine whether learning has taken place as a result of
some specific experience. Test the significance of the difference between
the two means at the 5% level using a directional test.
Subject Pretest Posttest
no.
1 16 20
2 11 8
3 8 9
4 12 13
5 7 10
6 14 17
7 9 11
8 13 15
9 10 12 33
Hypotheses
H0 :There is no significant difference
between the two tests
Ha:There is no significant difference
between the two tests
Significance level: 0.05
Analysis
34
35
Analysis
increase t
DECISION
The p-value is 0.008 less than 0.05 level of significance. Thus, we can
reject the null hypothesis
36
Conclusion:
Therefore, we have enough evidence to
prove that the post-test mean is
significantly different from the pre-test.
It would imply that exposure of the
students to the specific experience did
increase their achievement.
37
Example
Suppose that a researcher wished to learn if a particular chemical is toxic to a
certain species of beetle. She believes that the chemical might interfere with the
beetle’s reproduction. She obtained beetles and divided them into two groups.
She then fed one group of beetles with the chemical and used the second group
as a control. After 2 weeks, she counted the number of eggs produced by each
beetle in each group. The mean egg count for each group of beetles is below.
Group 1 Group 2
fed chemical not fed chemical
33 35
31 42
34 43
38 41
32
28
38
The researcher believes that the chemical interferes with
beetle reproduction. She suspects that the chemical
reduces egg production. Her hypotheses are:
Ho: There is no significant difference in the number of
eggs in two groups or
Ha: The mean number of eggs in group 1 is less than
the mean number of group 2.
A t-test can be used to test the probability that the two
means do not differ.
This is a 1-tailed test because her hypothesis proposes
that group B will have greater reproduction than group
1. If she had proposed that the two groups would have
different reproduction but was not sure which group
would be greater, then it would be a 2-tailed test.
39
Minitab output
Two-sample T for C1 vs C2
Difference = mu C1 - mu C2
Estimate for difference: -7.58
95% upper bound for difference: -3.47
T-Test of difference = 0 (vs <): T-Value = -3.43 P-Value = 0.005 DF = 8
Both use Pooled StDev = 3.43
40
Decision
The researcher concludes that the mean
of group 1 is significantly less than the
mean for group 2 because the value of
P(0.005) < 0.05. She accepts her
alternative hypothesis that the chemical
reduces egg production because group 1
had significantly less eggs than the
control.
41
Conclusion
Therefore, the chemical interferes with
beetle reproduction. It can reduces egg
production.
42
ANOVA:
Analysis of Variation
ED 202
Lynn Mangin - Remo
What does ANOVA do?
At its simplest (there are extensions) ANOVA
tests the following hypotheses:
H0: The means of all the groups are equal.
R ANOVA Output
Df Sum Sq Mean Sq F value Pr(>F)
treatment 2 34.7 17.4 6.45 0.0063 **
Residuals 22 59.3 2.7
Where’s the Difference?
Once ANOVA indicates that the groups do not all
appear to have the same means, what do we do?