You are on page 1of 2

CHI SQUARE GOODNESS OF FIT CASE STUDY

CASE DESCRIPTION:

A study conducted by a team of researchers wanted to analyze if children belonging to a particular


region exhibited a marked pattern of eye and hair color.
For the above research objective, the eye and hair color of children from two different regions of
Europe were recorded. The eye colors observed were found to range between blue, green and brown
and the hair color was found to vary between; fair, dark, medium, black or red.
The basic aim of the study was to see if the childrens hair color follows a specified distribution with
respect to the two geographical regions or not.
DATA SET:
The name of the data set is eye and hair color data.xls. There are four variables with 27 observations on
each variable. A total of 762 children were observed.
VARIABLE DESCRIPTION:
a) Region: specifies the geographical region in which children with a particular hair and eye color reside.
It is a character variable recording two responses: 1 for Region 1 and 2 for Region 2.
b) Eye color: denotes the eye color of children. It is a character variable with three responses, viz., blue,
green and brown recording the three different eye colors.
c) Hair color: denotes the hair color of the children under observation. It is a character variable, too;
with five responses, viz., fair, dark, medium, black and red.
d) Count: It is a numeric variable denoting the number of children exhibiting each of the fifteen eye and
hair color combinations.

The information provided on these pages remains, unless otherwise stated, the copyright of the respective authors. All layout , design, original graphics, concepts and
other World Wide Web Intellectual Property Rights barring the information mentioned above, remains the property and copyright of OrangeTree Global

PROBLEM OF THE STUDY:

a) Suppose the researchers set a premeditated hypothesis for each region that states:

i) 30% of the observed hair color of the children would pertain to fair and medium hair colors
each
ii) 12% of the observed hair color would pertain to children having red hair.
Iii) 25% of the observed hair color will be dark and
iv) 3% of the observed hair color of children will be black.

Test the hypothesis at 5% level of significance (based on the given data) for each region to
determine if the childrens actual hair color follows such a distribution or not. State the null and
research hypotheses clearly.

The information provided on these pages remains, unless otherwise stated, the copyright of the respective authors. All layout , design, original graphics, concepts and
other World Wide Web Intellectual Property Rights barring the information mentioned above, remains the property and copyright of OrangeTree Global