Beruflich Dokumente
Kultur Dokumente
Block: 4
Problem (question)
Does the Chi- Square analysis of the amount of M&Ms in one bag accept or reject the
null hypothesis? Does the number of observed M&Ms deviate from the expected amount
Data Table 2: The Calculation of the Chi- Square Value and the Observed Number of
M&Ms for All of the Class Data
Color Categories
Calculations:
Brown
Observed (O)
315
Expected (E)
297.18
Difference (O-E)
17.82
Difference Squared
317.55
2
(O-E)
(O-E)2/E
1.07
N/A
Blue
532
548.64
-16.64
276.89
Orange
474
457.2
16.8
282.24
Green
408
365.76
42.24
1784.22
Red
281
297.18
-16.18
38.19
Yellow
276
320.04
-44.04
1939.52
Total
2286
2286
N/A
N/A
0.50
N/A
0.62
N/A
2.14
N/A
0.13
N/A
6.06
N/A
N/A
10.52
Some errors that may have affected the data could have been counting errors
when tallying the number of M&Ms for each color. Additionally, calculation
errors may have occurred when calculating the Chi- Square value. For example,
there may be errors when subtracting the expected value from the observed value.
Mistakes may have occurred when squaring the difference between the observed
and expected data and when dividing that by each expected quantity. Lastly,
totaling all of these values may have been slightly incorrect or not completely
accurate.
324.44g. A fourth bag was 357.49g, but this may have been an error because the bag was
filled with too many M&Ms off of a conveyor line. This random filling of each bag
causes the percentage of the number of each color M&M to vary quite widely.
Additionally, it shows why having a large number of bags and a greater data set better
depicts the intended percentages of each color. Different packages of M&Ms all from the
same run will indeed match the percentages that the Mars Company gives to the public
with little variation, which is due to chance as stated by the null hypothesis. This shows
why the null hypothesis was rejected for the first trial, and the results found were not
highly out of the ordinary. The number of each color of M&M in a single package varies
widely because the percentages are not used to fill each bag. The bags are packed
randomly out of large amount of M&Ms that do support the null hypothesis. This large
group, or run, is produced based upon the percentages that the Mars Company stated
were in each bag. These runs do support the null hypothesis because there is very little
deviation from the expected data. This means that any variation seen in the observed data
for second data set is due to chance. The more data there is the closer it is to the expected
results because each run is created with a certain amount of M&Ms.
In the future one could research if, based upon these results, a smaller bag of
M&Ms would deviate farther from the expected percentages because it is a smaller pool
of data. If multiple runs were calculated would they be even closer to the expected
percentages, as well? Additionally, one could question what would make a large group of
data, such as a run or multiple runs, reject the null hypothesis and if this is even possible.
Some recommendations for the future could be to have a more precise or efficient way of
counting the M&Ms. It may also be more accurate if one person counted all of the
M&Ms to have the most precise number. Lastly, the data that used a calculator should be
double checked to ensure that all of the math was done correctly and accurately.
Sources
https://joshmadison.com/2007/12/02/mms-color-distribution-analysis/