Beruflich Dokumente
Kultur Dokumente
Holly Kersich
Roman Franco
Diana Lucas
Introduction:
This term project allows my group and I to pull together many concepts that we’ve been studying this semester. This includes
organizing and analyzing data, drawing conclusions, using confidence intervals and hypothesis testing. Each person in our class
purchased a 2.17 ounce bag of Original Skittles, and formed into groups of 3-5 people. Each group recorded the proportion of skittle
colors per bag as well as the amount of skittles per bag. These are my group’s findings.
0.058
Our group’s observation of the data is pretty comparable
to what we thought our data would look like. We 0.000
assumed the amount of skittles per color would be fairly
consistent, with a small sample standard deviation. Our
Orange Green Yellow Purple Red
data shows minimal proportion differences between the
Skittle colors overall. The most significant difference in
proportion would be for the yellow Skittles, which Class Proportion
showed a 2.6% difference. Group Proportion
1
Organizing and Displaying Quantitative Data: the Number of Candies per Bag
Mean 58.8
Standard Deviation 1.81
Min 54
Q1 58
Med 59
Q3 60
Max 61
Reflection:
The difference between qualitative and quantitative data is that quantitative data has
numbers that you can do something with, such as counts or measurements, while
qualitative data takes numbers and places them into categories. Pie charts and Pareto
charts are best used for qualitative data because these charts show the separate
categories. For our Skittles project, the color proportions in each bag would be considered
2
charts are best used for qualitative data because these charts show the separate
categories. For our Skittles project, the color proportions in each bag would be considered
our qualitative data and are represented by both a Pie & Pareto chart. Histogram graphs
and box plots would not be appropriate for qualitative data because the numbers given do
not show a greater significance over the other (Ie. red skittles aren’t superior to green
skittles) and therefore these graphs are unable to show the categories.
Histograms and box plots are used for quantitative data, and these graphs are able to
show what we can do with these numbers (Ie. there is a greater amount of red skittles vs.
green skittles). For our Skittles project, the amount of skittles per bag is our quantitive
data.Interpretations for qualitative data would be counts or percentages, while
interpretations for quantitative data would be the distribution of the data.
A confidence interval is a range of values that we are fairly certain that our true value lies
in. The general purpose and meaning of a confidence interval to take a sample of
something and make an inference about the entire population + or - error.
3
Hypothesis Tests:
The conditions that need to be met when doing an interval estimates and hypothesis tests
include: the data must be random, the data must have normal distribution, >10 success
and failure on each, n<10% then population size, and the observations must be
independent. For a hypothesis, they also must have two possible outcomes. One that fails
and one that is a success. Errors could have occurred in the counting of the samples, not
counting all of the samples and errors in the math we did. We could improve our sampling
by doing a higher number of samples.
In conclusion, we can say that there is not sufficient evidence to support the claim that
20% of all Skittles are red. We can also say that there is sufficient evidence to warrant
rejection that the mean number of candies in a bag of skittles is 55. We can say that with
a 99% confidence that the proportion of total of yellow skittles is between (.17 and .228)
We can say that we are 98% confident that the mean number of candies in a bag is
between (58.63, 58.89)