Beruflich Dokumente
Kultur Dokumente
MATH 1040
Intro to Statistics
A Brief Explanation
a real world example that was unique to us. We were each tasked with
purchasing a bag of Skittles, counting the total, counting the total of each
color, recording both of those, and recording our height. Once the
complete data set had been created we did a statistical analysis on the
Reflection
physical height. Joking aside, this project has taught me that while stats
can be useful they are also vulnerable to misinterpretation and can easily
There was a pretty large range of total candies in the bags that were preset
Does that mean that the company should be held liable for not supplying
Dax Allen
MATH 1040
Intro to Statistics
enough candies to the smallest bag or that they are losing money on the
largest bag? Learning how to find the answer to that question is something
information that I was given in a much more concise and detailed way.
calculations and then take the answers and create a story with them is
daily.
I don’t know if I can say, yet, how this has influenced my view of real
animations for movies, whether it be the whole movie or just some special
effects.
Dax Allen
MATH 1040
Intro to Statistics
One of the biggest things that I will take away from this course is how
driven world people like to through in correct conclusions around all the
time.
Dax Allen
MATH 1040
09/21/2019
Which of the two variables (candy color, number of candies per bag) is qualitative and which is
quantitative and an explanation of how you know?
State what the individuals (single objects being observed) are for measuring each variable.
Column n Mean Std. Median Range Min Max Q1 Q3 IQR Unadj. Mode
dev. std.
dev.
Red 87 12.230 3.259 12 17 6 23 10 15 5 3.240 11
Orange 87 11.322 3.215 11 21 2 23 10 13 3 3.197 12
Yellow 87 12.218 2.503 12 10 8 18 10 14 4 2.489 Multiple
modes
Green 87 11.678 3.197 12 14 5 19 9 14 5 3.179 12
Purple 87 11.218 3.353 11 18 4 22 9 13 4 3.334 9
Total 87 58.667 2.380 59 21 51 72 58 60 2 2.367 59
Height 87 65.802 3.893 65 20 58 78 63 68 5 3.870 65
Dax Allen
MATH 1040
09/21/2019
Lower Upper
Fence Fence Outliers LF Calc UF Calc
Red 2.5 22.5 23 10 - (1.5 * 5) 15 + (1.5 * 5)
Orange 5.5 17.5 2, 5, 18, 19, 23 10 - (1.5 * 3) 13 + (1.5 * 3)
Yellow 4 20 No Outliers 10 - (1.5 * 4) 14 + (1.5 * 4)
Green 1.5 21.5 No Outliers 9 - (1.5 * 5) 14 + (1.5 * 5)
Purple 3 19 20, 22 9 - (1.5 * 4) 13 + (1.5 * 4)
Total 55 63 51, 54, 72 58 - (1.5 * 2) 60 + (1.5 * 2)
Height 55.5 75.5 77, 78 63 - (1.5 * 5) 68 + (1.5 * 5)
There were 61 total candies in my bag. 6 Red, 15 Orange, 9 Yellow, 19 Green, & 12 Purple. My bag isn’t
an outlier, and neither are any of my colors. My height is an outlier at 78 Inches.
I think that it is appropriate to discuss the shape of the graphs for the individual colors, the totals per
bad, and the height of the student. The colors are all mostly symmetric in shape though yellow has a bit
of a skew to the right. The totals per bag is very symmetric while the height of the students is skewed
right.
Dax Allen Project Part 3 Stats 1040
Parameter estimates:
Problem 1: Suppose all of the Skittles in the class data set are
combined into one large bowl and you are going to randomly
select one Skittle.
o What is the probability that you select a green Skittle? (4
points)
1016 / 5104 = 0.1991
o What is the probability that you select a Skittle that is NOT
green? (4 points)
1 0.1991 = 0.8009
o What is the probability that you select a Skittle that is red
OR yellow? (4 points)
Red 1064 / 5104 = 0.2085
Yellow 1063 / 5104 = 0.2083
0.2085 + 0.2083 = 0.4168
o What is the probability that you select a Skittle that is
orange GIVEN that it is a secondary color (secondary
colors are green, orange and purple)? (4 points)
985 (Or) + 1016 (Gr) + 976 (Pr) = 2977
985 / 2977 = 0.3309
Problem 3: Suppose all of the Skittles in the class data set are
combined into one large bowl and you are going to randomly
select ten Skittles with replacement and count how many are
yellow.
o List the requirements of the binomial probability
distribution and show that this meets them, including
identifying the values for n and p. (6 points)
Binary choice i or i n Yellow or not Yellow
With replacement probability remains constant
Fixed number of trials, n = 10
Choo ing a ello and replacing i doe n impac he
ability to choose another yellow, or any other color trials
are independent
n = 10, p = 0.2083
o What is the probability that exactly 4 of the 10 Skittles are
yellow? (4 points)
binomialpfd(10, 0.2083, 4) = 0.0974
o What is the probability that at most 2 of the 10 Skittles are
yellow? (4 points)
binomialcdf(10, 0.2083, 2) = 0.6526
o For samples of size 10, what is the expected value and
standard deviation for the number of yellow skittles that
will be included? (4 points)
10 * 0.2083 = = 2.083 or 2
Sqrt(10 * 0.2083 * (1 0.2083) ) = = 1.284
Dax Allen Project Part 5 MATH 1040
Using values for the class data that you computed in Part 2 of
the project, construct a 99% confidence interval estimate for
the true proportion of yellow candies using the class data as
your sample. Remember that for this computation, n is the
number of CANDIES for the entire class data. Include all your
work, showing the formula used and appropriate values
inserted (neatly written and scanned or typed) or including the
appropriate calculator commands and inputs. (5 points)
o TInterval (
o x-bar = 12.218
o Sx = 2.503
o n = 87
o C-Level = .99 )
o Lower Bound: 11.511
o Upper Bound: 12.925
Using values for the class data that you computed in Part 2 of the project and a 0.1
significance level, test the claim that 20% of all Skittles candies are red. Show all the
steps (neatly written and scanned, typed, or copied from StatCrunch) including:
2. the requirements for performing the hypothesis test, along with discussing
ho he a e me o no me hin : he a e no all me ! In o di c ion,
describe how you selected the bag of skittles you bought (give me some
details, like where you bought it and how you picked the bag to buy), and
identify what type of sampling method it was. (6 points)
When selecting my bag of Skittles, I just walked into my local Harmon s
and chose the first one my hand touched. I would say this was
convenience sampling.
The number of red Skittles per bag also has one outlier but the sample size
is large enough.
5104 (0.2) (1-0.2) = 816.64 ≥ 10 is True
5104 ≤ 0.05(Total Skittles population) is True – Sample values are
independent of each other
x: 1064
n: 5104
Prop: ≠ P0
P-Value = .1306
Using values for the class data that you computed in Part 2 of the project and a 0.05
significance level, test the claim that the mean number of candies in a bag of Skittles
is more than 58. Show all the steps (neatly written and scanned, typed, or copied
from StatCrunch) including:
2. the requirements for performing the hypothesis test, along with discussing
ho he a e me o no me hin : he ame one ha a n me abo e i n
met here! (4 points)
The samples were not selected at random; it was done via convenience
sampling.
There are three outliers in the sample, but the sample size is 87, which is
large enough to offset that. 87 ≥ 30
87 < 0.005 (Skittles population) – Sample values are independent of each
other
µ0: 58
x̄: 58.667
Sx: 2.380
N: 87
µ > µ0
t = 2.61
4. the p-value (3 points)
T Test:
µ0: 58
x̄: 58.667
Sx: 2.380
N: 87
µ > µ0
p = .0053
5. the appropriate decision about the null hypothesis and an appropriate
conclusion (4 points)
At a level of significance, = 0.05, we will reject the null hypothesis as
the P-Value is less than . The P-Value of 0.0053 means that we could
expect for every 1000 samples we would get a mean of 58 in 5 of them.
That is a very rare occurrence.
6. Also describe the Type I and Type II errors for this test. (6 points)
Type I would be rejecting the null when it should not be rejected. Let s
say the Skittles factory needs the mean to be 58 per bag. If our results
were accurate and we got a Type I error, they might re-calibrate the
Skittles factory when they didn t need to.
Type II would be not rejecting the null when they need to. Using the
example above, this could be a serious problem if we needed to average
per bag to be 62. We would be promising a certain amount of Skittles per
bag and not delivering on that promise which could end up in legal issues.