Beruflich Dokumente
Kultur Dokumente
MWF10:00am11:00am
Statistics1
042716
SkittlesProject
Introduction:
ThisSkittlesProjectisaprojectinwhichwewillapplystatisticstothenumberofcolored
skittlesinasampleof2.7OZbagofskittles.Doingthisprojectwillhelpustofigureouthow
toapplystatisticstofindingthefivenumbersummary,thestandarddeviationanda
differentgraphssuchashistograms,piecharts,andbargraphs.Tobegintheproject,we
willobtaina2.7OZbagoforiginalskittles.Thenopenthebagandseparatetheskittles
accordingtocolor.Thegoalistodeterminetheprobabilityofgettingacolorinabagof
skittles.
DataCollection:
Numberofred
candies
12
Numberof
orangecandies
14
Numberof
yellowcandies
19
Numberof
greencandies
Numberof
purplecandies
14
ClassData:
Numberofred
candies
Numberof
orangecandies
Numberof
yellowcandies
Numberof
greencandies
Numberof
purplecandies
513
438
444
487
483
Whenlookingatthetwocharts,Iamabletonoticethatthemajorityoftheskittlesarewithin
thesamerange.Ihaveonlyoneoutlier.Otherthanthat,thetwographsarewithinthesame
range.Theprobabilitiesofthecolorsbeinginthebackareallaroundtherangeof20%.
Thisroughlygiveseachcolorthesameprobabilityofbeinginthebagofskittles.
MyData:
IQR:7,Min:8,Q1:10,Median:14,Q3:16.5,Max:19
ClassData:
IQR5,Min52,Q157,Median60,Q362,Max67
Betweenmydataandtheclassesdata,itappearsthatthetwotypesofgraphshavevery
closesimilarities.LikeIpredicted,mydatahasonnoticeableoutlier,whereastheclass
datahasnone.Meaningtheirprobabilityisallwithinacertainrange.Becausemysample
wassosmallitiscleartoseethattherewouldbevariationfromthemuchlargersample
sizeoftheclass.Thedistributionseemstohaveabellshapeforboththeclassdataand
mydata.Thesamplesseemtohavenormaldistribution.
Reflection:
Thedifferencebetweencategoricalandquantitativedataisthat,categoricalisdatathat
usescategories,meaningitdoesnothavemathematicalsignificance.Itishaslables.
Quantitativedatadoeshavenumberthathavemathematicalsignificance.Theboxplot
andparetochartsaresensibletorelatetocategoricaldatabecausetheyrepresentthe
colorandarenotrepresentingtheamountorindividualvalues.Thehistogramandpiechart
aresensibletorelatetoquantitativedatabecausetheyaremadebasedontheamountor
numbers.ThePiechartshowswhichcolorhasthemostnumberstoit.Forcategorical,the
numbersofeachcolorwouldmakesensebecausethenumberisrepresentingthecolor.
Forquantitativedata,wewouldusethefivenumarysystem,whichshowshowthenumber
ofskittleshavemathematicalsignificance.
HypothesisTestAndConfidenceIntervalEstimate:
Reflection:
Theconditionsfordoinghypothesistestare,weneedtohaveasamplesizethathasat
leastapopulationsizethatistentimesthesizeofsample(samplesizeisgreaterthanor
equalto1/10thofthepopulation).TheconditionsfordoingaConfidenceIntervalEstimate
are,thesamplemustberandomlyselectedandthesamplingdistributionisapproximately
normal.TheconditionsforboththeHypothesisTestandtheConfidenceIntervalEstimate
weremet.ApossibleerrordoingthisisthatIcouldmessupthenumberofbagsandthe
numberofindividualcandiesasthepopulation.Ihavetokeepbothofthemintheirown
position.Thesamplingmethodcouldbeimprovedbyverifyingallthedatabyhavingthe
classeachbringtheirownbagofskittlesintotheclassandverifythatitisthecorrect
weightandthencountouttheskittlesrighttheir.Thiswouldensurethatourdatawouldbe
moreaccurate.TheconclusionIcametois,the95%ConfidenceIntervalforyellowcandies
is(0.16705,0.20842).The95%confidenceIntervalforthemeanofthenumberofcandies
foreachbagis,(57.945,60.255).The98%ConfidenceIntervalforthestandarddeviation
foreachisbagis(22.164,63.691).Themeannumberofcandiesperbagisgreaterthan
55.