23 views

Uploaded by api-337959039

- Statistics
- Beneke 2015
- IE 27 Ch 9 Slides
- Statistics review
- Hypothesis Testing
- Worksheet1-solns (1)
- Statistical Test Excel File
- Value of the p value
- jane thomas math 1040 term project
- Statistics Saturday
- Eviews_7.0_Manual.pdf
- 6300p2a
- Hypothesis Testing 1
- 2138
- Business Development Plan
- Significance in Statistics
- R Studio Cheat Sheet for Math1041
- Class 04 - Basic Concepts of Statistics and Probability (2 of 2)
- 33230 SP09 Final Exam
- Clodes Class Data Science

You are on page 1of 7

MATH1040

Skittles Project

For this assignment, each student had to record each color of skittles from a small bag. Once all

the students had the numbers of their skittles, they were put together and combined. That was the

first part of this assignment. Below there is a list of the numbers I got from my skittles bag.

The following section shows the overall sample that was gathered by our class. It will be shown

in a Pie Chart and Pareto chart. It will be separated by their color. The sample size for this data is

After looking at the data, the numbers for each color close to each other. With the overall class

they go from Red, Orange, Green, Purple, and Yellow. When comparing this to my own data I

can see that mine isnt in that same order, mine goes from Orange, Purple, Green, Yellow and

Red. Although, mine is different, a lot of the numbers are pretty close to the overall data. There

are definitely a variety of combination in each bag, so each one will look different.

For this section there is going to be a frequency histogram and a boxplot will be displayed and a

5-number summary of the data. We are focusing on the number of candies per bag.

Although the boxplot and the histogram are different graphs, using the same information, it still

follows the same pattern. They are both skewed left. The mean number of candies per bag was

60.3 with a standard deviation of 3.44. To explain the boxplot a little more it shows that the

minimum amount of Skittles per bag was 50, the maximum amount per bag was 65. The first line

you see is the first interquartile range (Q1), it was 59 Skittles, the second line is the median which

was 61 Skittles, and the third line seen is the third interquartile range (Q3) was 62 skittles. The 5

number-summary proves the shape of the distribution for both graphs. Each individual person

may have different numbers per bag, mine in particular had 54, which is close to the minimum.

There is a difference between categorical data and quantitative data. This project consists of both,

when youre counting how many candies per bag there are is an example of quantitative data

because it they are all different. It is used to see the sample size of the data as well. Once you

combine them and split them into groups of color, like this project, then its categorical data.

For the following section, we are going to do a confidence interval. A confidence interval is to

show if the value we are looking for falls within a specific parameter. The higher the percentage

the more confident you will be to have a value fall within the parameters. An example for this

data, is to find the true proportion of yellow candies and be 99% confident. The number of

Yellow Skittles from the total number of skittles is 581. Reminder that the number of all the

Skittles is 3076. Below I will insert an image with the problem I worked out to make a 99%

confidence interval. After working out the numbers both by hand and by calculator we can make

a conclusion. The conclusion would say, we are 99% confident that the true proportion of Yellow

Another example would be constructing a 95% confidence interval for the true mean number of

candies per bag. Below I will insert an image of the work by hand. I also did it on the calculator

as well. The mean number of Skittles for all classes is 615.2 So after conducting the 95%

confidence interval for the true mean number of candies per bag, we can say that we are 95%

confident that the true mean number of Skittles is between 587.11 and 643.29.

For the following section we are going to focus on a hypothesis test. What we want to find with

this is whether our predictions based on the sample we have is true or not. I will insert an image

below showing both hypothesis tests. Using this data, we will make a hypothesis test, using a

significance level of 0.05, to test the claim that 20% of all Skittles of candies are red. To begin

with the hypothesis test we have to write a null hypothesis and an alternative hypothesis. The

null hypothesis will always be equal to the proportion, or mean we are trying to test. The

alternative is what we are trying to find, whether it is more than, less than, or not equal. For this

case we will write the null hypothesis as H0: p=20, and the alternative hypothesis as H1: p 20.

We put the values in the calculator and find that our p-value is .23, and our z value is 1.21. With

this data we compare the p-value to the level of significance, which in this case our p-value of .

23 is more than our level of significance of 0.05, because of this we fail to reject the null

hypothesis. We do not have sufficient evidence to support the claim that 20% of the Skittles are

not Red.

Another example using the data using a significance level to test the claim that the mean number

of candies in bag of Skittles is 55. We will do the same process as above and begin with the null

hypothesis and alternative hypothesis. The null hypothesis would be H0: = 55, and the

alternative hypothesis would be H1: 55. We put the values in the calculator and find that the

p-value is 0.000, and the t value is 11. Comparing the p-value to the level of significance we find

that the p-value is less than the level of significance, because of this we reject the null

hypothesis. There is sufficient evidence to support the claim that the mean number of Skittles per

Before we do a

hypothesis test, we have to meet 3 conditions, first that they are independent, more than or equal

to 10, and are less than 5% of the population. If these conditions are not met we cannot do the

hypothesis test. During this project there are many possible errors that could have been made, but

using a calculator eliminates most of these errors. There couldve been errors when entering the

data of each individuals bag of skittles and when adding up the total number of skittles. The

sampling method couldve improved if each person could compare the data they got to theirs, but

to do it with 51 people is a lot. Doing this project I was able to put the concepts learned in class

to something real and not just a story problem in the book, which makes it interesting.

- StatisticsUploaded byRobert Deligero
- Beneke 2015Uploaded byKiran Saleem
- IE 27 Ch 9 SlidesUploaded byPia Ortiz
- Statistics reviewUploaded byNickolas Morgan
- Hypothesis TestingUploaded bySiti Qomariyah
- Worksheet1-solns (1)Uploaded byameer
- Statistical Test Excel FileUploaded byrb
- Value of the p valueUploaded bybijugeorge1
- jane thomas math 1040 term projectUploaded byapi-299103922
- Statistics SaturdayUploaded byThomas Adam Johnson
- Eviews_7.0_Manual.pdfUploaded byBimo Sakti
- 6300p2aUploaded byGellie Vale Managbanag
- Hypothesis Testing 1Uploaded byNguyen Son Tung
- 2138Uploaded bysushan19
- Business Development PlanUploaded bykidszalor1412
- Significance in StatisticsUploaded byFatima Mussadiq Haidari-Raza
- R Studio Cheat Sheet for Math1041Uploaded byOliver
- Class 04 - Basic Concepts of Statistics and Probability (2 of 2)Uploaded byb_shadid8399
- 33230 SP09 Final ExamUploaded bymoses
- Clodes Class Data ScienceUploaded bySantanu Boral
- 09104005_psdpUploaded byEthanAhamed
- CHI SQUARE TEST.pptUploaded byMiszz Bella
- Coeficientul riscului financiarUploaded byLivia Preda
- 1237_paper_Rossi_Citrus.pdfUploaded bymustafa
- PJC_2008_H1_PrelimsUploaded byChia Pei Xian
- Extent of Parent-Teacher Association Involvement in the Implementation of Universal Basic Education Program in Primary Schools in Northern Senatorial District of Ondo State, NigeriaUploaded byAkinfolarin Akinwale Victor
- Airlines IndustryUploaded bySabin Lal
- dataanalysis-160525103051.pdfUploaded byJauhar Jauharabad
- clodes class data science.txtUploaded bySantanu Boral
- SPSS Tutorials _ SPSS Correlation TestUploaded byViverly Joy De Guzman

- SRT757-Acoustic+Presentation.pdfUploaded bysharmane88
- PDH User GuideUploaded bykprasadkumar87
- Exercise Cht 3 Emv Evpi EolUploaded byReiki Channel Anuj Bhargava
- Topo Base Feature Rule ReferenceUploaded byMESSAOUDI
- KOA2_Jailbreak.pdfUploaded byGhshs
- 163203-5959-IJET-IJENSUploaded byGarudaOzo
- 29257146 Tekla Advanced Functions CopyUploaded byqaadil
- Creative BriefUploaded byExodius
- ProposalUploaded byTesfaye Belaye
- Procedure TextUploaded byanon_443618438
- MDM-udfsUploaded byGalina Ilieva Ilieva
- TechEd2015 BarcelonaUploaded bymirandes69
- SAP R3 UpgradeUploaded byKumarReddy
- Dennett Free WillUploaded byLeo Pacuare
- tshoot ch 4Uploaded byDaniel Corfu
- Action Strings Manual EnglishUploaded byAlberto Guillén
- Study of Sram and Its Low Power TechniquesUploaded byIAEME Publication
- 1983almanaseerphdUploaded byma sungsang
- MSE604 Ch. 14 - Decision Making Considering Multi-AttributesUploaded byEthan Kiu
- Thinking Skills and Problem Solving ModelUploaded byJohn Evans
- YLP SAS Advanced Certification SlidesUploaded bynare999
- Tor_Forensics_On_Windows_OS_Mattia_Epifani.pdfUploaded byCristi Lupu
- d 05422227Uploaded byIOSRjournal
- Katalog_compl_2016_en_2016_03_09Uploaded byÄpriolia Su
- Cartridge CatalogUploaded bybrainandspirit
- System IntegratorsUploaded byGopi Ganapathy
- Great Resources for StartupsUploaded byPen Levit
- Functional Safety HandbookUploaded byVeera Ragavan
- Apache_Spark_Analytics_Made_Simple.pdfUploaded byprerit_t
- HP Proliant ML10 v2 DatasheetUploaded byoon