Beruflich Dokumente
Kultur Dokumente
Administrative Notes
Homework 3 is due on Monday, June 15th
Covers chapters 1-5 in textbook
Last Class
Focused on models for continuous data: using the sample mean as our estimate of population mean Sampling Distributionof the Sample Mean
how does the sample mean change over different samples? Sample 1 of size n Sample 2 of size n Sample 3 of size n Sample 4 of size n Sample 5 of size n Sample 6 of size n . . .
Stat 111 - Lecture 9 - Proportions
Population
Parameter:
x x x x x x
Todays Class
We will now focus on count data: categorical data that takes on only two different values Success (Yi = 1) or Failure (Yi = 0)
Examples
Gender: our class has 83 women and 42 men What is proportion of women in Penn student population? Presidential Election: out of 2000 people sampled, 1150 will vote for McCain in upcoming election What proportion of total population will vote for McCain? Quality Control: Inspection of a sample of 100 microchips from a large shipment shows 10 failures What is proportion of failures in all shipments?
June 10, 2008 Stat 111 - Lecture 9 - Proportions 5
Population
Parameter: p
Binomial Distribution for Sample Count Sample count Y (number of Yi=1 in sample of size n) has a Binomial distribution The binomial distribution has two parameters:
number of trials n and population proportion p P(X=k) = nCk * pk (1-p)(n-k)
Can add up bars of histogram to get any probability we want: eg. P(Y < 4) Different values of n and p have different histograms, but Table C in book has probabilities for many values of n and p
June 10, 2008 Stat 111 - Lecture 9 - Proportions 9
Binomial Table
10
Example: Genetics
If a couple are both carriers of a certain disease, then their children each have probability 0.25 of being born with disease Suppose that the couple has 4 children P(none of their children have the disease)? P(X=0) = 4!/(0!*4!) * .250 * (1-.25)4 P(at least two children have the disease)?
P(Y 2) = P(Y = 2) +P(Y = 3) +P(Y = 4) = 0.2109 +0.0469 +0.0039 (from table) = 0.2617
11
12
Sample Proportions
Usually, we are more interested in a sample proportion = Y/n instead of a sample count P ( < k ) = P( Y < n*k) Example: a worker inspects a sample of 20 microchips from a large shipment with probability of a microchip being faulty is 0.1 What is the probability that our sample proportion of faulty chips is less than 0.05?
P(
June 10, 2008
0.05 x 20
13
We can calculate the mean and variance of a Binomial distribution with parameters n and p: Y = n*p 2 = n*p*(1-p) = (n*p*(1-p))
June 10, 2008 Stat 111 - Lecture 9 - Proportions 14
16
We can instead use the fact that for large samples, the Binomial distribution is closely approximated by the Normal distribution
June 10, 2008 Stat 111 - Lecture 9 - Proportions 17
18
Also works for sample proportion: = Y/n a Normal distribution with mean and variance
follows
19
21
23