Sie sind auf Seite 1von 22

.Jv Uldpter J U1screte Random Variabl es and Their Probability Distributions 3.

tions 3.3 The Expected Value of a Random Variable or a Function of a Random Variable 91

Such simulation studies are very useful. By repeating some experiments over and
3.9 In order to verify the accuracy of their financial accounts, companies use auditors on a regular
over again, we can generate measurements of discrete random va1iables that possess basis to verify accounting entries. The company's employees make erroneous entries 5% of
frequency distributions very similar to the probability distributions derived in this the time. Suppose that an auditor randomly checks three entries.
chapter, reinforcing the conviction that our models are quite accurate.
a Find the probability distribution for Y, the number of en·ors detected by the auditor.
b Construct a probability histogram for p(y).
c Find the probability that the auditor will detect more than one error.
Exercises
3.1 0 A rental agency, which leases heavy equipment by the day, has found that one expensive piece
3.1 When the health department tested private wells in a county for two impurities commonly found of equipment is leased, on the average, only one day in five. If rental on one day is independent
in drinking water, it found that 20% of the wells had neither impurity, 40% had impurity A, of rental on any other day, find the probability distribution of Y , the number of days between
and 50% had impurity B. (Obviously, some had both impurities.) If a well is randomly chosen a pair of rentals.
from those in the county, find the probability distribution for Y , the number of impurities found 3.11 Persons entering a blood bank are such that 1 in 3 have type o+ blood and 1 in 15 have type o-
in the well. blood. Consider three randomly selected donors for the blood bank. Let X denote the number of
3.2 You and a friend play a game where you each toss a balanced coin. If the upper faces on the donors with type o + blood and Y denote the number with type o- blood. Find the probability
coins are both tails, you win $1; if the faces are both heads, you win $2; if the coins do not match distributions for X and Y. Also find the probability distribution for X + Y, the number of
(one shows a head, the other a tail), you lose $1 (wi n ( -$ 1)). Give the probability distribution donors who have type 0 blood.
for your winnings, Y , on a single play of this game.

3.3 A group of four components is known to contain two defectives. An inspector tests the compo-
nents one at a time until the two defectives are located. Once she locates the two defectives, she
3.3 The Expected Value of a Random Variable
stops testing, but the second defective is tested to ensure accuracy. Let Y denote the number of
the test on which the second defective is found. Find the probability distribution for Y.
or a Function of a Random Variable
3.4 Consider a system of water flowing through valves from A to B. (See the accompanying We have observed that the probability distribution for a random variable is a theoret-
diagram.) Valves I, 2, and 3 operate independently, and each correctly opens on signal with ical model for the empirical distribution of data associated with a real population. If
probability .8. Find the probability distribution for Y, the number of open paths from A to B the model is an accurate representation of nature, the theoretical and empirical dis-
after the signal is given. (Note that Y can take on the values 0, 1, and 2.)
tributions are equivalent. Consequently, as in Chapter 1, we attempt to find the mean
and the variance for a random variable and thereby to acquire numerical descriptive
measures, parameters, for the probability distribution p(y) that are consistent with
those discussed in Chapter I.
A B

DEF I NITION 3.4 Let Y be a discrete random variable with the probability function p(y). Then
2 3
the expected value of Y, E(Y), is defined to be2
3.5 A problem in a test given to small children asks them to match each of three pictures of animals
to the word identifying that animal. If a child assigns the three words at random to the three E(Y) = LYP(y).
y
pictures, find the probability distribution for Y , the number of correct matches.

3.6 Five balls, numbered I, 2, 3, 4, and 5, are placed in an urn . Two balls are randomly selected
from the five, and their numbers noted. Find the probability distribution for the following: If p(y) is an accurate characterization of the population frequency distribution,
a The largest of the two sampled numbers then E (Y) = JJ,, the population mean.
Definition 3.4 is completely consistent with the definition of the mean of a set of
b The sum of the two sampled numbers
measurements that was given in Definition 1.1. For example, consider a discrete
3.7 Each of three balls are randomly placed into one of three bowls. Find the probability distribution
for Y =the number of empty bowls. 2. To be precise, the expected value of a discrete random variable is said to exist if the sum, as given
earlier, is absol utely convergent- that is, if
3.8 A single cell can either die, with probability .I, or split into two cells, with probability .9,
producing a new generation of cells. Each cell in the new generation dies or splits into two cells .L lyJp(y) < oo.
y
independently with the same probabilities as the initial cell. Find the probability distribution
This absolute convergence will hold for all examples in this text and will not be mentioned each lime an
for the number of cells in the next generation.
96 Chapter 3 Discrete Random Variables and Their Probability Distributions Exercises 97

Solution The expected daily cost for A is


THEOREM 3.6 Let Y be a discrete random variable with probability function p(y) and mean
E(Y) = p,; then E[CA(t)] = E[lOt + 30Yn =lOt+ 30E(YD
2
V(Y) = a2 = E[(Y- p,) 2 ] = E(Y 2)- p, 2 . = 10t+30{V(YJ)+[E(YJ)fJ = 10t+30[ . 10t+(.l0t) ]

= l3t+.3t 2 .
Proof
a 2 = E[(Y- p,) 2 ] = E(Y 2 - 2p,Y + p, 2) In this calculation, we used the known values for V(Y 1 ) and E(Y 1) and the fact
= E(Y 2 ) - E(2p,Y) + E(p, 2 ) (by Theorem 3.5). that V(Y 1) = E(Y12) - [E(Y 1)f to obtain that ECYF) = V(Y 1) + [E(Y 1)] 2 =
.lOt+ (.10t) 2 . Similarly,
Noting that p, is a constant and applying Theorems 3.4 and 3.3 to the second
and third terms, respectively, we have E[C 8 (t)] = E[8t + 30Y~] = 8t + 30E(YD
2
a 2 = E (Y 2 ) - 2p,E (Y) + p, 2
. = 8t + 30{ V (Y2) + [E(Y2)fJ = 8t + 30[ .12t + (.12t) ]

But p, = E(Y) and, therefore, = 11.6t + .432t 2.


a2 = E(Y2)- 2p,2 + p,2 = E(Y2)- p,2. Thus, for scenario (a) where t = I 0,

E[CA(lO)] = 160 and E[C 8 (10)] = 159.2,


Theorem 3.6 often greatly reduces the labor in finding the variance of a discrete
random variable. We will demonstrate the usefulness of this result by recomputing which results in the choice of machine B.
the variance of the random variable considered in Example 3.2. For scenario (b), t = 20 and

E[CA(20)] = 380 and E[C 8 (20)] = 404.8,


resulting in the choice of machine A.
EXAMPLE 3.3 Use Theorem 3.6 to find the variance of the random variable Y in Example 3.2. In conclusion, machines of type B are more economical f or short time periods
because of their smaller hourly operating cost. For long time p eriods, however, ma-
Solution The mean p, = 1.75 was found in Example 3.2. Because chines of type A are more economical because they tend to be repaired less frequently .

E(Y 2 ) = Ll
y
p(y) = (0) 2(1/8) + (1) 2(1/4) + (2) 2(3/8) + (3) 2(1/4) = 4, •
Theorem 3.6 yields that The purpose of this section was to introduce the concept of an expected value and
to develop some useful theorems for finding means and variances of random variables
cr 2 = E(Y 2 ) - p, 2 = 4- (1.75) 2 = .9375. • or functions of random variables. In the following sections, we present some specific
types of discrete random variables and provide formulas for their probability distribu-
tions and their means and variances. As you will see, actually deriving some of these
expected values requires skill in the summation of algebraic series and knowledge of
a few tricks. We will illustrate some of these tricks in some of the derivations in the
upcoming sections.
EXAMPLE 3.4 The manager of an industrial plant is planning to buy a new machine of either type A
or type B. If t denotes the number of hours of daily operation, the number of daily
repairs Y 1 required to maintain a machine of type A is a random variable with mean
and variance both equal to .lOt. The number of daily repairs Y 2 for a machine of Exercises
type B is a random variable with mean and variance both equal to .12t. The daily
cost of operating A is CA (t) = lOt+ 30Yf; forB it is Cs(t) = 8t + 30Y~. Assume 3.12 Let Y be a random variable with p(y) given in the accompanying table. Find E(Y), E(l I Y),
that the repairs take negligible time and that each night the machines are tuned so E(Y 2 - 1), and V(Y).
that they operate essentially like new machines at the start of the next day. Which
machine minimizes the expected daily cost if a workday consists of (a) 10 hours and y 11 2 3 4
(b) 20 hours? p(y) .4 .3 .2 .1
Exercises 99

3.13 Refer to the coin-tossing game in Exercise 3.2. Calculate the mean and variance of Y, your N is proportional to the area of a circleR blocks from the firehouse, then N = Crr R 2 , where Cis
winnings on a single play of the game. Note that E(Y) > 0. How much should you pay to play
a constant, rr = 3. 1416 ... , and R, a random variable, is the number of blocks that a fire engine
this game if your net winnings, the difference between the payoff and cost of playing, are to
have mean 0? can move in the specified time interval. For a particular fire company, C = 8, the probability
distribution for R is as shown in the accompanying table, and p(r) = 0 for r :::: 20 and r :::_ 27.
3.14 The maximum patent life for a new drug is 17 years. Subtracting the length oftime required by
r 1 21 22 23 24 25 26
the FDA for testing and approval of the drug provides the actual patent life for the drug-that
is, the length of time that the company has to recover research and development costs and to p(r) I .05 .20 .30 .25 . 15 .05
make a profit. The distribution of the lengths of actual patent lives for new drugs is given below·
Find the expected value of N, the number of homes that the fire department can serve.
Years, y 3 4 5 6 7 8 9 10 ll 12 13
3.22 A single fair die is tossed once. Let Y be the number facing up. Find the expected value and
p(y) .03 .05 .07 . 10 .14 .20 .18 .12 .07 .03 .01 variance of Y.

a Find the mean patent life for a new drug. 3.23 In a gambling game a person draws a single card from an ordinary 52-card playing deck. A
b Find the standard deviation of Y =the length of life of a randomly selected new drug. person is paid $15 for drawing a jack or a queen and $5 for drawing a king or an ace. A person
who draws any other card pays $4. If a person plays this game, what is the expected gain?
c What is the probability that the value of Y falls in the interval /.L ± 2a?
3.15 3.24 Approximately I 0% of the glass bottles coming off a production line have serious flaws in the
Who is the king of late night TV? An Internet survey estimates that, when given a choice glass. If two bottles are randomly selected, find the mean and variance of the number of bottles
between David Letterman and Jay Leno, 52% of the population prefers to watch Jay Lena. that have serious flaws.
Three late night TV watchers are randomly selected and asked which of the two talk show
hosts they prefer. 3.25 Two construction contracts are to be randomly assigned to one or more of three firms: I, II,
and Ill. Any firm may receive both contracts. If each contract will yield a profit of $90,000 for
a Find the probability distribution for Y , the number of viewers in the sample who prefer the firm, find the expected profit for firm I. If firms I and II are actually owned by the same
Leno. individual, what is the owner's expected total profit?
b Construct a probability histogram for p(y).
*3.26 A heavy-equipment salesperson can contact either one or two customers per day with proba-
c What is the probability th•t exactly one of the three viewers prefers Leno? bility l/3 and 2/3, respectively. Each contact will result in either no sale or a $50,000 sale,
d What are the mean and standard deviation for Y? with the probabilities .9 and .I , respectively. Give the probability distribution for daily sales.
e What is the probability that the number of viewers favoring Leno falls within 2 standard Find the mean and standard deviation of the daily sales. 3
deviations of the mean?
3.27 A potential customer for an $85,000 fire insurance policy possesses a home in an area that, ac-
3.16 The secretary in Exercise 2.121 was given n computer passwords and tries the passwords at cording to experience, may sustain a total loss in a given year with probability of .00 I and a 50%
random. Exactly one password will permit access to a computer file. Find the mean and the loss with probability .0 I. Ignoring all other partial losses, what premium should the insurance
variance of Y , the number of trials required to open the file, if unsuccessful passwords are company charge for a yearly policy in order to break even on all $85,000 policies in this area?
eliminated (as in Exercise 2.121 ). 3.28 Refer to Exercise 3.3. If the cost of testing a component is $2 and the cost of repairing a
3.17 Refer to Exercise 3.7. Find the mean and standard deviation for Y =the number of empty defective is $4, find the expected total cost for testing and repairing the lot.
bowls. What is the probability that the value of Y falls within 2 standard deviations of the mean? *3.29 If Y is a discrete random variable that assigns positive probabilities to only the positive integers,
3.18 Refer to Exercise 3.8. What is the mean number of cells in the second generation?
show that

3.19 00
An insurance company issues a one-year $1000 policy insuring against an occurrence A that
historically happens to 2 out of every I 00 owners of the policy. Administrative fees are $15 per
E(Y) = L P(Y :::. k).
i= l
policy and are not part of the company's "profit." How much should the company charge for the
policy if it requires that the expected profit per policy be $50? [Hint: If Cis the premium for the 3.30 Suppose that Y is a discrete random variable with mean /.Land variance a 2 and let X = Y + l.
policy, the company's "profit" is C- 15 if A does not occur and C- 15-1000 if A does occur.]
a Do you expect the mean of X to be larger than, smaller than, or equal to /.L = E(Y)? Why?
3.20 A manufacturing company ships its product in two different sizes of truck trailers. Each ship- b Use Theorems 3.3 and 3.5 to express E(X) = E(Y +I) in terms of /.L = E(Y). Does this
ment is made in a trailer with dimensions 8 feet x I 0 feet x 30 feet or 8 feet x I 0 feet x 40 feet. result agree with your answer to part (a)?
If 30% of its shipments are made by using 30-foot trailers and 70% by using 40-foot trailers, c Recalling that the variance is a measure of spread or dispersion, do you expect the variance
find the mean volume shipped per trailer load. (Assume that the trailers are always full.) of X to be larger than, smaller than, or equal to a 2 = V (Y)? Why?
3.21 The number N of residential homes that a fire company can serve depends on the distance r (in
city blocks) that a fire engine can cover in a specified (fixed) period of time. If we assume that 3. Exercises preceded by an asterisk are optional.
100 Chapter 3 Discrete Random Variables and Their Probability Distributions 3 .4 The Bin omi a l Probability Di stributi on 101

d Use Definition 3.5 and the result in part (b) to show that questioned prior to a local election either favors candidate Jones or does not. In this
section we are concerned with experiments , known as binomial experiments, that
V(X) = E{[(X- E(X)] 2 } = E[(Y - /L) 2 ] = a 2; exhibit the following characteristics .
that is, X = Y + I and Y have equal variances.
3.31 Suppose that Y is a discrete random variable with mean fL and variance a 2 and let W = 2Y.
DEF I NITION 3.6 A binomial experiment possesses the following properties:
a Do you expect the mean of W to be larger than, smaller than, or equal to fL = E(Y)? Why?
b Use Theorem 3.4 to express E(W) = E(2Y) in terms of fL = E(Y) . Does this result agree 1. The experiment consists of a fixed number, n, of identical trials.
with your 11nswer to part (a)? 2. Each trial results in one of two outcomes: success, S, or failure, F.
c Recalling that the variance is a measure of spread or dispersion, do you expect the variance 3. The probability of success on a single trial is equal to some value p and
of W to be larger than, smaller than, or equal to a 2 = V (Y)? Why? remains the same from trial to trial. The probability of a failure is equal to
d Use Definition 3.5 and the result in part (b) to show that q = (1 - p).
2 4 . The trials are independent.
V(W) = E{[W- E(W)] 2 } = E[4(Y - fL) ] = 4a 2 ;
5. The random variable of interest is Y , the number of successes observed
that is, W = 2Y has variance four times that of Y. during the n trials.
3.32 Suppose that Y is a discrete random variable with mean fL and variance a 2 and let U = Y I I0.
a Do you expect the mean of U to be larger than, smaller than, or equal to fL = E(Y)? Why?
Determining whether a particular experiment is a binomial experiment requires
b Use Theorem 3.4 to express E(U) = E(Y/ 10) in terms of fL = E(Y) . Does this result
examining the experiment for each of the characteristics just listed. Notice that the
agree with your answer to part (a)?
random variable of interest is the number of successes observed in the n trials. It is
c Recalling that the variance is a measure of spread or dispersion, do you expect the variance
important to realize that a success is not necessarily "good" in the everyday sense of
of U to be larger than, smaller than, or equal to a 2 = V (Y) ? Why?
the word. In our discussions , success is merely a name for one of the two possible
d Use Definition 3.5 and the result in part (b) to show that
outcomes on a single trial of an experiment.
2
V(U) = E{[U- E(U)] 2 } = E[.OI(Y - fL) ] = .Ola 2 ;
that is, U = YjiO has variance .01 times that of Y.
EXAMPLE 3.5 An early-warning detection system for aircraft consists of four identical radar units
3.33 Let Y be a discrete random variable with mean fL and variance a 2 If a and b are constants,
use Theorems 3.3 through 3.6 to prove that operating independently of one another. Suppose that each has a probability of .95
of detecting an intruding aircraft. When an intruding aircraft enters the scene, the
a E(aY+b)=aE(Y)+b=afL+b . random variable of interest is Y, the number of radar units that do not detect the
b V(aY+b)=a 2 V(Y)=a 2 a 2 . plane. Is this a binomial experiment?
3.34 The manager of a stockroom in a factory has constructed the following probability distribution
for the daily demand (number of times used) for a particular tool. Solution To decide whether this is a binomial experiment, we must determine whether each
of the five requirements in Definition 3.6 is met. Notice that the random variable of
y 0 2 interest is Y, the number of radar units that do not detect an aircraft. The random
p(y) I .I .5 .4 variable of interest in a binomial experiment is always the number of successes ;
consequently, the present experiment can be binomial only if we call the event do not
It costs the factory $10 each time the tool is used. Find the mean and variance of the daily cost detect a success. We now examine the experiment for the five characteristics of the
for use of the tool. binomial experiment.

I . The experiment involves four identical trials. Each trial consi sts of determining
3.4 The Binomial Probability Distribution whether (or not) a particular radar unit detects the aircraft.
2. Each trial results in one of two outcomes. Because the random variable of
Some experiments consist of the observation of a sequence of identical and inde- interest is the number of successes, S denotes that the aircraft was not detected,
and F denotes that it was detected.
pendent trials, each of which can result in one of two outcomes. Each item leaving
a manufacturing production line is either defective or nondefective. Each shot in a 3. Because all the radar units detect aircraft with equal probability, the probability
of anSon each trial is the same, and p = P(S) = P(do not detect)= .OS.
sequence of firings at a target can result in a hit or a miss, and each of n persons
the proportion of "successes" in our sample, in this case 6/20. In the next section, we 3.39 A complex electronic system is built with a certain number of backup components in its
will apply this same technique to obtain an estimate that is not initially so intuitive. As subsystems. One subsystem has four identical components, each with a probability of .2 of
we will see in Chapter 9, the estimate that we just obtained is the maximum likelihood failing in less than I000 hours. The subsystem will operate if any two of the four components
estimate for p and the procedure used above is an example of the application of the are operating. Assume that the components operate independently. Find the probability that
method of maximum likelihood. • a exactly two of the four components last longer than I 000 hours.
b the subsystem operates longer than I 000 hours.

3.40 The probability that a patient recovers from a stomach disease is .8. Suppose 20 people are
known to have contracted this disease. What is the probability that

a exactly 14 recover?
Exercises b at least I0 recover?
c at least 14 but not more than 18 recover?
3.35 Consider the population of voters described in Example 3.6. Suppose that there are N = 5000 d at most 16 recover?
voters in the population, 40% of whom favor Jones. Identify the event favors Jones as a
3.41 A multiple-choice examination has 15 questions, each with five possible answers, only one of
success S. It is evident that the probability of Son trial I is .40. Consider the event B that s
which is correct. Suppose that one of the students who takes the examination answers each of
occurs on the second trial. Then B can occur two ways: The first two trials are both succes-
the questions with an independent random guess. What is the probability that he answers at
ses or the first tlial is a failure and the second is a success. Show that P(B) = .4. What is
least ten questions correctly?
P(Bi the first trial isS) ? Does this conditional probability differ markedly from P(B)?
3.42 Refer to Exercise 3.41. What is the probability that a student answers at least ten questions
3.36 a A meteorologist in Denver recorded Y =the number of days of rain during a 30-day period.
COITectly if
Does Y have a binomial distribution? If so, are the values of both n and p given?
b A market research firm has hired operators who conduct telephone surveys. A computer a for each question, the student can correctly eliminate one of the wrong answers and sub-
is used to randomly dial a telephone number, and the operator asks the answering person sequently answers each of the questions with an independent random guess among the
whether she has time to answer some questions. Let Y =the number of calls made until the remaining answers?
first person replies that she is willing to answer the questions. Is this a binomial experiment? b he can correctly eliminate two wrong answers for each question and randomly chooses
Explain. from among the remaining answers?
3.37 In 2003, the average combined SAT score (math and verbal) for college-bound students in the 3.43 Many utility companies promote energy conservation by offering discount rates to consumers
United States was I026. Suppose that approximately 45% of all high school graduates took who keep their energy usage below certain established subsidy standards. A recent EPA report
this test and that I00 high school graduates are randomly selected from among all high school notes that 70% of the island residents of Puerto Rico have reduced their electricity usage
grads in the United States. Which of the following random variables has a distribution that can sufficiently to qualify for discounted rates. If five residential subscribers are randomly selected
be approximated by a binomial distribution ? Whenever possible, give the values for n and p. from San Juan, Puerto Rico, find the probability of each of the following events:
a The number of students who took the SAT a All five qualify for the favorable rates.
b The scores of the I 00 students in the sample b At least four qualify for the favorable rates.
c The number of students in the sample who scored above average on the SAT
3.44 A new surgical procedure is successful with a probability of p. Assume that the operation is
d The amount of time required by each student to complete the SAT
performed five times and the results are independent of one another. What is the probability
e The number of female high school grads in the sample that
3.38 The manufacturer of a low-calorie dairy drink wishes to compare the taste appeal of a new a all five operations are successful if p = .8?
formula (formula B) with that of the standard formula (formula A). Each of four judges is given b exactly four are successful if p = .6?
three glasses in random order, two containing formula A and the other containing formula B.
c less than two are successful if p = .3?
Each judge is asked to state which glass he or she most enjoyed. Suppose that the two formulas
are equally attractive. Let Y be the number of judges stating a preference for the new formula. 3.45 A fire-detection device utilizes three temperature-sensitive cells acting independently of each
a Find the probability function for Y . other in such a manner that any one or more may activate the alarm. Each cell possesses a
probability of p = .8 of activating the alarm when the temperature reaches I 00° Celsius or
b What is the probability that at least three of the four judges state a preference for the new more. Let Y equal the number of cells activating the alarm when the temperature reaches 100 .
formula?
c Find the expected value of Y. a Find the probability distribution for Y .
d Find the variance of Y . b Find the probability that the alarm will function when the temperature reaches IOOo.
112 Chapter 3 Discrete Random Variables and Their Probability Distributions Exercises 113

3.46 Construct probability histograms for the binomial probability distributions for n = 5, p = . I, 3.54 Suppose that Y is a binomial random variable based on n trials with success probability p and
.5 , and .9. (Table 1, Appendix 3, will reduce the amount of calculation.) Notice the symmetry consider Y* = n - Y .
for p = .5 and the direction of skewness for p = . I and .9.
a Argue that for y* = 0, I , ... , n
3.47 Use Table I, Appendix 3, to construct a probability histogram for the binomial probability
distribution for 11 = 20 and p = .5 . Notice that almost all the probability falls in the interval P(Y* = y *) = P(n- Y = y*) = P(Y = n- y*) .
5 :::: y:::: 15.
b Use the result from part (a) to show that
3.48 A missile protection system consists of n radar sets operating independently, each with a
probability of .9 of detecting a missile entering a zone that is covered by all of the units. P(Y* = y*) = ( n ) p" -y" qY" = ( n )q y" p" _Y .
n- y • y*
a If n = 5 and a missile enters the zone, what is the probability that exactly four sets detect
the mi ssile? At least one set? c The result in part (b) implies that Y* has a binomial distribution based on n trials and
b How large must 11 be if we require that the probability of detecting a missile that enters the "success" probability p* = q = I - p. Why is this result "obvious"?
zone be .999? 3.55 Suppose that Y is a binomial random variable with n > 2 trials and success probability p.
3.49 A manufacturer of floor wax has developed two new brands, A and B , which she wishes to Use the technique presented in Theorem 3. 7 and the fact that E {Y ( Y - 1) ( Y - 2)) = E ( Y 3 ) -
subject to homeowners' evaluation to determine which of the two is superior. Both waxes, 3E(Y 2 ) + 2E(Y) to derive E(Y 3 ) .
A and B , are applied to floor surfaces in each of 15 homes. Assume that there is actually no 3.56 An oil exploration firm is formed with enough capital to finance ten explorations. The probabil-
difference in the quality of the brands. What is the probability that ten or more homeowners ity of a particular exploration being successful is .1. Assume the explorations are independent.
would state a preference for Find the mean and variance of the number of successful explorations.
a brand A? 3.57 Refer to Exercise 3.56. Suppose the firm has a fixed cost of$20,000 in preparing equipment prior
b either brand A or brand B ? to doing its first exploration. If each successful exploration costs $30,000 and each unsuccessful
exploration costs $15 ,000, find the expected total cost to the firm for its ten explorations.
3.50 In Exercise 2. I 5 I, you considered a model for the World Series. Two teams A and B play a series
of games until one team wins four games. We assume that the games are played independ~ntly 3.58 A particular sale involves four items randomly selected from a large lot that is known to contain
and that the probability that A wins any game is p. Compute the probability that the series lasts 10% defectives. Let Y denote the number of defectives among the four sold. The purchaser of
exactly five games. [Hint: Use what you know about the random variable, Y , the number of the items will return the defectives for repair, and the repair cost is given by C = 3Y 2 + Y + 2.
games that A wins among the first four games.] Find the expected repair cost. [Hint: The result of Theorem 3.6 implies that, for any random
variable Y, E(Y 2) = a 2 + Jl 2.]
3.51 In the I 8th century, the Chevalier de Mere asked Blaise Pascal to compare the probabilities of
two events . Below, you will compute the probability of the two events that, prior to contrary 3.59 Ten motors are packaged for sale in a certain warehouse. The motors sell for $100 each, but a
gambling experience, were thought by de Mere to be equally likely. double-your-money-back guarantee is in effect for any defectives the purchaser may receive.
Find the expected net gain for the seller if the probability of any one motor being defective is
a What is the probability of obtaining at least one 6 in four rolls of a fair die? .08. (Assume that the quality of any one motor is independent of that of the others.)
b If a pair of fair dice is tossed 24 times, what is the probability of at least one double six?
3.60 A particular concentration of a chemical found in polluted water has been found to be lethal to
3.52 The taste test for PTC (phenylthiocarbamide) is a favorite exercise in beginning human genetics 20% of the fish that are exposed to the concentration for 24 hours. Twenty fish are placed in a
classes. It has been establi shed that a single gene determines whether or not an individual is a tank containing this concentration of chemical in water.
"taster." If 70% of Americans are "tasters" and 20 Americans are randomly selected, what is
a Find the probability that exactly 14 survive.
the probability that
b Find the probability that at least 10 survive.
a at least I 7 are "tasters"?
c Find the probability that at most 16 survive.
b fewer than I 5 are "tasters"? d Find the mean and variance of the number that survive.
3.53 Tay-Sachs disease is a genetic disorder that is usually fatal in young children. If both parents are 3.61 Of the volunteers donating blood in a clinic, 80% have the Rhesus (Rh) factor present in their
can·iers of the disease, the probability that their offspring will develop the disease is approxi- blood.
mately .25 . Suppose that a husband and wife are both carriers and that they have three children.
If the outcomes of the three pregnancies are mutually independent, what are the probabilities a If five volunteers are randomly selected, what is the probability that at least one does not
of the following events? have the Rh factor?
b lf five volunteers are randomly selected, what is the probability that at most four have the
a All three children develop Tay-Sachs.
Rh factor?
b Only one child develops Tay-Sachs.
c What is the smallest number of volunteers who must be selected if we want to be at least
c The third child develops Tay-Sachs, given that the first two did not. 90% certain that we obtain at least five donors with the Rh factor?
1 1,. Lhapter 3 Discrete Random Variables and Their Probability Distributions 3.5 The Geometric Probability Distribution 115

3.62 Goranson and Hall ( 1980) explain that the probability of detecting a crack in an airplane wing The sample spaceS for the experiment contains the countably infinite set of sample
is the product of p 1, the probability of inspecting a plane with a wing crack; p 2 , the probability points:
of inspecting the detail in which the crack is located ; and p 3 , the probability of detecting the
damage. £,: s (success on first trial)
£2: FS (failure on first, success on second)
a What assumptions justify the multiplication of these probabilities?
£3: FFS (first success on the third trial)
b Suppose P1 = .9, P2 = .8, and P3 = .5 for a certain fleet of planes. If three planes are FFFS (first success on the fourth trial)
£4:
inspected from this fleet, find the probability that a wing crack will be detected on at least
one of them.
*3.63 Consider the binomial distribution with n trials and P(S) = p. Ek: FFFF ... FS (first success on the k 1h trial)
'----.,--'

a Show that
p(y) (n- y + l)p k-1
for y = I, 2, ... , 11. Equivalently, for y ==
p(y- I) yq
(n- y + l)p
I, 2, ... , 11, the equation p(y) = p(y - I) gives a recursive relationship
yq Because the random variable Y is the number of trials up to and including the first
between the probabilities associated with successive values of Y . success, the events (Y = I) , (Y = 2), and (Y = 3) contain only the sample points
b If 11 = 90 and p = .04, use the above relationship to find P(Y < 3). £ 1, £ 2, and £ 3, respectively. More generally, the numerical event (Y = y) contains
p(y) (n-y+l)p . p(y) . only E y. Because the trials are independent, for any y = 1, 2 , 3 , ... ,
c Show that = > I 1f v < (11 + I) p , that < I 1f y >
p(y-1) yq . p(y-1)
1
(n+l)p,andthat p(y) =I if(11+l)pisanintegerandy = (n+I)p.Thisestablishes p(y) = P(Y = y) = P(Ey) = P(FFF~ ... F, S) = ~p = qY - p.
p(y- I)
y-1 y- 1
that p(y) > p(y- I) if y is small (y < (11 + l)p) and p(y) < p(y- I) if y is large
(y > (11 + 1) p). Thus, successive binomial probabilities increase for a while and decrease
from then on.
d Show that the value of y assigned the largest probability is equal to the greatest intege~ less DE FIN IT I0 N 3.8 A random variable Y is said to have a geometric probability distribution if and
than or equal to (n + l)p. If (n + l)p = m for some integer m, then p(m) = p(m- 1). only if

*3.64 Consider an extension of the situation discussed in Example 3.1 0. If there are n trials in a p(y) = q y- 1 p, y = I , 2, 3, . .. , 0 :S p :S 1.
binomial experiment and we observe y0 "successes," show that P(Y = y0 ) is maximized when
p = Yo/11. Again, we are determining (in general this time) the value of p that maximizes the
probability of the value of Y that we actually observed. A probability histogram for p(y), p = .5, is shown in Figure 3.5. Areas over
intervals correspond to probabilities, as they did for the frequency distributions of
*3.65 Refer to Exercise 3.64. The maximum likelihood estimator for p is Y j 11 (note that Y is the data in Chapter 1, except that Y can assume only discrete values, y = I , 2, ... , oo.
binomial random variable, not a particular value of it).
That p(y) ::: 0 is obvious by inspection of the respective values. In Exercise 3.66
a Derive E(Y /11). In Chapter 9, we will see that this result implies that Y / n is an unbiased you will show that these probabilities add up to 1, as is required for any valid discrete
estimator for p. probability distribution.
b Derive V(Y j11). What happens to V(Y /n) as 11 gets large?
FIGURE 3.5 p(y)
The geometric
3.5 The Geometric Probability Distribution probability
.5

distribution, p = .5
.4
The random variable with the geometric probability distribution is associated with
an experiment that shares some of the characteristics of a binomial experiment. This
.3
experiment also involves identical and independent trials, each of which can result in
one of two outcomes: success or failure. The probability of success is equal to p and
.2
is constant from trial to trial. However, instead of the number of successes that occur
in n trials, the geometric random variable Y is the number of the trial on which the
.I
first success occurs. Thus, the experiment consists of a series of trials that concludes
with the first success. Consequently, the experiment could end with the first trial if a
0
success is observed on the very first trial, or the experiment could go on indefinitely. 2 3 4 5 6 7 8 )'
1 1o Lnapter 3 U1screte Random Variables and Their Probability Distributions
Exercises 119

P(Y .::::: 2) that was implicitly used in Example 3. I 1. Note that the argument in these
commands is the value y0 - l, not the value y0 . This is because some authors preferto Exercises
define the geometric distribution to be that of the random variable Y* = the number of
failures before the .first success. In our formulation, the geometric random variable y 3.66 Suppose that Y is a random variable with a geometric distribution. Show that
is interpreted as the number of the trial on which the first success occurs. In Exercise a L:, p(y) = L: ~ ~ q> - 1P = I.
3.88, you will see that Y* = Y -1. Due to this relationship between the two versions of
b p(y) = q, for y = 2, 3, ... . This ratio is less than 1, implying that the geomet-
geometric random variables, P(Y =Yo) = P(Y -1 = y0 -1) = P(Y* =Yo -1). R p(y- 1)
computes probabilities associated with Y*, explaining why the arguments for dgeom ric probabilities are monotonically decreasing as a function of y. If Y has a geometric
and pgeom are Yo - 1 instead of YO· distribution, what value of Y is the most likely (has the highest probability)?
The next example, similar to Example 3.10, illustrates how knowledge of the 3.67 Suppose that 30% of the applicants fora certain industrial job possess advanced training in com-
geometric probability distribution can be used to estimate an unknown value of p, puter programming. Applicants are interviewed sequentially and are selected at random from
the probability of a success. the pool. Find the probability that the first applicant with advanced training in programming
is found on the fifth interview.

3.68 Refer to Exercise 3.67. What is the expected number of applicants who need to be interviewed
in order to find the first one with advanced training?
EXAMPLE 3.13
Suppose that we interview successive individuals working for the large company 3.69 About six months into George W. Bush's second term as president, a Gallup poll indicated that
discussed in Example 3.10 and stop interviewing when we find the first person who a near record (low) level of 41% of adults expressed "a great deal" or "quite a lot" of confidence
likes the policy. If the fifth person interviewed is the first one who favors the new in the U.S . Supreme Court (http://www.gallup.com/poll/content/default.aspx?ci= 170 ll, June
policy, find an estimate for p, the true but unknown proportion of employees who 2005). Suppose that you conducted your own telephone survey at that time and randomly called
favor the new policy. people and asked them to describe their level of confidence in the Supreme Court. Find the
probability distribution for Y, the number of calls until the first person is found who does not
Solution If Y denotes the number of individuals interviewed until we find the first person who express "a great deal" or "quite a lot" of confidence in the U.S. Supreme Court.
likes the new retirement plan, it is reasonable to conclude that Y has a geometric 3.70 An oil prospector will drill a succession of holes in a given area to find a productive well. The
distribution for some value of p. Whatever the true value for p, we conclude that the probability that he is successful on a given trial is .2.
probability of observing the first person in favor of the policy on the fifth trial is a What is the probability that the third hole drilled is the first to yield a productive well?
b If the prospector can afford to drill at most ten wells, what is the probability that he will
P(Y = 5) = (1 - p) 4 p.
fail to find a productive well?
We will use as our estimate for p the value that maximizes the probability of observing 3.71 Let Y denote a geometric random variable with probability of success p.
the value that we actually observed (the first success on trial 5). a Show that for a positive integer a,
To find the value of p that maximizes P(Y = 5), we again observe that the value
P(Y > a) = qa.
of p that maximizes P(Y = 5) =(I - p) 4 p is the same as the value of p that
maximizes In[(!- p) p] = [4/n(l- p) + ln(p)].
4 b Show that for positive integers a and b,
lf we take the derivative of [4ln( I - p) + ln(p )] with respect to p, we obtain P(Y > a+biY >a) =qb = P(Y >b).

d[4ln(l - p) + ln(p)] -4 1 This result implies that, for example, P(Y > 71Y > 2) = P(Y > 5). Why do you think
dp =--+-.
1- p p
this property is called the memoryless property of the geometric distribution?
c In the development of the distribution of the geometric random variable, we assumed
Setting this derivative equal to 0 and solving, we obtain p = l /5. that the experiment consisted of conducting identical and independent trials until the first
Because the second derivative of [41n(l- p) + In(p)] is negative when p = 1/5, success was observed. In light of these assumptions, why is the result in part (b) "obvious"?
it follows that [4ln(l- p) + ln(p)] [and P(Y = 5)] is maximized when p = Jj5. 3.72 Given that we have already tossed a balanced coin ten times and obtained zero heads, what is
Our estimate for p, based on observing the first success on the fifth trial is 1/5 . the probability that we must toss it at least two more times to obtain the first head?
Perhaps this result is a little more surprising than the answer we obtained in
3.73 A certified public accountant (CPA) has found that nine of ten company audits contain sub-
Example 3.10 where we estimated p on the basis of observing 6 in favor of the new
stantial errors. lf the CPA audits a series of company accounts, what is the probability that the
plan in a sample of size 20. Again, this is an example Qf the use of the method of
first account containing substantial errors
maximum likelihood that will be studied in more detail in Chapter 9. •
a is the third one to be audited?
b will occur on or after the third audited account?
1 ..w Lhapter 3 LJiscrete Random Variables and Their Probability Distributions 3.6 The Negative Binomial Probability Distribution (Optional) 121

3.74 Refer to Exercise 3.73. What are the mean and standard deviation of the number of accounts *3.87 Refer to Exercise 3.86. The maximum likelihood estimator for p is 1/ Y (note that Y is the
that must be examined to find the first one with substantial errors? geometric random variable, not a particular value of it). Derive £(1/Y). [Hint: If lrl < I ,
L:~ 1 r ;/ i =-In( I - r).]
3.75 The probability of a customer arrival at a grocery service counter in any one second is equal
to . I. Assume that customers arrive in a random stream and hence that an arrival in any one *3.88 If Y is a geometric random variable, define Y * = Y - l. If Y is interpreted as the number of
second is independent of all others. Find the probability that the first arrival the trial on which the first success occurs, then Y * can be interpreted as the number of failures
before the first success. If Y * = Y- I , P(Y* = y) = P(Y- I = y) = P(Y = y +I) for
a will occur during the third one-second interval.
y = 0 , I , 2, . ... Show that
b will not occur until at least the third one-second interval.
P(Y* = y) = q·'' p , y = 0, 1,2, ....
3.76 If Y has a geometric distribution with success probability .3, what is the largest value, y , such
0
that P(Y > y0 ) ~ .1? The probability distribution of Y * is sometimes used by actuaries as a model for the distribution
of the number of insurance claims made in a specific time period.
3.77 If Y has a geometric distribution with success probability p, show that
*3.89 Refer to Exercise 3.88. Derive the mean and variance of the random variable Y *
P(Y = an odd integer)=~- a by using the result in Exercise 3.33 and the relationship Y * = Y- I, where Y is geometric.
1-q
*b directly, using the probability distribution for Y * given in Exercise 3.88.
3.78 Of a population of consumers, 60% are reputed to prefer a particular brand, A , of toothpaste.
If a group of randomly selected consumers is interviewed, what is the probability that exactly
five people have to be interviewed to encounter the first consumer who prefers brand A? At
least five people? 3.6 The Negative Binomial Probability
3.79 In responding to a survey question on a sens1t1ve topic (such as "Have you ever tried Distribution (Optional)
marijuana?"), many people prefer not to respond in the affirmative. Suppose that 80% of
the population have not tried marijuana and all of those individuals will truthfully answer no A random variable with a negative binomial distribution originates from a context
to your question. The remaining 20% of the population have tried marijuana and 70% of those
much like the one that yields the geometric distribution. Again, we focus on inde-
individuals will lie. Derive the probability distribution of Y , the number of people you would
need to question in order to obtain a single affirmative response.
pendent and identical trials, each of which results in one of two outcomes: success or
failure. The probability p of success stays the same from trial to trial. The geometric
3.80 Two people took turns tossing a fair die until one of them tossed a 6. Person A tossed first, B distribution handles the case where we are interested in the number of the trial on
second, A third, and so on. Given that person B threw the first 6, what is the probability that B which the first success occurs. What if we are interested in knowing the number of
obtained the first 6 on her second toss (that is, on the fourth toss overall)?
the trial on which the second, third, or fourth success occurs? The distribution that
3.81 How many times would you expect to toss a balanced coin in order to obtain the first head? applies to the random variable Y equal to the number of the trial on which the rth
success occurs (r = 2, 3, 4, etc.) is the negative binomial distribution.
3.82 Refer to Exercise 3.70. The prospector drills holes until he finds a productive well. How many
The following steps closely resemble those in the previous section. Let us select
holes would the prospector expect to drill ? Interpret your answer intuitively.
fixed values for r and y and consider events A and B, where
3.83 The secretary in Exercises 2.121 and 3.16 was given n computer passwords and tries the
passwords at random. Exactly one of the passwords permits access to a computer file. Suppose A = {the first (y - I) trials contain (r - I) successes}
now that the secretary selects a password, tries it, and-if it does not work-puts it back in and
with the other password s before randomly selecting the next password to try (not a very clever
secretary!). What is the probability that the correct password is found on the sixth try? B ={trial y results in a success}.
3.84 Refer to Exercise 3.83. Find the mean and the variance of Y, the number of the trial on which Because we assume that the trials are independent, it follows that A and B are inde-
the correct password is first identified. pendent events, and previous assumptions imply that P(B) = p. Therefore,
*3.85 Find E[Y(Y- I)] for a geometric random variable Y by finding d 2 j dq 2 ('£'; 1 q-'} Use this p(y) = p(Y = y) = P(A n B) = P(A) x P(B).
result to find the variance of Y.
Notice that P(A) is 0 if (y- 1) < (r- 1) or, equivalently, if y < r. If y 2: r, our
*3.86 Consider an extension of the situation discussed in Example 3.13. If we observe y 0 as the value
previous work with the binomial distribution implies that
for a geometric random variable Y, show that P(Y = Yo) is maximized when p = 1/ y 0 . Again.
we are determining (in general this time) the value of p that maximizes the probability of the
value of Y that we actually observed.
P(A) = (Y- ')pr-lqy-r.
r- 1
1 "1."1. Chapter 3 Discrete Random Variables and Their Probability Distributions Exercises 123

Finally,

( )_ (Y
p y -
- 1)
r-1 pq
r y-r
' y = r, r + 1, r + 2, . . .. THE OREM 3.9 If Y is a random variable with a negative binomial distribution,

f.L = E(Y) = !._ and a 2 = V(Y) = r(l ~ p)


p p

DEFINITION 3.9 A random variable Y is said to have a negative binomial probability distribution
if and only if

y- 1)
p(y) = ( r - 1 pr qY- r, y = r, r + 1, r + 2, ... , 0 :::; p :::; 1.
EXA MPLE 3.15 A large stockpile of used pumps contains 20% that are in need of repair. A maintenance
worker is sent to the stockpile with three repair kits. She selects pumps at random and
tests them one at a time. If the pump works, she sets it aside for future use. However,
if the pump does not work, she uses one of her repair kits on it. Suppose that it takes
10 minutes to test a pump that is in working condition and 30 minutes to test and
EXAMPLE 3.14 A geological study indicates that an exploratory oil well drilled in a particular region repair a pump that does not work. Find the mean and variance of the total time it takes
should strike oil with probability .2. Find the probability that the third oil strike comes the maintenance worker to use her three repair kits.
on the fifth well drilled.
Solution Let Y denote the number of the trial on which the third nonfunctioning pump is
Solution Assuming independent drillings and probability .2 of striking oil with any one well , found. It follows that Y has a negative binomial distribution with p = .2. Thus,
let Y denote the number of the trial on which the third oil strike occurs. Then it is E(Y) = 3/(.2) = 15 and V(Y) = 3(.8)/(.2) 2 = 60. Because it takes an additional
reasonable to assume that Y has a negative binomial distribution with p = .2. Because 20 minutes to repair each defective pump, the total time necessary to use the three
we are interested in r = 3 and y = 5, kits is
T = lOY+ 3(20).
P(Y = 5) = p(5) = G)(.2) \ .8) 2
Using the result derived in Exercise 3.33, we see that
= 6(.008)(.64) = .0307.
• and
E(T) = IOE(Y) + 60 = 10(15) + 60 = 210

If r = 2, 3 , 4, . .. and Y has a negative binomial distribution with success prob- V(T) = 102 V(Y) = 100(60) = 6000.
ability p, P(Y = Yo) = p(yo) can be found by using the R (or S-Plus) command Thus, the total time necessary to use all three kits has mean 210 and standard deviation
dnbinom (yo-r, r, p ). If we wanted to useR to obtain p(5) in Example 3.14, we .J6600 = 77 .46. •
use the command dnbinorn ( 2, 3, . 2 ). Alternatively, P(Y :::: y 0 ) is found by using
the R (or S-Pius) command pnbinorn (y0 -r, r, p ). Note that the first argument in
these commands is the value y 0 - r , not the value y0 . This is because some authors
prefer to define the negative binomial distribution to be that of the random variable
Y* = the number of failures before the rth success. In our formulation, the negative Exercises
binomial random variable, Y, is interpreted as the number of the trial on which
the rth success occurs. In Exercise 3.1 00, you will see that Y* = Y - r. Due to 3.90 The employees of a firm that manufactures insulation are being tested for indications of asbestos
this relationship between the two versions of negative binomial random variables. in their lungs. The firm is requested to send three employees who have positive indications
of asbestos on to a medical center for further testing. If 40% of the employees have positive
P(Y = yo) = P(Y- r = y 0 - r) = P(Y* = y 0 - r). R computes probabilities
indications of asbestos in their lungs, find the probability that ten employees must be tested in
associated with Y*, explaining why the arguments for dnbinorn and pnbinorn are
order to find three positives.
Yo - r instead of YO·
The mean and variance of a random variable with a negative binomial distribution 3.91 Refer to Exercise 3.90. If each test costs $20, find the expected value and variance of the total
can be derived directly from Definitions 3.4 and 3.5 by using techniques like those cost of conducting the tests necessary to locate the three positives.
previously illustrated. However, summing the resulting infinite series is somewhat 3.92 Ten percent of the engines manufactured on an assembly line are defective. If engines are
tedious. These derivations will be much easier after we have developed some of the randomly selected one at a time and tested, what is the probability that the first nondefective
techniques of Chapter 5. For now, we state the following theorem without proof. engine will be found on the second trial?
3. 7 The Hypergeometric Probability Distribution 125
3.9 3 Rofocto borei<o 3.92. Whot ;, tho pcobobiJity thottho thinJ ooodofocti" oogioo wi II bo
a on the fifth trial?
b on or before the fifth trial? *3.1 01 a We observe a sequence of independent identical trials with two possible outcomes on each
trial, Sand F, and with P(S) = p. The number of the trial on which we observe the fifth
3.94 Rofoc to bore;" 3.92. Fiod tho moao '"d ""'"'" of tho ""mboc of tho tci•l oo which
success, Y, has a negative binomial distribution with parameters r = 5 and p. Suppose
a the first nondefective engine is found. that we observe the fifth success on the eleventh trial. Find the value of p that maximizes
[ P(Y = 11).
b the third nondefective engine is found.

~
b Generalize the result from part (a) to find the value of p that maximizes P(Y = y0 ) when
Y has a negative binomial distribution with parameters r (known) and p.
3.95 Rofoc to E>ere;., 3.92. Gi"o thot tho firnt two oogioo., to,ted weco dofocti>o, Whot ;,
pcob,biJity !hot ot loa.r two"'"'" oogioo. "'"-" bo to,tod bofore tho fiN ooodofocti>o ;, f""""

3.96 Tho tolophooo lio., .O<viog oo '''""' ""'"""" office •re •JJ b"<y '"""' 60% of tho time.
3.7 The Hypergeometric Probability
o Jf yoo
first oreThe
try? COIIiog thi,try?
second office,
TheWhot tho pmbobiJity thot yo, will comploto yooc oall 0o t!,
third;,try?
Distribution
b If yo" ood o hiood m"" both complete coli, to thi, office, wh" ;, tho pmbobiJity th.,,
total of four tries wi 11 be necessary for both of you to get through?
In Example 3.6 we considered a population of voters, 40% of whom favored candidate
3.97 A gooJogkol ""dy iodi""' thot '" <>plo,tory oil Well •ho"ld Mciko oil with pmMility 2 Jones. A sample of voters was selected, and Y (the number favoring Jones) was to be
observed. We concluded that if the sample size n was small relative to the population
• Whot ;, tho pmb,bility th" tho firnt '""'como, oo tho thicd Woll dcillod?
size N, the distribution of Y could be approximated by a binomial distribution. We also
b Whot ;, tho pco"'bility thot tho thin!""'' como, oo tho '<>ooth well dtillod? determined that if n was large relative to N, the conditional probability of selecting
c Whot "'"mptio,. did yo" m•ko to obtoio tho '"'worn to port, (•) '"d (b)? a supporter of Jones on a later draw would be significantly affected by the observed
d wants
Fiod tho mo'"
to set '"d ""'""
up three tho ""mboc of well, th" m"<t bo dcillod if tho compooy
of wells.
producing preferences of persons selected on earlier draws. Thus the trials were not independent
and the probability distribution for Y could not be approximated adequately by a
•J.98 Co"<idoc tho "<got;>o bioomiol di'"ih"tioo gi"" '" Dofioitioo 3.9. binomial probability distribution. Consequently, we need to develop the probability

(~)
distribution for Y when n is large relative to N.
---l!..S.!J_
+ 1, p(y- Suppose that a population contains a finite number N of elements that possess

~
a Show that if y ::=: r 1) == y- r q. This establishes a recursive re1ation-

(-Y-1) q.
one of two characteristics. Thus, r of the elements might be red and b = N - r,
<hip betwoo" '"""'''" "<goti>o bioomi•l pmb•bilitb, b'"'""' p(y) p(y- I) >
y -r black. A sample of 11 elements is randomly selected from the population, and the
random variable of interest is Y, the number of red elements in the sample. This
---l!..S.!J_
(-Y-1)
b Show that random variable has what is known as the hypergeometric probability distribution.
p(y- I) == q > I ify <
r-q y -r
r- q p(y) ·t· For example, the number of workers who are women, Y, in Example 3.1 has the
Y>-. - . Similarly, - < I 1
1-q I - q p(y- I) hypergeometric distribution.

~ ~
The hypergeometric probability distribution can be derived by using the combina-

*3.99
c which tho > p(y-
Apply p(y) re'"''
1). (b) foe tho"·"'
io p•n 7, p 5 to dotonnioo tho "''"" of y foe torial theorems given in Section 2.6 and the sample-point approach. A sample point
in the sample space S will correspond to a unique selection of n elements, some red

~
and the remainder black. As in the binomial experiment, each sample point can be
Jo' <oq"'"" of iodopeodom idootioal tciru, with two po..,iblo o"tcomo, oo oach "'''· S ood
success? characterized by an n-tuple whose elements correspond to a selection of n elements
F, ood with P(S) p, Whot ;, tho pcob,biJciy thot '>octly y tci•J• will "''"' bofoc, tho cth
*3.100 from the total of N. If each element in the population were numbered from 1 toN, the

If Y ;, ' oogoti>o bioomi,J condom ""'bJo, dofioo Y'


occo~.
~
Y - '· If Y ;, iotocpcotod " th<
thoo Y' "" bo iotecpcotod " tho ""mboc
sample point indicating the selection of items 5, 7, 8, 64, 17, ... , 87 would appear
as then-tuple
of failures
""mbtoc tho tcioJtheoorth
ofbefore success.
which tho nh '"ceo,.

0 Jf Y' ~ Y- c, P(Y• ~ ~ P(Y- ~ ~ P(Y ~


y) c y) Y+ c) foe y ~ 0, I, 2, .. ., •how
(5, 7, 8, 64, 17, '.'' 87).
that P(Y* == y) == (y ,._,
+ 1
r - )p'"q·'', y ==: 0, I, 2, .... n positions

b Do,;" tho moao 'Od "'''""of tho nmdom ""''blo r• by "''"• tho rel,tio,hip Y " The total number of sample points in S, therefore, will equal the number of ways of
Y- r , where Y is negative binomial and the result in Exercise 3.33.
selecting a subset of 11 elements from a population of N, or(~)- Because random
selection implies that all sample points are equiprobable, the probability of a sample
it will be rejected? What is the expected number of defectives in the sample of size ?
What is the variance of the number of defectives in the sample of size 5? 5· 3.104 Twenty identical looking packets of white power are such that 15 contain cocaine and 5 do
not. Four packets were randomly selected. and the contents were tested and found to contain
Solution cocaine. Two additional packets were selected from the remainder and sold by undercover
Let Y equal the number of defectives in the sample. Then N = 20, r = 4, and n ::::
police officers to a single buyer. What is the probability that the 6 packets randomly selected
The lot will be rejected if Y = 2 , 3, or 4. Then 5·
are such that the first 4 all contain cocaine and the 2 sold to the buyer do not?
?(rejecting the lot) = P(Y 2:: 2) = p(2) + p(3) + p(4) In southern California, a growing number of individuals pursuing teaching credentials are
3.105
= I - p(O) - p(l) choosing paid internships over traditional student teaching programs. A group of eight candi-
dates for three local teaching positions consisted of five who had enrolled in paid internships
- I (~)C6) eH~6) and three who enrolled in traditional student teaching programs. All eight candidates appear
- - (~0) - (~0) to be equally qualified, so three are randomly selected to fill the open positions. Let Y be the
number of internship trained candidates who are hired.
1 - .2817- .4696 = .2487.
a Does Y have a binomial or hypergeometric distribution? Why?
The mean and variance of the number of defectives in the sample of size 5 are
b Find the probability that two or more internship trained candidates are hired.
(5)(4) 4 ) (20- 4) (20- 5) c What are the mean and standard deviation of Y?
f.k = 2Q =1 and a-
?
=5 (

20
20 _ = .632. 1
20 1 3.106 Refer to Exercise 3.1 03. The company repairs the defective ones at a cost of $50 each. Find
the mean and variance of the total repair cost.

3.107 A group of six software packages available to solve a linear programming problem has been
Example 3.17 involves sampling a lot of N industrial products, of which r are ranked from I to 6 (best to worst). An engineering firm, unaware of the rankings, randomly se-
defective. The random variable of interest is Y, the number of defectives in a sa mple lected and then purchased two of the packages. Let Y denote the number of packages purchased
of size n. As noted in the beginning of this section, Y possesses an approximately by the firm that are ranked 3, 4 , 5, or 6. Give the probability distribution for Y.
binomial distribution when N is large and n is relatively small. Consequently, we
3.108 A shipment of 20 cameras includes 3 that are defective. What is the minimum number of
would expect the probabilities assigned to values of Y by the hypergeometric distri-
cameras that must be selected if we require that P(at least I defective) =::: .8?
bution to approach those assigned by the binomial distribution as N becomes large
and r IN, the fraction defective in the population, is held constant and equal to p. 3.109 Seed are often treated with fungicides to protect them in poor draining, wet environments.
You can verify this expectation by using limit theorems encountered in your calc ulus A small-scale trial, involving five treated and five untreated seeds, was conducted prior to a
courses to show that large-scale experiment to explore how much fungicide to apply. The seeds were planted in wet
soil, and the number of emerging plants were counted. If the solution was not effective and
CH~;;)
lim
N->oo C) = C) p'' (l - p)" -Y ,
four plants actually sprouted, what is the probability that
a all four plants emerged from treated seeds?
where
r b three or fewer emerged from treated seeds?
- =p.
N c at least one emerged from untreated seeds?
(The proof of this result is omitted.) Hence, for a fixed fraction defective p = r IN, the 3.110 A corporation is sampling without replacement for n = 3 firms to determine the one from
hypergeometric probability function converges to the binomial probability functio n which to purchase certain supplies. The sample is to be selected from a pool of six firms, of
as N becomes large. which four are local and two are not local. Let Y denote the number of nonlocal firms among
the three selected.

a P(Y = 1).
Exercises b P(Y 0:: 1).
c P(Y .:S 1).
3.102
An urn contains ten marbles, of which five are green , two are blue, and three are red. Three
3.111 Specifications call for a thermistor to test out at between 9000 and I 0,000 ohms at 2SC Celcius.
marbles are to be drawn from the urn, one at a time without replacement. What is the probability
that all three marbles drawn will be green? Ten thermistors are available, and three of these are to be selected for use. Let Y denote the
number among the three that do not conform to specifications. Find the probability distributions
3.103
A warehouse contains ten printing machines, four of which are defective. A company selects for Y (in tabular form) under the following conditions:
five of the machines at random, thinking all are in working condition. What is the probability
that all five of the machines are nondefective? a Two thermistors do not conform to specifications among the ten that are available.
b Four thermistors do not conform to specifications among the ten that are available.
3.112 Used photocopy machines are returned to the supplier, cleaned, and then sent back out on lease
agreements. Major repairs are not made, however, and as a result, some customers receive 3.8 The Poisson Probability Distribution
malfunctioning machines. Among eight used photocopiers available today, three are malfunc- Suppose that we want to find the probability distribution of the number of automobile
tioning. A customer wants to lease four machines immediately. To meet the customer's deadl ine accidents at a particular intersection during a time period of one week. At first glance
four of the eight machines are randomly selected and, without further checking, shipped to th~ this random variable, the number of accidents, may not seem even remotely related
customer. What is the probability that the customer receives
to a binomial random variable, but we will see that an interesting relationship exists.
a no malfunctioning machines? Think of the time period , one week in this example, as being split up into n
b at least one malfunctioning machine? subintervals, each of which is so small that at most one accident could occur in it
with probability different from zero. Denoting the probability of an accident in any
3.113 A jury of 6 persons was selected from a group of 20 potential jurors, .of whom 8 were African
subinterval by p, we have, for all practical purposes,
American and 12 were white. The jury was supposedly randomly selected, but it contained
only I African American member. Do you have any reason to doubt the randomness of the P(no accidents occur in a subinterval) = l - p ,
selection?
P(one accident occurs in a subinterval) = p ,
3.114 Refer to Exercise 3.1 13. If the selection process were really random, what would be the mean
and variance of the number of African American members selected for the jury? P(more than one accident occurs in a subinterval) = 0.

3.115 Suppose that a radio contains six transistors, two of which are defective. Three transistors Then the total number of accidents in the week is just the total number of subin-
are selected at random, removed from the radio, and inspected. Let Y equal the number of tervals that contain one accident. If the occurrence of accidents can be regarded as
defectives observed, where Y = 0, I , or 2. Find the probability distribution for Y. Express independent from interval to interval , the total number of accidents has a binomial
your results graphically as a probability histogram.
distribution.
3.116 Simulate the experiment described in Exercise 3.115 by marking six marbles or coins so that Although there is no unique way to choose the subintervals, and we therefore
two represent defectives and four represent nondefectives. Place the marbles in a hat, mix, know neither n nor p, it seems reasonable that as we divide the week into a greater
draw three, and record Y , the number of defectives observed. Replace the marbles and repeat number n of subintervals, the probability p of one accident in one of these shorter
the process untiln = 100 observations of Y have been recorded. Construct a relative frequency subintervals will decrease. Letting A. = np and taking the limit of the binomial
histogram for this sample and compare it with the population probability distribution (Exercise = C:.)pY(i- p) -y as n ~ oo, we have
11
probability p(y)
3.115).

3.117 In an assembly-line production of industrial robots, gearbox assemblies can be installed in .


!Ill
11l-+oo
(n)
y
" (I
p· - p
) 11 - r . n(n- I)·· · (n- y
. = 1!Ill
y!
+ I) (;._)Y(I
-
n
-
)..)
-
n
11

-y

one minute each if holes have been properly drilled in the boxes and in ten minutes if 11 ->oo

the holes must be redrilled. Twenty gearboxes are in stock, 2 with improperly drilled holes.
Five gearboxes must be selected from the 20 that are available for installation in the next five =
. ).. Y( I -
ltm -
A.)
-
11
n(n-l)···(n-y+l)( A.)-y
l - -
11 ->oo y! n nY n
robots .

a Find the probability that all 5 gearboxes will fit properly.


=)..Y
- lim ( 1 - )..)
-
11
(
I--A.)-y (l - -I)
b Find the mean, variance, and standard deviation of the time it takes to install these y! 11 ->oo n n n
5 gearboxes.

3.118 Five cards are dealt at random and without replacement from a standard deck of 52 cards.
x (1- ~) x ... X (1- )': l).
What is the probability that the hand contains all 4 aces if it is known that it contains at least
3 aces? Noting that
3.119 Cards are dealt at random and without replacement from a standard 52 card deck. What is the
probability that the second king is dealt on the fifth card?
lim
H -HXJ
(1--A)ll
n
= e-!..

*3.120 The sizes of animal populations are often estimated by using a capture- tag- recapture method. and all other terms to the right of the limit have a limit of I, we obtain
In this method k animals are captured, tagged, and then released into the population. Some time
later 11 animals are captured, andY , the number of tagged animals among then , is noted. The
v
p(y) = -e-!..
probabilities associated with Yare a function of N, the number of animals in the population, y!
so the observed value of Y contains information on this unknown N. Suppose that k === 4 (Note: e = 2.718 .... ) Random variables possessing this distribution are said to have
animals are tagged and then released. A sample of n = 3 animals is then selected at randorn a Poisson distribution. Hence, Y, the number of accidents per week, has the Poisson
from the same population. Find P(Y = I) as a function of N . What value of N will maximize
P(Y= I) ? - distribution just derived.
Exercises situation. Find the probability that the number of autos entering the tunnel during a two-minute
period exceeds three. Does the Poisson model seem reasonable for this problem?
3.121 Let Y denote a random variable that has a Poisson distribution with mean A = 2. Find Assume that the tunnel in Exercise 3.132 is observed during ten two-minute intervals, thus
3.133
a P(Y = 4). giving ten independent observations Y1, Yz, ... , Y10 , on the Poisson random variable. Find the
probability that Y > 3 during at least one of the ten two-minute intervals.
b P(Y 0: 4).
c P(Y < 4). 3.134 Consider a binomial experiment for n = 20, p = .OS. Use Table I, Appendi x 3, to calculate
d P(Y0:4[Y 0: 2). the binomial probabilities for Y = 0 , J, 2. 3, and 4. Calculate the same probabilities by using
the Poisson approximation with A= np. Compare.
3.122 Customers arrive at a checkout counter in a department store according to a Poisson distribution
3.135 A salesperson has found that the probability of a sale on a single contact is approximately .03.
at an average of seven per hour. During a given hour, what are the probabilities that
If the salesperson contacts 100 prospects, what is the approximate probability of making at
a no more than three customers an·ive? least one sale?
b at least two customers arrive?
3.136 Increased research and discussion have focused on the number of illnesses involving the organ-
c exactly five customers arrive?
ism Escherichia coli (I 0257:H7), which causes a breakdown of red blood cells and intestinal
3.123 The random variable Y has a Poisson distribution and is such that p(O) = p(l ). What is p(2)? hemorrhages in its victims (http://www.hsus.org/ace/11831. March 24, 2004). Sporadic out-
breaks of E. coli have occurred in Colorado at a rate of approximately 2.4 per I 00,000 for a
3.1 24 Approximately 4% of silicon wafers produced by a manufacturer have fewer than two large period of two years.
flaws. If Y , the number of flaws per wafer, has a Poisson distribution, what proportion of the
wafers have more than five large flaws? [Hint: Use Table 3, Appendix 3.] a If this rate has not changed and if J00,000 cases from Colorado are reviewed for this year,
what is the probability that at least 5 cases of E.coli will be observed?
3.125 Refer to Exercise 3.122. If it takes approximately ten minutes to serve each customer, find
b If 100,000 cases from Colorado are reviewed for this year and the number of E.coli cases
the mean and variance of the total service time for customers arriving during a 1-hour period.
exceeded 5, would you suspect that the state's mean E. coli rate has changed? Explain.
(Assume that a sufficient number of servers are availab le so that no customer must wait for
service.) Is it likely that the total service time will exceed 2.5 hours? 3.137 The probability that a mouse inoculated with a serum will contract a certain disease is .2.
3.126 Refer to Exercise 3.122. Assume that an·ivals occur according to a Poisson process wi th an
Using the Poisson approximation, find the probability that at most 3 of30 inoculated mice will
contract the disease.
average of seven per hour. What is the probability that exactly two customers arrive "i n the
two-hour period of time between 3.138 Let Y have a Poisson distribution with mean A. Find E[Y(Y- I)] and then use this to show
that V(Y) =A.
a 2:00P.M. and 4:00P.M. (one continuous two-hour period)?
b I :00 P.M. and 2:00P.M. or between 3:00P.M. and 4:00P.M. (two separate one-hour periods 3.139 In the daily production of a certain kind of rope, the number of defects per foot Y is assumed
that total two hours)? to have a Poisson distribution with mean A = 2. The profit per foot when the rope is sold is
3.127 given by X, where X = SO - 2 Y - Y 2 . Find the expected profit per foot.
The number of typing errors made by a typist has a Poisson distribution with an average of
four en·ors per page. If more than four errors appear on a given page, the typist must retype the *3.140 A store owner has overstocked a certain item and decides to use the following promotion to
whole page. What is the probability that a randomly selected page does not need to be retyped? decrease the supply. The item has a marked price of $100. For each customer purchasing the
3.128 item during a particular day, the owner will reduce the price by a factor of one-half. Thus,
Cars an·ive at a toll both according to a Poisson process with mean 80 cars per hour. If the
the first customer will pay $50 for the item, the second will pay $25, and so on. Suppose that
attendant makes a one-minute phone call , what is the probability that at least I car arrives
the number of customers who purchase the item during the day has a Poisson distribution with
during the call?
mean 2. Find the expected cost of the item at the end of the day. [Hint: The cost at the end of
*3.129 Refer to Exercise 3.128. How long can the attendant's phone ca ll last if the probabil!tY is at the day is I00( 1/ 2) Y, where Y is the number of customers who have purchased the item.]
least .4 that no cars arrive during the call?
3.141 A food manufacturer uses an extruder (a machine that produces bite-size cookies and snack
3.130 A parking lot has two entrances. Cars arrive at entrance I according to a Poisson distribution at food) that yields revenue for the firm at a rate of $200 per hour when in operation. However, the
an average of three per hour and at entrance fi according to a Poisson distribution at an average extruder breaks down an average of two times every day it operates. If Y denotes the number
of four per hour. What is the probability that a total of three cars will arrive at the parking lot in of breakdowns per day, the daily revenue generated by the machine is R = 1600 - SOY 2 . Find
a given hour? (Assume that the numbers of cars arriving at the two entrances are independent. ) the expected daily revenue for the extruder.
3.131 The number of knots in a particular type of wood has a Poisson distribution with an average of *3.142 Let p(y) denote the probability function associated with a Poisson random variable with
l.S knots in I 0 cubic feet of the wood. Find the probability that a I0-cubic-foot block of the mean A.
wood has at most I knot. A
a Sh ow that t he rat1o . pro b a b"l"
. o f success1ve . sat1s
1 1t1es p(y)
. fi es --'--'-- for y = I , 2, ....
3.132 The mean number of automobiles entering a mountain tunnel per two-minute period is one. An p(y- I) y
excessive number of cars entering the tunnel during a brief period of time produces a hazardous b For which values of y is p(y) > p(y - I)?
138 Chapter 3 Discrete Random Variables and The ir Probability Distributions 3.9 Moments and Moment-Generating Functions 139

c Notice that the result in part (a) implies that Poisson probabilities increase for awhile as v into one simple expression. We will first define the moment-generating function and
increases and decrease thereafter. Show that p(y) maximized when y =the greatest integ~r then explain how it works.
less than or equal to A..

3.143 Refer to Exercise 3.142 (c). If the number of phone calls to the fire department, Y, in a day has
DEFINITION 3.14 The moment-generating function m (t) for a random variable Y is defined to be
a Poisson distribution with mean 5.3, what is the most likely number of phone calls to the fire
department on any day?
m(t) = E(e 1Y). We say that a moment-generating function for Y exists if there
exists a positive constant b such that m(t) is finite for It I :S b.
3.144 Refer to Exercises 3.142 and 3.143. If the number of phone calls to the fire department, Y, in
a day has a Poisson distribution with mean 6, show that p(5) = p(6) so that 5 and 6 are the
two most likely values for Y. Why is E(e 1Y) called the moment-gene rating functionfor Y? From a series expan-
sion for e1Y, we have

3.9 Moments and Moment-Generating elY = I + ty + (ty)2 + (ty)3 + (ty)4 + ....


2! 3! 4!
Functions Then, assuming that /L~ is finite fork = I , 2, 3, ... , we have
3
The parameters fL and a are meaningful numerical descriptive measures that locate 1 11
~ e ' p(y) = "'"""' [
E(e y) = "'"""' L: (ty? + 3!
I+ t y + 2! (ty) + · · ·] p(y)
the center and describe the spread associated with the values of a random variable
Y. They do not, however, provide a unique characterization of the distribution of Y. t2 t3
3
Many different distributions possess the same means and standard deviations. We = LP(Y)+ tLyp(y)+ 2! L l P(Y)+ 3! L Y P(Y)+ ...
now consider a set of numerical descriptive measures that (at least under certain y y y y

conditions) uniquely determine p(y) . I (2 I t3 I


= I + t f1, I + - IL 2 + - /LJ + ....
2! 3!
DEFINITION 3.12 The kth moment of a random variable Y taken about the origin is defined to be This argument involves an interchange of summations, which is justifiable if m(t)
E(Yk) and is denoted by fL~. exists. Thus, E(e 1 Y) is a function of all the moments /L~ about the origin, fork =
I, 2, 3, .... In particular, /L~ is the coefficient of t k I k! in the series expansion of
m(t) .
Notice in particular that the first moment about the origin, is E ( Y) = f1, 11 = f1, and The moment-generating function possesses two important applications. First, if
that /L ; = E(Y 2 ) is employed in Theorem 3.6 for finding a 2 . we can find E(e 1 Y), we can find any of the moments for Y .
Another useful moment of a random variable is one taken about its mean.
THEOREM 3.12 If m(t) exists, then for any positive integer k,
DEFINITION 3.13 The kth moment ofa random variable Y taken about its mean, or the kth central
dkm(t)] - (k)(O) - I
moment of Y, is defined to be E [ (Y - fL )k] and is denoted by ILk. -m -ILk·
dtk 1=0

In other words, if you find the kth derivative of m(t) with respect to t and
In particular, a 2 = f1, 2. then set t = 0, the result will be /L~.
Let us concentrate on moments /L~ about the origin where k = I, 2, 3, · · · ·
1
Suppose that two random variables Y and Z possess finite moments with /L 1Y ::::; Proof dk m(t) I dtk, or m <k) (t), is the kth derivative of m(t) with respect tot. Because
1 1 1 1 1 h . · 1 That
/L1z• !L 2r = !L 22 , ... , fLJY = fLJz• w ere J can assume any mteger va ue. . .is. 2 3
IY I t I f 1
the two random variables possess identical corresponding moments about the ongln· m(t) = E(e ) =I +t!L 1 + -f1,2 + -f1,3 + · · ·'
2! 3!
Under some fairly general conditions, it can be shown that Y and Z have identical
it follows that
probability distributions. Thus, a major use of moments is to approximate the prob·
ability distribution of a random variable (usually an estimator or a decision maker). ( I) I 2f I 3f2 I
m (t) = 11 + -2! ~-"2 + -3! IL3 + .. · '
Consequently, the moments fL~, where k = 1, 2, 3, ... , are primarily of theoretical
11
~-"I

value fork > 3. ( 2) I 2f I 3f 2 I


Yet another interesting expectation is the moment-generating function for a randorn m (t) = f1, 2 + -f1,3 + -f1,4 + · · ·,
2! 3!
variable, which, figuratively speaking, packages all the moments for a random variable
Exercises 3.158 If Y is a random variable with moment-generating function m(t) and if W is given by
W =a Y + b, show that the moment-generating function of W is e 1bm(at).
3.145 If Y has a binomial distribution with n trials and probability of success p, show that the 3.159 Use the result in Exercise 3.158 to prove that, if W = aY + b, then E ( W) = a E (Y) + b and
moment-generating function for Y is V(W) = a 2 V(Y).
1
m(t) = (pe +q)", whereq = I - p. 3.160 Suppose that Y is a binomial random variable based on n trials with success probability p and
3.146 Differentiate the moment-generating function in Exercise 3.145 to find £(Y) and £(Y2). Then let y• = n- Y.
find V(Y).
a Use the result in Exercise 3.159 to show that £(Y') = nq and V(Y') = npq, where
3.147 If Y has a geometric distribution with probability of success P: show that the moment-generating q = I - p.
function for Y is b Use the result in Exercise 3.158 to show that the moment-generating function of Y' is
pel
m(l) = - - - . whereq = I - p. m'(t) =
(qe 1 + p)" , where q =
I - p.
I -qe 1
c Based on your answer to part (b), what is the distribution of Y'?
3.148 Differentiate the moment-generating function in Exercise 3.147 to find E (Y) and E (Y 2). Then
find V(Y). d If Y is interpreted as the number of successes in a sample of size 11 , what is the interpretation
of Y'?
3.149 Refer to Exercise 3.145. Use the uniqueness of moment-generating functions to give the dis- e Based on your answer in part (d), why are the answers to parts (a), (b), and (c) "obvious"?
tribution of a random variab le with moment-generating function 111(1) = (.6e 1 + .4) 3.
3.161 Refer to Exercises 3.147 and 3.158.lf Y has a geometric distribution with success probability p,
3.150 Refer to Exercise 3.147. Use the uniqueness of moment-generating functions to give the dis-
.3e1
consider Y' = Y- I. Show that the moment-generating function of Y' is m'(t) = __P_'
tribution of a random variab le with moment-generating function m(t) = - - - . I - qe1
I - .7e1 whereq = I - p.
3.151 Refer to Exercise 3.145. If Y has moment-generating function m(t) = (.7e1 + .3) 10 , what is *3.162 Let r(t) = ln[m(t )l and r(k)(Q) denote the kth derivative of r(t) evalu ated fort = 0. Show that
P(Y :<::: 5) ? r(I)(Q) = ~-t '1 = J-t and r <2)(Q) = J-t; - (~-t'1 f = a 2 [Hint: m(O) = 1.]
3.152 Refer to Example 3.23. If Y has moment-generating function m(t) = e 6 <e'-l), what is *3.163 Use the results of Exercise 3.162 to find the mean and variance of a Poisson random variable
P(IY - ~-tl :<::: 2a)? with m(t) = e5<e' - l>. Notice that r(t) is easier to differentiate than m(t) in this case.

3.153 Find the distributions of the random variables that have each of the following moment-
generating functions:
a m(t) = [(l / 3)e1 + (2/3)] 5 .
el 3.1 0 Probabi Iity-Generati ng
b m(t) = - - .
C
2- e 1

m(t) = e 2 <e'-l)_
Functions (Optional)
3.154 Refer to Exercise 3.153. By inspection, give the mean and variance of the random variables An important class of discrete random variables is one in which Y represents a count
associated with the moment-generating functions given in parts (a), (b), and (c). and consequently takes integer values: Y = 0, I, 2, 3, .... The binomial, geometric,
3.155 Let 111 (t) = (l / 6)e1 + (2/6)e 21 + (3/6)e 31 . Find the following: hypergeometric, and Poisson random variables all fall in this class. The following
a E(Y)
examples give practical situations that result in integer-valued random variables. One,
involving the theory of queues (waiting lines), is concerned with the number of persons
b V(Y)
(or objects) awaiting service at a particular point in time. Knowledge of the behavior of
c The distribution of Y
this random variable is important in designing manufacturing plants where production
3.156 Suppose that Y is a random variable with moment-generating function m (t). consists of a sequence of operations, each taking a different length of time to complete.
a What is m(O)? An insufficient number of service stations for a particular production operation can
b If W = 3Y , show that the moment-generating function of W is 111(31).
result in a bottleneck, the formation of a queue of products waiting to be serviced,
and a resulting slowdown in the manufacturing operation. Queuing theory is also
c If X = Y- 2, show that the moment-generating function of X is e - 21m(l).
important in determining the number of checkout counters needed for a supermarket
3.15 7 Refer to Exercise 3. 156.
and in designing hospitals and clinics.
a If W = 3Y , use the moment-generating function of W to show that £(W) = 3£(Y) and Integer-valued random variables are also important in studies of population growth.
V(W) = 9V(Y). For example, epidemiologists are interested in the growth of bacterial populations and
b If X = Y - 2, use the moment-generating function of X to show that £(X) = E(Y)- 2 the growth of the number of persons afflicted by a particular disease. The numbers of
and V(X) = V(Y). elements in each of these populations are integer-valued random variables.
146 Chapter 3 Discrete Random Variables and Their Probability Distributions Exe rc ises 147

Because we already have the moment-generating function to assist in finding the


moments of a random variable, of what value is P(t)? The answer is that it may be
difficult to find m(t) but much easier to find P(t). Thus, P(t) provides an additional
tool for finding the moments of a random variable. It may or may not be useful in a
given situation.
---
EXAMPLE 3.28
The number of customers per day at a sales counter, Y , has been observed for a long
period of time and found to have mean 20 and standard deviation 2. The probability
distribution of Y is not known. What can be said about the probability that, tomonow,
Y will be greater than 16 but less than 24?
Finding the moments of a random variable is not the major use of the probability.
generating function. Its primary app lication is in de~iving the probability function (and Solution We want to find P(16 < Y < 24). From Theorem 3.14 we know that, for any k ~ 0,
hence the probability distribution) for other related integer-valued random variables. P(IY- t.Li < ka) ~I- ljk 2 , or
For these applications, see Feller ( 1968) and Parzen ( 1992). I
P [ (t.L - ka) < Y < (t.L + ka)] ~ 1 - k2 .

Because f.L = 20 and a = 2, it follows that f.L - ka = 16 and f.L + ka = 24 if k = 2.


Exercises Thus,

*3.164 I 3
Let Y denote a binomial random variable with n trials and probability of success p. Find the P(l6 < Y < 24) = P(f.L - 2a < Y < f.L + 2a) > 1- - 2
- (2)
= -.
4
probability-generating function for Y and use it to find E(Y).
*3.165 Let Y denote a Poisson random variable with mean A.. Find the probability-generating function In other words, tomonow's customer total will be between 16 and 24 with a fairly
fo r Y and use it to find E(Y) and V(Y). high probability (at least 3/ 4).
Notice that if a were 1, k would be 4, and
*3.166 Refer to Exercise 3.165. Use the probability-generating function found there to find E(Y 3) .
I 15
P ( 16 < Y < 24) = P (t.L - 4a < Y < f.L + 4a) > I - - ? = - .
- (4)- 16
3.11 T chebysheff' s Theorem Thus, the value of a has considerable effect on probabilities associated with intervals .

We have seen in Section 1.3 and Example 3.22 that if the probability or population

histogram is roughly bell-shaped and the mean and variance are known, the em pirical
rule is of great help in approximating the probabilities of certain intervals. However,
in many instances, the shapes of probability histograms differ markedly from a mound
shape, and the em pirical rule may not yield useful approximations to the probabilities
Exercises
of interest. The following result, known as Tchebysheff's theorem, can be used to
determine a lower bound for the probability that the random variable Y of interest
3.167 Let Y be a random variable with mean II and variance 9. Using Tchebysheff's theorem, find
fal ls in a n interval f.L ± ka. a a lower bound for P(6 < Y < 16).
b the value of C such that P(IY - Ill 2: C) ::;:: .09.
THEOREM 3.14 Tchebysheff's Theorem Let Y be a random variable with mean f.L and finite
3.168 Would you rather take a multiple-choice test or a full-recall test? If you have absolutely no
variance a 2 . Then, for any constant k > 0,
know ledge of the test material, you will score zero on a full-recall test. However, if you are
1 1 given 5 choices for each multiple-choice question , you have at least one chance in five of
P(IY- t.Li < ka) ~ I - k 2 or P(IY- f.Li ~ ka) :S k 2 .
guessing each correct answer! Suppose that a multiple-choice exam contains I 00 questions,
each with 5 possible answers, and guess the answer to each of the questions.

Two important aspects of this result shou ld be pointed out. First, the result applies a What is the expected value of the number Y of questions that will be correctly answered ?
for any probability distribution , w hether the probability histogram is bell-shaped or b Find the standard deviation of Y.
not. Second, the results of the theorem are very conservative in the sense that th~ c Calcu late the intervals 11- ± 2a and 11- ± 3a.
actual probability that Y is in the interval f.L ± ka usually exceeds the lower boU~ d If the results of the exam are curved so that 50 correct answers is a passing score, are you
10
for the probability, I - I I k 2 , by a considerable amount. However, as discussed likely to receive a passing score? Explain.
Exercise 3. 169, for any k > I, it is possible to construct a probability distribU· 3.169 This exercise demonstrates that, in general , the results provided by Tchebysheff' s theorem
tion so that, for that k, the bound provided by Tchebysheff's theorem is actuallY at·
cannot be improved upon. Let Y be a random variable such that
tained. (You should verify that the results of the empirical rule do not contradict those
g iven by Theorem 3.14.) The proof of this theorem will be deferred to Section 4. 1°· I 16 I
p(-t) =IS' p(O) =IS' p(l) =IS'
The usefulness of this theorem is illustrated in the following example.
a Show that E(Y) = 0 and V(Y) = 1/9.
regarding banning of clothes that display gang symbols, comment on the probability of ob-
·b Use the probability distribution of Y to calculate P(jY- J.l/ 2:: 3u). Compare this ex
serving this survey result (that is, observing 85% or more in a sample of 549 who are in favor
probability with the upper bound provided by Tchebysheff's theorem to see that the bouacr
provided by Tchebysheff's theorem is actually attained when k = 3. Od of banning clothes that display gang symbols). What assumption must be made about the sam-
pling procedure in order to calculate this probability? [Hint: Recall Tchebysheff's theorem and
*c In part (b_) we guaranteed E(Y) =
0 by placing all probability mass on the values - I ,
the empirical rule.]
and 1, wah p(-1) = p(l). The variance was controlled by the probabilities ass igned. 0
to p(- 1) and p( 1). Using this same basic idea, construct a probability distribution for 3.1 77 For a certain section of a pine forest , the number of diseased trees per acre, Y , has a Poisson
random variable X that will yield P(jX- J.lx/ 2:: 2ux) =
1/4. a distribution with mean ).. = I 0. The diseased trees are sprayed with an insecticide at a cost of
*d If any k > I is
2 specified, how can a random variable W be constructed so that P(j W -/.lwj ::.
$3 per tree, plus a fixed overhead cost for equipment rental of $50. Letting C denote the total
kuw) = 1/k ? ' - spraying cost for a randomly selected acre, find the expected value and standard deviation for
C. Within what interval would you expect C to lie with probability at least .75?
3. 170 The U.S. mint produces dimes with an average diameter of .5 inch and standard deviation .OJ
Using Tchebysheff's theorem, find a lower bound for the number of coins in a lot of 400
that are expected to have a diameter between .48 and .52.
coin~ 3.178 It is known that I 0% of a brand of television tubes will burn out before their guarantee has
expired. If I 000 tubes are so ld, find the expected value and variance of Y, the number of original
tubes that must be replaced. Within what limits would Y be expected to fall?
3. 1 71 Fo" cort,;o typ<of 'O;J tho oombe< of wi"w"'m' pocoobic foot h" ' mo,o of I00. A'<omfog
a Poisson distribution of wireworms, give an interval that will include at least 5/9 of the sample 3.179 Refer to Exercise 3.91. ln this exercise, we determined that the mean and variance of the costs
values of wireworm counts obtained from a large number of !-cubic-foot samples. necessary to find three employees with positive indicat ions of asbestos poisoning were 150 and
4500, respectively. Do you think it is highly unlikely that the cost of completing the tests will
3.1 72 Refer to Exercise 3.1 15. Using the probability histogram, find the fraction of values in the exceed $350?
population thattheorem.
Tchebysheff's fall within 2 standard deviations of the mean. Compare your result with that of

3.173 A balanced coin is tossed three times. Let Y equal the number of heads observed.

a Use the formula for the binomial probability distribution to calculate the probabilities
3.1 2 Summary
associated with Y = 0, I , 2, and 3.
b Construct a probability distribution similar to the one in Table 3.1. This chapter has explored discrete random variables, their probabi lity distributions,
and their expected values. Calculating the probability distribution for a discrete ran-
c Find
V(Y) the expected value and standard deviation of Y, using the formulas E(Y)
=: npq.
=: np and
dom variable requires the use of the probabilistic methods of Chapter 2 to evaluate
the probabilities of numerical events. Probability functions , p(y) = P(Y = y),
d Using the probability distribution from part (b), find the fraction of the population mea- were derived for binomial, geometric, negative binomial, hypergeometric, and Pois-
surements lying within 1 standard deviation of the mean. Repeat for 2 standard deviations.
son random variables. These probability functions are sometimes called probability
rule? do your results compare with the results of Tchebysheff's theorem and the empirical
How
mass functions because they give the probability (mass) assigned to each of the finite
or countably infinite possible values for these discrete random variables.
3. 174 Suppose that a coin was definitely unbalanced and that the probability of a head was equal to
The expected values of random variables and functions of random variables pro-
p = .I. Follow instructions (a), (b), (c), and (d) as stated in Exercise 3.173. Notice that the
vided a method for finding the mean and variance of Y and consequently measures
probability distribution loses its symmetry and becomes skewed when pis not equal to 1/2.
of centrality and variation for p(y). Much of the remaining material in the chapter
3.1 75 In May 2005, Tony Blair was elected to an historic third term as the British prime minis ter. was devoted to the techniques for acquiring expectations, which sometimes involved
A Gallop U.K. poll (http://gallup.com/poll/contentldefault.aspx?ci=l 710, June 28, 2005) con- summing apparently intractable series. The techniques for obtaining closed-form ex-
ducted after Blair's election indicated that only 32% of British adults would like to see their son pressions for some of the resulting expected values included (l) use of the fact that
or daughter grow up to become prime minister. If the same proportion of Americans would pre-
fer that their son or daughter grow up to be president and 120 American adults are interviewed.
L yp(y) = I for any discrete random variable and (2) E(Y 2 ) = E[Y (Y -1)]+ E(Y).
The means and variances of several of the more common discrete distributions are
a president?
what is the expected number of Americans who would prefer their child grow up to be summarized in Table 3.4. These results and more are also found in Table A2.1 in
Appendix 2 and inside the back cover of this book.
b to
what is the standard deviation of the number Y who would prefer that their child grow up
be president? Table 3.5 gives the R (and S-Plus) procedures that yield p(yo) = P(Y = Yo)
and P(Y :S y0 ) for random variables with binomial, geometric, negative binomial,
c is it likely
exceeds that the number of Americans who prefer that their child grow up to be president
40? hypergeometric, and Poisson distributions.
We then discussed the moment-generating function associated with a random vari-
3. 176 A national poll of 549 teenagers (aged 13 to 17) by the Gallop poll (http://gallup.com/contenr/ able. Although sometimes useful in finding f.L and a, the moment-generating function
default.aspx?ci=l7110), April, 2005) indicated that 85% "thi~k
that clothes that display gang is of primary value to the theoretical statistician for deriving the probability distribu-
symbols" should be banned at school. If teenagers were really evenly split in their opinions tion of a random variable. The moment-generating functions for most of the common
random variables are found in Appendix 2 and inside the back cover of this book.

Das könnte Ihnen auch gefallen