Beruflich Dokumente
Kultur Dokumente
and
Probability
Statistics and
Probability
Illustrating a Random Variable
(Discrete and Continuous)
Statistics and Probability
Illustrating a Random Variable (Discrete and
Continuous)
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
Telefax: 02-8682-5773/8684-4914/8647-7487
Welcome to the Grade 11 Statistics and Probability Alternative Delivery Mode (ADM) Module.
This module was collaboratively designed, developed and reviewed by educators both from public and private institutions to assist
you, the teacher or facilitator in helping the learners meet the standards set by the K to 12 Curriculum while overcoming their
personal, social, and economic constraints in schooling.
This learning resource hopes to engage the learners into guided and independent learning activities at their own pace and time.
Furthermore, this also aims to help learners acquire the needed 21 st century skills while taking into consideration their needs and
circumstances.
As a facilitator you are expected to orient the learners on how to use this module. You also need to keep track of the learners'
progress while allowing them to manage their own learning. Furthermore, you are expected to encourage and assist the learners as
they do the tasks included in the module.
Welcome to the Grade 11 Statistics and Probability Alternative Delivery Mode (ADM) Module.
The hands are one of the greatest assets of the human body. No other beings in the world has hands that can grasp, hold, move,
and manipulate objects like human hands. Through our hands, we may learn, create and accomplish. Hence, the hand in this
learning resource signifies that you as a learner is capable and empowered to successfully achieve the relevant and essential
competencies and skills at your own pace and time. Your academic success lies in your own hands!
This module was designed to provide you with fun and meaningful opportunities for guided and independent learning at your own
pace and time. You will be enabled to process the contents of the learning resource while being an active learner.
What’s In
This is a brief drill or review to help you link the current lesson with the previous
one.
What’s New
In this portion, the new lesson will be introduced to you in various ways such as a story, a song, a poem, a problem opener, an
activity or a situation.
What is It
This section provides a brief discussion of the lesson. This aims to help you discover
and understand new concepts and skills.
What’s More
This comprises activities for independent practice to solidify your understanding and
skills of the topic. You may check the answers to the exercises using the Answer Key
at the end of the module.
What I Have Learned
This includes questions or blank sentence/ paragraph to be filled in to process what
you learned from the lesson.
What I Can Do
This section provides an activity which will help you transfer your new knowledge or
skill into real life situations or concerns.
Assessment
This is a task which aims to evaluate your level of mastery in achieving the learning
competency.
Additional Activities
In this portion, another activity will be given to you to enrich your knowledge or skill
of the lesson learned. This also tends retention of learned concepts.
Answer Key
This contains answers to all activities in the module.
1
illustrating random variables (discrete and continuous) which are essential in
solving real life problems.
Week
The scope of this module permits it to be used in many different learning
situations. The language used recognizes the diverse vocabulary level of students.
The lesson is arranged to follow the standard sequence of the course.
The module consists of only one lesson entitled illustrating random variables 1
(discrete and continuous).
After going through this module, you are expected to:
1. define random variable; and
2. illustrate random variables (discrete and continues).
What I Know
Before studying this module, take this test to determine what you already know
about the topic covered.
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
D. Nominal
5. A variable where the information or data can take infinitely many values is
a. Quantitative variable
b. Discrete variable
c. Qualitative variable
d. Continuous variable
2
7. A set of numerical values assigned to a sample space is called
a. Random experiment
b. Random sample
c. Random variable
d. None of the above
8. A variable whose value could be a finite and countable number is a
a. Continuous variable
b. Discrete variable
c. Qualitative variable
d. Poison variable
9. Which of the following statement describe a continuous random variable?
a. The average distance traveled by a jeep in a week
b. The number of students present in a Class Anthurium
c. The number of motorcycle owned by a randomly selected household
d. The number of girls taller than 5 feet in a random sample of 10
girls 10. Which of the following is discrete random variable?
a. Jerwin is 165 cm tall
b. Jerwin has three sisters
c. Jerwin weighs 68 kilograms
d. Jerwin ran 500 meters in two minutes
11. Which of the following variable is discrete random variable?
A. The amount of unleaded gasoline in a Suzuki car
B. The temperature of a cup of coffee served at a restaurant.
C. The number of boys in a randomly selected three-child family
D. The average amount spent on water bill every month of December by a
randomly selected household in Quezon Province.
12. Which of the following is NOT a discrete random variable?
A. Number of refrigerator sell each day
B. Height of dragon fruit as measured each day
C. Number of students late in going to school each day
D. Number of people went to the doctor from Monday to Friday
13. You decided to conduct a survey of families with five children. You are
interested in counting the number of girls (out of five children) in each family. Is
this a random variable?
A. Maybe
B. Cannot be determined
C. Yes, it is a random variable
D. No, it is not a random variable
14. Which of the following statement DOES NOT describe a continuous random
variable?
A. Height of students in a certain class
B. The average weight of chicken each day
C. The number of towns belong to Quezon Province
D. The distance travelled by a delivery van in an hour
3
Lesson 1 Illustrating a Random
Variable (Discrete and Continuous)
This module will assist you with understanding the way toward illustrating
random variables (discrete and continuous). Let’s proceed and appreciate learning.
What’s In
In the study of basic probability, you have discovered that an experiment is
any movement that should be possible more than once under comparative
condition. The arrangement of every possible outcomes of an experiment is what we
called a sample space. You have additionally figured out how to mathematically list
down the conceivable outcome of a given experiment. In tossing a coin, for example,
the potential results are turning up a head or a tail.
For you to begin let us all understand that probability distributions can be
illustrated or classified as discrete probability distributions or as continuous
probability distributions, depending on whether they define probabilities associated
with discrete variables and continuous variables.
What’s New
Tossing a coin
As you can see in one- peso coin, it has Dr. Jose P. Rizal on one side, which
we will call it as Head (H), and the other side is the Tail (T). Toss your one-peso coin
three times and record in your notebook the results of the three tosses. In order to
write the result easily, use letter H for the heads and letter T for the tails.
If the results of your three tosses are heads, tails, heads, then you will write
HTH on your notebook.
Continue tossing your coin and record the time. If possible, use mobile
phone timer and record up to the last minutes.
Let say in a minute, how many times the heads and tails appeared. Then,
record all the possible answers on your notebook.
Write all eight possible outcomes. You can do this systematically so that you
do not get confused later on.
4
Looking at the table we see just
1 case of Three Head, but 3
cases of Two Heads, 3 cases of
One Head, and 1 case of Zero
Heads.
So:
• P(X=3) = 1/8
• P(X=2) = 3/8
• P(X=1) = 3/8
• P(X=0) = 1/8
In Graphical Form:
= 1 – 1/8 = 7/8
Example 2;
For a fair coin tosses twice, the probability of each of the possible values for
Number of Heads can be tabulated as shown:
Let x is equal to the number of heads observed. x is what we called random
variable.
Number of 0 1 2
Heads
• P( X=2) = 1/4
• P( X=1) = 2/4
• P( X=0) = 1/4
5
This is again an example of a discrete variable. Thus, a discrete random
variable X has possible values x1, x2 , x3, .....
In Graphical Form:
= 1 – 1/4 = 3/4
What’s More
This comprises activities for independent practice to solidify your
understanding and skills of the topic.
A Random Variable is a capacity that connects a real number with every
component in the sample space. It is a variable whose qualities are controlled by
chance. In this manner, a Random Variable is a numerical amount that is derived
from the results of an arbitrary trial or experiment. The word “random” is used
often in everyday life.
Types of Random Variables:
6
At that point, recognize the two types of arbitrary factors. These are the
discrete and continuous random variables.
Continuous Random Variables, then again, are random variables that take
an interminably uncountable number of potential values, regularly measurable
amounts. Examples are the height or weight of an individual, the time an individual
takes for an individual to wash, time, temperature, item thickness, length, age, etc.
Now, let us try to look to some examples of random variables from the table below.
Types of
Number X or the Random
Experiment Random
Variable X
Variable
7
1. Number of rings before the phone is
answered
What I Can Do
Answer the following.
Classify whether the given experiment implies a discrete random variable or a
continuous random variable. Write D if discrete and C if continuous.
Assessment
Multiple Choice. Choose the letter of the best answer. Write your chosen letter on
a separate sheet of paper.
1. A variable where the information or data can take infinitely many values is
A. Continuous variable
B. Discrete variable
C. Quantitative
D. Qualitative variable
2. Which of the following statement describe a continuous random variable?
A. The number of students present in a Class Temperance
B. The average distance travelled by a tricycle in a month
C. The number of motorcycle owned by a randomly selected household
D. The number of girls taller than 5 feet in a random sample of 6 girls
8
5. Which of the following is NOT a discrete variable?
A. Number of book per student
B. Number of green marbles in the box
C. The number of arrivals of customers in the clinic between 8:00 a. m to 4:00
p.m.
D. The weight of a box of soft drinks labeled 12 ounces.
6. Which of the following is an example of discrete variable?
A. Distance travelled between cars
B. Height of the students in a section Prudence
C. Number of blue marbles in the box
D. Weight of potatoes in the basket
7. A set of numerical values assigned to a sample space is called
A. Random experiment
B. Random sample
C. Random variable
D. None of the above
8. A variable whose value could be a finite and countable number is a
A. Continuous variable
B. Discrete variable
C. Qualitative variable
D. Quantitative variable
9. This term can best describe a variable that can be counted
A. Continuous
B. Discrete
C. Interval
D. Ratio
10. Which of the following is NOT a discrete random variable?
A. Height of eggplant as measured each day
B. Number of refrigerator sell each day
C. Number of late comers in going to school each day
D. Number of people went to the Rizal Park from Monday to Friday
9
A. Maybe
B. Cannot be determined
C. Yes, it is a random variable
D. No, it is not a random variable
15. Which of the following statement DOES NOT describe a continuous random
variable?
A. Height of students in a certain class
B. The average weight of chicken each day
C. The number of streets at barangay Tahimik
D. The distance travelled by a delivery van in an hour
Additional Activities
Hondagua National High School-Senior High School would like to conduct election
for the Accountancy Business and Management (ABM) officers. Complete the table
for the possible outcomes from a sample of four voters and identify also the value
of random variable of the number of “yes” votes.
Value of Random
Event Voter # 1 Voter #2 Voter #3 Voter #4 Variables
(Number of Yes votes)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
Statistics and
Probability
10
Distinguishing Between a
Discrete and a Continuous
Random Variable
Statistics and Probability
Distinguishing Between a Discrete and a Continuous Random Variable
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.
Telefax: 02-8682-5773/8684-4914/8647-7487
11
What I Know
Week Before studying this module, take this pre-test to find out how much you already
know about the topic covered. Write the chosen letter on a separate sheet of paper.
12
10.Which of these is NOT a discrete random variable?
A. The number of absent in a class
B. The number of death per year attributed to lung cancer
C. The average amount of electricity consumed per household per
month
D. The number of people who drive through a red light each day during
rush hour
11.Which of the following statement DOES NOT describe a discrete random
variable?
A. The number of siblings in your family
B. The height of your brother in centimeters
C. The number of affected individuals by “Covid- 19” pandemic D. The
number of qualified family in “Social Amelioration Program”
12.Which of the following is NOT a continuous random variable?
A. Amount of water in a pail
B. A person’ s height each year
C. A person’s weight on each birthday
D. Number of cars manufactured in a factory each day
13.Which of the following statement DOES NOT describe a continuous random
variable?
A. The amount of sugar in a cup of coffee
B. The amount of water in a cup of coffee
C. The average earnings per day in a cafeteria
D. The number of cups of coffee sold in a cafeteria during lunch 14. The
distance that a cyclist rides each day is what sort of variable?
A. Continuous
B. Discrete
C. Ordinal
D. Qualitative
15. x is the time it takes a chef to cook a specific dish, is a continuous random
variable. Why do think the variable x in the statement becomes
continuous?
A. The variable x is continuous because it takes a range of values
B. The variable x is continuous because it takes an integral values.
C. The variable x is continuous because it takes a countable value
D. The variable x is continuous because it takes a specific
numerical values
What’s In
13
Remember that a variable is a quantity that may change within the context of a
mathematical problem or experiment. Typically, we use a single letter to represent a
variable. The letters x, y, and z are common generic symbols used for variables. In
this lesson, we shall discuss variables that are associated with probabilities, called
random variables.
To find out if you are ready to learn this new lesson, do the following.
List the sample space of the following experiments.
Experiment Sample Space
1. Flipping a coin
2. Rolling a die
3. Drawing a card at random from a deck of 6
identical card suits labeled 1 to 6
4. Spinning the wheel
the
shown, assuming that
arrow will never fall on the
lines separating the 8 sectors
14
Height of individuals Height of each member
of the family
(You must only use a meter
stick or ruler and avoid using
a carpenter’s rule because you
might accidentally get lose of it
and will end up wounding
your fingers or hand.)
This time, you need to reflect on the following questions because it will help you to
understand the appropriate way of classifying a random variable. Answer the
following questions:
1. How do you find the activity?
2. Were you able to complete the table? If yes, how? If no, why?
3. If a random variable takes on values on continuous scale, it is continuous
random variable. If a random variable takes on outcomes that are countable
then it is discrete random variable. Now, will you be able to classify those
experiments in activity 1 as discrete or continuous? If yes, how? If no, why?
4. In your own words, how will you differentiate between discrete and
continuous random variables?
What is It
In Algebra, “variable” means one thing - an unknown value, you’ll come across
different types of variables in statistics. Some of these are: categorical variable
(variable that can be put into categories like tooth paste brands); ordinal variable
(similar to categorical variable, but there is a clear order like socioeconomic
status ); quantitative variable (a broad category that includes any variable that can
be counted or has a numerical values associated with it); qualitative variable (a
broad category that for any variable that can’t be counted) and there’s a lot more
variable that you may encounter in the field of statistics, but our focused in this
module, is on random variable.
A Random Variable is a function that associates a real number with each element
in the sample space. It is a variable whose values are determined by chance. Thus,
in simple words, a Random Variable is a numerical quantity that is derived from
the outcomes of a random experiment.
In the experiment of tossing a coin, the number of times the coin turns up a head is
an example of random variables. Below are some examples of random variables.
a. Suppose two dice are rolled. The sum of the two numbers that face up is
an example of a random variable. This variable
make take on integers from 2 to 12.
b. Suppose the spinner shown below is spun. An
example of a random variable for this experiment
is the number of times that the spinner stops at
number 3.
There are two types of random variables, the
discrete and continuous random variables
15
Types of Random Variables
Discrete Continuous
A discrete random variable A continuous random
is a random variable variable is a random
whose values represented variable that takes values
by count data. This on a continuous scale.
happens when the set of This hap pens when the
possible outcomes from an set of possible outcomes
experiment can be from an experiment
counted such as can be cannot be counted but
counted such as the can be measured such as
number of household in a height of Grade 11
barangay, thus making students. Continuous
the values of the random random variables
variable countable of the represent measured data
random variable countable like height, weight and
just like whole numbers 0, temperature.
1, 2, 3.
From the example in the table above, Examples 1, 2, and 3 are all discrete while
example 4 and 5 are continuous
Example 6. Brand Z is a company that manufactures batteries. They record
the number of defective batteries that are contained in each box
before it is sent out for delivery. If X is the number of defective
batteries per box, then what type of random variable is X?
Solution. X is a discrete random variable, and X can be a set {0, 1, 2, …, n} where
n is the total number of batteries in each box.
Example 7. Suppose Brand Z in example number 6 wants to check the average
life span of their batteries. If Y is the average number of days that
each battery lasts, then what can you conclude about the variable
Y?
Solution. We can conclude from above, that Y is a continuous random
variable. It may be assigned time intervals that corresponds to each
battery’s life span.
16
Because of the way the discrete random variable is defined, we can say that the
range of values that can be assigned to it is confined to the set of whole numbers.
That is, a discrete random variable may NOT take on non-integers.
What’s More
Independent Activity 1.
Look Back and Reflect
1. How do you determine the values of a random variable?
10. the number of fouls committed by a basketball team during the games
17
1. A well - defined function from a sample space to the set of real numbers is called
a _______________________. It is commonly denoted by capital letters, such as X, Y
and Z. The specific values of a random variable is denoted by small letters such as
x, y, and z.
2. A random variable is a _______________________if its set of possible outcomes is
countable. Mostly, discrete random variable represents count data, such as the
number of enrolled students in a particular school.
3. A random variable is _______________________if it takes on values on a continuous
scale. Often, continuous random variables represent measured data, such as
heights, weights temperatures and lifespan.
What I Can Do
This time, it’s your turn to shine. Answer the following questions.
1. Eight students were asked to solve the Rubik’s cube. The time it took for them to
complete the puzzle is summarized in the table below.
Time (in min) Less than
1 min 1 - 2 min 2 - 3 min Greater than 3 min
Number of children who completed the Rubik’s cube 2 2 3 1
Classify what type of random variable is shown.
2. Suppose a Meter-man Apprentice was told by his superior to measure the
voltage of a certain electric outlet. The voltage reading ranges from 118 V <
x < 122 V. Does the scenario represent a discrete or a continuous random
variable?
Assessment
Let us test how far you understood the lesson on random variables. Answer the test
below. Write your answers on a separate sheet of paper. I believe that with your
effort and determination, you can do it once again!
18
B. Crust: thin or deep dish
Toppings: cheese, bacon or pepperoni
C. Crust: thin or deep dish
Toppings: cheese, bacon, sausage or pepperoni
D. Crust: thin or deep dish
Toppings: cheese, bacon, sausage, pepperoni or hotdog
5. A drawer has 2 red ties, 3 black ties and 5 blue ties. A tie is picked at
random. What is the total number of possible outcomes?
A. 2
B. 3
C. 5
D. 10
6. Which of the following statement describe a continuous random variable?
A. The number of students present in a class
B. The average distance traveled by a jeep in a week
C. The number of correct guesses on a multiple choice test
D. The number of women taller than 68 inches in a random sample of 5
Women
7. Which of the following is a continuous random variable?
A. Height of sunflower as measured each day
B. Number of gadgets a business sell per day
C. Number of people attending mass every Sunday D. Number of people
eating at the restaurant everyday
8. Which of the following is discrete random variable?
A. Hipolito weighs 65 kg
B. Hipolito is 160 cm tall
C. Hipolito has two brother
D. Hipolito ran 100 meters in 10.2 seconds
9. Which of the following variable is discrete random variable?
A. Lifetime of a AAA battery
B. The amount of gasoline in a car
C. Number of goals of a football team
D. The time it takes to commute to word
10. Which of the following is NOT a discrete random variable?
A. Height of rose as measured each day
B. Number of appliances a business sell each day
C. Number of people went to church every Sunday
D. Number of students late in going to school each day
11. Which of the following statement DOES NOT describe a discrete random
variable?
A. The number of children who do household chores everyday
B. The average amount of milk consumption of a baby each day
C. The number of students who excelled in mathematics in a certain
grade level
D. The number of children who love to pray a rosary every 6 o’clock in
the afternoon
12. Which of the following statement DOES NOT describe a continuous
random variable?
A. The distance traveled by a truck in an hour
B. The average height of a coconut tree each day
19
C. The number of provinces belong to Region IV-A
D. The intensity of an earthquake that happens last month
13. Which of the following is NOT a continuous random variable?
A. The amount of water on a jug
B. The length of time attending a “Webinar”
C. The number of OFW who traveled abroad each day
D. The height of the tallest mountain in the Philippine
14. You decide to collect a bunch of cans of soda and measure the volume of
soda in each can. Let x be the number of soda in each can. What type of
variable is x?
A. x is a constant
B. x is a place holder
C. x is a discrete random variable
D. x is a continuous random variable
15. You decided to conduct a survey of families with two children. You
are interested in counting the number of girls (out of 2 children) in
each family. Is this a random variable?
A. Yes, it is a random variable
B. No, it is not a random variable
C. Maybe
D. Cannot be determine
Additional Activities
Answer the following.
Identify whether the given experiment involves a discrete random variable or a
continuous random variable. Write D if discrete and C if continuous.
_____ 1. Getting the distance travelled by a car
_____ 2. Collecting data about the weights of students in a certain school
_____ 3. The temperature in Quezon at noon times
_____ 4. Picking a multiple of 3 less than 200
_____ 5. Tallying the number of PUI (Person Under Investigation) due to
Covid-19 pandemic.
Statistics and
Probability
Finding Possible Values of a
Random Variable
Statistics and Probability
Finding Possible Values of a Random Variable
20
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
Telefax: 02-8682-5773/8684-4914/8647-7487
21
The module has one lesson:
Lesson 1 – Finding possible values of a random variable.
Week
After going through this module, you are expected to find possible values of random
variables.
What I Know 1
Before studying this module, take this pre-test to find out how much you already
know about the topic covered. Write the chosen letter on a separate sheet of paper.
1. A quantity resulting from an experiment by chance that can assume
different values is called ________________.
A. Ran dom sample
B. Random variable
C. Random process
D. Random experiment
2. Which of the following is NOT a true statement?
A. Random variables can only have one value.
B. The value of a random variable could be zero.
C. The probability of all the value of a random variable could be zero.
D. The sum of all the probabilities in a probability distribution is always
equal to one.
3. If a coin is tossed, what are the possible values of the random variable for
the number of heads?
A. 0, 1
B. 0, 1, 2
C. 1, 2, 3
D. 0, 1, 2, 3
For numbers 4 - 9. Suppose you tossed two coins.
4. What are the sample spaces for the experiment above?
A. HH, TT
B. HH, HT, TT
C. HH, TH, TT
D. HH, HT, TT, TH
5. How many possible outcomes will occur from the experiment above?
A. 2
B. 3
C. 4
D. 5
6. What are the possible values of the random variable for the number of
heads?
A. 0, 1
B. 0, 1, 2
C. 1, 2, 3
D. 0, 1, 2, 3
7. What are the possible values of the random variable for the number of tails?
A. 0, 1
B. 0, 1, 2
C. 1, 2, 3
D. 0, 1, 2, 3
8. Which of the following is NOT a possible value of the random variable for the
number of heads?
22
A. 1
B. 2
C. 3
D. 4
9. Which of the following is NOT a possible value of the random variable for the
number of tails?
A. 4
B. 3
C. 2
D. 1
For numbers 10 – 11. You decide to conduct a survey of families with two
children. You are interested in counting the number of girls (out of 2 children)
in each family.
10. Is the above experiment “counting the number of girls (out of 2 children in
each family” a random variable?
A. Yes, it is a random variable.
B. No, it is not a random variable.
C. Maybe.
D. It cannot be determined.
11. If the experiment above shows a random variable, then what are the
possible values of it?
A. Its value can be 1 or 2.
B. Its value can be 0, 1, or 2.
C. Its value can be 2 or 4.
D. None, since it is not a random variable.
12. In a local community, a couple was asked the questions, “Are you satisfied
with the work of the current president?” If the husband and the wife both
said “yes”, the response is written as YY. If the husband said yes and the
wife said “no”, the response is YN. Let X be the number of “yes” responses,
what are the possible values of the random variables?
A. 0, 1, 2
B. 1, 2, 3
C. 2, 3, 4
D. 1, 1, 2
For numbers 13 - 15. Suppose two laptops are tested. Let D represent the
defective laptop and N represent the non-defective laptop.
13. What are the possible sample spaces for the experiment?
A. DD, NN
B. DD, DN, NN
C. DD, ND, NN
D. DD, DN, NN, ND
14. How many possible outcomes will occur from the experiment?
A. 2
B. 3
C. 4
D. 5
15. If we let X be the random variable representing the number of defective
laptops. What are the possible values of the random variable?
A. 0, 1
23
B. 0, 1, 2
C. 1, 2, 3
D. 0, 1, 2, 3
Well, what do you think about the pre-test? Did you do it well? Compare
your answers with those in the Answer Key on the last part of this module to find
out your level of performance in this test. If all your answers are correct, then it’s
excellent! This shows that you already know much about the topic. You may SKIP
this module and PROCEED to the next module.
If you got a low score or missed a point, it’s ok. This module was made to
help you understand important concepts about random variables that you can
apply in your daily life. If you study this module carefully, you will learn the
answers to all the items in the test and a lot more!
If three coins are tossed, what numbers can be assigned for the frequency of heads
that will occur? If three cards are drawn from a deck, what number can be
assigned for the frequency of face cards that will occur? These questions may be
answered using random variables which you will learn in this module.
What’s In
Recall that a variable is an attribute that can assume different values. We use
letters to denote or represent a variable. In this lesson, we shall discuss variables
that are resulted from an experiment by chance, called random variables.
To find out if you are ready to learn this new lesson, try to determine whether each
of the following experiments/situations involved discrete or a continuous random
variable.
1. Choosing an even number less than 100.
2. Tallying the number of households in a subdivision that has a least one
vehicle.
3. Gathering information about the average monthly income of a household.
4. Teacher applicants in the Division of Quezon.
5. Recording the number of patients who recovered from COVID-19.
What’s New
To understand how one finds possible values of random variables, do the activity
below.
Suppose two iPhone are tested at random. We want to find out the number of
defective iPhone. Thus, to each outcome in the sample space we shall assign a
value. These are 0, 1, or 2. If there is no defective iPhone, we assign the number 0;
if there is 1 defective iPhone, we assign the number 1; and 2, if there are two
defective iPhone. The number of defective iPhone is a random variable. The possible
values of this random variable are 0, 1, and 2.
24
Read and analyze the given situation below.
Illustration
Let D represent the defective iPhone and N for the non-defective iPhone. If we let Y
be the random variable representing the number of defective iPhone, show the
values of the random variable Y. Complete the table below to show the values of the
random variable.
Possible Outcomes Value of the Random Variable Y
(number of defective iPhone)
DD 2
NN 0
DN 1
ND 1
Suppose three laptops are tested at random. We want to find out the number of
non-defective laptops. Thus, to each outcome in the sample space we shall assign a
value. These are 0, 1, 2, or 3. If there is no defective laptop, we assign the number
0; if there is 1 non-defective laptop, we assign the number 1; if there are two
nondefective laptops, we assign the number 2 and 3, if there are three
nondefective laptops. The number of non-defective laptops is a random variable. Let
D represents the defective laptop and N represents the non-defective laptop. If we
let X be the random variable representing the number of non-defective laptops,
show the values of the random variable X. Complete the table below to show the
values of the random variable.
Possible Outcomes Value of the Random Variable X
(number of non-defective laptops)
25
This time, you need to reflect on the following questions because it will help you to
understand the appropriate way in finding the possible values of a random
variable.
Answer the following questions:
1. How do you find the activity?
2. Were you able to complete the table? If yes, how? If no, why?
3. Will you be able to find the values of a random variable? If yes, how?
If no, why?
4. In your own words, how will you describe a random variable?
5. How do you find the possible values of a random variable?
What is It
A random variable is a numerical quantity that is derived from the outcomes of
random experiments.
The random variable in the activity above is a discrete random variable because the
set of possible outcomes is countable. The possible values of random variable Y
(number of defective iPhone) are 0, 1, and 2, while the possible values of random
variable X (number of non-defective laptops) are 0, 1, 2, and 3.
Example 1. Suppose two coins are tossed, let Z be the random variable representing
the number of heads that occur. Find the values of the random variable Z.
Steps Solution
1. Determine the sample space. The sample space for this experiment is:
Let H represent head and T for S = {TT, TH, HH, HT}
tail.
2. Count the number of heads in
each outcome in the sample Possible Outcomes Value of the
space and assign this number
Random variable Z
to this outcome.
(Number of heads)
TT 0
TH 1
HH 2
HT 1
Steps Solution
1. Determine the sample space. Let The sample space for this experiment is: S
W represent the white ball and Y = {WW, YY, WY, YW}
for the yellow ball.
2. Count the number of white balls
in each outcome in the sample Value of the
space and assign this number to random variable X
Possible
this outcome.
Outcomes (Number of white
balls)
WW 2
YY 0
WY 1
YW 1
.
So, the possible values of the random variable X are 0, 1, and 2.
26
What’s More
Independent Activity 1.
Understanding Possible Values of Random Variables.
Answer the following questions.
1. How do you find the values of a random variable?
2. How is this variable, as described in Algebra, similar to a random variable?
How do they differ?
Independent Assessment 1
Two cards are drawn from a deck. How many possible values can each of the
following variables take?
1. sum of the numbers on the cards
4. Number of times the first card is six and the second card is red
5. Number of times the first card is face card and the second card is not a
face card
Independent Activity 2. Getting Balls from a Box
Two balls are picked in succession without replacement - 4 white balls and 5 green
balls. Let Y be the random variable representing the number of green balls. Find
the values of the random variable Y. Complete the table below.
Value of Random Variable Y
Possible Outcomes (number of green balls)
Independent Assessment 2
Four coins are toss. Let Y be the random variable representing the number of
tails that occur. Find the values of the random variable Y.
Value of Random Variable Y
Possible Outcomes
(number of tails)
Assessment
Multiple Choice. Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper. .
1.A set of numerical values assigned to a sample space is called
A. Random sample C. Random variable B.
A. 0, 1 C. 1, 2, 3
B. 0, 1, 2 D. 0, 1, 2, 3
For numbers 4 - 9. Suppose you tossed three coins.
4. What are the sample spaces for the experiment above?
A. HHH, TTT
B. TTT, HHH, TTH, THT, HTH
C. TTT, HHH, HHT, THT, HTH
D. TTT, TTH, THT, HTT, HHT, HTH, THH, HHH
28
5. How many possible outcomes will occur from the experiment above?
A. 2 C. 6
B. 4 D. 8
6.What are the possible values of the random variable for the number of
heads?
A. 0, 1 C. 1, 2, 3
B. 0, 1, 2 D. 0, 1, 2, 3
7. What are the possible values of the random variable for the number of
tails?
A. 0, 1 C. 1, 2, 3
B. 0, 1, 2 D. 0, 1, 2, 3
8. Which of the following is NOT a possible value of the random variable for
the number of heads?
A. 1 C. 3
B. 2 D. 4
9. Which of the following is NOT a possible value of the random variable for
the number of tails?
A. 1 C. 3
B. 2 D. 4
For numbers 10 - 11. You decide to conduct a survey of families with two children.
You are interested in counting the number of boys (out of 2 children) in each family
10. Is the above experiment “counting the number of boys (out of 2
children) in each family” a random variable?
A. Yes, it is a random variable. C. Maybe.
A. 0, 1 C. 1, 2, 3
B. 0, 1, 2 D. 0, 1, 2, 3
29
For numbers 14 - 15. Suppose three laptops are tested. Let D represent the
defective laptop and N for the non-defective laptop.
14. How many possible outcomes will occur from the experiment?
A. 3 C. 8
B. 4 D. 9
15. If we let X be the random variable representing the number of
non-defective laptops. What are the possible values of the random variable?
A. 0, 1 C. 1, 2, 3
B. 0, 1, 2 D. 0, 1, 2, 3
Additional Activities
Answer the following.
An umbrella - check girl returns 2 umbrellas at random to 2 customers who had
previously checked in, namely Andrea and Althea. If Andrea’s umbrella was black
and Althea’s umbrella was Yellow:
1. list the elements of the sample space, S - all possible orders in which the
umbrellas can be returned, and;
2. list all possible values of the random variable X, the number of correct
matches.
30
Statistics and
Probability
Illustrating a Probability
Distribution for a Discrete
Random Variable and its
Properties
Statistics and Probability
Illustrating a Probability Distribution for a Discrete Random Variable and its Properties First
Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
31
Cainta, Rizal 1800
After going through this module, you are expected to illustrate a probability
distribution for a discrete random variable and its properties.
What I Know
Let us see how far you will go and how much you know about this pretest. Choose
the letter of the best answer. Write the chosen letter on a separate sheet of paper.
1. What is the sum of the probabilities of all values of the random variable?
A. ∑ 𝑃(𝑋))= 0
B. ∑ 𝑃(𝑋)) = 1/10
C. ∑ 𝑃(𝑋)) = 1
D. ∑ 𝑃(𝑋)) = 10
Refer to the given problem in answering numbers 2-3
2. If two balls are drawn in succession without replacement from an urn
containing 5 red balls and 6 blue balls. If the value of the random variable X
represents the number of blue balls, what is the probability of getting two
blue balls?
A. 1/8
B. 1/4
C. 1/3
D. 1/2
3. Which probability distribution represents the given problem?
A. X 0 1 2
P(X) 1/2 1/4 1/4
B.
X 0 1 2
P(X)
X 1/4
0 1/2
1 1/4
2 3
C.
P(X) 1/8 3/8 3/8 1/8
32
X 0 1 2 3
P(X) 3/8 1/8 3/8 1/8
D.
B. P(X) =
6
C. P(X) =
𝑋
1
D. P(X) =
6
9. The following table shows the probability distribution of a discrete
random variable X. Find the value of n.
X 2 4 6
P(X) 0.40 0.15 n
A. 0.15
B. 0.25
C. 0.45
D. 0.6
10. Which of the following table represents probability distribution?
A. X 0 1 2 3
P(X) 0.11 0.15 0.42 0.44
X 1 3 5 7
33
P(X) 1/3 1/2 1/3 1/3
B.
C.
X 1 2 3 4
P(X) 0.32 0.28 0.28 0.12
D
.
X 0 2 4 6
11. If the values of the random
P(X) 1/5 1/5 1/5 1/5
variable X are 0,1,2 and 3
where X represents the number of heads in tossing a coin thrice,
what is the probability that three heads will come up?
A. 1/2
B. 3/8
C. 1/4
D. 1/8
𝑋
12. What are the values of P(x) if P(x)= , for x=0,1, and 2? 3
A. 0, 0.33, 0.67
B. 0, 1,2
C. 0, 0.45, 0.55
D. 0.2, 0.3, 0.5
13. Which of the following values can serve as the probability distribution
of a random variable X?
1
A. P(X)= for x =1,2, 3…..9
6
B. P(X)= for x = 1, 2, 3, 4
1
C. P(X)= for x = 1,2,3……8
8
3
D.PX) = for x = 1,2,3
14. The number of absences from June to March of a student based on his
class adviser’s record is shown below.
A. X 0 1 2 3
P(X) 2/10 2/10 3/10 3/10
B. X 0 1 2 3
P(X) 1/10 3/10 3/10 3/10
X 0 1 2 3 4
C. P(X) 1/10 1/10 3/10 3/10 1/5
34
X 0 1 2 3 4
P(X) 1/5 3/10 3/10 1/10 1/10
D.
A. X 5 8 9 12 15
P(X) 0.06 0.14 0.20 0.28 0.32
B.
X 5 8 9 12 15
P(X) 0.32 0.28 0.20 0.14 0.06
C.
X 5 8 9 12 15
P(X) 0.06 0.20 0.14 0.32 0.28
D.
X 5 8 9 12 15
Lesson P(X) 0.06 0.14 1 0.32 0.28 0.20
Illustrating
a Probability Distribution for a
Discrete Random Variable and its
Properties
In this module, the concept of the probability distribution for a discrete
random variable will be introduced. Some decisions are made by assigning
probabilities to all possible outcomes related to the situation. When we draw
a conclusion from an experiment like tossing a coin, tossing dice, or from
other situations, it requires the use of random variable and probability
distribution. You will also learn how to determine if the given distribution
represents a probability distribution.
What’s In
In your previous lesson, you have learned how to find the possible values of a
random variable. In this module, you will learn how to illustrate a probability
distribution of a discrete random variable. To find out if you are ready to learn in
this new lesson, answer the following questions.
A. Determine whether the statement is True or False. If the answer is false, you
can modify the statement to make it true.
Statement True False
1. There are 4 outcomes if you tossed two coins.
2. If you tossed three coins where X be the random variable
representing the number of tails that occur. The possible
values of the random variable X are 0, 1 and 2
35
6 2
Solution:
1. True.
2. False: The possible values of the random variable X are 0,1,2 and 3.
3. False:
4. True
5. False: If P(x)= , the value of P(1) is
6. True
B. Recall how to determine the values of the random variable by answering the
given problem.
Find the values of the random variable Y representing the number of green balls
when 2 balls are drawn in succession without replacement from a jar containing 4
red balls and 5 green balls.
Solution:
a. Determine the sample space. Let R represent the red ball and G represent
the green ball. The sample size for this experiment is: { RR, RG, GR, GG }.
b. Count the number of green balls in each outcome in the sample space
and assign the number to this outcome.
What’s New
Activity
Number of Defective Computers
36
Based on the above problem, observe, analyze, and answer the following questions:
a. List the sample space in the given experiment. How many outcomes
are possible?
b. Construct a table showing the number of defective computers in each
outcome and assign this number to this outcome. What is the value of the
random variable X?
c. Illustrate a probability distribution. What is the probability value P(X)
to each value of the random variable?
d. What is the sum of the probabilities of all values of the random
variable?
e. What do you notice about the probability of each value of the random
variable?
What is It
To solve the problem above, you have to consider first the steps in determining the
values of the random variable that you have learned from your previous lesson.
There are four possible values of the random variable X representing the number of
defective computers. The possible values that X can take are 0, 1, 2, and 3.
If each of the outcomes is equally likely to occur, then the probability is:
P (E)=
Assign probability values P(X) to each value of the random variable. Since the
number of outcomes is 8, the probability that 0 defective computer will come
out is or P (0) = , the probability that 1 defective computer will come out is
or
37
3
P(1)= is , the probability that 2 defective computers will come
out is or P(2)=
8
1 and the probability that 3 defective computers will come out is or P(3)= .
8
X 0 1 2 3
P(X) 1 3 3 1
8 8 8 8
you can also use decimals in determining the sum of the probabilities
0.125 + 0.375 + 0.375 + 0.125 = 1
If you add all the probabilities, the sum is equal to 1.
e. From the given activity, you can see that the values of the probability
range from 0 to 1.
The probability distribution of a discrete random variable X is a list of the possible
values of X and the corresponding probabilities of the values. It specifies the
probability associated with each possible value of the random variable. The
distribution functions of discrete random variables are concentrated as a mass for
a particular value, and generally known as Probability Mass Function.
X 1 5 7 8 9
PX) 1
1 1
1
38
3 3 3 3
The distribution must satisfy that each probability value P(X) must be
a.) from 0 to 1 and b) the sum of all the values of the probabilities must be
equal to 1.
By adding all the values of P(X)
Σ P(X)=
The probability of each value of the random variable has the same value which is and
this value lies between 0 and 1 but the sum of its probabilities is not equal to 1.
Hence, this is not a probability distribution because ΣP(X) ≠ 1
Ex. 2. Determine if the distribution below is a discrete probability distribution You
can also express the values of the probabilities in decimal form.
X 1 3 5 7
Ex. 3. Determine whether the given values can serve as the values of a
probability distribution.
b. P(1) = , ,
The probability of each value of the random variable lies between 0 and 1 and the
sum of its probabilities is equal to 1 or ΣP(X) =1. Therefore, this is a probability
distribution.
Ex. 4. Determine whether the following can serve as the probability distribution of a
random variable X.
a. P(X)= for x= 1,2,3,…….9
This means that the value of P(1) to P(9)=
X 1 2 3 4 5 6 7 8 9
P(X) 1 1 1 1 1 1 1 1 1
7 7 7 7 7 7 7 7 7
39
The probability of each value of the random variable has the same value which is
and this value lies between 0 and 1 but the sum of its probabilities is which is not
equal to 1. Hence, this is not a probability distribution because ΣP(X) ≠ 1
P(1)=
P(2)=
P(3)=
P(4)=
Illustrating this in a table would give as:
X 1 2 3 4
P(X) 0.48 0.24 0.16 0.12
The probability of each value of the random variable lies between 0 and 1 and the
sum of its probabilities is equal to 1 or ΣP(X) =1. Therefore, this is a probability
distribution.
What’s More
Independent Activity 1
A. Determine whether the distribution represents a probability distribution or not.
Explain your answer.
1 X 1 5 7 8
1 1 1 1
P(X) 4 8 4 8
2
X 0 2 3 4 6
1 1 1
3 X 1
1 3 51 7
P(X) 6 6 6 3 6
P(X) 0.35 0.25 0.22 0.12
4. P(1)=0.42,
5. 𝟏𝟐 𝟖 𝟑 P(1)=
𝟑𝟓 , P(2)= 𝟑𝟓 , P(3)= 𝟕
Independent Activity II
Number of Girls
Let X be a random variable giving the number of girls in a randomly selected
threechild family. Assuming that boys and girls are equally likely to be selected,
illustrate the probability distribution of a random variable X.
40
Rolling Two Dice
Using the sample space for rolling two dice, illustrates a probability distribution for
the random variable X representing the sum of the numbers that appear.
Independent Activity IV
Face Mask
Julia is a boutique shop owner in her town. Due to COVID-19 pandemic, wearing a
face mask of a person is required for their safety. Since there are limited stocks
available, she decided to start another business by making a face mask. She
started selling face masks from day 1 to day 10. The data she collected is shown in
the table below.
What I Can Do
COVID -19 Recoveries
The ongoing pandemic of corona virus disease 2019, a novel infectious disease
caused by severe acute respiratory syndrome was spread in the Philippines on
41
January 30, 2020. Thousands of people in the country tested positive in this virus,
and some of them recovered from the disease. The following table shows the
number of recoveries from April 24-May 3, 2020.
b. What safety precautions will you undertake to help avoid the spread of
the COVID-19 pandemic?
Assessment
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. What must be the value of the probability of each random variable?
A. P(X) ≥ 0
B. 0≤ P(X) ≤ 1
C. -1≤ P(X) ≤ 1
D. P(X) ≤ 1
2. What is the sum of the probabilities of all values of the random
variable?
A. ∑ 𝑃(𝑋))= 0
B. ∑ 𝑃(𝑋)) = 1/10
C. ∑ 𝑃(𝑋)) = 1
D. ∑ 𝑃(𝑋)) = 10
Refer to the given problem in answering numbers 3-4
3. If two balls are drawn in succession without replacement from an
urn containing 4 red balls and 5 blue balls. If the value of the
random variable X represents the number of blue balls. What is the
probability of getting two blue balls?
1
A.
2
1
B.
3
C.
1
D.
8
42
X 0 1 2
1 1 1
P(X) 2 4 4
X 0 1 2
1 1 1
P(X)
4 2 4
C
X 0 1 2 3
1 3 3 1
D P(X) 8 8 8 8
X 0 1 2 3
5. Which of 1 1 the following values
is not the value of the probability of the
random variable? P(X) 8 8
A. 1.01
B.1
C.0.50
D. 0.05
6. Which of the following table represents probability distribution?
A. X 0 1 2 3
P(X) 0.11 0.15 0.42 0.44
B. X 1 3 5 7
1 1
1
P(X) 3
3 3
C. X 1 2 3 4
P(X) 0.32 0.28 0.28 0.12
D. X 0 2 4 6
1 1 1 1
P(X)
5 5 5 5
7. Which of the following
values can serve as the values of a probability distribution?
A. P(1) = 0.42, P2) = 0.31, P(3 )= 0.37
B. P(1) = 9/14, P2) = 4/14, P(3) = 1/14
C. P(1) = 0.08, P2) = 0.12, P(3 )= 1.03
D. P(1) = 10/33, P2) = 12/33, P(3) = 10/33
43
8. If P(X)= , what are the possible values of X for it to be a
probability distribution?
A.0,2,3
B. 1,1,2
C. 2,3,4
D. 1,2,3
9. Which formula gives the probability distribution shown by the
table?
X 2 3 6
P(X) 1/2 1/3 1/6
A. P(X) =
𝑋
B. P(X) =
6
C. P(X) =
D. P(X) =
10. The following table shows the probability distribution of a
discrete random variable X. Find the value
of n.
X 2 4 6
P(X) 0.4 0.25 n
A. 0.15
B. 0.25
C. 0.28
D. 0.35
11. The probabilities that a customer buys 5,8,9,12 and 15 items
in a grocery store are 0.06, 0.14, 0.32, 0.28, and 0.20
respectively. Which probability distribution represents the given
problem?
A. X 5 8 9 12 15
P(X) 0.06 0.14 0.2 0.28 0.32
B.
X 5 8 9 12 15
P(X) 0.32 0.28 0.2 0.14 0.06
C.
X 5 8 9 12 15
P(X) 0.06 0.14 0.32 0.28 0.20
D.
X 5 8 9 12 15
12. P(X) 0.06 0.14 0.2 0.28 0.32 What are the
possible
values of each random variable X if it represents the
number of heads in tossing a coin thrice?
A. X = {0,1,2,3}
B. X = {0, 1,2}
C. X = {1, 2, 3}
D. X = {1,2,}
44
13. What are the values of P(x) if P(x)= , for x=0,1, and 2
A.0,1,2
B.0,0.33, 0.67
C. 0,0.45, 0.55
D. 0.2, 0.3, 0.5
14. Which of the following can serve as the probability distribution of a
random variable X?
A. P(X) = for x =1,2, 3…...8
3+𝑋
B. P(X) = for x = 1, 2, 3, 4
1
C. P(X) = for x = 1,2,3……9
6
A. X 0 1 2 3
P(X)
B. X 0 1 2 3
P(X)
X 0 1 2 3 4
1
C. P(X) 5
X 0 1 2 3 4
D. 1
P(X) 5
Additional Activities
45
1. Tossing Four Coins
Suppose that a coin is to be tossed four times, and let X represent the number of
tails that occur. Illustrate a probability distribution for a random variable X.
2.Spin and Add
A spinner has 3 equal sections that are labeled 1,2, and 3. If
the spinner spins twice, find the sum of the two numbers it 3 1
lands on. Let X be the random variable that represents the
sum of the two numbers.
Statistics and
Probability
Computing Probability
Corresponding to a Given
Random Variable
Statistics and Probability
Computing Probability Corresponding to a Given Random Variable First
Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
46
Development Team of the Module
Week Writers: Melba S. Remojo
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
2 Illustrator: Kelvin A. Ramintas and Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Team:
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann
O. Briones Celestina M. Alba, Asuncion C. Ilao
Telefax: 02-8682-5773/8684-4914/8647-7487
What I Know
Let us see how far you will go and how much you know about this pretest. Choose
the letter of the best answer. Write the chosen letter on a separate sheet of paper.
3 6 6 3
1
A. B.
C. D.
6
47
Refer to the given table in answering numbers 2-3.
In the experiment of tossing a coin twice, the following table shows the probability
distribution. What is P (X≤ 2)?
X 0 1 2
P(X) 1 1 1
4 24
4. Find P (X ≥ 1)
A. 0.125 B. 0.5 C. 0.625 D. 0.875
5. Find P (X ≤ 1)
Y 0 1 2 3 4
P(Y)
A. B . 2
C. 1
D.
5 2
7. Find P (1 ≤ Y ≤ 4)
A. B. C.
D.
8. Find P (Y> 1)
A. B. C. D.
Refer to the given problem in answering numbers 9-10.
Two jars both contain the numbers 2, 4, 6, and 8. The following table shows the
probability distribution of the sum when one number from each jar is taken at a
time with replacement.
X 4 6 8 10 12 14 16
4 P(X) 1 1 1
9. 8 8
Find P(4) + P(6)
A. B. C. D.
10. Find the probability that X assumes a value greater than 10.
48
3
A. B. C. D.
8
11. Find the probability that X assumes a value between 6 and 14.
3
A. B. C. D.
8
Refer to the given problem in answering numbers 12-13.
The given table shows the probability distribution of a random variable Z which
represents the percentage of male students per section among Grade 11 senior
high school students.
Z 8 9 11 14 15 18
P(Z) 8% 8% 25% 17% 25% 17%
14. What is the probability that two or more copies will be demanded on a
particular day?
A. 0.30 B. 0.35 C. 0.65 D. 0.75
15. What is the probability that the demand will be at least one but not
more than four?
A.0.76 B. 0.71 C.0.51 D. 0.35
What’s In
In your previous lesson, you have learned how to illustrate a probability
distribution of a discrete random variable. To understand this lesson, your
knowledge of getting the probability of an event is very important as you have
49
learned from your previous lesson in Mathematics. To find out if you are ready to
learn in this new lesson, determine the probability of the following.
Find the probability of the following events.
Event (E) Probability P (E)
a. Getting an odd number in a single roll of a die
b. Getting an ace when a card is drawn from a
deck
c. Getting a number greater than 2 in a single
roll of a die
d. Getting a red queen when a card is drawn
from a deck
e. Getting doubles when two dice are rolled
f. Getting a sum of 5 or a sum of 9 when two
dice are rolled
The probability for each event will be:
What’s New
Golden’s bakery is known for its famous Filipino delicacies. Among these foods
which is a native delicious food called “kakanin” is a “leche puto”. The bakeshop
50
owner recorded the number of boxes of “leche puto” that were delivered each day.
The number of boxes delivered for 10 days is shown below.
Number of
Day
Boxes(X)
1 35
2 37
3 50
4 45
5 37
6 45
7 40
8 42
9 45
10 42
What is It
In the previous module, you have learned how to illustrate a probability
distribution for a discrete random variable and its properties. Some experiments
like tossing a coin, throwing a die, and other activities consist of one or more
outcomes. If each of the outcomes is equally likely to occur, then the probability is:
P (E)=
To answer the given question, you have to construct first the probability
distribution. Let X the value of the random variable represented by the number of
boxes of “leche puto”. The probability distribution is shown below.
Number of Boxes X Probability P(X)
35 1
10
37 1
5
40 1
10
42 1
5
45 3
10
50 1
10
a. The probability that 40 or more boxes will be sold in a particular day means
P (X≥ 40).
51
This means that you have to add P(X=40), P(X=42), P (X= 45), and (X= 50). P
(X≥ 40) = P(40) + P(42) + P(45) + P(50)
or 0.7
b. The probability that the number of boxes delivered will be at least 37 but not
more than 50 means P (37≤ X< 50).
Hence, the values included are P(37), P(40), P42) and P(45), then substitute
its corresponding probability
P (37≤ X< 50) = P(37) + P(40) +P(42) + P(45),
or 0.8
or 0.40
d. Find P (X≤ 45) = P(45) + P(42) + P(40) + P(37) +P(35)
or 0.90
Other solution:
or 0.10
52
What’s More
Independent Activity I
Number of Cell Phones
The number of cell phones sold per day at Gadgets Zone has the following probability
distribution.
Find:
1. P(X≥ 2)
2. P(x≤ 3)
3. P(1≤ 𝑋 ≤ 4)
4. P(2) + P(4)
Independent Activity II
Number of Absences
The monthly absences of a student based on his class adviser’s record are
presented in the probability distribution below.
X 0 1 2 3 4
1
P(X) 5
53
67 0.10
68 0.07
70 0.07
72 0.10
73 0.07
80 0.03
a. What is the probability that at least 65 years old senior citizen will receive
the SAP?
b. What is the probability that at most 70 years old senior citizen will
receive the SAP?
c. What is the probability that at least 60 years old but less than 70 years
old senior citizen will receive the SAP
What I Can Do
A pair of fair dice is rolled. Let X the random variable
representing the sum of the numbers that appear.
Assessment
Multiple Choice. Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper.
1. The given table represents a probability distribution. What is P(1) +
P(4)?
X 1 2 3 4
P(X) 1 1 1 1
3 6 6 3
A. C.
B. D.
Refer to the given table in answering numbers 2-3.
54
In the experiment of tossing a coin twice, the following table shows the probability
distribution. What is P (X≤ 2)?
X 0 1 2
P(X) 1 1 1
4 2 4
2. What is P (X≤ 2)? 1
A. C.
4
B.
D. 1
3. What is P(0) + P(2)?
A. 1
C.
B.
D.
Refer to the given table in answering numbers 4-5.
4. Find P (X ≤ 1)
A. 0.125
C. 0.625
B. 0.5
D. 0.875
5. Find P (X ≥ 1)
A. 0.125
C. 0.625
B. 0.5
D. 0.875
Refer to the given problem in answering numbers 6-8.
6. The random variable Y representing the number of nuts in a chocolate bar
has the following probability distribution. Find P (odd)
Y 0 1 2 3 4
P(Y)
A. C.
B. D.
7. Find P (0 )
A.
C.
10
B. D.
10
8. Find P (Y< 3 )
A. C.
55
B.
D.
Refer to the given problem in answering numbers 9-10.
Two jars both contain the numbers 2, 4, 6, and 8. The following table shows the
probability distribution of the sum when one number from each jar is taken at a
time with replacement.
X 4 6 8 10 12 14 16
P(X) 1 1 1
4
9. Find 8 8 P(6) +
P(8)
A.
C.
B. D.
10. Find the probability that X assumes a value greater than 12.
A. C.
B. D.
11. Find the probability that X assumes a value between 6 and 14.
A. C.
B. D.
Refer to the given problem in answering numbers 12-13.
The given table shows the probability distribution of a random variable Z which
represents the percentage of male students per section among Grade 11 senior
high school students.
Z 8 9 11 14 15 18
P(Z) 8% 8% 25% 17% 25% 17%
12. Find P(Z≥9)
A. 92% C. 59%
B.84% D.43 %
56
15. What is the probability that the demand will be at least one but not
more than five?
A.0.76 C.0.51
B. 0.71 D. 0.43
Additional Activities
Analyze and solve the following problems:
1. Box A and B contain numbers 1,2 3, and 4. The following is the probability
distribution of the sum when one number from each box is taken at a time
with replacement.
X 2 3 4 5 6 7 8
P(X) 1 1 1
8 4 8
a. Find P(even)
b. Find P (X≥ 5)
c. Find the probability that X assumes a value of less than 4.
d. Find the probability that X assumes a value greater than 6.
e. Find the probability that X assumes a value between 3 and 8.
57
Statistics and
Probability
Computing Probability
Corresponding to a Given
Random Variable
Statistics and Probability
Computing Probability Corresponding to a Given Random Variable First
Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
58
Development Team of the Module
Week Writer: Annabele L. Laraquel
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
2 Illustrator: Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Team:
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann
O. Briones Celestina M. Alba, Asuncion C. Ilao
Telefax: 02-8682-5773/8684-4914/8647-7487
What know
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. Which of the following terms is considered as a measure of the `central location'
of a random variable?
A. Probability Value
B. Mean Value
C. Numerical Value
59
D. Variance
2. Which of the following notations is equivalent to the mean of the probability
distribution?
A. 𝜎
B. ∞
C. 𝜇
D. None of the Above
3. Which of the following notations is equivalent to the expected value of the
probability distribution?
A. 𝐸(𝑋)
B. 𝜀
C. 𝜎
D. 𝑃(𝑥)
4. Which of the following statements is TRUE about the standard deviation of a
discrete random variable?
A. It shows the spread out or dispersion of discrete random variables.
B. It is obtained by multiplying the x values and their corresponding
probability.
C. It deals with the average or center of location of the probability distribution.
D. It is the summation of the product of the square of the difference of x and its
probability.
1 2 3 4 5 6 7 8 9 10
A. Probability Value
B. Expected Value
C. Mean Value
D. Variance
8. What formula is used to find the variance of a discrete random variable?
60
A. E(x) = μx = ∑ x ∙ p (x)
B. E(x) = μx = ∑ x + p (x)
C. E(x) = μx = ∑ x − p (x)
D. E(x) = μx = ∑ x ∙ p (x)2
For items 10 -11, refer to the diagram below.
1 2 3 4 5 6 7 8 9 10
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14
12 24 36 48 60 72
61
B. 36
C. 48
D. 60
15. How dispersed are the elements from the mean?
A. 12
B. 24
C. 36
D. 48
What is In
Let’s have this first for a review.
Suppose you tossed a fair coin twice. Let X be the number of heads that are
observed.
a. Construct the probability distribution of X.
b. Find the probability that at least one head is observed.
Solution:
a. The possible values that X can take are 0, 1, and 2. Each of these numbers
corresponds to an event in the sample space S = {HH, HT, TH, TT} of equally
likely outcomes for this experiment:
For X = 0 the corresponding sample space will be {TT};
For X = 1 the corresponding sample space will be {HT, TH} and For
X = 2 to {HH}.
X 0 1 2
62
b. At least one head is the event X ≥ 1, which is the union of the mutually
exclusive events X = 1 and X = 2. Thus
P(X≥1) = P(1) + P(2) = 0.50 + 0.25 = 0.75
Therefore, the probability that at least one head is 0.75
What’s New
Let’s Roll a Die!
Read and analyze the situation given below:
During Town Fiesta, people used to go to Carnival that most folks call it
“Perya”. Mang Ben used to play “Beto–beto” hoping that he would win.
While he is thinking about what possible outcomes in every roll would be, he
is always hoping that his bet is right.
Let us help Mang Ben to know the probabilities of each outcome by filling the
table below.
X 1 2 3 4 5 6
Based on the data above, observe, analyze, and answer the following
questions:
63
4. The mean is 3.5
If you are wondering how we came up with the answer for questions 4 and 5, you
will understand everything as we go on in our discussion
What is It
If you roll a fair die several times, what will be the average outcome? Imagine
rolling it 6000 times. You would expect to roll about 1000 ones, 1000 twos, and so
on: about 1000 occurrences of each possible outcome. What would be the average
value of the outcomes obtained? Approximately, the mean or expected value would
be;
It only implies that the weighted average of the six possible outcomes 1, 2,…,
6 with weights provided by the relative frequencies. Remember that 3.5 isn’t a value
that we can observe if we will do the experiment.
By similarity with information and relative frequencies, we can characterize
the mean of a discrete random variable by utilizing probabilities from its dispersion,
as follows.
1. The mean is considered as a measure of the `central location' of a random
variable. It is the weighted average of the values that random variable X can
take, with weights provided by the probability distribution.
2. The Expected Value or Mean Value of a discrete random variable x is can be
computed by first multiplying each possible x value by the probability of
observing that value and then adding the resulting quantities. Symbolically,
𝑬(𝑿) = 𝝁𝒙 = 𝑋1 ∗ 𝑃(𝑋1) + 𝑋2 ∗ 𝑃(𝑋2) + ⋯ + 𝑋𝑛 ∗ 𝑃(𝑋𝑛) = ∑ 𝑿 ∗ 𝑷(𝑿)
Example1: Suppose a random variable X that has the discrete uniform distribution
with possible values 1, 2,…, n. The mean or expected value is presented by
Since the possible values are 1, 2,…, n when rolling a die a fair of die the
mean is . This 3.5 is the expectation only. That’s the reason why in
question 4, we come up to 3.5 as its mean value.
So, doing the experiment which is rolling a single die several times and
getting average or mean of the total possible outcomes, we, therefore, conclude that
the average tends to be close to 3.5. This also implies that the more rolls we do, the
closer the average will be to 3.5. Example 2.
0.1669500
0.1669000
P(X) 0.1668500
0.1668000
0.1667500
0.1667000
0.1666500
0.1666000
1 2 3 4 5 6 X
From the figure above, it shows the probability distribution for the possible
outcome in rolling a fair die. Notice that they have equal probabilities or chances to
64
occur. This distribution is symmetric and the mean is 3.5 which found in the
middle of the distribution, it is in the axis of symmetry.
We have noticed that the mean of a random variable X is a measure of the
central location of the distribution of X. If we are summarizing highlights of the
distribution of X, it is evident that location is not the only significant aspect
If the example above complicated you, a simpler one is provided for you.
Example 3. Suppose you want to find the average number of apples sold in any five
grocery stores as indicated by the figure below.
Analyzing the illustration above, then the mean or expected value is 11. If
you are wondering why this happens you may also follow with these processes:
1. Separate 8 elements on the left side of the mean then get the average of those a
(3, 4, 5, 6, 7, 8, 9, 10) which is equal to 6.5
2. Do the same on the right side of mean (12, 13, 14, 15, 16, 17, 18, and 19) which
is equal to 15.5.
3. Get the average value of the numbers 6.5, 11, and 15.5. It will be equal to 11
also
As a whole, we can generate a physical representation and interpretation of
the discrete random variable X with its probability P(X). Assume that the x-axis is
an unbounded see-saw in both directions, and we put weights corresponds to
probability P(X) at each associated value(x) of random variable X. Therefore, the
mean (𝝁𝒙) is the point where the see-saw balance. In other words, it is the center of
weights of the observation.
In the off chance, that value of X close to its mean 𝜇𝑥 are likely and values
further away from 𝜇𝑥 have small probability, at that point, the distribution of X will
be firmly concentrated around 𝜇𝑥. For this situation, the spread of the distribution
of X is little. On the contrary, if the expected value of X some distance from its
mean 𝜇𝑥 are likely, the spread of the distribution of X will be sizeable.
These concepts lead the way to the most important measure of spread, the
variance, and a closely related measure of variability, the standard deviation.
You probably have met the concepts of variance and standard deviation
when summarizing data in your Junior High School Mathematics. These were the
sample variance and the sample standard deviation. The difference here is that we
are referring to properties of the distribution of a discrete random variable.
The variance of a random variable displays the variability or the dispersions
of the random variables. It shows the distance of a random variable from its mean.
The Variance of a Discrete Random Variable X, denoted by 𝝈𝟐𝒙 is
computed by first subtracting the mean from each possible x value to obtain the
deviations, then squaring each deviation and multiplying the result by the
probability of the corresponding x value, and then finally adding these quantities.
Definition: If X is a random variable with mean 𝐸(𝑋) = µ𝑥, then the variance of X is
defined by,
65
𝜎𝑥 = √𝑽𝒂𝒓(𝑿) or 𝝈𝒙 = √𝝈𝒙𝟐
Example 4. Let us consider the same histogram above. Illustrate the variance and
standard deviation.
To be able to determine the element spread away from the mean, we can do
this by adding this procedure in the illustration given.
0.1669500 3 3
0.1669000
P(X) 7
0.1668500 4 10
0.1668000
0.1667500
0.1667000
0.1666500
0.1666000
1 2 3 4 5 6
X
Since we are comparing two elements this time (1 and 3 on the left side of
the mean and 4 and 6 in the right., the 𝜇𝑥 3.5 will be doubled which became 7.
Adding 1 and 3 elements gives us 4, similarly on the right side adding 4 and 6 gives
us 10. Get the average value of 4 and 10 which resulted in 7 which is our new 𝜇𝑥.
Subtract 7 by 4 which is 3, likewise subtract 10 by 7 which resulted in 3. This
implies that the distance of the element from the mean in either direction is 3
which describes the spread of the elements in the observation.
And this is how we come up with the answer in question under What’s More.
Example 5. Consider this figure.
3
0
1 2 3 4 5 6 7 8 9 10
66
b. To illustrate the variance and standard deviation without having
5 5
7 12 17
2 5 7 10
0
1 2 3 4 5 6 7 8 9 10
computation, study the diagram below.
Figure A. 𝜇𝑥
Since we are comparing two elements this time (2 and 5 on the left side of
the mean and 7 and 10 in the right, the 𝜇𝑥 is doubled which became 12. Adding 2
and 5 elements gives us 7, similarly on the right side adding 7 and 10 gives us 17.
Get the average value of 7 and 17 which resulted in 12 which is our new 𝜇𝑥.
Subtract 12 by 7 which is 5, likewise subtract 17 by 12 which also resulted in 5.
This implies that the distance of the element from the mean in either direction is 5
which describes the spread of the elements in the observation.
Example 6. Compare the variance and standard deviation of this distribution.
4 4
16
8 12
4
3 9
3
2 5 7
0
1 2 3 4 5 6 7 8 9 10
Figure B 𝜇𝑥
Following the same procedure just like in example 4, for figure A, the 𝜇𝑥 is 6.
Now let’s compare 3 and 5 on the left side of the mean and 7 and 9 in the right.
Adding 3 and 5 which is 8 and on the right side add 7 and 9 which results in 16.
The average of 8 and 16 is 12 which happens to be our new mean. Subtract 8 from
12 which is 4, similarly subtract 12 from 1 wherein the result is also 4. This shows
that the distance of the element from the mean in either direction is 4 which
expresses the spread of the elements in the observation. Example 7. Analyze and
observe the figure below
67
6 6
6 12 18
2
2 4 8 10
1
0
1 2 3 4 5 6 7 8 9 10
Figure C 𝜇𝑥
Again, since we are comparing two elements this time (2 and 4 on the left
side of the mean and 8 and 10 in the right, the 𝜇𝑥 is doubled which became 12.
Adding 2 and 4 elements give us 6, similarly, on the right side adding 8 and 10
gives us 17. Get the average of 6 and 18 which also results in 12 which is our new
𝜇𝑥. Subtract 12 by 6 which is 6, likewise subtract 18 by 12 which also results in 6.
This suggests that the distance of the element from the mean in either direction is 6
which describes the spread of the elements in the observation.
This time let us compare the variance and standard deviation of the three
given illustrations. Because Figure A has a spread value of 5, Figure B has 4 and
Figure C has spread value of 6 in either direction then we can have this
comparison;
𝜎𝐵 < 𝜎𝐴 < 𝜎𝐶
Which illustrations show less spread out? Then we can easily answer it by
simply observing the figure we generated. Figure B is less dispersion since it spread
by 4 in either direction. Moreover, which illustrations shows the farthest spread
out? Looking back to the three illustrations we have, then the farthest spread out is
Figure C since the element dispersed 6 away from the mean.
In general, unlike the mean, there is no simple direct interpretation of the
variance or standard deviation. The variance is undifferentiated to the moment of
inertia in physics, but that is not necessarily widely understood by students. What
is important to understand is that, in relative terms:
• a small value of standard deviation (or variance) means that the dispersion
of the random variable is narrowly concentrated around the mean.
• a large value of standard deviation (or variance) means that the distribution
is spread out, with some possibility of observing values at some distance
from the mean.
What’s More
Independent Activity: Study and analyze.
5 10 15 20 25 30 35 40
68
1. Figure above shows the number of polo shirt sold by 5 different RTW boutiques.
a. Illustrate the mean. (2 points)
b. Illustrate the variance and standard deviation. (5 points)
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14
2. Compare the variance and standard deviation of the two figures provided.
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14
Figure A
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14
Figure B
69
1. The mean a measure of the `central location' of a random variable. It is the
weighted average of the values that random variable X can take, with
weights provided by the probability distribution.
2. The Expected Value or Mean Value of a discrete random variable x is can
be computed by first multiplying each possible x value by the probability of
observing that value and then adding the resulting quantities. In symbol,
𝑬(𝑿) = 𝝁𝒙 = 𝑋1 ∗ 𝑃(𝑋1) + 𝑋2 ∗ 𝑃(𝑋2) + ⋯ + 𝑋𝑛 ∗ 𝑃(𝑋𝑛) = ∑ 𝑿 ∗ 𝑷(𝑿)
3. The Variance of a Discrete Random Variable X, denoted by σ_x^2 is
computed by first subtracting the mean from each possible x value to obtain
the deviations, then squaring each deviation and multiplying the result by
the probability of the corresponding x value, and then finally adding these
quantities.
4. The formula in determining the variance of a discrete random variable is,
𝑽𝒂𝒓(𝑿) = 𝝈𝟐𝒙 = ∑ (𝒙 − 𝝁𝒙)𝟐 ∙ 𝑷(𝒙)
𝒂𝒍𝒍 𝒑𝒐𝒔𝒔𝒊𝒃𝒍𝒆
𝒗𝒂𝒍𝒖𝒆𝒔 𝒐𝒇 𝒙
What I Can Do
Family Budget
Things to do:
1. Create a Table of Expenses of your family in a week. (Ask help from your
Mother)
2. List the number of expenses for each day.
3. Create a graphical representation of your data gathered.
4. Base on the graphical representation you made, illustrate for the
following:
a. mean or expected value; and
b. variance and standard deviation.
5. Share your output to the Class Group Chat through Image or Video
Presentation.
TASK CRITERIA
Accuracy of the Data Gathered 50%
Clarity and content of the visual 25%
representation
Originality and creativity of the 25%
Presentation
TOTAL 100%
Assessment
70
Multiple Choice. Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper.
1. Which of the following use the formula of (𝑥) = ∑ 𝑥 ∙ 𝑝 (𝑥) ?
A. Probability Distribution
B. Variance of Discrete Probability Distribution
C. Standard Deviation of Discrete Probability Distribution D. Mean or Expected
Value of Discrete Probability Distribution
2. Among the notations below, which is equivalent to 𝐸(𝑥)?
A. 𝜎𝑥2
B. 𝜎𝑥
C. 𝜇𝑧
D. 𝑃 (𝑋)
3. Which of the following statements best describe the expected value of a discrete
random variable?
A. It is the simple average of all possible outcomes.
B. It is the geometric average of all possible outcomes.
C. It is the weighted average over all possible outcomes.
D. It is the complex average of all possible outcomes in the distribution.
4. To which of the following concepts refer this statement “the sum of the product
of each value of a discrete random variable and its corresponding probability”?
A. Probability Distribution
B. Variance of Discrete Probability Distribution
C. Standard Deviation of Discrete Probability Distribution
D. Mean or Expected Value of Discrete Probability Distribution 5. Use the
illustration below. What is the mean of the distribution?
10 20 30 40 50 60 70
A. 20
B. 30
C. 40
D. 50
71
B. P(X)
C. 𝑃 (𝑋)
D. Var(X)
9. If the variance of a probability distribution is 2.6 grams, then what proper way
to do to get the standard deviation?
A.
B. (2.6)2
C. 1.61 + 2.6
D. 1.61 – 2.6
For items 10 -11, refer to the diagram below.
4
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14
3 6 9 12 15 18 21
72
3
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14
Additional Activities
Study and analyze the figures below.
3 3
2 2
1 1
0 0
1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10
Figure A Figure B
0
1 2 3 4 5 6 7 8 9 10
Figure C
A. Compare the means of the three figures.
B. Show the relationship between the variance and standard deviation among
three given figures.
73
Statistics and
Probability
Calculating Mean and Variance
of a Discrete Random Variable
Statistics and Probability
Calculating Mean and Variance of a Discrete Random Variable
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
74
Development Team of the Module
Writers: Annabele L. Laraquel Week
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto 2
Layout Artist: Ronnjemmele A. Rivera
Management Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Team:
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann
O. Briones Celestina M. Alba, Asuncion C. Ilao
Telefax: 02-8682-5773/8684-4914/8647-7487
Your patience in solving problems here in the module will help you upgrade your
computational skills as it tackles appropriate culture-based situated problems.
Your ability to explain, reason–out, and make a judgment or even decision out of
statistical measures will also be practiced here. The extent of this module permits it
to be used in many different learning situations. The language used recognizes the
diverse vocabulary level of students. The lessons are arranged to follow the
standard sequence of the course. But the order in which you read them can be
changed to correspond with the textbook you are now using.
The module focused on calculating the mean and variance of a discrete random
variable.
What I Know
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
75
1. Which of the following is NOT included in the process of calculating the mean
of the discrete random variable X?
A. Identify the correct probabilities for each x value.
B. Multiply each x value by its probability.
C. Get the summation of the product.
D. Get the square root of the product.
2. Which of the following is TRUE about the value of the mean of a discrete
random variable?
A. Mean Value is always equal to 1
B. Mean Value cannot be negative.
C. Men value is equal to the expected value
D. Mean, Variance, and Standard Deviation are equal.
3. To determine the expected value of the discrete random variable which
processes should be done?
A. Get the squared sum of the difference of each value of a discrete random
variable its probability.
B. Get the summation of the difference of each value of a discrete random
variable and its probability.
C. Get the summation of the product of each value of a discrete random
variable and its probability.
D. Get the square root of the summation of the product each value of a
discrete random variable and its probability.
4. What can we generate if we take the squares of standard deviation?
A. Expected Value
B. Mean Value
C. Probability value
D. Variance
5. In tossing a coin, what are the possible values of the random variable X?
A. 0 B. 0,1 C. 1, 2 D. 0, 1, 2
6. If the variance of a probability distribution is 2.6 grams, what is the standard
deviation?
A. 1.61 B. 1.16 C. 1.06 D. 1.01
For items 7-10, refer to the scenario provided below.
x 0 1 2
C. x 0 1 2
P(X)
P(X) 0.06250.0625
0.375 0.5625 0.0625
0.375
D. x 0 1 2
P(X) 0.5625 0.0625 0.375
76
A. 0.5 B. 0.4 C. 0.05 D. 0.04
10. What is the probability that Juan gets at least 1 correct answer?
A. 0.0625 B. 0.375 C. 0.4375 D. 0.5625
For items 11-12, refer to the scenario and table provided below.
In a recent Barangay Basketball League, each player went to free throws 2 times.
The number of free throws made by each player is described by the following
probability distribution.
1 0.45
2 0.35
11. What is the mean of the probability distribution?
A. 1.00 B. 1.15 C. 2.00 D. 2.25
12. What is the probability that both free throws will be out of the basket?
A. 0.20 B. 0.45 C. 0.35 D. 1.00 For items 13-15, refer to the
scenario and table provided below
The number of qualified voters living in the household on a randomly selected
Subdivision block is described by the following probability distribution.
Number of qualified voter/s, x 1 2 3 4
77
What’s In
Let’s have this first for a review.
The number of COVID -19 patients seen in the ER in any given hour is a random
variable represented by X. The probability distribution for X is presented below.
P(X)
0.5
COVID -19 Patients
0.4
0.3
0.2
0.1
0 X
15 16 17 18 19
3 3
31 34 37
P(X)
0.45 15
0.4
16
0.35
0.3 18 19
0.25
0.2
0.15
0.1
0.05
0 X
15 16 17 18 19
78
𝜇𝑥
What’s New
Time to Check
Read and analyze the situation given below:
Mr. Umali, a Mathematics teacher, regularly gives a formative assessment
composed of 5 multiple-choice items. After the assessment, he used to check the
probability distribution of the correct responses, and the data is presented below:
3. 𝐸(𝑋) = 𝜇𝑥 = 3.41
If you are wondering how we came up with the answer for questions 4 and 5, you
will understand everything as we go on in our discussion.
What is It
In the previous module, mean or expected value characteristics had been
presented, even variance and standard deviation of a discrete random variable.
Expected Value or Mean Value is the sum of the products of each possible value
of a random variable and that value’s probability. In symbol,
So, in what way can we use the concept of expected value to calculate the mean
and variance of a probability distribution?
Using the previously given data, multiply each x value by its probability and add
the results to get the mean (𝜇x ) or expected value(𝑬(𝑿)).
79
Test Item, Probability, 𝑿 ∗ 𝑷(𝑿)
𝑿 𝑷(𝑿)
0 0.03 0
1 0.05 0.05
2 0.12 0.24
3 0.30 0.9
4 0.28 1.12
5 0.22 1.1
𝐸(𝑋) = 𝜇𝑥 = ∑ 𝑥 ∙ 𝑃(𝑋)
𝑬(𝑿) = 𝝁𝒙 = 𝟑. 𝟒𝟏
So, the answer to question 3 can now be stated. It indicates that the expected value
or mean of the random variable is 3.41.
Now that you have calculated the mean or the average marks in the five tests of
Mathematics. It will be convenient this time to determine the difference in the score
of each item from the mean or expected value. This difference in marks shows the
variability of the possible values of the random variable.
Let’s recall the formula in finding the variance of a discrete random variable x,
Remember that if the value of the variance is small, then the values of the random
variable are close to the mean.
𝜎𝑥 = √𝑽𝒂𝒓(𝑿) or 𝝈𝒙 = √𝝈𝒙𝟐
Again, let’s use the same data above, variance and standard deviation of the given
random variable can be presented in this manner. The formula for 𝑉𝑎𝑟(𝑋) says to
take a weighted average of the squared distance to the mean. By squaring,
remember that we are averaging only non-negative values so that the spread to the
right of the mean won’t cancel that to the left. By using expected value, we are
weighting high probability values more than low probability values. So, the table
will now look like this.
Test Item, Probability 𝒙 ∙ 𝑷(𝑿) 𝒙−𝝁 (𝒙 − 𝝁)𝟐 (𝒙 − 𝝁)𝟐 ∙ 𝑷(𝑿)
𝒙 𝑷(𝑿)
0 0.03 0 -3.41 11.6281 0.3488
1 0.05 0.05 -2.41 5.8081 0.2904
2 0.12 0.24 -1.41 1.9881 0.2386
3 0.30 0.90 -0.41 0.1681 0.0504
4 0.28 1.12 0.59 0.3481 0.0975
5 0.22 1.10 1.59 2.5281 0.5562
𝑬(𝑿) ∑(𝒙 − 𝝁)𝟐 ∙ 𝑷(𝑿)
= 𝝁𝒙 = 𝟏. 𝟓𝟖𝟏𝟗
= 𝟑. 𝟒𝟏
80
𝑽𝒂𝒓(𝑿) = 𝝈𝟐𝒙 = ∑(𝒙 − 𝝁)𝟐 ∙ 𝑷(𝑿)
The Variance is 1.5819, and the Standard Deviation is √𝟏. 𝟓𝟖𝟏𝟗 , and it is
equivalent to 𝝈𝒙 = 1.26
Therefore, the correct answers for question 4 are 1.5819 and 1.26 respectively.
For uniformity of the answers, expected value or mean and standard deviation are
both expressed into two decimal places while variance is up to four decimal places
to lessen rounding error.
Take note on units:
1. 𝜎𝑥 has the same units as X.
Since 𝜎𝑥 and X have the same units, the standard deviation is a natural measure of
spread. In some ways, the standard deviation is the more tangible of the two
measures, since it is in the same units as X.
Suppose that a coin is tossed twice so that the sample space is S = {𝐻𝐻, 𝑇𝐻, 𝐻𝑇,
𝑇𝑇}. Let X represent the “number of heads that can come up”, Based on the
prepared discrete probability distributions of the random variable X below,
calculate the mean, variance, and standard deviation.
Outcome or Sample HH HT TH TT
Point
Applying 𝒙 2 1 1 0 the concepts
from the activity given
above, let us complete the table below.
We have discovered that the Expected Value E(X) is only the average or mean (µ𝑥)
of random variable X. It's occasionally called a "weighted average" because more
frequent values of X are weighted more highly in the average. It is additionally how
we anticipate that X should behave on-average as time goes on. The mean of a
random variable X is a measurement of the central location of the distribution of X.
If we are summarizing features of the distribution of X, it is clear that location is
not the only relevant feature. The second most important component is the spread
of the distribution.
81
Again, the variance of a discrete random variable X is defined by
where the sum is taken over all values of x for which P(x) > 0 but less than 1. So,
the variance of X is the weighted average of the squared deviations from the mean
μ, where the weights are given by the probability function P(x) of X.
In another way around, the standard deviation is the more tangible of the two
measures, since it is in the same units as X. For example, if X is a random variable
measuring length in cm, then the standard deviation is in centimeter(𝑐𝑚), while the
variance is in 𝑐𝑚2.
What’s More
Independent Activity 1: Study and analyze
The number of shoes sold per day at a retail store is shown in the table below.
Illustrate the mean, variance, and standard deviation of this distribution.
X 19 20 21 22 23
The number of patients seen in the Emergency Room in any given hour is a random
variable represented by x. The probability distribution for x is:
82
X 10 11 12 13 14
X 0 1 2 3 4
P(X)
What I can do
LET’S DO FAMILY PLANNING!
Things to do:
1. Create a short script as if you are with your dream partner in life and you both
talking about Family Planning.
2. During your planning, specify in your decision how many girls (G) and boys (B)
in the family you both want to have. Consider also your random variable X
stand for.
3. After your planning, construct a probability distribution of whatever decision
you had.
4. Based on the probability distribution table, illustrates the following:
a. Mean or Expected Value;
b. Variance; and
c. Standard Deviation.
83
5. Share your output to the Class Group Chat through Image or Video
Presentation.
TASK CRITERIA
Assessment
Multiple Choice. Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper.
1. What formula is appropriate to use in calculating the expected value?
A. 𝐸(𝑥) = ∑ 𝑥 ∙ 𝑝 (𝑥) B. 𝐸(𝑥) = ∑ 𝑥 − 𝑝 (𝑥)
C. 𝐸(𝑥) = ∑ 𝑥 ∙ 𝑝 (𝑥)2
D. 𝐸(𝑥) = ∑ 𝑥 /𝑝 (𝑥)
2. Which among the list of formulas should be used to solve for the variance
of discrete random variables of the given data above?
A. 𝜎𝑥2 = ∑(𝑥 + 𝜇)2 ∙ 𝑃(𝑥); for all possible values of x
B. ; for all possible values of x
C. 𝜎𝑥2 = ∑ 𝑥 ∙ 𝑃(𝑥); for all possible values of x
D. ; for all possible values of x
3. Which of the following statements does not describes the value of the
standard deviation?
A. A small standard deviation (or variance) means that the distribution
of the random variable is narrowly concentrated around the mean.
B. A large value of standard deviation (or variance) indicates that the
distribution is spread out, with some chance of observing values at
some distance from the mean.
C. Standard deviation is obtained by getting the square root of the
variance.
D. It is obtained by squaring the variance.
4. What can we generate if we take the summation of the product of each
value assigned to the random variable and its corresponding probability?
A. Expected Value
B. Probability value
C. Standard Deviation
D. Variance
5. Suppose that a coin is to be tossed four times, and let X represent “the
number of TAILS that can come up”, what are the possible values of the
random variable X?
A. 0 B. 0, 1 C. 1, 2,3 D. 0, 1, 2 ,3, 4
6. Which of the following is the correct value of standard deviation (𝜎) of a
discrete random variable X if the variance is 2.5754?
A. 1.61 B. 1.60 C. 1.06 D. 1.01
For items 7-9, refer to the scenario and table provided below.
Let X be a random variable defining number of students getting 95 above grade.
X 0 1 2 3
84
8. What is the variance (𝜎𝑥2) of the given probability distribution?
A. 1.16 B. 1.18 C. 1.20 D. 1.28
9. Which of the following value is appropriate for deviation (𝜎𝑥) of the given
probability distribution?
A 1.02 B. 1.04 C. 1.06 D. 1.08
10. A Grade 11 – HUMMS researcher surveyed the households in Brgy. Quipot,
Tiaong Quezon. The random variable X represents the number of college
graduates in the households. The probability distribution of X is shown
below.
X 0 1 2
P(X) 0.25 0.5 0.25
12. What is the probability that she made at least 9 flower arrangements?
A. 0.60 B. 0.50 C. 0.40 D. 0.30
13. What is the mean or expected value of the given probability distribution?
A. 8.5 B. 8.6 C. 8.7 D. 8.8
14. Which of the following is the appropriate value for the variance?
A. 1.55 B. 1.56 C. 1.57 D. 1.58
15. What value corresponds to the standard deviation?
A. 1.25 B. 1.26 C. 1.27 D. 1.28
Additional Activities
1. Willie works in an automotive tire factory. The number X of sound but
blemished tires that he produces on a random day has the probability
distribution.
x 2 3 4 5
P(X) 0.48 0.35 0.10 0.07
a. Find the probability that Leomar will produce more than three
blemished tires.
b. Find the probability that Leomar will produce at most four
blemished tires.
c. Calculate the mean, variance, and standard deviation of a discrete
random variable.
85
2. The Land Bank of the Philippines Manager claimed that each saving
account customer has several credit cards. The following distribution
showing the number of credits cards people own.
x 0 1 2 3 4
P(X) 0.18 0.44 0.27 0.08 0.03
Show the complete table of values in calculating the Mean, Variance, and
Standard Deviation.
Statistics and
Probability
Interpreting the Mean and the
Variance of a Discrete Random
Variable
Statistics and Probability
Interpreting the Mean and the Variance of a Discrete Random Variable First
Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
86
Layout Artist: Edna E. Eclavea
Management Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Week Team:
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann
O. Briones Celestina M. Alba, Asuncion C. Ilao
3
Department of Education – Region IV-A CALABARZON
Telefax: 02-8682-5773/8684-4914/8647-7487
This module has only one lesson which focuses on interpreting the mean and variance
of a discrete random variable. Your knowledge and skill on how to use the formula for
the mean, variance, and standard deviation that you learned from previous modules are
pre-requisite skills. Accurate computations will yield accurate interpretations. In case
that there will be mistakes in the calculations of mean, variance, and standard
deviation, interpretation of the data will be affected. And if we interpret the result based
on the miscalculations, our tendency to make a not so wise decision is higher. So, to be
successful in this module, always review your computations before you make
interpretations. Expect also that some of the practice exercises and problems included
here need your patience and determination because it requires long computations. It
will teach you the virtues of being patient and striving for excellence. Bear in mind that
you need to review first your solutions to arrive with accurate interpretations. So, are
you now ready? Let us start your journey today.
What I Know
Before studying this module, take this test to determine what you already know
about the topic covered.
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. What do you call a random variable with possible values that form a finite or
countable set?
A. continuous C. finite
B. discrete D. infinite
87
2. What term is used to describe the average value of a discrete random variable
over numerous trials of an experiment?
A. mean C. standard deviation
B. probability D. variance
3. Which of the following represents the amount of spread, dispersion, or
variability of the items in a distribution?
A. mean or expected value
B. median and mode
C. outcomes and probability distribution
D. variance and standard deviation
4. Which of the following is also equal to the square root of the variance?
A. mean C. probability
B. median D. standard deviation
5. How would you interpret a very small variance or standard deviation but not
equal to zero?
A. The values of the random variables are farther from the mean.
B. The values of the random variables are nearer to the mean.
C. The values of the random variables are equal to the mean.
D. The values of the random variables have no relationship with the mean.
6. Which of the following shows most likely the largest possible variance or
variability?
A. number of girls in a randomly selected three-child family
B. number of newborn babies per minute.
C. number of Oreo cookies inside a 133 grams pack from different branch
of 711 stores
D. number of patients who are positive with COVID-19 in different
hospitals
7. Which of the following data shows most likely the smallest possible variance or
variability?
A. number of books in different branch of National Bookstore B. number of
books inside a pack bag of grade 11 students
C. number of books inside the library of different universities
D. number of books inside the library of different households
8. What formula is described by 𝜎2 = ∑[𝑥2𝑃(𝑥)] − 𝑢2?
A. the mean of a discrete random variable
B. the variance of a discrete random variable
C. the standard deviation of a discrete random variable
D. the expected value of a discrete random variable
For numbers 9-12, refer to the probability distribution of rolling a single
unfair die.
𝑥 1 2 3 4 5 6
𝑃(𝑥) 0.1 0.1 0.1 0.5 0.1 0.1
9. What is the mean of the probability distribution?
A. 2.5 B. 3.7 C. 4.1 D. 5.7
10. What is the variance of the probability distribution?
A. 1.81 B. 2.34 C. 3.70 D. 4.26
11. What is the standard deviation of the probability distribution?
A. 1.07 B. 1.35 C. 1.92 D. 2.06
12. How would you interpret the mean value that you get in item number 9? A.
The mean value is the difference between each probable value of the
outcome when you roll the unfair die in numerous trials.
B. The mean value is the higher probable value of the outcome when you roll
the unfair die in numerous trials.
C. The mean value is the lowest probable value of the outcome when you roll
the unfair die in numerous trials.
88
D. The mean value is the closest value to the most probable value of the
outcome when you roll the unfair die in numerous trials.
13. Which of the following is NOT a property of the variance? A. The variance is not
equal to the standard deviation.
B. A small variance means that the distribution of the random variable is
narrowly concentrated around the mean.
C. A large variance means that the distribution is spread out, with some
chance of observing values at some distance from the mean. D. The value of
the variance is less than zero.
For numbers 14-15. The mean of the probability distribution below is equal to 18.2
with a variance of 5.86 and a standard deviation of 2.42.
Number of cellphones sold per
day in a retail store (𝑥) 15 18 19 20 22
15. How would you interpret the values of variance and standard deviation?
A. It gives the difference between the highest number of cellphones and the
least number of cellphones that can be sold.
B. It is the average number of cellphones that can be sold in a day.
C. It describes how the data or the number of cellphones sold in a day varies.
D. No interpretation can be made about the variance and standard deviation.
What’s In
What’s the formula?
The following formulas were introduced to you in the previous lessons, write the
correct formula in each box to complete this task.
89
Situation 2: I went to the grocery store and I noticed that there are five different
kinds of brown sugar at different prices, each brand has different probabilities that
they will be sold. What formula should I use if I want to determine how the prices
vary from the average price of the product that would be sold to the customers
repeatedly?
Solution:
Situation 2: The correct formula is 𝜎 = √[𝑥2 ∙ 𝑃(𝑥)] − 𝑢2, because it is the formula to
be used if you want to find the standard deviation of a discrete random variable.
Even though formulas 2 and 3 (formulas to find the variance) measures also the
variability of the values assumed by the random variable X (prices of brown sugar)
in repeated trials (sold to the customers repeatedly). The units of standard deviation
match those with X because standard deviation is a square root of the variance.
What’s New
An Unfair Die
Suppose an unfair die is rolled and let X be the random variable representing the
number of dots that would appear with a probability distribution below.
Outcome 1 2 3 4 5 6
(𝑥)
Probability 0.1 0.1 0.1 0.5 0.1 0.1
P(𝑥)
Questions:
What Is It
Recall that in your previous lesson, you learn how to calculate the mean, variance,
and standard deviation of a discrete random variable. So, with those skills, you will
be able to answer the questions in the previous activity.
First, let us find the mean of the probability distribution using the formula.
𝜇𝑥 = ∑[𝑥𝑖 ∙ 𝑃(𝑥)]
𝜇𝑥 = 3.7
Therefore, the mean of the probability distribution is 3.7. Since the mean is equal to
3.7, we can say that the average number of dots that would appear is a number
near to 3.7 which is 4. If we will repeat the experiment over and over, there is a
90
higher possibility the 4 dots will always appear since the average number of dots
that would appear is 4.
To determine the variability of the assumed values from the mean, use the formulas
for finding variance and standard deviation. But first, construct a table like the one
below:
Therefore, the variance of the random variable X (the number of dots appeared) is
equal to 1.81 while the standard deviation is equal to 1.35. Take note that, small
variance or standard deviation means that the assumed values or data points tend
to be very close to the mean, while higher variance or standard deviation means
that the assumed values or data points are spread out from the mean. Specifically,
the variance and standard deviation measures or describes how far a set of data
(assumed values of random variables) is spread out. Since the value of the standard
deviation is 1.35, we can say that the assumed values of each outcome are
somewhat close to the mean for about 1.35 units from the mean.
Knowing this information, you may now decide whether you will join a game using
an unfair die. Make sure you will think more critically and logically before you
make a decision and remember being fair and just is a virtue.
Bear in mind that the mean of a discrete random variable is just the average value
of a random variable over repeated trials of experiment and in the next lesson it will
be discussed further and its application to the real-life situation.
To understand more the concept of the variance and standard deviation, the
following are examples that will give you a deeper idea about the concept of
variance and standard deviation.
The first data in the following examples is somewhat expected to have a higher
variance and standard deviation or variability compare to the other data.
Example 1
The population size of 100 different countries around the world compare to the
population of 10 countries belong to the first world countries
Due to a large number of samples (100 different countries) and other factors such
as land area, economic status, etc., you may say that most likely you will get a
higher variance and standard deviation compared to the population of 10 countries
belong to the first world countries
Example 2
The number of students in different universities compares to the number of
students in a class in a particular school.
91
Considering different universities with different size and course offerings, you may
say that the difference between their enrolments will be much higher than the
difference of each class size in a particular school
Example 3
The number of people who love to play basketball in different barangays to the
number of basketball players from different teams who play in the first-quarter
game.
Basketball is a famous game and almost every guy loves to play this game. Imagine
that you will gather the number of people who love to play from different
barangays, it is expected that you will gather different results compared to the
number of basketball players from different teams who play in the first-quarter
game. Almost everybody is aware that there are only five players from each team to
play a game, therefore if you will compute the variance of the data from different
teams you will get a value of zero. The mean will be equal to 5 because each team
who play the game will should have 5 players only, at the same time if you will
compute the variance of distribution with equal inputs you will get a value of zero.
The mean and variance of a discrete random variable are easy to understand if you
know the concepts behind these two measurements. As you go on with the different
modules in Statistics and Probability, mean and variance will always be included in
the discussion. The mean, variance, and standard deviation are important in
statistics because they serve as the basis for other types of statistical calculations
or statistical tests.
What’s More
In the following activities, read each situation carefully to solve each problem.
Write your answer on a separate sheet of paper.
Practice Activity 1
The number of cellular phones sold per day at the E-Cell Retail Store with the
corresponding probabilities is shown in the table below. Compute the mean,
variance, and standard deviation and interpret the result.
Number of cellphones sold
per day in a retail store (𝑥) 15 18 19 20 22
92
𝜎2 = ∑[𝑥2 ∙ 𝑃(𝑥)] − 𝑢2 = _______ − ________ = ___________
4. Number of M & M’s peanuts inside a 1.69 0z bag from different retail
stores.
Number of kernels in corn of different sizes.
93
2. The variance and standard deviation of a discrete random variable measured or
described the ____________________ of the assumed values of the random variable
to the mean.
3. A small variance or standard deviation means that the assumed values or data
points tend to be ___________________to the mean.
4. A higher variance or standard deviation means that the assumed values or data
points are __________________ from the mean
What I Can do
Oh, what a game!
Things to do:
Device a game of chance using a fair die with corresponding winnings and losses.
For example, in rolling a die you may assign even outcomes as winning outcomes
and the odd outcomes as the losses or think of your own plan. Let X the amount
to be won by a player or the amount he will lose when he plays your game.
Compute for the mean or the average amount a player may win or lose, the
variance and the standard deviation, and interpret the results. Don’t forget that
each face of a die has a probability of . Be prepared to describe your game and
demonstrate it in our class next meeting or you may prepare a video of your
demonstration to be submitted online.
Rubrics for the task:
4 3 2 1
Categories Excellent Satisfactory Developing Beginning
95
𝑃(𝑥) 0.1 0. 0.4 0.2 0.1 0.1
1
9. What is the mean of the probability distribution?
A. 25 B. 29 C. 30 D. 32
10. How would you interpret the mean value that you get from item number 8?
A. It is the least number of books borrowed from the school library in a day.
B. It is the largest number of books borrowed from the school library in a day.
C. It is the average number of books borrowed from the school library in a day.
D. It is the difference between the largest and the least number of books
borrowed from the school library in a day.
11. What is the variance of the probability distribution?
A. 38 B. 40 C. 43 D. 46
12. What is the standard deviation of the probability distribution?
A. 6.16 B. 6.32 C. 6.56 D. 6.78
13. Which of the following is NOT a property of the variance?
A. A small variance means that the distribution of the random variable is
narrowly concentrated around the mean.
B. A large variance means that the distribution is spread out, with some chance
of observing values at some distance from the mean. C. The variance is a
value that is always positive.
D. The variance is a value that is always negative.
For numbers 14-15. The mean of the probability distribution below is equal to
37.05 with a variance of 36.75 and a standard deviation of 6.06.
Number of ice
candy sold per
30 32 36 40 42 45
day in a retail
store (𝑥)
Probability (𝑃(𝑥) 0.30 0.10 0.15 0.10 0.10 0.25
14. How would you interpret the mean value of 37.05?
A. The least number of ice candy that will be sold in a day is 37 pieces.
B. The highest number of ice candy that will be sold in a day is 37 pieces.
C. The average number of ice candy that will be sold in a day is 37 pieces.
D. No interpretation can be made about the mean value of 37.05.
15. If you are the owner of the retail store, how many ice candies will you
prepare to ensure that you can supply the demands of your customers
every day?
A. 10 pieces and below C. 21-30 pieces
B. 11-20 pieces D. 30 pieces and above
Additional Activities
This activity is design for you to gain a deeper understanding of the mean,
variance, and standard deviation. It will not just help you to practice your skill
but it will give you a chance to compare the results with theoretical
explanations.
Try it and enjoy it!
Things to do:
Roll a die 50 times. Record the result and construct a probability distribution to
find the mean, variance, and standard deviation. Compare the results with
theoretical results:
Take Note of This:
The probability distribution of rolling a die is given below, this will serve as your
theoretical result.
Outcome (𝑥) 1 2 3 4 5 6
96
Probability of the 1 1 1 1 1 1
Outcome
𝑃(𝑥) 6 6 6 6 6 6
Statistics and
Probability
Solving Problems Involving Mean
and Variance of Probability
Distributions
Statistics and Probability
Solving Problems Involving Mean and Variance of Probability Distributions
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
97
Development Team of the Module
Week Writer: Elizabeth D. Lalunio
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
3 Illustrator: Kelvin A. Ramintas and Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Team:
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann
O. Briones Celestina M. Alba, Asuncion C. Ilao
Telefax: 02-8682-5773/8684-4914/8647-7487
What I Know
Before studying this module, take this test to determine what you already know about
the topic covered.
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. Which of the following represents the mean value of a discrete random variable?
98
A. Expected value C. Standard deviation
B. Median D. Variance
For numbers 2-3, refer to the table below:
𝑥 2 4 6 8
𝑃(𝑥) 0.12 0.29 0.24 0.35
2. Which of the following is the correct formula to find the expected value of the
probability distribution?
5. If X denotes the net gain from the purchase of a randomly selected ticket.
Which of the following tables of values shows the probability distribution of
X?
x 675,000 -500
A. P(x) 1 1999 C. x 674,500 -500
P(x) 1 1999
2000 2000
x 2000
674,500 2000
500
x 675,000 500 P(x) 1 1999
B. V D.
P(x) 1 1999
2000 2000
6. What 2000 2000 is the mean or the expected value for those
who will buy the tickets?
A. -162.50 B. -162.25 C. 162.25
D. 162.50
For numbers 7-8, refer to the following:
One thousand tickets are sold for ₱10.00 each. One ticket will win ₱2,000.00, two
tickets will win ₱1,000.00 each and four tickets will win ₱500.00 each.
7. What is the probability of winning any amount in the purchase of two tickets?
A. C.
D.
99
A. x 195 -5 C x 50 .
-5
P(x) P(x)
B. x 5 -5 D. x 200 5
P(x) 1 4
P(x)
5 5
10. What is the mean or the expected value for those who take a bet?
A. – 1.8 B. -1 C. -0.95 D. -0.5
11. How much the organizer will earn if 100 games will be played such that each
number has a bet of ₱5.00?
A. ₱2,000.00 B. ₱3,000.00 C. ₱4,000.00 D. ₱5,000.00
For items 12-13, refer to the following:
Good Hands is a car insurance company who offers to pay ₱600,000.00 if a car is
destroyed beyond repair due to an accident. This insurance policy cost ₱25,000.00
per year and the probability that the company will need to pay the amount of
insurance is 0.003.
12. What is the probability that the company will NOT pay any amount to its
client within a year?
A. 0.003 B. 0.250 C. 0.500 D. 0.997
13. What is the expected value of the insurance to its buyer?
A. Each policyholder expects to lose ₱20,000.00 every year. B.
Each policyholder expects to lose ₱22,500.00 every year.
C. Each policyholder expects to lose ₱23,200.00 every year.
D. Each policyholder expects to lose ₱25,000.00 every year
100
What’s In
For you to begin let us recall first the previous lessons which are very
important for you to succeed. From your last module, you learned that the
expected value or mean value of a discrete random variable, denoted by x, is
computed by first multiplying each possible x value by the probability of observing
that value and then adding the resulting quantities. Symbolically,
𝐄(𝐱) = 𝐮𝐱 = ∑[𝐱𝐢 ∙ 𝐏(𝐱𝐢)]. where:
𝑥𝑖 is the value of the random variable for the outcome 𝑖;
𝑃(𝑥𝑖) is the probability that a random variable will have an outcome 𝑖; 𝑢𝑥 is
the mean of a random variable 𝑥; and 𝐸(𝑥) is the expected value of random
variable 𝑥.
Also, you already learned the equation for computing the variance of a discrete
random variable. To find the variance, remember the formula:
𝜎2 = ∑(𝑥 − 𝜇)2 ∙ 𝑃(𝑥)
𝜎2 = [∑ 𝑥2 ∙ 𝑃(𝑥)] − 𝑢2
where:
𝑥2 is the square value of a discrete random variable 𝑥;
𝑃(𝑥) is the probability of an outcome of a discrete random variable;
𝑢2 is the square of the mean of a discrete random variable; and
𝜎2 is the variance of a discrete random variable
Before you proceed to the next lesson, study the example below, and reflect whether
you are now ready to proceed to the new lesson.
Example
The table below shows the probability distribution of the number of girls in a family
of three children in Barangay Maligaya. Calculate the mean and variance of the
random variable with the given probability distribution.
Outcome or GBB,BGB, GGB, GBG,
BBB GGG
Sample Point BBG BGG
𝑥 0 1 2 3
101
Therefore, the mean or the expected value of the discrete random variable is
1.5.
To find the variance and standard deviation of the discrete random variable you
need to prepare a table like the one below.
𝑥 𝑃(𝑥) 𝑥2 𝑥2 ∙ 𝑃(𝑥)
0 1⁄8 0 0
1 3⁄8 1 3⁄8
2 3⁄8 4 12 3
⁄8 𝑜𝑟 ⁄2
3 1⁄8 9 9⁄8
From the table, the variance and standard deviation can be computed using:
Therefore, the variance of the discrete random variable is 0.75, while the
standard deviation is 0.87.
What’s New
GAME OF CHANCE
Read the conversation below and answer the questions that follow.
102
Oh no! I do understand but if
you want to take a chance I
Our television set is no have here 5 raffle tickets out of
Hey Romulo I longer functional and I 1000 released tickets wh ich
Ok, so what’s
have a problem. your problem need to buy a new one. cost ₱100.00 each. If you will
my friend? But the problem is I don’t be lucky enough, you’ll get a
have enough budget for 32-inch LED Television worth
that. ₱15,000.00.
Questions:
103
6. How would you describe Romulo as a friend?___________________________
What Is It
To answer question 1 in the previous activity, you have to understand first
the consequences of buying a ticket, will it give you advantages and disadvantages?
Can you afford to spend extra money to buy a ticket?
Look at this way, if you will buy a ticket you will have a chance to win the
prize worth ₱15,000.00 if you are lucky enough it is good for you. But on the other
hand, what if you are not lucky, then you will lose ₱500.00 for buying the five
tickets. Considering this amount (₱500.00), you are aware that you can buy more
essential needs like foods, school and medical supplies, and other miscellaneous
needs with this amount rather than using it to buy tickets. However, if you
allocated extra money for other unexpected expenses, buying a ticket is not bad at
all. The point of making the right decision is not difficult if you know how to
consider what is more important.
If you are Cardo, you need to understand the given details on the situation for you
to make the right decision. If you will win the prize, your net gain is ₱15,000.00
less ₱500.00 (amount spend for buying 5 tickets) or ₱14,500.00. The probability that
you will win the prize is five out of one thousand tickets or . But if you decide to
buy one ticket only, then your net gain is ₱15,000.00 less ₱100.00 (amount spend
for buying 1 ticket) or ₱14,900.00 and the probability that you will win the prize is
one out of one thousand tickets or . The more tickets you buy, the more
chances of winning. However, your chance of losing is higher than your chance of
winning because it is equal to 1 less than the probability of winning if you buy five
tickets the
995 probability that you will not win the
prize is or 0.995 while if you buy one ticket only, the probability is or
0.999.
What if 1000 tickets were purchased by different individuals, what is the
expected value of buying one ticket? To answer this question, you need to apply the
formula for the mean or expected value of the discrete probability distributions
which is equal to 𝐄(𝐱) = 𝐮𝐱 = ∑[𝐱𝐢 ∙ 𝐏(𝐱𝐢)]. The expected value is also defined as the
average value of a random variable over numerous trials of an experiment. The
table below is the probability distribution of the given situation.
104
insist on what he wants but let his friend decide on his own after giving a piece of
advice. You are blessed if you will find a true friend that you can rely on. Having a
true friend is wealth.
What’s More
In the following activities, read each situation carefully to solve each problem. Write
your answer on a separate sheet of paper.
Practice Activity 1
b. Find the probability of winning any amount in the purchase of one ticket.
Let 𝑃(𝑊) the probability of winning any amount
The expected value is equal to the product of net gain and the
probability of winning (in this case you have three chances of winning,
you may win the first, second, or third prize) plus the product of net loss
and the probability of losing.
𝐄(𝐱) = _______________
Interpretation:__________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________
Independent Assessment 1
Practice Activity 2
105
Game of Chance Problem (A Roulette Wheel)
A roulette wheel in a fiesta carnival has the numbers 1 through 30. If you bet
₱5.00 you will have a chance to win a prepaid load worth ₱100.00. Let X denote the
net gain for a bet.
𝑥 95 -5
𝑃(𝑥)
b. Find the expectation if you play a bet and interpret the result.
𝐄(𝐱) = _______________
Interpretation:__________________________________________________________
________________________________________________________________________
______________________________________________________
c. How much the organizer will earn if 100 games will be played such that each
number has a bet of ₱5.00? ____________________________
(Hint: Compute for the net gain of the organizer per game and multiply it by
100)
Independent Assessment 2
Colored Wheel
A roulette wheel is divided into eight colors blue, red, yellow, green, orange,
violet, black, and white. Each time roulette is spun, you can have a bet of ₱10.00. If
the outcome is either black, blue, or violet you will lose the bet. But if the outcome
is red or yellow, your money will be doubled (you will win additional ₱10.00 plus
your bet), and if it is green or orange you will win nothing and your bet will be given
to you. But if the result is white, your money will be tripled (you will win an
additional ₱20.00 plus your bet. If X denotes the net gain of a player what is the
expected value of a ₱10.00 bet?
Practice Activity 3
𝑥 500 -249,500
𝑃(𝑥)
b. Find the expectation if you play a bet and interpret the result.
𝐄(𝐱) = _______________
Interpretation:_____________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
106
Independent Assessment 3
Compare your answers with those found in the Answer Key on the last part
of this module. Did you get all items correct? Great! If you did, you learned the
lesson well. If you were not able to get the correct answer, review your
computations.
_______________________________________________________________________________
_______________________________________________________________________________
_______________________________________________________________________________
What I Can Do
Raffle for a Cause!
Things to do:
Make a plan for a Christmas raffle bonanza in which the PWD (Person with
Disability) will be the beneficiaries. The plan should consist of the number of
tickets to be sold, the amount per ticket, and the number of winners and their
corresponding prizes. Let X be the random variable representing your net gain per
ticket, and compute the expected value of X and the net gain if all tickets will be
sold out. Also, using your creativity prepare a sample of your ticket. You may use
the sample guide to complete your task.
Name of Project
Brief Description
Expected Value of X:
107
(Please show the probability
distributions and your solution)
Net Gain of the Project:
Sample Ticket:
1. Which of the following represents the expected value of the discrete random
variable?
A. mean C. mode
B. median D.
variance
For numbers 2-3, refer to the table below:
𝑥 1 2 3 4
𝑃(𝑥) 1 2
108
5 5
2. What is the expected value of the given probability distribution?
A. C. 2
B. D.
A. 2 C. 2
1000
B. D. 998
1000
5. Which of the following tables of values shows the probability distribution of X?
A.
x 10,000 5,000 1,000 -50
P(x) 1 2 3 994
B. x 10,000
1000 5,000
1000 1,000
1000 -50
1000
P(x)
6. What is the mean or the expected value of a person who buys a ticket?
A. - 25 C. 27
B. - 27 D. 25
109
You buy three ₱1,000.00 raffle tickets for a prize of a new 20-passenger Sarao
jeepney valued at ₱800, 000.00. Two thousand tickets are sold.
2000
B. D. 4
2000
8. What is the probability that you will NOT win the prize? (refer to item 7)
A. C. 997
2000
B. D.
For numbers 9-11, refer to the following:
A roulette wheel in an amusement park has the numbers 1 through 60. If you bet
₱100.00 for a randomly chosen number from 1 to 60, you will have a chance to
win a cellular phone worth ₱5,000.00.
9. If X denotes the net gain from taking a chance to bet on a randomly
selected number. Which of the following tables of values shows the
probability distribution of X?
A. X 5000 -1000 C x 60 -1 .
P(x) 59 P(x)
x 4900 -1000
P(x) 59 x 100 -1
P(x)
60
B. D.
60
10. What is the mean or the expected value for those who take a bet?
A. – 10.12 C.
-16.67
B. -15.40 D.
-18.25
11. How much the organizer will earn if 100 games will be played?
A. ₱50,000.00 C.
₱150,000.00
B. ₱100, 000.00 D.
₱250,000.00
110
13. How much is the net loss of the insurance company from a policyholder if
his family will avail of the insurance benefits?
A. ₱1,000,000.00 C.
₱998,000.00
B. ₱999, 000.00 D.
₱995,000.00
14. Which of the following tables of values shows the probability distribution of X,
if each policyholder has a 99.95% chance of surviving within three years?
A. x 2,000 C.
-1,000,000 x 2000 -998,000
B. P(x) 0.9995 0.0005D. P(x) 0.9995 0.0005
x 2000 -998,000
x 2,000
1,000,00 P(x) 0.9995 0.0005
0
P(x) 0.9995 0.0005
Additional Activities
If you want to try more, these activities are for you. It will help you to practice your
skill in solving real-life problems involving mean or expected value and to make
wise decisions on matters like taking a chance in lottery, starting your own
business and considering investment.
Study and analyze each situation to solve the problem.
1. Game of Chance (Lottery)
A lotto works by picking 6 numbers from 1- 42 (Combinations of numbers from 1-
42 taken 6 at a time produce 5,245,786 number combinations). A ticket costs
₱24.00 to play the lottery. If you win today, you would win 5 million pesos after
taxes. If you play the lottery today, what would be your expected winnings or
losses?
2. Business Opportunity
The net gain for a day of Joy in a Cup Bread and Pastry Shop depends on the
number of cupcakes sold within a day. If X is the amount of income within a day
for five consecutive days. Compute the mean or expected earnings for a day and the
variance. The probability of X is shown below.
3. Investment
A person may earn ₱100,000.00 by investing in the stocks of an international
company with a probability of 0.40 or lose ₱35,000.00 over the same period with a
probability of 0.60. Let X denote the net gain of a person who will invest in the
company, construct the probability distribution of X, and compute for the expected
value of a person who will invest in the same company. Interpret the result
How well did you perform in this lesson?
111
Are you now ready for the next lesson?
If your answer is Yes, I am HAPPY for you. Good Luck and happy learning on your
next journey. 😊
Statistics and
Probability
Illustrating a Normal Random
Variable and its Characteristics
Statistics and Probability
Illustrating a Normal Random Variable and its Characteristics
First Edition, 2020
112
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
Telefax: 02-8682-5773/8684-4914/8647-7487
113
acquired about the percentage, mean and standard deviation is very useful to
attain the target objectives of this module
Week
After going through this module, you are expected to:
1. define a normal random variable;
2. illustrate a normal random variable and its characteristics; and
3
3. state and apply the concepts of the empirical rule or 68%-95%- 99.7% of the
distribution.
What I Know
Let us see how far you will go and how much you know about this pretest. Please
answer all questions with your best and utmost honesty so that we can gauge your
learning in the target competency of this lesson. Let’s get started!
Direction: Read and analyze each item carefully. Choose the letter of the best
answer.
Write the chosen letter on a separate sheet of paper.
1. The total area under the normal curve is _________.
A. -1 B. 0 C. 0.5 D. 1
2. The normal curve is bell-shaped.
A. False B. True C. Sometimes D. It depends
3. Which part of the normal curve is extended indefinitely in both directions along
the horizontal axis, approaching but never touching it?
A. center B. tail C. top D. spread
4. According to the property of a Normal Probability Distribution, the mean is
equal to what measures of central tendency?
A. median B. mode C. both A and B D. only A
5. Which of the following rules state that almost all data fall within the 1, 2, and 3
Standard Deviation of the Mean when the population is normally distributed?
A. Empirical rule
B. Lottery rule
C. Pascal’s triangle rule
D. Sampling rule
6. Under the curve, the area to the left of the mean is_________
A. 30% B. 47.72% C. 50% D. 68%
7. Using the empirical rule, the area under the normal distribution curve that lies
within three standard deviations from the mean is approximately 95%.
A. False
B. True
C. Sometimes
D. None of the Above
8. Which of the following statements is CORRECT about the shape of the normal
curve?
A. The shape of the normal curve is based on the given Mean and Standard
Deviation of the distribution.
B. The shape of the normal curve is based on the given Mean and Median of
the distribution.
C. The shape of the normal curve is based on the given area and standard
Deviation of the distribution.
D. All of the above
9. Which of the following graphs is the BEST example of the normal curve?
114
A. B. C. D.
10. Based on the empirical rule, the bell-shaped distribution will have
approximately 68% of the data within what number of Standard Deviations
from the mean?
A. 0 B. 1 C. 2 D. 3
11. If the average age of retirement for the population in the Philippines is 65 years,
with a standard deviation of 5 years, what is the approximate age range in
which 68% of people retire?
A. 60 – 70 years B. 55 – 65 years C. 55 – 60 years D. 60 -65 years
12. If the mean and the standard deviation of a continuous random variable that is
normally distributed are 10 and 3 respectively, find an interval that contains
68% of the distribution.
A. [7, 13] B. [4, 13] C. [7, 16] D. [3, 10]
13. According to the 68%-95%-99.7% rule or better known as an empirical rule, the
standard deviation and mean interval that covers approximately 95% of the
data from a frequency distribution is____
A. 2µ ± 2ơ B. µ ± ơ C. µ ± 2ơ D. µ ± 3ơ
14. The first quarterly assessment result of Grade 11- HUMSS students in their
Statistics and Probability subject had scored with a mean of 40 and a
standard deviation of 8. A histogram of the data shows that the distribution is
normal.
Use the Empirical Rule to find the percentage of scores between 24 and 32.
A. 13.5% B. 68% C. 81.5% D. 95%
15. Suppose the distribution of data about the recovered victims of COVID 19
daily has a mean of 145 and a standard deviation of 22. How many standard
deviations away from the mean is a value of 101?
A. It is one standard deviation above the mean.
B. It is one standard deviation below the mean.
C. It is two standard deviations above the mean.
D. It is two standard deviations below the mean.
How did you find this pretest? Did you encounter both familiar and
unfamiliar terms, symbols, and notations? Kindly compare your answer in the
Answer Key on the last part of this module.
If you obtain 100% or a perfect score, skip the module and immediately move
to the next module. While in the event you missed a point, please proceed with the
module as it will enrich your knowledge in the normal distribution, especially in
illustrating the normal random variables and its characteristics.
What’s In
Let’s have a review first. Read the following statements carefully and determine
whether it is TRUE or FALSE. Write the word TRUE if it is correct and FALSE if it is
incorrect, then modify the statement to make it true.
115
________________1. Probability is the value greater than or equal to zero but less
than or equal to one.
________________2. Discrete variables are the infinite numerical values like
heights, weights, distance and length ________________3. 34%
is also equal 0.34.
________________4. Mean, mode and standard deviation are the measures of central
tendency.
________________5. Mean is equal to the summation of scores divided by the
number of cases.
What’s New
A. The distribution of the height (X) in centimeter (cm) of the 16 teachers of
SCNHS was presented below. Construct a histogram for the random
variable (X).
X F
138 1
139 2
140 3
141 4
142 3
143 2
144 1
B. The following items show four pictures captured by the author during
his travel in the different places and constructed using GeoGebra
application which represents a common object. Analyze the group of
pictures carefully then identify the common object they represent.
Write your answer by filling in the empty boxes below each item. A
hint is also provided for you.
1. 2. 116
4. 5.
3.
Can you define or describe each word that you have obtained in this activity?
1. ____________________________________________________________
2. ____________________________________________________________
3. ____________________________________________________________
4. ____________________________________________________________
5. ____________________________________________________________
What is it
How do you find the activity? How can you describe the graph you have made in
Activity A? Are the terms or words you obtained in Activity B will describe the
graphs in Activity A?
In the activity above, the distribution of the height of the teachers is one of the
examples of the distribution of a normal random variable because the set of
numerical values is normally distributed. Height is also described as a continuous
random variable. You have also noticed that the mean of the distribution can be
found at the center which is 141 cm and the distribution from the center to the
right is a mirror of the distribution to the left as shown in the figure and table
below. See Figure 1.
117
Normal Probability Distribution is a probability distribution of continuous
random variables. It shows graphical representations of random variables obtained
through measurement like the height and weight of the students, the percentile
ranks of the A&E (Accreditation and Equivalency) Test result of the Alternative
Learning System students, or any data with infinite values. It is used to describe
the characteristics of populations and help us visualize the inferences we make
about the population. It also used to determine the probabilities and percentile of
the continuous random variables in the distribution. For example, your grades in
Mathematics is one of the scores in the distribution, you can predict the location of
that score in the distribution and interpret it with regards to the mean and
standard deviation.
Properties of Normal Curve
The graphical representation of the normal distribution is popularly known
as a normal curve. The normal curve is described clearly by the following
characteristics
1. The normal curve is bell-shaped.
2. The curve is symmetrical about its center. This means that, if we draw a
segment from the peak of the curve down to the horizontal axis, the segment
divides the normal curve into two equal parts or areas.
3. The mean, median, and mode coincide at the center. This also means that in
a normal distribution, or a distribution described by a normal curve, the
mean, median, and mode are equal.
4. The width of the curve is determined by the standard deviation of the
distribution.
5. The tails of the curve are plotted in both directions and flatten out
indefinitely along the horizontal axis. The tails are thus asymptotic to the
baseline. A portion of the graph that is asymptotic to a reference axis or
another graph is called an asymptote, always approaching another but never
touching it.
6. The total area under a normal curve is 1. This means that the normal curve
represents the probability, or the proportion, or the percentage associated
with specific sets of measurement values.
118
A normally distributed random variable with a mean µ = 0 and standard
deviation ơ = 1 is called a standard normal variable. It is presented using standard
normal distribution where the center of the curve is zero, which is mean and added
one unit from the center to the right and subtract one unit from the center to the
left. This is shown below in figure 3.
The shape of a normal curve is based on the two given parameters, the mean
and the standard deviation of the distribution. When comparing two distributions
each described by the normal curve, the following are the three situations based on
the said parameters
a. When the means are not equal, but the standard deviations are equal. (µ 1
≠ µ2 ; ơ1 =ơ2 ), the curves have a similar shape but centered at different
points, as shown in figure 4.
c. When the means are different and the standard deviations are also
different (µ1≠ µ2 ; ơ1 ≠ ơ2 ), the curves are centered at different points and
vary in shapes, as shown in figure 6.
119
Fig. 6: The Means are Different and the Standard
Deviations are also Different
EMPIRICAL RULE
Using the empirical rule, you can say that the distribution of the grades of
the Senior High School in Statistics and Probability are distributed normally
because the shape of the graph is bell-shaped and symmetric about its mean,
which is 87. This also indicates that most of the students’ grades fall close to the
mean and fewer of them are far below and/or above the mean
It can summarize the distribution in the following percentage:
• 68% of data lies within 1 standard deviation from the mean have a
grade of 83 to 91
• 95% of data lies within 2 standard deviations from the mean have a
grade of 79 to 95
120
• 99.70% of data lies within 3 standard deviations from the mean have
a grade of 83 to 9 Illustrative Example 1:
The scores of the Senior High School students in their Statistics and Probability
quarterly examination are normally distributed with a mean of 35 and a standard
deviation of 5.
Answer the following questions:
a. What percent of the scores are between 30 to 40?
b. What scores fall within 95% of the distribution?
Solution:
Draw a standard normal curve and plot the mean at the center. Then, add the
standard deviation to the mean once and mark it to the right of the mean. Add
twice the standard deviation to the mean and put it to the right of the first sum.
Then, add thrice the standard deviation to the mean and mark it to the right of the
second sum. Do the same to the left. This time, instead of adding, subtract the
standard deviation from the mean. An illustration is provided below in figure 7:
121
Fig. 8: Distribution of BMI of the Alternative Learning System Students
Solution: The mean of this problem is 160, it is subtracted by 7 to the left (e.g. 160
-7 = 153; 153-7 = 146; 146 – 7 = 139) and add 7 to the right. (e.g. 160 + 7 = 167;
167 + 7 = 174 + 7 = 181).
As stated in figure 8, 153 cm falls at 1 standard deviation from the mean to the left
and the height of 174 cm falls at 2 standard deviations from the mean to the right.
Therefore, it covers the whole 68% and 13.5%. of the distribution and the sum of it
is 81.5%
ANSWER: 81.5% of the male students have a height between 153 cm to 174 cm.
What’s More
Activity 1.
A. True or False. Write the word TRUE if the statement is correct and FALSE
if the statement is incorrect
_______1. The total area of the normal curve is 1.
_______2. Normal probability distribution is used to determine the
probabilities of a discrete random variable.
_______3. The mean, median and mode in a normal distribution have
the same value.
_______4. The distance of the number in the horizontal line or the x-axis
should always proportional.
_______5 The normal probability distribution has a mean µ = 1 and
standard deviation ơ = 0.
Activity 2
A. Use Empirical rule to complete the following table. Write on the respective
column the range or interval of the scores based on the given parameters.
122
The government launched Enhanced Community Quarantine on March 15,
2020, as a preventive measure to fight the spread of COVID-19 in the country.
Many of us experienced difficulties in getting our daily needs like food or getting
money to pay for our bills and rentals. Our government gave financial support and
food, but it is not enough for the people. Fortunately, we are blessed because the
spirit of love and generosity is still present in the heart of our fellow countrymen.
They donated money, food, time, and other essential things.
Suppose you and your friends conducted a donation-drive in your barangay.
Each of the donors gave a certain amount with an average of Php 320.00 and a
standard deviation of Php 60.00. Assuming, it is normally distributed,
1. What percentage of the distribution that the amount falls between Php.
260.00 and Php 380.00?
2. How much of the donations fall in 95% of the distributions?
3. How much of the donations fall in 99.70% of the distributions?
What I Can Do
MAKE A SURVEY
Materials Needed
1. Cellphone` 4. Calculator
2. Computer/Laptop 5. Short Bond Paper
3. Pen/pencil
Instructions
1. Contact at least 20 of your classmates through SMS or messenger apps.
2. Ask them for the result of their pre-test (What I Know) or pre-assessment
of this module
3. Compute the mean and standard deviation of the collected data includes
your own score.
4. Use the empirical rule, to answer the following questions
a. What is the interval or range of scores falls within 68%?
123
b. What is the interval or range of scores falls within 95%?
c. What is the interval or range of scores falls within 99.7%?
d. What is the interval or range of scores falls within 81.5%?
e. In which interval away from the mean does your score belong?
5. Illustrate the distribution of the data you obtained from this survey through a
diagram and put a mark on the interval where your score located.
RUBRICS
Assessment
Direction: Read and analyze each item carefully. Choose the letter of the best
answer.
Write the chosen letter on a separate sheet of paper.
1. What is half of the total area of the normal curve?
A. -1 B. 0 C. 0.5 D. 1
2. What is the shape of a Normal Probability Distributions?
A. bar B. bell C. circle D. line
3. Which part of a normal curve is asymptotic with the horizontal axis?
A. center B. curve C. tail D. top
4. Which of the following statements is the characteristic of a normal probability
distribution?
A. The three measures of central tendency such as mean, mode, and median
are equal.
B. The three measures of central tendency such as mean, mode, and median
are not equal
C. The two measures of central tendency such as mean and mode are equal
while the median is greater than zero
D. The two measures of central tendency such as mean and mode are equal
while the median is less than zero.
5. Which of the following statements describe the concepts of empirical rule?
A. It states that the data in the distribution lies within one (1), two (2), and
three (3) of the standard deviation from the mean are approximately 68%,
95% and 99.70%, respectively .
B. It states that the data in the distribution lies within 100%.
C. It states that the data in the distribution lies within one (1), two (2), and
three (3) of the standard deviation from the mean are exactly 68%, 95%
and 99.70%, respectively.
D. It states that the data is asymmetric.
6. Under the curve, the area to the right of the mean is_________
A. 30% B. 47.72% C. 50% D. 68%
7. Which of the following parameters is the basis of the shape of the normal
curve?
A. mean B. standard deviation C. Variance D. Both A and
B 8. What is the equivalent of the mean of the following graph?
A. 0 B. 1 C. 2 D. 3
9. Using the empirical rule, the area under the normal distribution that lies within
three standard deviations of the mean is approximately_____.
A. 68% B. 95% C. 97% D. 99.7%
10. Based on the empirical rule, the bell-shaped distribution will have
approximately 68% of the data within what number of standard deviations of
the mean?
A. 0 B. 1 C. 2 D, 3
124
11. If the average age of retirement for the population in the Philippines is 65
years and with a standard deviation of 5 years, what is the approximate age
range in which 95% of people retire?
A. 60 – 70 years B. 55 – 65 years C. 55 – 60 years D. 55-
75 years 12. If the mean and the standard deviation of a continuous random
variable that is normally distributed are 10 and 3, respectively, find an
interval that contains 99.7% of the distribution.
A. [7, 16] B. [4, 16] C. [7, 16] D. [1, 19] 13.
According to the 68%-95%-99.70% rule, the standard deviation and mean
interval that covers approximately 99.70% of the data from a frequency
distribution is____
A. 2µ ± 2ơ B. µ ± ơ C. µ ± 2ơ D. µ ± 3ơ
14. The first quarterly assessment result of Grade 11- STEM students in their
Statistics and Probability subject had scored with a mean of 40 and a standard
deviation of 8. A histogram of the data shows that the distribution is normal.
Use the Empirical Rule to find the percentage of scores between 24 and 48.
A. 13.5% B. 68% C. 81.5% D. 95%
15. Suppose that distribution of data about the number of deaths of COVID 19
positive has a mean of 45 and a standard deviation of 18. How many standard
deviations away from the mean is a value of 81?
A. It is one standard deviation above the mean
B. It is two standard deviations above the mean
C. It is one standard deviation below the mean
D. It is two standard deviations below the mean
Additional Activities
Read the following problems carefully. Apply the empirical rule and illustrate your
answer through a diagram.
1. IQ scores of the ALS students in the Division of Quezon are normally
distributed with a mean of 110 and a standard deviation of 10.
a. What percent of the distribution falls within the IQ scores of 100 to 130?
b. What percent of the distribution falls within the IQ scores of 90 to 140?
2. A normal distribution of data has a mean of 78 and a standard deviation of 9.
a. What scores fall within 68% of the distribution?
b. What scores fall within 95% of the distribution?
Statistics and
Probability
Identifying Regions Under
125
Normal Curve Corresponds to
Different Standard Normal
Values
Statistics and Probability
Identifying Regions Under Normal Curve Corresponds to Different Standard Normal Values First
Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
Telefax: 02-8682-5773/8684-4914/8647-7487
126
This module was designed to easily understand how to identify regions under
the normal curve corresponding to different standard normal values. It includes the
Week
steps/procedures on how to use the table of areas under the normal curve. Also,
this module will help you to accomplish the succeeding modules about normal
What I Know
Let us see how far you will go and how much you know about this pretest.
Let’s get started.
Direction: Read and analyze each item carefully. Choose the letter of the best
answer. Write the chosen letter on a separate sheet of paper.
1. What is the sum of the area that corresponds to the right of 0 and to the left of
0?
A. 0.5 B. 1 C. 1.5 D. 2
2. Where can we find the area of a specific region under the curve?
A. Table of Areas of Geometric Figure
B. Table of Areas Under Normal Curve
C. Table of Contents
D. Periodic Table of Elements
3. Which of the following does NOT belong to the group?
A. to the right B. less than C. to the left D. below
4. Which of the following figures is the best illustration of the area to the right of z
=1?
A.
C.
B D.
127
A. 0.0865 B. 0.4772 C. 0.8679 D. 0.8779
6. What will you do to find the area between the two z values?
A. Subtract the obtained value from the Z table to 1
B. Rewrite the value obtained from z table
C. Subtract the two obtained values from the z table.
D. Calculate the mean and standard deviation
7. Find the area above z= 2.56
A. 0.0052 B. 0.4948 C. 0.9946 D. 9948
8. Find the area of the shaded region of the given figure.
128
Lesson 1 Identifying Regions under Normal
Curve Corresponds to Different
Standard Normal Values
In your previous lessons in Mathematics, you learned how to solve for the
area of certain geometric figures or shapes. This time, you are going to identify the
areas or regions of a curve without using a formula or even a density function of
standard normal random variables because the z – table or Table of Areas under
the Normal Curve is already given on the last page of this module.
What’s In
Let’s have this first for a review.
A.Perform the indicated operation manually. Then check your answer using
calculator or any calculating device
1. 0.9983 – 0.4531 = ____________ 6. 0.8413 + 0.1313 =______________
2. 1 - 0.3413 = ____________ 7. 0.0017 + 0.056 =_______________
3. 0.4772 - .0033 =______________ 8. 0.5 + 0.5000 =______________
4. 0.5 – 0.1537 =_____________ 9. 0.4490 + 0.352 =_______________
5. 1 – 0.2318 =_____________ 10. 0.1001 + 0.0009=______________
B.Investigate all parts of the diagram below if the characteristics of a normal
curve is present.
What’s New
A. The following numbers are the number found at the left side and upper part of
the table. Identify the number that lies on the intersection between the given
set of numbers. Use the Table of the Area Under Normal Curve indicated at the
last page of this module 1. -1.6 and 0.09 = ________
2. 1.0 and 0.07 = _________ 3.
1.1 and 0.06 = _________ 4.
0.5 and 0.08 = _________
5. 1.0 and 0.00 = _________
B. Perform the indicated operation. Write your answer inside the box connected to
the shaded region of the following normal curve.
129
What is It
In the previous module, we discussed the properties of the normal
probability distribution, and one of the things that you learned is that the total area
under the curve is equal to one. Since the standard normal distribution is defined
as normally distributed random variable having a mean of zero ( µ= 0 ) and
standard deviation of one (ơ = 1), the fifty (50) percent of the region under the curve
is below the mean and the other 50% is above the mean. Figure 1, as shown below
illustrate the standard normal curve.
Illustration:
130
Answer: The area to the left of z= -1.69 is 0.0455.
STEPS ON HOW TO FIND THE AREA THAT CORRESPONDS TO Z –VALUE
1. Draw/sketch a normal curve and locate the given z-value on the normal
curve
2. Shade the region of the curve according to the condition of z-value whether it
is below, above, or between.
3. Use the table of the area under the normal curve to find the corresponding
area.
4. Choose the appropriate operation based on step 2 and 3
4.1. When the z-value is to the left or any related terms (e.g. below, less
than) just write the value we obtained in step 3
4.2. When the z-value is to the right or any related terms (e.g. above,
greater than), subtract 1 by the obtained value in step 3
4.3. When the shaded region is in between of the two z-value, subtract the
biggest by the smallest value obtained in step 3
5. Label the shaded region and draw a conclusion Example 1: Find the area
that corresponds below z = -1.35 Solution:
Step 1. Draw/sketch a normal curve and locate the given z-value.
Step 2: Shade the region of the curve according to the condition of z-value
whether it is below, above, or between.
131
Step 3. Use the table of the area under the normal curve to find the
corresponding area
132
Step 3. The intersection between -1.3 and 0.05 is 0.0885
Step 1:
Step 2:
Step 4. Since the shaded region is to the right of z-value, we will subtract
0.0885 from 1. The difference is 0.9915
Step 5
Step 2:
133
Step 3. The value that corresponds to z= 1.3 is 0.0968 and z= 2 is
0.4772.
Step 4. Since the shaded region is within the two z-value, subtract
0.4772 by 0.0968. The difference is 0.8804
Step 5 Therefore, the area between z = -1.30 and z = 2 is 0.8804.
What’s More
Activity 1.
Identify the regions under the normal curve of the following standard normal
values using the z table (refer to the last page of this module).
1. z = - 3.25 ______________ 6. z = -1.33 _______________
2. z = 1.25 ______________ 7. z = -2.5 _______________
3. z = 0.56 ______________ 8. z = 0.29 _______________
4. z = 3 ______________ 9. z = - 0.13 _______________
5. z = 1.96 ______________ 10. z = - 1.96 _______________
Activity 2
Find the area or region that corresponds to each of the following z values. Sketch
and label each curve.
1. to the right of z = 2.18
2. below z= -1.40
3. above z= 0.57
4. between z = -2.34 and z = 1.57
5. between z = 0.32 and z = 2.42
134
e. Label the shaded region and draw a conclusion.
4. When the z-value is to the left or any related terms (e.g. below, less than) just
write the value we obtained from the z table.
5. When the z-value is to the right or any related terms (e.g. above, greater than),
subtract 1 by the obtained value from the z table.
6. When the shaded region is in between of the two z-value, subtract the biggest
by the smallest value obtained from the z table.
What I Can Do
For instance, one of your friends from another school asks your help about his
problem in Statistics and Probability. He needs to answer his assignment but
unfortunately, he left his module in the school. He wants to know how to identify
the regions under the normal curve corresponding to different standard normal
values. Your friend will send a picture of a normal curve with the corresponding
area, and he asks you favor to explain each picture on how to obtain the value of
the shaded part of the normal curves. Here is the screenshot of the pictures:
Assessment
Direction: Read and analyze each item carefully. Choose the letter of the best
answer. Write the chosen letter on a separate sheet of paper.
1. How many percents is the total area that corresponds to the right of 0 and to the
left of 0?
A. 25% B. 50%
C. 75% D. 100%
2. What is another term of z table?
A. Table of Areas of Geometric Figure
135
B. Table of Areas Under Normal Curve
C. Table of Contents
D. Periodic Table of Elements
B. A. . C
B.
A. 0.1587 B. 0.3413
C. 0.3907 D. 0.8413
6. What mathematical operation to use in getting values from the z table to find the
area between the two z values?
A. Add B. Divide C. Multiply D. Subtract
7. What is the area between z = 0.23 and z = 1.99?
A. 0.3858 B. 0.3588 C. 0.8239 D. 0.8583
8. Find the area above z= 2.14
A. 0.052 B. 0.0162 C. 0.0324 D. 0.9838
9. Find the z value of the following graph that corresponds to the given area.
136
A. – 2.24 B. -2.25 C. 2.24 D. 2.25
137
Statistics and
Probability
Converting a Normal Random
Variable to a Standard Normal
Variable and Vice-versa
Statistics and Probability
Converting a Normal Random Variable to a Standard Normal Variable and Vice-versa First
Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
Writers:
Dhojie N. Yanto, Elson A. Valencia, and Victor M. Disilio
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Team:
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann
O. Briones Celestina M. Alba, Asuncion C. Ilao
Telefax: 02-8682-5773/8684-4914/8647-7487
138
You will learn how to convert a normal random variable 𝑥 to standard normal
variable 𝑧 and vice-versa.
Week
In a previous module, you have learned about how to illustrate a normal random
variable and its characteristics. It also discussed how to identify regions under the
4 normal curve corresponding to different standard normal values. This time, you will
focus on converting Normal Random Variable to Standard Normal Random
Variable. A normal distribution may take any values. These values are called raw
scores. Raw scores came from different sources cannot be compared unless we
converted them to standard scores.
Meanwhile, the knowledge, skills, and attitude you acquired about the previous
modules of normal distribution such as – the mean, standard deviation, raw scores
(x) and z – scores – is very useful to attain the target objectives of this module
What I Know
Let us see how far you will go and how much you know about this pretest. Please
answer all questions with all your best and utmost honesty so that we can clearly
gauge your learning of the target competency of this lesson. Let’s get started.
Direction: Read and analyze each item carefully. Choose the letter of the
best answer. Write the chosen letter on a separate sheet of paper.
1. Which of the following is the measure of the number of standard
deviations (σ) a particular data value is away from the mean (μ)?
A. Standardization
B. Random Normal Variable
C. Standard Normal Variable
D. Probability Distribution
2. Given: x = 35 ; µ = 15 ; σ = 4. Find the z score.
A. -5
B. 5
C. 10
D. 15
3. Given: z = - 2.25 ; µ = 60 ; σ = 8. Find the value of x
A. 18
B. 42
C. 78
D. 102
For questions number 4 and 5
Suppose the provincial health officer of Quezon reported that the average COVID 19
positive for 30 days is 8 patients with the standard deviation of 4 patients.
139
D. 1
6. The mean score of the students on pre-assessment is 10 and the
standard deviation is 2. What was Elson’s raw score if his z – score
is 3?
A. 4
B. 12
C. 16
D. 20
7. Some DepEd public and private schools administered National
Achievement Test for Grade 6 and Grade 10. Suppose the test for Grade 10
has a mean score of 110 and a standard deviation of 15. If Juan, a Grade 10
student, has obtain a z-score of 1.27, what was his score on the test?
A. 91
B. 92
C. 129
D. 130
For questions number 8-10
The Enhance Community Quarantine (ECQ) on some places was lifted by our
President Rodrigo Roa Duterte last May 15, 2020. Suppose that after the said ECQ,
one of the dieticians of well-known company conducted the survey about the effect
of home quarantine on the health of the Filipino people. He found out that average
weight for women is 60 kg with a standard deviation of 4 kg.
8.If the weights are normally distributed, what is the z-score of a woman with
a weight of 70.
A. -2.5
B. -1.5
C. 1.5
D. 2.5
9. What is the z score of a woman with a weight of 50 kg?
A. -2.5 B. -1.5
C. 1.5
D. 2.5
10. How many kilograms will be corresponds to the z score of 0.5 of the
weight of woman?
A. 56kg B. 58kg C. 60 kg
D. 62 kg
11. If the scores in a Probability and Statistics test are normally distributed
with a mean of 25 and standard deviation of 3.5. What is the z-score for a
score of 30?
A. 1.43
B. 1.06
C. 0.72
D. -1.43
12. Gracia, an ALS student, obtained a z-score of 2.35 in an achievement test
that is normally distributed with a mean grade of 83 and standard deviation
of 5. What was Gracia’s grade in the achievement test?
A. 71.25
B. 80.65
C. 85.35
D. 94.75
13. In a normal distribution with a mean of 69 and standard deviation of 15,
what does a z-score of -2 indicates?
A. It indicates that the score is equal to 39.
B. It indicates that the score is above the mean.
C. It indicates that the score is 39 less than the mean
140
D. It indicates that the difference between the score and the mean is 39.
14. Francis Leo and Romy who are best friends took a college entrance
examination. The mean score and standard deviation of the
examination are 115 and 12, respectively. If Francis Leo and
Romy obtained z-scores of 1.5 and -0.95 respectively, what was
their scores in the examination?
A. 132 and 104
B. 133 and 104
C. 132.75 and 104
D. 133.75 and 103.6
15. The velocities of cars in express way are normally distributed with a
mean of 80 km/hr and standard deviation of 8 km/hr. Jessie’s
velocity has a z-score of 2.5, what was his velocity?
A. 100 km/hr
B. 105 km/hr C. 110 km/hr
D. 115 km/hr
How do you find this pre-test? Did you encounter both familiar and unfamiliar
terms, symbols and notation? Kindly compare your answer in the Answer Key on
the last part of this module.
If you obtain 100% or a perfect score, skip the module and immediately move to the
next module. While in the event you missed a point, please proceed on the module
as it will enrich your knowledge in normal distribution, especially in illustrating the
normal random variables and its characteristics
What’s In
Let’s review. Solve for x in terms of another variable.
1. ax = b
2. x + y =1
3. y = mx + b
5. z =
Answers:
1. x = b/a 2. x = 1-y 3. x = 4. x = a + b 5. x = zσ + μ
What’s New
Fill Me Up
Complete the table below by filling in the correct value of the required variable. Use
this formula and solve.
z x 𝜇 𝜎
___ 23 18 4
141
-1.17 ___ 125 30
What is It
Was the activity fun and interesting? How did you manage to derive the
formula? Are you successful in completing the table?
The activity allows you to manipulate the given data and the missing value
through derivation. In this case, you convert the raw score (x) into the standard
score (z) using the formula . You substitute the given values and uses
properties of equality and algebraic rules to obtain the needed data
In solving real-life problems involving the normal curve, it is particularly
important to convert a random variable x to a standard normal variable or z-score.
This procedure is known as “standardizing” or “standardization” of a random
variable, where a standardized value is called a z-score. A z-score is a measure of
the number of standard deviations (σ) a particular data value is away from the
mean (μ).
Suppose your score on a test in Probability and Statistics is 39 and the
scores are normally distributed with a mean of 33 and standard deviation of 3, then
your score is exactly 2 standard deviations above the mean. If you scored 30, then
it is exactly 1 standard deviation below the mean. All values that are above the
mean have positive z-scores and all values that are below the mean have negative z-
scores. If you obtained a z-score of -3, this means that your score is 3 standard
deviations below the mean, that is, 33 – 3(3) = 33 – 9 = 24.
Given any value x from a normal distribution with mean μ and standard
deviation σ, to convert x to a z-score (standard normal score), you need to; (a)
Subtract the mean μ from x.
(b) Divide this quantity, 𝑥 – 𝜇, by the standard deviation σ.
The formula used in converting a random variable x to a standard normal
variable z is:
Example 1.
Suppose IQ scores are normally distributed with a mean of 100 and
standard deviation of 10. If your IQ is 85, what is your z-score? (Round off your
answer to the nearest hundredths) Solution:
The z-score can be computed using the formula;
142
Example 2.
On a nationwide placement test that is normally distributed, the mean was
125 and standard deviation was 15. If you scored 149, what was your z-score?
(Round off your answer to the nearest hundredths) Solution:
Using the formula
Example 3.
The heights of teachers in Sta. Catalina National High School are normally
distributed with a mean of 150 cm and standard deviation of 15 cm. The height of
Sir Victor has a z-score of 3.25. What is the actual height of Sir Victor? (Round off
your answer to the nearest hundredths) Solution:
In this example, z-score is given while the normal random variable is unknown. To
solve for the normal random variable x, multiply the z-score (z) by the standard
deviation σ, then add the mean μ.
Therefore;
𝑥 = 𝑧𝜎 + 𝜇
= 3.25(15 cm) + 150 cm
= 48.75 cm + 150 cm
𝑥 = 198.75 𝑐𝑚
Example 4.
The time it takes for a cell to divide is normally distributed with an average
of 60 minutes and standard deviation of 5 minutes. How long will it take for a given
cell to divide if its “mitosis” has a z-score of -1.35?
Solution:
𝑥 = 𝑧𝜎 + 𝜇
= -1.35(5 min) + 60 min
= -6.75 min + 60 min
𝒙 = 𝟓𝟑. 𝟐𝟓 𝒎𝒊𝒏𝒖𝒕𝒆𝒔
Example 5.
Suppose you have the population values 40 and 70 and that their
corresponding z – scores are -1 and 2, respectively.
1. Is it possible to determine the population’s mean
and standard deviation?
2. If so, what are these values? Solution:
To begin, let us first identify the given. Let
𝑥1 = 40 𝑥2 = 70
𝑧1 = −2 𝑥2 = 1
143
30 = 3𝜎 the difference
1 multiply by
3
10 = 𝜎 simplify
The standard deviation of the population is 10. To obtain the mean, substitute it to
either equation 1 or 2.
40 = −2𝜎 + 𝜇 ⇒ 40 = −2(10) + 𝜇 ⇒ 60 = 𝜇
70 = 1𝜎 + 𝜇 ⇒ 70 = 1(10) + 𝜇 ⇒ 60 = 𝜇
Both agrees that the mean of the population is 60.
Therefore,
1. It is possible to determine the population’s mean and standard deviation
given only the raw score (x) and its corresponding standard score (z).
2. The mean and standard deviation of the population is 60 and 10,
respectively.
What’s More
Independent Activity
Find Me
Supply the needed information by solving the missing value.
1. x = 25, µ= 40, σ = 6, z = ?
2. z = -2.25, µ= 218, σ = 36, x = ?
3. x = 39, µ= 30, σ = 3, z = ?
4. x = 125, µ= 105, σ = 8, z = ?
5. z = 1.11, µ= 145, σ = 20, x =?
Independent Assessment
Solve Me
Solve the following problems.
1. The mean number of hours a Filipino worker spends on the computer is
3.1 hours per workday. Assume the standard deviation is 0.5 hour and is
normally distributed, how long does a worker spend on the computer if
his z-score is 1.2?
2. Each month, a Filipino household generates an average of 28 pounds of
newspaper for garbage or recycling. Assume the standard deviation is 2
pounds. Determine the z-score of a household that generates 22 pounds
of newspaper.
3. The Candelaria Automobile Association reports that the average time it
takes to respond to an emergency call is 30 minutes. Assume the variable
is normally distributed and the standard deviation is 4.5 minutes. How
long will a call be responded if it has a z-score of 0.75?
4. The average monthly salary for newly – hired teachers is P21,945. If the
distribution is approximately normal with a standard deviation of P3250.
How much will a teacher earn in a month if his salary has a z-score of
1.15?
144
2. _________________________________ is equal to the quotient between the
difference of normal random variable (x) or raw score and mean and
standard deviation. It is written as:
z=
3. __________________________________ is equal to the product of z score and
standard deviation plus the mean. It is written as:
𝑥 = 𝑧𝜎 + 𝜇
What Can I Do
Read and analyze the following problem carefully
Victor, a Grade 11 student of Valencia National High School is one of the
candidates for the scholarship program in one of the well-known companies in
their municipality. He was already submitted the different requirements like ITR of
his parents and G10 Report Card. The HR of the said company make a list of the
five priority subjects and he instructed his clerk to compute the mean and
standard deviation. Assuming that you are the assigned clerk, kindly complete the
following table:
SUBJECTS Grade
Filipino 85
English 83
Math 90
Social Science 88
Science 88
Mean
SD
Based on the above data, express each grades of the respective subjects into z-
scores. Show your complete solution and write your final answer on the following
table.
SUBJECTS Grade z- scores
Filipino 85
English 83
Math 90
Social Science 88
Science 88
Assessment
Direction: Read and analyze each item carefully. Choose the letter of the best
answer.
Write the chosen letter on a separate sheet of paper.
1. What is another term used for standard normal score?
A. z- score
B. mean score C. x score
D. t score
2. Given: x =125; µ = 105; = 16. Find the z score.
A. -1.25
B. -1 C. 1
D. 1.25
3. Given: z = 2.25; µ = 70; = 9. Find the value of x
A. -49.75
B. 35.11
C. 90.25
D. 148.5
For questions number 4 and 5
145
Suppose the provincial health officer of Batangas reported that the average
COVID19 positive for 40 days is 10 patients with the standard deviation of 6
patients.
4. What is the z score of 18 patients?
A. 1.33
B. 1.34
C. 4.66
D. 4.67
8. If the weights are normally distributed, what is the z-score of a with a weight of
63
A. -1.17
B. -1.16
C. 1.16
D. 1.17
9. What is the z score of a woman with a weight of 85 kg?
A. -2.5
B. -1.5 C. 1.5
D. 2.5
10. How many kilograms will be corresponds to the z score of 1.5 of the
weight of woman?
A. 76kg
B. 79 kg
C. 82 kg
D. 85 kg
11. If the scores in a Probability and Statistics test are normally distributed
with a mean of 31 and standard deviation of 4.5. What is the z-score for a score
of 40?
A. 1
B. 2
146
C. 2.5
D. 3
Additional Activities
Read the following problems carefully. Apply what you have learned in the module.
The Candelaria Automobile Association reports that the average time it takes to
respond to an emergency call is 30 minutes. Assume the variable is normally
distributed and the standard deviation is 4.5 minutes.
a. How long will a call be responded if it has a z-score of 0.75?
b. How long will a call be responded if it has a z-score of – 1.83?
Statistics and
Probability
Computing Probabilities and
Percentiles Using the Standard
147
Normal Table
Statistics and Probability
Computing Probabilities and Percentiles Using the Standard Normal Table
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
Writers:
Dhojie N. Yanto, Elson A. Valencia and Victor M. Disilio
Editors: Celestina M. Alba and Nestor N.Sandoval,
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Team:
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann
O. Briones Celestina M. Alba, Asuncion C. Ilao
Telefax: 02-8682-5773/8684-4914/8647-7487
148
understand the uses of normal distribution because the examples and problems
provided are based on real-life situations.
Week
In the previous modules, you have learned about identifying regions under
the normal curve corresponding to different standard normal values and converts a
normal random variable to a standard normal variable and vice versa. This time,
you will focus on computing for probabilities and percentiles using the standard
4
normal table. A standard normal table or unit normal table or z – table, is a
mathematical table composed of a list of z – values and probabilities that arrange
cumulatively. Meanwhile, the knowledge, skills, and attitude you acquired about
the normal distribution, mean, standard deviation, raw score (x) and standard
scores (z) is extremely useful to achieve our target objectives on this module
What I Know
Let us see how far you will go and how much you know about this pre-test. Please
answer all questions with all your best and utmost honesty so that we can gauge
your learning of the target competency of this lesson. Let’s get started!
Direction: Read and analyze each item carefully. Choose the letter of the best
answer. Write the chosen letter on a separate sheet of paper.
1. Which of the following notations indicate the probability of a z value from the
left?
A. 𝑃(𝑍 > 𝑧) B. 𝑃(𝑍 < 𝑧) C. 𝑃(𝑋 < 𝑥) D. 𝑃(𝑋 > 𝑥)
2. What does P(a<Z<b) notation mean?
A. Probability of z that is in the left of a and b
B. Probability of z that is in the right of a and b.
C. Probability of z that is in the opposite of a and b.
D. Probability of z that is in between two other z values a and b.
3. What is the probability of z value indicated by 𝑃(𝑍 < −1.78)? A. 0.0375 B.
0.3075 C. 0.9625 D. 0.9633 4. Find the probability value of
𝑃(𝑍 > −1.00).
A. 15.87% B. 34.13% C. 84.13% D. 90.13%
5. Compute the probability value of 𝑃(−1.35 < 𝑍 < 2.75)
A. 90.85% B. 99.70% C. 0.9085 D. Both A and C
6. Find the z-scores that bound the middle 92.50% of the area under the
standard normal curve.
A. -1.44, 144 B. -1.44, -1.78 C. 1.44, 1.78 D. -1.78, 1.78
7. Which of the following graph is the correct illustration of the notation 𝑃(𝑎 < 𝑧
<
𝑏)?
A. B. C. D.
149
For items 10 to12, use the situation below.
Eight hundred fifty businessmen accepted the Mayaman Challenge of one of
the Philanthropic Filipinos who are willing to give relief goods for our fellow
Filipinos affected by the recent pandemic. They will be contributing an average of
120 sacks of rice with a standard deviation of 25 sacks of rice.
10. What is the probability that the businessman will contribute less than 90
sacks of rice?
A. 11.51% B. 76.99% C. 88.49% D. 89.45%
11. What is the probability that the businessman will contribute more than150
sacks of rice?
A. 11.51% B. 76.99% C. 88.49% D. 89.45%
12. How many businessmen will contribute a sack of rice between 100 sacks
and 145 sacks?
A. 534 B.535 C. 715 D. 716
13. The ages of ALS students enrolled at Sta. Catalina National High School are
normally distributed with a mean of 19 years and a standard deviation of 3.5
years. If a student is selected at random, what is the probability that his age is
under 22 years?
A. 11.67% B. 19.49% C. 80.51% D. 90.57%
14. Teaching the value of productive work, ABM students launched the “Go
Negosyo” project. If their weekly sales on this project is normally distributed
with mean of P3,200 and standard deviation of P500. What percent of their
weekly sales are over P4,000?
A. 94.52% B. 77.34% C. 22.57% D. 5.48%
15. In problem number 14, how many percents of their weekly sales are less
than P3,000?
A. 11.61% B.34.46% C. 65.54% D 88.39%
How do you find this pre-test? Did you encounter both familiar and
unfamiliar terms, symbols, and notations? Kindly compare your answer in the
Answer Key on the last part of this module.
If you obtain 100% or a perfect score, skip the module and immediately move
to the next module. While in the event you missed a point, please proceed with the
module as it will enrich your knowledge in normal distribution, especially in
illustrating the normal random variables and its characteristics.
What’s In
Let’s review. Supply the needed information by writing on the response area.
Questions Responses here
1. Convert 0.4783 into percent. _____________________
150
2. What is the equivalent of 38.45% in decimal? _____________________
3. What is the product of 450 and 0.2356? _____________________
(Round off your answer into nearest whole number)
4. What is the 25% of 100? _____________________
5. What is the total area of a normal curve? _____________________
What’s New
Find the area of the shaded region under the normal curve and make a brief and
concise interpretation of the following graph.
1. 2.
3. 4.
5. 6.
What is It
How did you find the activity? Was it fun and challenging at the same time?
How did you determine the area of the shaded region? What are the strategies you
commonly used?
A normal distribution curve can be used as a probability distribution curve
for normally distributed variables. The area under the standard normal distribution
curve can also be thought of as a probability. That is, if it's possible to select any
value at random, the probability of choosing one, say, below 1.45 would be the same
as the area under the curve at the left of 1.45. In this case, the area is 0.9265.
Therefore, the probability of randomly selecting a value below of 1.45 is 0.9265 or
92.65%. The problems involving probabilities and percentiles are solved in the same
manner as finding the areas under a normal curve.
In finding probabilities, the following notations will be used:
(a) – probability at the left of
(b) – probability at the right of
(c) – the probability of that is in between two other values,
say and
151
(d) – the probability of is in the opposite direction of two
values, say and
(e) – probability at the left of a normal random variable
(f) – probability at the right of a normal random variable
(g) – the probability of a normal random variable that is in
between two other normal random variables, say and .
(h) – the probability of is in the opposite direction of
two values, say and Example 1.
Find the probabilities for each of the following.
(a)
(b)
(c)
(d) )
Solution:
(a) The probability means to find the area under the normal
curve at the left of . From the table, find the intersection on row
1.3 and column 0.02. Thus, the probability is 0.9066 or 90.66%.
(c) The probability 𝑃(−0.75 < 𝑍 < 1.56) is the same as the area under the
normal curve between 𝑧 values -0.75 and 1.56. To get the area, subtract
the area of 𝑧 = −0.75 from the area of 𝑧 = 1.56. Thus, 𝑃(−0.75 < 𝑍 < 1.56)
= 𝑃(𝑍 < 1.56)– 𝑃(𝑍 < −0.75) = 0.9406 – 0.2266 = 𝟎. 𝟕𝟏𝟒𝟎 𝑜𝑟 𝟕𝟏. 𝟒𝟎%.
(d) The probability 𝑃(𝑍 > −0.88) is the same as the area under the normal
curve at the right of 𝑧 = −0.88. To get the area, subtract the area of 𝑧 =
−0.88 from 1. Thus, 𝑃(𝑍 > −0.88) = 1 – 𝑃(𝑍 < −0.88) = 1 – 0.1894 =
𝟎. 𝟖𝟏𝟎𝟔 𝑜𝑟 𝟖𝟏. 𝟎𝟔%.
152
Example 2.
Let 𝑋 be a normal random variable with mean 𝜇 = 15 and standard deviation
𝜎 = 3. Find the probabilities of the following:
(a) 𝑃(𝑋 < 19)
(b) 𝑃(10 < 𝑋 < 19) Solution:
(a) To find the probability, convert the normal random variable 19 to a z-score.
Thus, .
From the table, the probability at the left of 𝑧 = 1.33 is equal to 0.9082 or
90.82%.
(b) To find the probability, convert the normal random variables 10 and 19 to z
values. Thus,
1.33) = 𝑃(𝑍 < 1.33) – 𝑃(𝑍 < −1.67). From the table, we have
0.9082 – 0.0475 = 𝟎. 𝟖𝟔𝟎𝟕 or 𝟖𝟔. 𝟎𝟕%.
These probabilities can also be applied to real-life problems.
Example 3.
The mean number of hours a Filipino worker spends on the computer is 3.1 hours
per workday. Suppose the spending time is normally distributed with a standard
deviation of 0.5 hour, find the percentage of workers who spend less than 3.5 hours
on the computer assuming that the variable is normally distributed. Solution:
The percentage of workers who spend less than 3.5 hours is the same as the
area under the normal curve at the left of a normal random variable of 3.5 hours.
Convert 3.5 to a z value. That is, z = = 0.80. From the table, the area at the left
of z = 0.8 is 0.7881. Thus, the percentage of workers who spend less than 3.5 hours
is 78.81%.
Example 4.
Each month, a Filipino household generates an average of 28 pounds of newspaper
for garbage or recycling. Assume that the distribution is normal with a standard
deviation of 2 pounds. If a household is selected at random, find the probability of
its generating
a. between 27 and 31 pounds per month.
b. more than 30.2 pounds per month.
Solution:
(a) The probability of generating between 27 and 31 pounds is 𝑃(27 < 𝑋 <
31). This is equal to
153
1.50) – 𝑃(𝑍 < −0.5) = 0.9332 – 0.3085 = 0.6247. Thus, the probability of
generating between 27 and 31 pounds of newspaper per month is 0.6247
or
62.47%.
(b) The probability of generating more than 30.2 pounds per month is
equivalent to 𝑃(𝑋 > 30.2) which is equal to 1 – 𝑃(𝑋 < 30.2) since it is
righttailed.
Example 5.
The Mauban Automobile Association reports that the average time it takes to
respond to an emergency call is 35 minutes. Assume the variable is approximately
normal and the standard deviation is 5.5 minutes. If 200 calls are randomly
selected, approximately how many were responded in less than 25 minutes?
Solution:
The number of calls responded in less than 25 minutes is equal to the percentage
of 200 calls which were responded in less than 25 minutes. First, find
. From the table, this is
equal
to 0.0344. Thus, there are (0.0344)(200) = 6.88 or approximately 7 calls were
responded in less than 25 minutes.
Example 6.
To help students improve their reading, Mauban south district decides to
implement a reading program. It is to be administered to the bottom 5% of the
students in the district, based on the scores of a reading achievement test which
are normally distributed. If the average score for the students in the district is
122.6 and the standard deviation
is 18, find the cut-off score that
will make a student eligible for
the program.
Solution:
154
The problem is asking for a normal random variable that corresponds to a z value
and occupying an area of 5% or 0.05 from the left under the normal curve. To find
this normal random variable which is the cut-off score that will make a student
eligible for the reading program, look for the 𝑧 value on the table corresponding to
an area of 0.05 and that is equal to −1.645. Then, convert this 𝑧 value to a normal
random variable 𝑥, that is, 𝑥 = 𝑧𝜎 + 𝜇 = (−1.645)(18) + 122.6 = −29.61 + 122.6 = 92.99 ≈
93. Therefore, a student will be eligible in the reading program if he scored 93 and
below.
Example 7.
To pass in the accreditation and
equivalency (A&E) test, ALS students must
score in the top 15% in general ability
tests. The test has a mean of 200 and a
standard deviation of 20. Find the lowest
possible score to pass the test assuming
the test scores are normally distributed.
Solution:
The lowest possible score is the normal
random variable corresponding to a 𝑧 value
occupying an area of 0.15 from the right of
the normal curve. To solve for the required
variable, given that P(Z > z) = 0.15
NOTE: Remember 𝑃(𝑍 < 𝑧) + 𝑃(𝑍 > 𝑧) = 1
𝑃(𝑍 > 𝑧) = 0.15
1 – 𝑃(𝑍 > 𝑧) = 1 – 0.15
1 – 𝑃(𝑍 > 𝑧) = 1 – 0.15 since it is right tailed
= 0.85
from the table, look for the z value corresponding to this area.
Thus, 𝑧 = 1.03 then convert this to a normal random variable x 𝑥
= 𝑧𝜎 + 𝜇 = (1.03)(20) + 200 = 220.6 ≈ 221
What’s More
Independent Practice
Find Me
A. Use the Table of Standard Normal distribution to determine corresponding
probabilities of the following:
1. 𝑃(𝑍 < −1.45)
2. 𝑃(0.5 < 𝑍 < 2.33)
3. 𝑃(𝑍 > 1.78)
B. 𝑋 is a normally distributed random variable with a mean of 60 and a
standard deviation of 8. Find the probabilities indicated by using the table.
4. 𝑃(𝑋 < 52)
5. 𝑃(48 < 𝑋 < 64)
6. 𝑃(𝑋 > 57) Independent Assessment
Solve Me
Solve the following problems by supplying the needed information.
The average number of calories in a 1.8-ounce chocolate bar is 230. Suppose
that the distribution of calories is approximately normal with a standard deviation
of 10.
1. Find the probability that a randomly selected chocolate bar will have less than
200 calories.
155
2. Find the probability that a randomly selected chocolate bar will have greater
than 195 calories.
3. What is the percentage that a cholate bar is randomly selected between 200
calories and 250 calories?
What Can I Do
Read and analyze the following problem carefully
A. Suppose your section will participate in the field demonstration at the Sports
festival in your school. This field demonstration must be consisting of 20
learners per section. Your adviser assigned you to organize a team that is
willing to join in the said event. He also instructed you to collect information
about your classmates’ height because he will need to buy a costume. Since
your community is under the General Community Quarantine because of the
recent pandemic, you need to contact your classmates through SMS or
messenger to get their heights and write it on the table below.
1 11
2 12
3 13
4 14
5 15
6 16
156
7 17
8 18
9 19
10 20
B. Answer the following questions based on your collected data.
1. What is the mean? standard deviation?
2. Suppose the above data is normally distributed. What is the probability
that the height of your classmates picked randomly is a. less than 155
cm?
b. more than 143 cm?
c. between 140cm and 160 cm?
3. How many participants in your team have heights between 140 cm to
170 cm?
Assessment
Direction: Read and analyze each item carefully. Choose the letter of the best
answer. Write the chosen letter on a separate sheet of paper.
1. Which of the following notations indicate the probability of a z value from the
right?
A. P(Z > z) B. P(Z < z) C. P(X < x) D. P(X > x)
2. What does P(a<Z<b) notation mean?
A. Probability of x that is in the right of a and b.
B. Probability of x that is in the opposite of a and b.
C. Probability of X that is in between two other z values a and b.
D. Probability of a normal random variable X that is in between two other
normal random variables a and b
3. What is the probability of z value indicated by P(Z > -1.78)?
A. 0.0375 B. 0.3075 C. 0.9625 D. 0.9633
4. Find the probability value of P(Z< -1.0)
A. 15.87% B. 34.13% C. 84.13% D. 90.13%
5. Compute the probability value of P(1.35< Z< 2.75)
A. 8.55% B. 85.5% C. 85.85% D. 90.85%
6. Find the z-scores that bound the middle 85.02% of the area under the
standard normal curve.
A. (-1.44, 1.44) B. (-1.44, -1.78) C. (1.44, 1.78) D. (-1.78, 1.78) 7. Which of
the following graph is the correct illustration of the notation 𝑃(𝑍 < 𝑎) ∪ 𝑃(𝑍 >
𝑏)?
A. B. C. D.
8. The average score on Statistics and Probability Summative Test is 40 points
with a standard deviation of 4. What is the probability that Beth’s score is
25 points?
A. -3.75% B. -0.009% C. 0.009% D. 3.75%
9. A set of data is normally distributed with a mean of 40 and a standard
deviation of which interval of the variable defines an area under the curve
corresponding to a probability of approximately 95.44 %?
A. (37,40) B. (34,43) C. (34,46) D. (37,46)
157
Nine hundred thirty-two businessmen accepted the Mayaman Challenge of
one of the Philanthropic Filipinos who are willing to give relief goods for our fellow
Filipinos affected by the recent pandemic. They will be contributing an average of
135 sacks of rice with a standard deviation of 15 sacks of rice.
10. What is the probability that the businessman will contribute less than 105
sacks of rice?
A.98% B. 97.72% C. 2.28% D. 2.0%
11. What is the probability that the businessman will contribute more than
150 sacks of rice?
A. 15.87% B. 76.45% C. 80.78% D. 84.13%
12. How many businessmen will contribute a sack of rice between 105 sacks
to 145 sacks?
A. 534 B.653 C. 676 D. 726
13. The ages of Senior High students enrolled at Sta. Catalina National High
School are normally distributed with a mean of 17 years and a standard
deviation of 2.5 years. If a student is selected at random, what is the
probability that his age is under 16 years?
A. 15.54% B. 34.46% C. 68.92% D. 84.56%
14. The weekly sales of ABM students on their “Go Business” project are normally
distributed with a mean of P4,250 and a standard deviation of P300. How many
percent of their weekly sales are over P4,700?
A. 6.68% B. 13.36% C. 86.63% D. 93.32%
15. In problem number 14, how many percent of the ABM students' weekly sales
are less than P4,700?
A. 6.68% B. 13.36% C. 86.63% D. 93.32%
Additional Activities
Read the following problems carefully. Supply the needed information. Round off
your answer to the nearest hundredths (two decimal places) for the standard scores
and nearest ten thousandths (four decimal places) for the area of probabilities.
1. Use the Table of Standard Normal distribution to find the probabilities of the
following:
a. P(Z < -1.51)
b. P(Z < 1.37)
c. P(Z > 1.78)
d. P(Z > 1.51)
e. P(1.15 < Z < 2.33)
158
159
Statistics and
Probability
160
Identifying the Different
Random Sampling Techniques
Statistics and Probability
Identifying the Different Random Sampling Techniques
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
Writers:
Maria Cristina O. Rosas and Coren R. Manalo
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Team:
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann
O. Briones Celestina M. Alba, Asuncion C. Ilao
Telefax: 02-8682-5773/8684-4914/8647-7487
161
in your future researches. This module was written and intended for senior high
students like you who are taking Statistics and Probability subject. Your adept at
analysis will help you appreciate the richness, and beauty of this subject which will
motivate you to apply to similar situations and develop your own statistical Week
procedures.
The module contains only one lesson which is illustrating random sampling. 5
It offers relevant problems or situations which help you to understand what is
being tackled in this module. Your ability to interpret, reason – out, and make a
judgment or even decision out of statistical measures will also be practiced here.
What I Know
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. What do you call to the set of all people, objects, events, or ideas
researchers want to investigate?
A. data C. sample
B. population D. statistics
2. Which of the following can be considered as the best sample?
A. conveniently chosen C. selected at random
B. purposefully selected D. self-selected
3. From the given populations, which of the following sample is most likely to
be representatives of the population from which it is drawn?
A. population: senior high school students sample: grade 10 students
B. population: online shoppers sample: market vendors
C. population: users of social media sample: students
D. population: SAP beneficiaries sample: government officials
4. Which of the following is NOT a distinguishing feature of simple random
sampling?
A. Each element in the population has an equal probability of being
selected
B. Random numbers selected to decide which elements are included as the
sample.
C. Each element has a unique identification.
D. Samples are selected based on the subjective judgment of the researcher.
5. Which of the following is a type of random sampling method?
A. convenience C. purposive
B. non-probability D. stratified
6. Which of the following is NOT a type of random sampling method?
A. cluster C. simple
B. convenience D. systematic
7. A researcher was given a random list of all graduating senior high school
students. He decided to conduct a survey by asking every seventh student on
the list regarding the courses that they want to take in college. What type of
sampling technique he is planning to use?
A. cluster C. stratified
B. random D. systematic
8. You are a researcher and you wish to do a comparative study between two
hospital's success rates in fighting COVID-19. Instead of mixing patients in the
two hospitals, you simply divided your samples into two and randomly select
162
samples based on the hospital that an individual attended for treatment. What
type of sampling technique are you using?
A. cluster C. stratified
B. random D. systematic
9. You wish to determine the effects of implementing the COVID-19 community
quarantine in the Philippines. Among the variables, the socio-economic status
of your respondents is your primary concern. What sampling technique would
best select your sample for this study?
A. cluster C. stratified
B. purposive D. systematic
10. You go to the streets of your barangay to conduct a study on how SAP
beneficiaries spend their money during the COVID-19 community quarantine.
You ask every by-passer to participate in your research. What type of sampling
technique are you using?
A. convenience C. stratified B. purposive
D. systematic
For numbers 11-15, refer to the situation below:
The Philippine High School caters 5,000 senior high students. Mr. Manalo, the
school principal, wants to obtain information about their plans after graduation.
11. What is the target population in his study?
A. senior high students in his school
B. senior high students in his district
C. parents of senior high students in his school
D. parents of senior high students in the nation
12. Mr. Manalo writes each name of all senior high school students on small pieces
of paper. He rolls and put them in a box and then draws 500 names as
respondents in his study. What random sampling technique does he apply?
A. cluster C. stratified
B. simple D. systematic
13. The principal creates a list of all senior high students. He decides to include
every tenth student in his samples. Which random sampling technique does he
apply?
A. cluster C. stratified
B. simple D. systematic
14. Mr. Manalo groups the senior high students according to their grade level and
strand. He proportionately and randomly chooses sample students from each
group. Which random sampling technique does he apply?
A. cluster C. stratified
B. simple D. systematic
15. The principal groups the senior high school students according to what
barangay they live. He randomly chooses 5 barangays and then includes all the
students living in the randomly chosen barangays in the sample. Which
random sampling technique does he apply?
A. cluster C. stratified
B. simple D. systematic
Lesson 1 Random Sampling
In research, collecting data can either be done in the entire population or the
subset of this population called sample. If a researcher opts to use sample rather
than a population, he must take considerations on the number of samples and how
these samples can be chosen out of his target population.
A population includes all of its elements from a set of data. The size of the
population is the number of observations in the population. For example, if
ABSCBN network has 11,000 employees having the required blood type in a certain
study, then we have a population of size 11,000.
Sample consists of one or more data drawn from the population. It is a
subset, or an incomplete set taken from a population of objects or observations.
163
Taking samples instead of the population is less time-consuming and cost-effective.
Although sampling has advantages, it can also be a source of bias and inaccuracy.
Random Sampling is a sampling method of choosing representatives from the
population wherein every sample has an equal chance of being selected. Accurate
data can be collected using random sampling techniques.
What’s In
Analyze the following scenarios then answer the questions that follow.
164
____________________________________________________________________________
____________________________________________________________________________
_______________________________________________
POPULATION SAMPLE
College
students in a
certain Nursing
University students
What’s New
You already knew that random sampling is a method of selecting a sample from a
population in such a way that every member of the population is equally likely to
be chosen as part of the sample. Do you know that we have four ways to randomly
select our sample? Yes, we have four types of random sampling which you will be
able to define and differentiate in the following activity.
II. You’ve just familiarized yourself with the keywords related to random sampling.
Now you are ready to distinguish its different types. Use your understanding of
the previous activity to identify whether the following illustrates simple,
systematic, stratified or cluster random sampling.
1. A researcher writes the name of each student on a piece of paper, mixes the
papers in a bowl, and draws 7 pieces of paper.
2. A researcher selects every 7th student from a random list.
3. A researcher tells the class to count and then selects those students who count
a multiple of 7 numbers.
4. A researcher separates the list of boys and girls, then draws 7 names by
gender.
5. A researcher surveys all students from 3 randomly selected classes out of 7
classes.
What is It
Let us analyze the situations given above.
1. A researcher writes the name of each student on a piece of paper, mixes the
papers in a bowl, and draws 7 pieces of paper.
Situation 1 illustrates simple random sampling. The pieces of paper
correspond to each student as elements of the population. All of them have an
equal chance of being selected as a sample by randomly picking 7 pieces of paper
in a bowl.
165
3. A researcher tells the class to count and then selects those students who
count a multiple of 7 numbers.
Situations 2 and 3 illustrate systematic random sampling because samples
are being selected based on the k th consistent intervals. Selecting every 7th student
on the random list of names creates an equal chance for all of the students. The
same thing happened in selecting students who count multiple of 7 or 7, 14, 21,
and so on.
4. A researcher separates the list of boys and girls, then draws 7 names by
gender.
Situation 4 illustrates stratified random sampling because the students were
divided into two different strata or groups, boys and girls. With a proportional
number for each group, samples will then be selected at random from these two
groups.
5. A researcher surveys all students from 3 randomly selected classes out of 7
classes.
Situation 5 illustrates cluster sampling since all students are divided into
clusters or classes, then 3 classes were selected at random out of the 7 classes. All
of the students of these three classes comprised the samples of the study. Take
note that each cluster is mutually homogeneous yet internally heterogeneous.
166
Simple Random Sampling
JUAN DE LA CRUZ
ANDRES KAPULE
MARIA PAYAPA
JOSEFA MABUHAY
MARCELA MABINI
b.
Systematic random sampling is a random sampling that uses a list of all the
elements in the population and then elements are being selected based on
th th
the k consistent intervals. To get the k interval, divide the population size
by the sample size.
c. Stratified random sampling is a random sampling wherein the population
is divided into different strata or divisions. The number of samples will be
proportionately picked in each stratum that is why all strata are
represented in the samples.
d.Cluster sampling is a random sampling wherein population is divided
into clusters or groups and then the clusters are randomly selected. All
elements of the clusters randomly selected are considered the samples of
the study.
167
Stratified Random Sampling
Cluster Sampling
The sampling techniques that involve random selection are called probability
sampling. Likewise, simple random, systematic, and stratified and cluster
sampling are all probability sampling techniques.
There are also sampling techniques that do not involve random selection of data.
They are called non-probability sampling. An example of this is convenience
sampling wherein the researcher gathers data from nearby sources of information
exerting minimal effort. Convenience is being used by persons giving questionnaires
on the streets to ask the passers-by.
Purposive sampling is also not considered a random sampling since the
respondents are being selected based on the goal of the studies of the researcher. If
the study is about the students who are children of OFW, the researcher will get
samples who are children of OFW. This excludes other students from being a
sample.
What’s More
Independent Practice
Identify the random sampling technique used in each item.
168
1. You are given a list of all graduating students in your school. You decide to
survey every 10th student on the list and ask them the organization that they
belong.
2. You wish to make a comparison of the gender differences in Mathematics
performance. You divide the population into two groups, male and female, and
randomly pick respondents from each of the group.
3. You assign numbers to the members of the population and then use draw lots
to obtain your samples to answer your survey on the most popular festivals in
the country.
4. You randomly pick five out of fifteen barangays to conduct your survey in
your municipality or city about their best environment-friendly practices.
5. You write the names of each student in pieces of paper, shuffles, and then
draw eight names to answer a survey on their ethical media practices.
Independent Assessment
Analyze the given situation below and then identify the type of random sampling
methods being illustrated in each item. Write the letter of your answer in a separate
answer sheet.
Each of the 30 basketball high school teams has 12 players. The organizer wants to
have a quick survey to know the average height of the players.
1. Each team will be asked to place papers with its players’ names into a
separate fishbowl and randomly drew out five names. The five names from each
team will be combined to make up the sample. Which of the following sampling
techniques is being used in this situation?
A. cluster C. stratified
B. simple D. systematic
2. The organizer lists all the players in a sheet of paper and then assigns a
unique number for each. Sixty numbers will be picked to get the samples. Which
random sampling technique did the organizer apply?
A. cluster C. stratified
B. simple D. systematic
3. All players will be grouped according to their age and will randomly choose
players from each group to measure their height. Which random sampling
technique did he apply?
A. cluster C. stratified
B. simple D. systematic
4. The organizer created a list of all players, decided to survey every sixth
name on the list, and later asked those players that were selected. Which random
sampling technique did he apply?
A. cluster C. stratified
B. simple D. systematic
5. A team is randomly selected to answer the question prior to the study.
Which random sampling technique did he apply?
A. cluster C. stratified
B. simple D. systematic
169
_________________________ is a part of population that is chosen as representatives
of the entire population.
What I Can Do
Decide a specific problem that you want to study about the COVID-19 pandemic.
1. Who are your target respondents or sources of data?
____________________________________________________________________________
____________________________________________________________________________
_____________________________________________________
2. How are you going to obtain your data?
____________________________________________________________________________
____________________________________________________________________________
_____________________________________________________
3. What sampling method would be best for your study? Why would you select
that method?
____________________________________________________________________________
____________________________________________________________________________
_____________________________________________________
4. What do you think would be the greatest description of a best sample?
Why?
____________________________________________________________________________
____________________________________________________________________________
_____________________________________________________
170
5. If you are to create your own research, will you use random sampling? Why
or why not? If yes, what kind of random sampling are you going to use? If
not, how will you obtain your samples?
____________________________________________________________________________
____________________________________________________________________________
______________________________________________________
Assessment
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. What do you call the set of all data that researchers want to investigate?
A. data C. random
B. population D. sample
2. Which of the following can be considered as the best sample?
A. conveniently chosen C. selected at random
B. purposefully selected D. self – selected
3. From the given populations, which of the following sample is most likely to be
representatives of the population from which it is drawn?
A. population: SAP beneficiaries sample: car owners
B. population: lawmakers sample: students
C. population: online shoppers sample: cellphone users
D. population: graduating students sample: grade 8 students
4. In random sampling, the sample should be_______ the population
A. as large as C. not taken from B. different
from D. representative of
5. Which of the following is NOT a type of random sampling?
A. cluster C. simple
B. purposive D. systematic
6. Which of the following is random sampling?
A. convenience C. purposive
B. non-probability D. stratified
7. As a researcher, you ask the people of your barangay on how they spent their
money during the community quarantine. You decided to group them according
to their socio-economic status and then proportionately and randomly selecting
samples from these groups. Which type of sampling are you using?
A. convenience C. stratified
B. purposive D. systematic
8. As a researcher, you want to know the plans of graduating senior high school
students. You ask all by-passers to participate in your research.
Which type of sampling are you using?
A. convenience C. stratified
B. purposive D. systematic
9. You wish to examine the effects of community quarantine on your fellow
students. You decided to borrow a list from the guidance office and select every
10th student on it. What sampling technique are you using?
A. convenience C. stratified
B. purposive D. systematic
10. You want to know the effects of the COVID-19 to the Filipino people and
you will ask the medical frontliners. Which type of sampling will you use?
A. convenience C. stratified
B. purposive D. systematic
171
11. What is the target population in her study?
A. all students in her school
B. parents of all students in her school
C. first year high school students in her school
D. parents of first year high school in her school
12. The principal creates a list of all first year high school students, decides to
survey every seventh student on the list. Which random sampling technique
does she apply?
A. cluster C. stratified
B. simple D. systematic
13. Mrs. Mogol writes each name of all first year high school students on small
pieces of paper, he then put them in a box and draws 300 names to participate
in the study. Which random sampling technique does she apply?
A. cluster C. stratified
B. simple D. systematic
14. The principal groups the first year high school students according to the
barangay where they live. She randomly picks a barangay and all of the
students living in that barangay answer the questionnaire. Which random
sampling technique does she apply?
A. cluster C. stratified
B. simple D. systematic
15. Mrs. Mogol groups the first year high school students according to their
school’s last attended. She proportionately and randomly chooses students
from each group. Which random sampling technique does she apply?
A. cluster C. stratified
B. simple D. systematic
Additional Activities
In the following situation, give the target population and identify possible samples
(should be taken from the target population).
The first one is done as an example for you.
3. The study habits of public senior high school students in selected schools in
Manila.
Population: ________________________________________________________________
Sample: ___________________________________________________________________
4. The degree of satisfaction of parents about the quality of education their children
will get from online classes in Quezon Province.
Population: ________________________________________________________________
Sample: ___________________________________________________________________
172
Population: ________________________________________________________________
Sample: ___________________________________________________________________
Statistics and
Probability
Computing for the Parameter
and Statistic
Statistics and Probability
Computing for the Parameter and Statistic
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
173
Development Team of the Module
Week Writers:
Coren R. Manalo
Editors: Celestina M. Alba and Nestor N. Sandoval
5 Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Team:
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann
O. Briones Celestina M. Alba, Asuncion C. Ilao
Telefax: 02-8682-5773/8684-4914/8647-7487
What I Know
For you to be able to have an insight regarding parameter and statistic, you must
first answer the pre-test before proceeding to the self-learning part of this module.
174
Read and analyze each question carefully then choose the letter of the best answer.
Write your chosen letter on a separate sheet of paper.
For numbers 12-15, the following are the six scores of the students above that are
randomly selected:
9, 7, 5, 8, 7, 8
What’s In
Can you still remember how to calculate for the mean, variance, and standard
deviation? Do the following exercise.
Teacher Jane randomly selected the grades in Statistics and Probability of 7
Grade 11 students. Calculate the mean grade, variance, and standard deviation
of the given set of data.
x 𝐱 − 𝒙̅ (𝐱 − 𝒙̅)𝟐
94
88
79
89
91
84
77
∑x = ∑(𝐱 − 𝒙̅)𝟐 =
Mean
Variance
Standard Deviation
What’s New
Solve for the mean, variance, and standard deviation of the given set of data of a
population.
X X-µ (X − µ)2
94
85
88
79
78
75
89
91
176
84
77
∑X = ∑(X − µ)2 =
Variance Standard
Deviation
Mean
µ=
2. On the second set of data, your computed values are what we call parameters.
What does the parameter mean?
____________________________________________________________________________
____________________________________________________________________________
____________________________________________________________________________
____________________________________________________________________
What is it
A parameter is a measure that is used to describe the population while statistic is a
measure that is used to describe the sample. To understand more, let us discuss
the two measures.
PARAMETER
Below are grades in Statistics of Grade 11 students during the third quarter.
94 85 88 79 78
75 89 91 84 77
Let us compute the population mean, population variance, and population
standard deviation.
POPULATION MEAN
The mean is the sum of the data divided by the number of data. The mean is used
to describe where the set of data tends to concentrate at a certain point.
Population mean is the mean computed based on the elements of the population or
data. The symbol µ (read as “mu”) is used to represent population mean. To
compute for the population mean, we simply add all the data (X) and then, divide it
by the number of elements in the population (N). We apply the formula: µ =
177
∑x = the summation of x (sum of the measures)
𝑁 = number of elements in the population
1 94
2 85
3 88
4 79
5 78
6 75
7 89
8 91
9 84
10 77
N = 10 ∑X = 840
In our case, adding all the 10 grades will give us a sum of 840.
We substitute the values obtained to the formula µ = therefore, µ =
Our computed population mean µ is 84.
178
Student’s Number Grade (X) X−µ (X − µ)2
1 94 94 – 84 = 10 102 = 100
2 85 85 – 84 = 1 12 = 1
3 88 88 – 84 = 4 42 = 16
4 79 79 – 84 = – 5 52 = 25
5 78 78 – 84 = – 6 62 = 36
6 75 75 – 84 = – 9 92 = 81
7 89 89 – 84 = 5 52 = 25
8 91 91 – 84 = 7 72 = 49
9 84 84 – 84 = 0 02 = 0
10 77 77 – 84 = 7 72 = 49
N = 10 ∑X = 840 ∑(X − µ)2 = 382
The third column is computed through subtracting the mean to the scores, while
the fourth column is computed by squaring the third column. Since there is a
symbol ∑ or summation in the formula, we need to add the computed values in the
fourth column.
Again, for the population mean, µ
=
For the population variance, we substitute the computed values to our formula,
thus
For the population standard deviation, we can also substitute the computed values
to the formula, or we can simply get the square root of the variance.
Population mean (µ), population variance ( 𝜎2) and population standard deviation (𝜎) are
what we called parameters.
STATISTIC
From the previous data of the population, suppose that we randomly select only 7
data out of the total 10 data in the population.
94 85 88
79 78
75 89 91 84 77
Compute the sample mean, sample variance, and sample standard deviation. Here
is the result:
SAMPLE MEAN
The sample mean is the average of all the data of the samples. The symbol 𝑥̅
(read as “x bar”) is used to represent the sample mean. To compute for the sample
179
mean, we simply add all the data and divide it by the number of elements in the
sample (n). We apply the formula:
In this example, there is a slight difference between the population mean and the
sample mean. But notice that there is no difference regarding the method in
determining the value of the population mean and the sample mean. For the
divisor, the population mean µ uses N (population size) while sample mean x
applies n (sample size).
SAMPLE VARIANCE AND SAMPLE STANDARD DEVIATION
Sample variance is the computed variance of the elements of the sample. s 2 is used
to represent sample variance.
180
Grade) Grade)
1 94 94 94 – 86 = 8 82 = 64
2 85
3 88 88 88 – 86 = 2 22 = 4
4 79 79 79 – 86 = – 7 – 72 = 49
5 78
6 75
7 89 89 89 – 86 = 3 32 = 9
8 91 91 91 – 86 = 5 52 = 25
9 84 84 84 – 86 = – 2 – 22 = 4
10 77 77 77 – 86 = – 9 – 92 = 81
∑x = 840 ∑x = 602 ∑(x − 𝑥̅)2 = 236
The fourth column is computed by subtracting the mean to the grades, while the
last column is computed by squaring the third column. Since there is a symbol ∑
or summation, we need to add the computed values.
Sample mean (𝑥̅), sample variance (s2) and sample standard deviation (s) are what we call
statistic.
Remember that parameters are for population while statistics are for sample.
Other examples of parameters and statistics are the population proportion and
correlation coefficient. For population proportion, we use “p” for sample and “P” for
the population. In correlation coefficient, we use “r” for the sample and “𝜌” (read as
rho) for the population. These will be discussed in the latter part of this course.
What’s More
Since the difference between parameter and statistic is already clear to you,
let us now apply the different parameters and statistics of the following data.
Believe me, you can do this!
Independent Practice 1A
Complete the table below by writing the correct symbols and formula in each
parameter and statistic.
Mean
181
Variance
Standard
Deviation
Independent Practice 1B
Complete the table below by computing for the different parameters and statistics.
Population: 1, 2, 3, 4 and 5
Sample: 2, 4 and 5
Mean
Variance
Standard
Deviation
Independent Assessment
I. The following are the scores of ten ABM students in their fourth-quarter
examination:
26 37 45 35 31
47 23 34 44 38
Complete the table and compute for the population mean, population variance, and
population standard deviation.
182
Write your computations here:
II. The scores of the 5 randomly selected HUMSS students are given below:
26, 45, 31, 34, 44
Complete the table and compute for the sample mean, sample variance, and sample
standard deviation.
183
Create a group with 8 members or ask any 8 people nearby. Inquire a specific
question that a possible answer is a number. Record those answers of each
member of the group or the people you have surveyed. Solve for the sample statistic
using the computations that we have discussed.
You may also choose from any of the following topics for survey.
Assessment
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. What do you call a number that describes a sample characteristic?
A. data C. sample
B. parameter D. statistic
2. What do you call a number that describes a population characteristic?
A. data C. sample
B. parameter D. statistic
3. Which of the following is a statistic?
A. 2 C.
B. µ D. s
For numbers 4-6, the following are scores of students in their quiz:
7, 11, 8, 8, 19, 15, 7, 9, 9, 20, 17, 14
184
B. 11 D. 14.4
5. What is the value of the population variance? (2 points)
A. 21 C. 23.43
B. 22.91 D. 27.33
6. What is the value of ? (2 points)
A. 4.58 C. 4.84
B. 4.79 D. 5.23
For numbers 13-15, the following are the seven scores of the students above that are
randomly selected:
7, 8, 8, 15, 9, 20, 17
Additional Activities
Given a set of data, which is more advantageous, computing the population
parameter or the sample statistic? Why?
What are the advantages and disadvantages of choosing parameter over statistic
and vice versa?
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
Statistics and
Probability
Identifying Sampling
185
Distribution of Statistics
Statistics and Probability
Identifying Sampling Distribution of Statistics
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
Writer:
Coren R. Manalo
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Team:
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann
O. Briones Celestina M. Alba, Asuncion C. Ilao
Telefax: 02-8682-5773/8684-4914/8647-7487
186
you to better understand them. Your adept at analysis will help you appreciate the
richness, and beauty of Statistics which will motivate you to apply to similar events
and create statistical measures of your own. This module was written and intended
Week
for senior high students like you who are taking Statistics and Probability subject.
What I Know
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
187
B. 5 D. 3
5. Which of the following sample mean has the greatest frequency in the
sampling distribution?
A. 1.5 C.
2.5
B. 2 D. 3
6. What is the frequency of sample mean 4?
A. 3 C. 1
B. 2 D. 0
7. Wat is the probability of the sample mean 3.5 in the sampling
distribution?
1 2
A. C.
6
3
3
B. D.
4
8. What is the lowest value of the sample mean in this sampling
distribution?
A. 0 C.
2
B. 1.5 D.
2.5
9. Which of the following histogram correctly shows the sampling
distribution of sample mean of size n = 2?
A. 2 C. 2
6 6
1 1
6 6
188
12.Which of the following sample mean has the least frequency in the
sampling distribution?
A. 5
C. 7
B. 6.33 D. 8.67
13.What is the probability of the sample mean 8.33 in the sampling
distribution of sample means?
2 1
A. C.
10 5
B.
D.
14.What is the lowest value of the sample means in this sampling
distribution?
A. 5
C. 7
B. 6.33 D. 8.67
15.Which of the following histogram correctly shows the sampling
distribution of sample means of size n = 3?
During your junior high school classes, you already discussed frequency
distribution and on the earlier part of this course, you also tackled the probability
distribution. In this module, we will discuss another distribution, the sampling
distribution of the mean. Sampling distribution shows all the possible samples, in a
189
given population, together with their mean. It describes the probability of the mean
of all sets of samples with the same sample size n. The idea of sampling distribution
helps us to recognize how we use one random sample to conclude for the
population. It defines the entire possible statistic we could conclude up with.
What’s In
In this lesson, you will learn how to construct a sampling distribution of
the sample mean. This part consists of three activities to prepare you for the
lesson. To begin with, the concept of combination tells us the number of all
possible random samples of population size N taken at sample size n.
2. 6C4
3. 5 C3
4. 3C2
5. 7C5
15 16.5 15 13 15.5
16 14.5 14 15.5 15
16 15 16.5 14.5 17
190
What’s New
A population of Senior High School consists of numbers 1, 2, 3, 4, and 5.
Let us create a sampling distribution of size 3 by following the steps.
A. First, let us evaluate the combination of N (population size) objects taken
n (sample size) at a time.
Therefore, we have
NCn = 5C3 = ____________________ samples
To get this, we apply the formula for a combination of N taken n at a
time, NCn, where N is the size of the population, and n is the sample size.
B. For the second step, write all the possible samples and compute for the
mean of each sample. To complete the table, compute for the sample
mean, samples are already given.
Observation Sample Mean
1 1, 2, 3 (1 + 2+ 3) ÷ 3 = 6 ÷ 3 = 2
2 1, 2, 4 2.33
3 1, 2, 5
4 1, 3, 4
5 1, 3, 5
6 1, 4, 5
7 2, 3, 4
8 2, 3, 5
9 2, 4, 5
10 3, 4, 5
C. Then, construct a frequency distribution of the sample mean.
Sample Mean Frequency
2
2.33
2.67
3
3.33
3.67
4
191
D. Lastly, add another column for the probability distribution. The
probability distribution of the sample means is also called the sampling
distribution of the sample mean. Since we have 10 possible outcomes, for
the probability, the denominator will be 10 and the number on the
numerator part will be the number of frequencies.
E.
Sample Means Frequency Probability P(x)
2 1
2.33 1
2.67 2
3 2
3.33 2
3.67 1
4 1
2
10
1
10
What is It
Sampling distribution of the sample means is a frequency distribution using
the computed sample mean from all the possible random samples of a particular
sample size taken from the given population.
NC n
1, 2, 3 2
192
1, 2, 4 2.33
1, 2, 5 2.67
1, 3, 4 2.67
1, 3, 5 3
1, 4, 5 3.33
2, 3, 4 3
2, 3, 5 3.33
2, 4, 5 3.67
3, 4, 5 4
What’s More
Independent Practice 1A
How many different samples of size n can be obtained from the following
population N sizes?
N N NCn
1. 4 2
2. 6 4
3. 8 3
4. 10 4
5. 15 5
Independent Practice 1B
I. A population consists of the values (1, 4, 3, 2). Consider samples of size 2 that
can be drawn from this population.
a. List down all the possible samples and corresponding sample mean
Sample Sample Means
193
b. Construct the sampling distribution of the sample means.
Sample Means Frequency Probability P(x)
Independent Assessment
Coren was able to sell several pairs of Marikina shoes that have sizes of 4, 5, 6, 7,
and 8. Consider samples of size 3 that can be drawn from this population. a. List
down all the possible samples and corresponding sample mean.
Sample Sample Mean
194
What I Have Learned
Let us see what you have learned in this module by completing the following
statements.
A _________________________ of the sample mean is a frequency distribution using
the computed sample mean from all the possible random samples of a particular
sample size taken from the given population.
What I Can Do
Since you already know how to construct the sampling distribution of the sample mean
and its corresponding histogram, let us apply your knowledge in the following data.
The ages of ALS learners in Bagong Pag-asa Community Learning Center (CLC) are
19, 20, 15, 17, 27, and 22. Consider samples of size 4 that can be drawn from this
population.
a. List down all the possible samples and corresponding sample mean.
195
c. Draw a histogram of the sampling distribution of the sample mean.
Assessment
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. What distribution pertains to the frequency distribution of the sample mean
from all the possible random samples of a particular sample size n taken
from the given population?
A. frequency B. normal C. population
D. sampling
2. Which of the following is NOT a step-in creating sampling distribution of the
sample mean?
A. Determine the number of sets of all possible random
samples.
B. Compute for the standard deviation and variance of the
samples.
C. Construct a frequency distribution table of the sample
means and probability.
D. List all the possible random samples and solve for the
sample mean of each set of samples.
3. Which of the following is the mean of sample 6, 10, 21, 25, and 28?
A. 15.17 B. 18 C. 21.2 D. 22
For numbers 4-9, refer to the following set of data of a population {11, 12, 13, 14}.
4. How many different samples of size n = 2 can be drawn from the
population?
A. 6 B. 5 C. 4 D. 3
5. Which of the following sample mean appears most frequent in the
sampling distribution?
A. 10 B. 11.5 C. 12
D. 12.5
6. What is the lowest value of the sample mean in this sampling
distribution?
A. 11.5 B. 12 C. 12.5
D. 13
7. What is the frequency of the sample mean 12.5?
A. 0 B. 1 C. 2 D. 3
196
2
2 6
6
1
1 6
6
5 5
15. Which of the following histogram correctly shows the sampling
distribution of sample means of size n = 3?
2 2
A. C.
10 10
1
1
10
10
2
2
B. 10 0 4 4.67 5.33 6 6.67 7.33 8 D.
10 0 4 4.67 5.33 6 6.67 7.33 8
1
1
10
10
Additional Activities
Ask a family with five members in your community. Now, consider samples of size 3
that can be drawn from the data.
197
2. List all the possible samples and compute the sample mean.
3. Construct the sampling distribution of the sample mean.
4. Draw a histogram corresponding to the sampling distribution of the sample
mean.
Statistics and
Probability
Finding the Mean and Variance
of the Sampling Distribution of
the Sample Mean
Statistics and Probability
Finding the Mean and Variance of the Sampling Distribution of the Sample Mean First
Edition, 2020
198
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
Author:
Hermenegildo R. Pineda
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Team:
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann
O. Briones Celestina M. Alba, Asuncion C. Ilao
Telefax: 02-8682-5773/8684-4914/8647-7487
199
At the end of this module, you are expected to find the mean and variance of the
Week
sampling distribution of the sample mean.
6 What I Know
Let us see how far you will go and how much you know about this pretest. Let’s get
started!
Direction: Read and analyze each item carefully. Shade the entire circle for each
statement that corresponds to your answer. The test has a total of 15 points.
A B C D 1. Which of the following measures the dispersion of the
O O O O sampling distribution of the sample mean and is
calculated using the relation
∑(𝑋 − 𝜇)2 • 𝑃(𝑋) or ∑[𝑋2 𝑃(𝑥)] − 𝜇2? (1 point).
B. D.
200
A B C DO 7. What is the mean of the sampling distribution of
O O O the sample means? (3 points)
A. 3.44 C. 4.92
B. 4.60 D. 5.20
What’s In
Let’s have this first for a review. Find out if you are ready to learn the next lesson by
answering the activity.
A. The following are sample data obtained from the Statistics quiz of 22
students in Math High School in Quezon Province. Compute the mean and
variance of the frequency distribution.
Scores F Scores f
61 1 84 2
68 1 86 2
71 1 88 2
72 1 89 1
75 1 90 1
81 1 91 3
201
82 1 95 1
83 2 96 1
Total 22
B. Mean and Variance of the Probability Distribution
Complete the table below and find the mean and variance of the given
probability distribution.
4 1 1
5 2 4
Σ(x- µ)2=10
Use the formula (3) NCn . Here N=5 and n=2.
! 3. _______
5C2 = 10
So, there are 10 possible samples of size 2 that can be drawn.
202
(4) Samples (4) Mean
1,2 1.5 4. _______
1,3 2.0
1,4 2.5
1,5 3.0
2,3 2.5
2,4 3.0
2,5 3.5
3,4 3.5
3,5 4.0
4,5 4.5
1.5 1 1/10
2.0 1 1/10
2.5 2 1/5
3.0 2 1/5
3.5 2 1/5
4.0 1 1/10
4.5 1 1/10
Total 10 1.00
(6)
6. _______
P. Mean)2 b._______
Mean)2
c._______
203
1.5 1/10 -1.5 2.25 0.225 d._______
Solution:
1. Compute the mean of the population using the formula µ = 𝛴𝑥/𝑁. The value
equals to 3.0.
2. Compute the variance of the population using the formula 𝜎2 = 𝛴(𝑥 − µ)2/ 𝑁.
a. Subtract each measurement by the computed population mean.
b. Square the results obtained in (a) then add. Divide the sum by the frequency
of measurement to get the value of the population variance. The value equals
to 2.0.
3. Determine the number of possible samples of size 2 (without replacement). Use
the combination formula NCn where N is the population size and n is the sample
size.
4. List all possible samples and compute the corresponding means.
5. Construct the sampling distribution of the sample means.
6. Compute the mean of the sampling distribution of the sample means. Follow
these steps:
a. Multiply each sample mean by the corresponding probability.
b. Add the results obtained in a. (The sum of the values corresponding
to that column is the mean of the sampling distribution of the sample
means.)
𝜇𝑥̅ = ∑̅[̅𝑋̅ • 𝑃(𝑋̅)]
7. Compute the variance of the sampling distribution of the sample means using
the formula 𝜎𝑥2̅ = ∑[𝑃(𝑋̅) • (𝑋̅ − 𝜇)2].
204
distribution of sample mean
What is It
Based on the previous activity, the mean of the population is equal to the
sample mean. This happens because all samples obtained of n size came from the
same population. The variance of the population is greater than the variance of the
sample means since a sample of n is less than the population N then the data vary
by a small amount as compared to the population. The activity also taught you the
steps in finding the mean and variance of the sampling distribution of the sample
mean.
Mean of the Sampling Distribution of the Sample Mean
∑[𝑋̅ • 𝑃(𝑋̅)] = sum of the products of the sample mean and the
probability of the sample mean
𝜇 = population mean
𝑛 = sample size
𝑁 =population size
𝑋̅ = sample mean
(𝑋̅ − 𝜇)2 = square of the difference between the sample mean and
population mean
𝑋̅2 • 𝑃(𝑋̅)= product of the square of the sample mean and the
probability of the sample mean
∑[𝑋̅2 • 𝑃(𝑋̅)] = sum of the product of the square of the sample mean
and the probability of the sample mean
What’s More
Independent Activity
205
MEAN AND VARIANCE OF THE SAMPLING DISTRIBUTION OF THE SAMPLE
MEAN.
Direction: Perform what is being asked. Follow the steps indicated in the first
column of the table to complete the activity. In the second column, write your
solution on the spaces provided.
Situation: A population of PWD learners in 5 schools in a certain municipality of
Quezon Province are (𝒙) 9, 5, 6, 12, and 15. Suppose that two schools were selected
as samples, determine the mean and variance of the sampling distribution of
sample mean.
Steps Illustration/Solution
1. Compute the mean Formula:
of the population µ. (1) µ = 𝛴𝑋/𝑁= __________
2. Compute the
variance of the 𝑿 (a) 𝑿 − µ (b) (𝑿 − µ)𝟐
population 𝜎2.
206
4. List all possible
samples and their Samples of size 2 (4) Mean (4)
corresponding
means.
5. Construct
the (5) Sampling Distribution of Sample Means
sampling distribution
Sample Mean Frequency Probability
of the sample means.
(𝑿̅) (𝒇) 𝑷(𝑿̅)
Total (f)
a. Multiply the
sample mean by
the corresponding
probability.
Total (b)
b. Add the results.
207
7. Compute the
variance of the Sampl Probability (a) (b) (c)
sampling distribution e Mean S. Mean (S. Mean P. Probability•(S.
of the sample mean. Mean) 2
Mean-P.
– P.Mean
Follow these steps: Mean)2
a. Subtract the
population mean from (𝑿̅) 𝑷(𝑿̅) (𝑿̅ − 𝝁) (𝑿̅ − 𝝁)𝟐
each sample mean. 𝑷(𝑿̅) • (𝑿̅ − 𝝁)𝟐
b. Square the
difference.
c. Multiply the
results by the
corresponding
probability.
Total (d)
d. Add the results. Note: S. Mean = Sample Mean, P. Mean = Population Mean (7)
Variance:
]= ____________
the sample means, you may use the formula 𝜎𝑥̅2 = ∑[𝑃(𝑋
̅ ) • (𝑋̅ − 𝜇 )2 ]or
2 = 2•
𝜎 𝑁−𝑛 for finite population (sampling without replacement) and
𝜎2
𝜎𝜎 2 =
𝑥̅ 𝑛 𝑁−1
𝜎𝑋̅ = √𝑛
𝑥̅ 𝑛 for infinite population (sampling with replacement). Also, the
𝜎
standard error of the mean is computed using the formula
Independent Assessment
Direction: Provide step-by-step solutions to the given problem below. Write your
answers on the spaces provided.
Consider all samples of size 2 from the population 2, 5, 6, 9, 11, and 13. Compute
the mean, and variance of the sampling distribution of the sample mean.
208
First step:
Second step
a)
b)
Third step
Fourth step
Fifth step
Sixth step a)
b)
Seventh step
a)
b)
c)
d)
1. What are the 7 step-rule in finding the mean and variance of the sampling
distribution of the sample mean?
2. What are the formulas that can be used to determine the mean and variance of
the sampling distribution of the sample mean for finite population (sampling
without replacement) and for infinite population (sampling with replacement)?
What I Can Do
Make use of your acquired skills in this lesson.
Direction: Perform what is being asked. Solve the problem by showing the step-
bystep process.
Situation: Joe, a DOH COVID-19 tracker, found out that the number of infected
persons of COVID-19 in seven low-risk provinces of the country are 1, 9, 2, 12, 8,
7, and 10. Suppose that 5 provinces are drawn as sample.
1. What is the mean (𝜇), variance (𝜎2) and standard deviation of the population (σ)?
209
2. How many different samples of size n =5 can be drawn from the population?
List them with their corresponding mean.
3. Construct the sampling distribution of the sample mean.
4. What is the mean (𝜇𝑥̅) of the sampling distribution of the sample mean?
Compare this to the mean of the population.
5. What is the variance of the (𝜎𝑥2̅ ) of the sampling distribution of the sample
mean?
Assessment
Direction: Read and analyze each item carefully. Shade the entire circle for each
statement that corresponds to your answer. This test is equivalent to 15 points.
A B C D 1. Which of the following is described as the quotient of
O O O O the population variance and the sample size n? (1
point)
A. mean of the sampling distribution of sample
mean
B. variance of the sampling distribution of the
sample mean
C. standard deviation of the sampling distribution of
the sample mean
D. average deviation of the sampling distribution of
the sample mean
B. D.
210
A B C D 7. What is the mean 𝑥̅ of the sampling distribution of the
O O O O sample mean? (3 points)
A. 4 C. 6
B. 5 D. 7
Equivalent Description
point/s
2 points Complete solution with the correct answer
1 point Last major step of the solution is incorrect
0 point Wrong solution or no answer
Additional Activities
Miss Mathica Kumpleto (MMK), a SHS Math teacher in Mathalino National
High School would like to find the mean and variance of the sampling
distribution of the sample mean of the height of her students by forming a group
consisting of 6 members. She also said that they need to determine the height (in
meters) of each member of the group and to get a list of the average height of
each group. She further instructed her students to submit their answer with the
following data.
Statistics and
Probability
211
Defining Sampling Distribution
of the Sample Mean for Normal
Population
Statistics and Probability
Defining Sampling Distribution of the Sample Mean for Normal Population
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
Writers:
Hermenegildo R. Pineda
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Kelvin A. Ramintas and Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Team:
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann
O. Briones Celestina M. Alba, Asuncion C. Ilao
Telefax: 02-8682-5773/8684-4914/8647-7487
212
b) unknown
At the end of this module, you are expected to define the sampling
Week
distribution of the sample mean for the normal population when the variance is
known and unknown.
6 What I Know
Let us see how far you will go and how much you know about this pretest. Let’s get
started!
Direction: Read and analyze each item carefully. Shade the entire circle for each
statement that corresponds to your answer.
A. C.
B. D.
A B C D 4. The mean weight of two thousand (2000) 11-year-old children O
O O O is 35 kilograms with a standard deviation of 5 kilograms. Samples of 25
children were selected and the mean weight of this group was found to be 34
kilograms with a standard deviation of 3 kilograms. What condition of the
sampling
A B C DO A. Population variance is known
O O O B. Population variance is unknown
distribution of C. Sample standard deviation is small
the sample D. Sample standard deviation is large
mean for
normal
population
manifests in the
situation?
213
5. Which of the formulas is used to calculate the standard deviation of the sampling
distribution
mean when the population standard deviation is of the
unknown? sample
A. C.
B. D.
A B C D 6. A population consists 1, 9, 2, 12, 8, 7, and 10. Suppose samples
O O O O of size 5 are drawn from this population. This situation is an example
of the sampling distribution of the sample mean for a normal
population wherein population variance is __________.
A. unknown C. large
B. known D. small
A B C D 7. Suppose that the rice mill wants to test its product to find out
O O O O whether it meets the average weight of 50 kilograms. Twenty-five
sacks of rice are weighed and found to have an average
weight of 49.8 kg with a standard deviation of 0.12 kg. The
rice mill owner is in a situation of the sampling distribution
of the sample mean wherein the population variance is
__________.
A. unknown C. large
B. known D. small
A B C D 8. Suppose the manager of a Jollibee Corp. wishes to know
O O O O whether their chicken packaging machine still packs accurately
A B C DO A. variance is known
O O O after 6 B. variance is unknown
months of use. C. standard deviation is small
The machine D. standard deviation is large
was rated to 9. This distribution is used to determine the probability of an event by
pack 1,500- transforming the mean of the sample to an approximately standard normal
gram pack of variable if the population variance is known and sample size is greater than
chicken with a 30.
standard A. t-Distribution
deviation of 30
B. Z-Distribution
grams. The
C. Chi-Square Distribution
statistician
D. Pearson Correlation Coefficient
tests a sample
of 40 packs to
A B C DO O O O
test the
accuracy of the
machine. What
example of
sampling
distribution of
the sample A B C DO O O O
mean for
normal
population
portrays the
situation?
A B C DO O O O
214
A B C DO A. unknown
O O O B. known
C. large
D. small
11. Which of the following formulas is used to calculate the standard deviation of
A B C DO the sampling distribution of the sample mean when the population standard
O O O
deviation is given? A. C.
B. D.
What’s In
Let’s have a review first. Find out if you are ready to learn the next lesson by answering the
activity.
Mark is conducting a survey on Senior High School students who are active on environmental
activities on their community. Given a population consisting of the following measurements (𝑋) 2,
8, 11, 7, 𝑎𝑛𝑑 3. Suppose that samples of size 2 are drawn from the population. Compute the mean
215
and variance of the sampling distribution of the sample mean. Complete the table below to
accomplish this task.
Table 1: List of samples of size 2 and the corresponding sample mean.
Sample Frequency Probability
Mean (𝒇) 𝑷(𝑿̅)
(𝑿̅)
Total (f)
Total
216
Table 4: Variance of the Sampling Distribution of the Sample Mean
Sample Probability Sample Mean – (Sample. Probability• (Sample.
2
Mean Population. MeanPopulation. Mean-Pop. Mean)
2
Mean Mean)
(𝑋̅) 𝑃(𝑋̅) (𝑋̅ − 𝜇) (𝑋̅ − 𝜇)2 𝑃(𝑋̅) • (𝑋̅ − 𝜇)2
Total
What’s New
Activity 1: KNOWING MO NA?
Instruction: Read and analyze the following situations. Determine whether the following
statements have a known or unknown population variance. Write your answer on the space
provided. Identify also the formula to be used to estimate the standard error of the mean by
writing the symbol 𝜎𝑥̅, when the population variance is known and the symbol 𝑠𝑥̅ when the
population variance is unknown
Standard Error
Situation Answer
Formula
1. Because of Inclusive Education, learners with
disabilities are also part of the normal students.
Consider a population of the PWD learners
consisting of 1, 2, 3, 4, and 5. Samples of size 2
are drawn from this population.
2. Given 𝜇 = 15, 𝑠 = 6 𝑎𝑛𝑑 𝑛 = 16
3. IQ scores are normally distributed with standard
deviation 𝜎 = 15 𝑝𝑜𝑖𝑛𝑡𝑠. You wish to determine the
𝜇 IQ of all students in Matiyaga National High
School by taking a random sample of 16 students.
4. The general weighted average (GWA) of the Grade 7
applicants for admission in QSHS over the past 7
years has been consistently
𝜇 = 92.78. This year a sample data of the applicant’s
GWA are 98, 89, 92, 95, 96, 90, 91, 92, 97 and 93
5. A finite population composed of six items whose
values are 2, 3, 6, 7, 8, and 10. Samples of 3
items are drawn at random without replacement.
8. Known – This is a population data. Although the value of the variance is not given, you can
still determine the population mean and population variance of the data using the formulas
respectively. So, this situation is an example of the sampling
distribution when the variance is known. Since the population variance can be calculated,
used in the computation of the standard error of the mean.
9. Unknown- The population variance is unknown. The given values are the population mean 𝜇
= 15, the sample standard deviation 𝑠 = 6, and the sample size 𝑛 = 16. Use the formula
for the standard error of the mean.
217
10.Known- the population variance is given with the value of the square of the standard
deviation 𝜎2 = 152. Apply the formula in computing for the standard error of the mean.
11.Unknown- The population variance is unknown. The only given values are the population
mean 𝜇 = 92.78 and the sample size 𝑛 = 10. Use the formula for the standard error of
the mean.
12.Known – This is a population data, you can determine the population mean and variance of
the data using the formulas . So, this situation is an example of the
sampling distribution where the population variance can be computed. Apply the formula
to solve for the standard error of the mean.
What is It
Based on the previous activity, you learned from the situations presented that by analysis, you
can easily find out if the given problem provides the value of the population variance or if the
population variance is unknown. Also, when the population variance is known, the standard
deviation of the sampling distribution of the mean is computed using the formula , while
the formula is used to estimate the standard error of the mean when the population
variance is unknown.
Distribution of the Sample Mean for Normal Population
1. Population variance 𝜎2 is known
The population has a mean μ and variance of 𝜎2, the distribution of the sample
mean is (at least approximately) normal and standard error of the mean , where σ is
the population standard deviation and 𝑛 is the sample size. To determine the probability of
a certain event, we can use the 𝑧 − 𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑖𝑜𝑛 by transforming the mean of the sample
data to an approximately normal variable 𝑧 , using the relation . This distribution is
best applied
What’s More
Independent Activity
DEFINE THE SAMPLING DISTRIBUTION OF THE SAMPLE MEAN FOR NORMAL
POPULATION WHEN THE VARIANCE IS KNOWN AND UNKNOWN.
Instruction: Read and analyze the following situations and supply the values of the following
variables (if there is any). On the third column, write known on the space provided if the
situation gives or can compute the value of the variance, otherwise write unknown. Identify also
the formula to be used to estimate the standard error of the mean by writing the symbols 𝜎𝑥̅,
when the population variance is known and 𝑠𝑥̅ when the population variance is unknown.
218
Situation Given Answer Standard
error
formula
1. Consider a population 𝑁 = ____, 𝑛 = ______,
consisting of 1, 2, 3, 4, 5, 8,
9, and 11. Samples of size 3
are drawn from this
population.
2. Given the population mean 𝑛 = ______
of 12, and a sample
standard deviation of 3 in a 𝜇 = ______
sample size of 125 𝑠 = ______
3. A population composed of 𝑁 = ____, 𝑛 = ______,
11 items whose
measurements are 12, 7,
9,11,8, 20, 23, 18, 13, 22,
and 10. Samples of 5 items
are drawn at random
without replacement.
4. The owner of the drugstore 𝑁 = ____, 𝑛 = ______,
chain has 100 drugstores
𝜇 = ______,
that sell Flexcoslow, a
surgical mask that prevents 𝜎2 = ______,
a person from catching 𝑋̅ = _______
liquid droplets of viruses.
𝑠 = _______
His operation manager said
that the average drugstores
in his chain sell 12 boxes of
the Flexcoslow a day with a
standard deviation of 2
boxes. To find out if this is
true, the owner randomly
selected 10 stores and
found that the mean is 15
boxes with a standard
deviation of 1 box.
5. The mean height of the 𝑛 = _____,𝜇 = ______,
students at Maligaya
Elementary School is 𝑋̅ = _____, 𝑠 = ______
137.16 cm. To find out if
the statement is true, a
sample of 15 students is
taken from the population.
The mean of the sample is
139.1 cm with a standard
deviation of 2.23 cm.
4. How did you find the activity?
5. How does each situation differ from each other?
6. What is the difference between the two symbols 𝜎𝑥̅ and 𝑠𝑥̅ in finding the
standard error of the mean?
Independent Assessment
Instruction: Read and analyze the following situations and supply the values of the following
variable in the second column. Determine whether the following statements have a known or
unknown population variance and provide a brief explanation to support your answer. Also,
219
identify the formula to be used for the standard error of the mean by writing 𝜎𝑥̅, when the
population variance is known and 𝑠𝑥̅ when the population variance is unknown.
Standard
Situation Given Answer
Error Formula
𝑛 = ______,
2. Given the population mean of 98, and a
sample standard deviation 2 in a sample 𝜇 = ______,
size of 45.
𝑠 = ______
3. A population composed of 10 items whose
measurements are 12,11, 15, 8, 20, 23, 𝑁 = _____,
18, 13, 22, and 10. Samples of 6 items 𝑛 = ______,
are drawn at random without
replacement.
4. An SHS teacher claims that the average 𝑛 = _____
time it takes a group of students to 𝜇 = ______
complete the Mathematics examination is
50.5 minutes with a variance of 17.64 𝜎2 = ______
𝑚𝑖𝑛𝑢𝑡𝑒𝑠2. She randomly selected 45
𝑋̅ = _______
students and found to have a mean of 52
minutes and a standard deviation of 3.5
𝑠 = _______
minutes. She then used the z-distribution
to find out if the group can complete the
exam faster than the population.
5. A manufacturer of lightbulbs produces 𝜎 = ______ , ,,
bulbs that last a mean of 800 hours with
𝑛 = ______ ,
a standard deviation of 100 hours. To
assess the claim of the manufacturer, a 𝜇 = ______ ,
random sample of 12 of these bulbs was 𝜎2 = _____
tested and found to have a mean of 790 𝑋̅ = _______
hours. He then used the test variate 𝑡 =
𝑠 = _______
What I Can Do
Make use of your acquired skills in this lesson.
Instruction: Read and analyze the following situations. Determine whether the following
statements have a known (or can be computed) or unknown population variance. Write your
answer on the space provided. Also, identify the formula to be used for the standard error of the
mean by writing 𝜎𝑥̅, when the population variance is known and 𝑠𝑥̅ when the population variance
is unknown.
Assessment
Direction: Read and analyze each item carefully. Shade the entire circle for each statement that
corresponds to your answer.
A. C.
B. ) D.
A B C D 4. It is the distribution used to estimate the population O O O O parameters for
small sample size and when the population
A B C DO A. z-Distribution
O O O B. t-Distribution
variance is C. Chi-Square Distribution
unknown? D. Pearson Correlation Coefficient Distribution
221
5. Which of the
following
formulas is
used to
calculate
the
standard
deviation of
the
sampling
distribution
of the
sample
mean (for
infinite
population)
when the
population
standard
deviation is
given?
C. 𝜎 𝑁−𝑛
𝜎𝑥̅ = √𝑛 • √ 𝑁−1
𝜎
𝜎𝑥̅ = √𝑛
D.
A B C DO
O O O
222
6. A population 11, 19, 22, 2, 8, 9, and 15. Suppose samples of 2 are drawn from this
population.
is an example of the sampling distribution of the sample This
mean for a normal population with population variance is situation
__________.
A. unknown C. large
B. known D. small
A B C D 7. Suppose that the rice mill wants to test its product to find out
O O O O whether it meets the average weight of rice of 50 kilograms with a
standard deviation of 0.18 kg. Thirty sacks of rice are weighed
and found to have an average weight of 50.8 kg and a
standard deviation of 0.15kg. The rice mill owner is in a
situation of the sampling distribution of the sample mean
with population variance is __________.
A. unknown C. large
B. known D. small
A B C D 8. The average family income in the Philippines in 2019 was
O O O O P22,250 with a standard deviation of P1,250. In a certain
municipality, a sample of 50 families had an average income
of P25,000. This situation is about the sampling distribution
of the sample mean where the population ________.
A. Variance is known
B. Variance is unknown
C. Standard deviation is small
D. Standard deviation is large
A B C D 9. This is used to determine the probability of an event by
O O O O transforming the mean of the sample to a variable of this
distribution where the population variance is unknown, and
the sample size is less than 30.
A. Z-Distribution
B. t-Distribution
C. Chi-Square Distribution
D. Pearson Correlation Coefficient
A B C D 10. Given 𝜇 = 15, 𝜎 = 5, and 𝑛 = 50. This is an example of the
O O O O sampling distribution of the sample mean for a normal population
when the population variance is _______.
A. unknown C. large
B. known D. small
A B C D 11. Which of the following formulas is used to calculate the O O O
O population standard deviation?
consists
A B C DO
B. D. 𝜎2 = 𝛴(𝑋 − µ)2/ 𝑁
O O O
12. The Department of Health has the following data of the number of infected
A.
persons of Covid-19 in five high-risk areas in the country, 51, 89, 129, 215,
and 297. Suppose samples of size 2 high-risk areas are drawn from this
population, what example of the sampling distribution of the sample mean for
normal population portrays the situation?
C A. variance is known
. B. variance is unknown
C. standard deviation is small
D. standard deviation is large
A B C DO O O O
A B C DO O O O
223
A B C D sample mean for a normal population when the population variance is
O O O O _______.
A. unknown C. large
13. Given 𝜇 = B. known D. small
88, 𝑋̅ = 90 𝑠 14. A researcher used to develop a problem-solving test to randomly selected 100
= 2, and 𝑛 = Grade 11 students. From this sample, the mean and standard deviation is 90
15. This is and 4 respectively. The mean of the population is 88. The researcher is in a
an example situation of the sampling distribution of the sample mean wherein the
of the population variance is __________.
sampling A. unknown C. large
distribution B. known D. small
of the 15. Given 𝜇 = 54.2, 𝜎 = 7.8, 𝑋̅ =60.2 and 𝑛 = 76. This is an example of the
sampling distribution of the sample mean for a normal population when the
population variance is _______.
A. unknown C. large B. known
D. smal
Additional Activities
Instruction: Determine whether the following statements has a known or unknown
population variance and write your answer on the second column. Identify also the
formula to be used to estimate the standard error of the mean by writing the
symbol 𝜎𝑥̅, when the population variance is known and the symbol 𝑠𝑥̅ when the
population variance is unknown.
Statistics and
Probability
224
Illustrating the Central Limit
Theorem
Statistics and Probability
Illustrating the Central Limit Theorem
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
Writers:
Rolando S. Merle
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Team:
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann
O. Briones Celestina M. Alba, Asuncion C. Ilao
Telefax: 02-8682-5773/8684-4914/8647-7487
225
that you can do it, and understand this module on your own. This module
discusses the context of our current situation right now.
Week
After going through this module, you are expected to:
What I Know
Let us see how far you will go and how much you know about this pre-test. Let’s get
started!
Directions: Read and analyze each item carefully. On a separate paper, write the
letter of the correct answer for each statement that corresponds to your answer.
The perfect score is 15 points. Use the rubric provided after the test.
1. If the mean of the sampling distribution of the mean is 6.5, which of the following
statements best describe the population mean? A. The population decreases by
6.5.
B. The population is greater than 6.5.
C. The population mean is also equal to 6.5.
D. The population mean and mean of the sampling distribution of the means
cannot be compared.
2. What does the Central Limit Theorem state?
A. the sample size is large
B. all possible samples are selected
C. the standard error of the sampling distribution is small
D. the standard deviation is sufficiently large than the normal
3. The Central Limit Theorem states that the mean of the sampling distribution of
the sample mean is
A. larger than the population mean.
B. exactly equal to the population mean.
C. close to the population means if the sample size is large.
D. equal to the population mean divided by the square root of the sample size.
4. The _________ states: “As the sample size n increases, the shape of the
distribution of the sample means taken without replacement from a population
with mean 𝜇 and the standard deviation 𝜎 will approach a normal distribution.”
A. Z distribution
B. Central limit theorem
C. chi distribution
D. Pythagorean theorem
5. If a population has a mean of 12.8, what is the mean of the sampling
distribution?
A. less than 12.8
B. larger than 12.8
C. closer to 12.8
D. exactly the same as 12.8
226
7. Which of the following descriptions does NOT illustrate the Central Limit
Theorem? (2 points)
A. The Central Limit Theorem is used to approximate the distribution of the
sample mean over the distribution of the population mean.
B. If the sample size n, where n is sufficiently large is drawn from any
population with mean 𝜇 and a standard deviation 𝜎, then the sampling
distribution of sample means approximates a normal distribution.
C. Whenever the population is not normally distributed, or if we do not know
distribution, the Central Limit Theorem allows us to conclude that the
distribution of sample means will be normal if the sample size is
sufficiently large.
D. Given a random variable X with mean 𝜇 and variance 𝜎2, then regardless of
whether the population distribution of X is normally distributed or not, the
shape of the distribution of the sample means taken from the population
approaches a normal distribution.
8. Which of the following statements is NOT true about Central Limit Theorem? (2
points)
A. The population mean and the mean of the sampling distribution of
the mean are equal.
B. The variance of the sampling distribution of the mean and the population
variance is exactly the same.
C. The central limit theorem tells us exactly what the shape of the
distribution of the mean will be when we draw repeated samples from a
given population.
D. If you take repeatedly independent random samples of size n from any
population, then when n is large, the distribution of the sample mean will
approach a normal distribution.
9. Consider the population consisting of values {2, 4, 6}. List all the possible
samples of size 2 which can be drawn with replacement. (2 points)
A. {2,4,6}
B. {(4,2),(4,4),(4,6),(6,2),(6,4),(6,6)
C. {(2,2),(2,4),(2,6) ,(4,2),(4,4),(4,6)}
D. {(2,2),(2,4),(2,6),(4,2),(4,4),(4,6),(6,2),(6,4),(6,6)}
10. If the population has a variance of 4.6, what is the variance of the
sampling distribution of its means if the sampling distribution was derived with
sample
size n = 2 and all possible samples are drawn with replacements? (2 points)
A. 21.16
B. 19.22
C. 10.58
D. 2.3
227
lesson about parameter and statistic and the sampling distribution of the sample
mean which were previously discussed. A portion of this module will help you recall
these topics.
As a start, this module presents the current concerns of our community. I
believed that you are updated on the news about the impact of COVID-19 in many
of our fellow Filipino people and to our community. Here is a glimpse of how it was
started.
To visualize the content of this module, situations and data set used here are
based on this unfortunate event that devastated the lives of many people. Statistics
is a tool that somehow helps you to appreciate the importance of data analysis.
Some situations and images placed here were based on the personal
experience of the writer of this module to help you understand and illustrate the
Central Limit Theorem.
What’s In
Activity 1
Consider a population of Senior High School consisting of the values 1, 2, 3,
4, 5, and 6.
Compute the following:
1. population mean
2. population variance
3. population standard deviation
4. illustrate the probability histogram of the sampling distribution of the means
Activity 2
Consider all the possible samples of size 2 that can be drawn with
replacement from the population 1, 2, 3, 4, 5, and 6. Create a sampling distribution
of the sample mean.
Compute the following:
1. Mean of the sampling distribution of the sample mean
2. Variance of the sampling distribution of the sample mean
3. Standard deviation of the sampling distribution of the sample mean
4. Illustrate the probability histogram of the sampling distribution of the mean.
What’s New
From Activities 1 and 2, compare the following:
1. population mean and the mean of the sampling distribution of the sample
mean
2. population variance and the variance of the sampling distribution of the
sample mean
3. population standard deviation and the standard deviation of the sampling
distribution of the sample mean
4. probability histogram of the population and the sampling distribution of the
mean with size 2
Show your comparisons
What is It
In Activity 1, we get the population mean using the formula where:
𝜇 – population mean
𝛴𝑋 − sum of all data in terms of X
N - population size Therefore, we have:
Substitute
Add each data divided by the population size
Simplify
228
𝜇 = 3.5 (population mean)
To solve for the population variance, subtract each data to the population mean
and square the result.
The table below shows the summary of the difference between each data and the
computed population mean in the second column while the square of each is on the
third column. While the fourth column shows the probabilities of each data.
σ2 ≈ 2.92
≈ √2.92
σ ≈ 1.71
0 1 2 3 4 5 6
229
For Activity 2, the table shows and summarizes the information needed to compute
the mean, variance, and standard deviation of the sampling distribution of the
sample mean.
Samp Samp
Obser Samp le (𝑿̅ Obser Sampl le (𝑿̅ (𝑿̅
vation le Mean − 𝝁vation e Mean − 𝝁𝑿̅) − 𝝁𝑿̅)𝟐
( 𝑋̅) ( 𝑋̅)
1 (1,1) 1 19 (4,1) 2.5 -1 1
2 (1,2) 1.5 20 (4,2) 3 -0.5 0.25
3 (1,3) 2 21 (4,3) 3.5 0 0
4 (1,4) 2.5 22 (4,4) 4 0.5 0.25
5 (1,5) 3 23 (4,5) 4.5 1 1
6 (1,6) 3.5 24 (4,6) 5 1.5 2.25
7 (2,1) 1.5 25 (5,1) 3 -0.5 0.25
8 (2,2) 2 26 (5,2) 3.5 0 0
9 (2,3) 2.5 27 (5,3) 4 0.5 0.25
10 (2,4) 3 28 (5,4) 4.5 1 1
11 (2,5) 3.5 29 (5,5) 5 1.5 2.25
12 (2,6) 4 30 (5,6) 5.5 2 4
13 (3,1) 2 31 (6,1) 3.5 0 0
14 (3,2) 2.5 32 (6,2) 4 0.5 0.25
15 (3,3) 3 33 (6,3) 4.5 1 1
16 (3,4) 3.5 34 (6,4) 5 1.5 2.25
17 (3,5) 4 35 (6,5) 5.5 2 4
18 (3,6) 4.5 36 (6,6) 6 2.5 6.25
∑(𝑿̅ − 𝝁𝑿̅)𝟐 =
∑ 𝑋̅ = 126
52.5
Let us now solve for the mean, variance, and standard deviation of the sampling
distribution with sample size 2 (with replacement).
= 3.5
2. Variance of the sampling distribution of the sample mean
≈ 1.46
≈ 1.21
5. To illustrate the probability histogram of the sampling distribution of the
sample, the following table is needed:
230
Sampling Distribution of the sample mean with replacement
𝑋̅ f Probability P(𝑿̅)
1 1
1.5 2
2 3
2.5 4
3 5
3.5 6
4 5
4.5 4
5 3
5.5 2
6 1
In What’s New part, we compared the values solved through the data in the
population and the sampling distribution.
1. The population mean 𝜇 = 3.5 while the mean of the sampling distribution
𝜇𝑋̅ = 3.5. Notice that the population mean is equal to the mean of the sampling
distribution. That is 𝜇 = 𝜇𝑋̅.
2. The population variance σ2 ≈ 2.92 while the variance of the sampling distribution
𝜎𝑋2̅ ≈ 1.46. Notice that if we divide the population variance by the sample size of
2, we have 1.46 which equal to the variance of the sampling distribution of the
2
sample mean. That is,
therefore
3. The population standard deviation σ ≈ 1.71 while the standard deviation of the
sampling distribution 𝜎𝑋̅ ≈ 1.21. Notice that if we divide the population standard
231
deviation by the square root of the sample size of 2, we have 1.21 which is equal
to the standard deviation of the sampling distribution. That is,
1.21. Therefore, .
4. The probability histogram of the population
wherein n=1 looks like this
P(x)
0 1 2 3 4 5 6
X
While the probability histogram of the sampling distribution of the sample mean
with size n = 2 looks like this.
The illustration above is more likely to have a curve of a normal distribution while
the probability histogram of the population is not normally distributed.
Based on our activity, it was evident in the Central Limit Theorem that the
sampling distribution will be approximately normally distributed for large sample
sizes regardless of the distribution from which we are sampling. Also, it is
important to recall that the mean of the sampling distribution of the sample mean
is equal to the population mean: 𝜇𝑋̅ = 𝜇 and the standard deviation of the sampling
distribution of the sample mean (sampling with replacement) is equal to: .
As discussed previously, if the population is normally distributed, the
sampling distribution of the sample mean is also normally distributed. But what if
the population is not normal?
That is where the Central Limit Theorem addressed this question. The
distribution of the sample mean tends toward the normal distribution as the
sample size increases, regardless of the distribution from which we are sampling.
As a simple guideline, the sample mean can be considered approximately normally
distributed if the sample size is at least 30 (n ≥ 30 ).
If the sample size is sufficiently large, the Central Limit Theorem can be used
to answer the sample mean in the same manner that a normal distribution can be
used to answer questions about individual samples. This also means that even if
the population is not normally distributed, or if we don’t know of its distribution,
232
the Central Limit Theorem allows us to conclude that the distribution of the sample
mean will be normal if the sample size is sufficiently large. It is generally accepted
that a sample size of at least 30 is large enough to conclude that the Central Limit
Theorem will ensure a normal distribution in the sampling process regardless of the
distribution of the original population. Further, we can continue to use the z
conversion formula in our calculations. This time we will use the formula,
z=
Z=
Step 3: Use the Z table to find P (Z < 2.5).
Suppose that the average age of the people living in a Barangay is 34 with a
standard deviation of 4. One hundred (100) residents of a certain Barangay decided
to take summer outing after COVID-19 pandemic and Enhanced Community
Quarantine has been lifted for bonding and relaxation.
If we make a relative histogram of samples with various sample sizes, it
would look like the histograms below.
233
POPULATION (individuals) Distribution of ages of
individuals living in a Barangay. This is an
example of uniform distribution whenever the
sample size n =1. Each individual has an equal
probability of selecting as a sample. The
histogram will look like on the left illustration not
so interesting!
What’s More
A. YOU COMPLETE ME. Choose from the pool of words/mathematical formulas
inside the box to complete the following paragraph on the important properties of
the Sampling Distribution of the Sample Mean. Then write the corresponding
words below each term to complete the sentence.
Recall on the Properties of the Sampling Distribution of Sample Mean
If all possible sample size n are drawn from a population of size N with mean
𝜇 and (1)_________𝜎2, then the sampling distribution of the sample mean has the
following properties: First, the (2)__________of the sampling distribution of the
(3)__________ is (4)__________to the (5)_____________ 𝜇. That is, 𝜇𝑋̅= 𝜇. Second, the
variance of the sampling distribution of the sample mean 𝜎𝑋2̅ is given by: for
234
SERVE JOY GLADNESS
DECODE:
Answer:
(1) (2) (3) (4) (5)
C.Samples of size n were randomly selected from populations with means (with
replacements) and variances given below. In each case, find the mean and
standard deviation of the sampling distribution of the means.
1. n = 4 𝜇 = 4.7 σ2 = 3
2. n = 7 𝜇=4 σ2 = 2
3. n = 16 𝜇=7 σ2 = 6
4. n = 22 𝜇 = 22 σ2 = 4.5
5. n = 44 𝜇 = 40 σ2 = 9.6
What I Have Learned
Let us see what you have learned in this module by completing the following
statements.
The Central Limit Theorem implies the following important ideas in statistics:
1. When the sample size tends to infinity (a very large sample) the
distribution of the sample mean X will be ______________ distributed.
2. If the sample size tends to infinity, the sample mean 𝜇𝑋̅ _____________ the
population mean 𝜇.
What I Can Do
A. Consider a population consisting of values (1,3,5).
1. List all the possible samples of size 2 that can be drawn from the population with
replacement.
235
2. Compute for the mean of the sampling distribution of the sample means.
3. Compute for the variance of the sampling distribution of the sample means.
𝑋̅ f Probability
̅
P(𝑿̅)
236
4. Which of the following descriptions about Central Limit Theorem is NOT
essential? A. The larger the sample, the better approximation will be.
B. The smaller the sample, the bigger the approximation will be.
C. When the original variable is normally distributed, the distribution of the
sample means will be normally distributed for any sample size n.
D. When the distribution of the original variable might not be normal, a sample
size of 30 or more is needed to use a normal distribution to approximate the
distribution of the sample means.
5. Suppose the average age of family members is 34 with a standard deviation of
4. If 100 members of the community decided to have a summer outing bonding
and relaxation, find the probability that the average of these members is less
than 35?
A. 91.32% C. 95%
B. 94.45% D. 99.38%
6. If a population has a mean of 12.8, what is the mean of the sampling
distribution?
A. less than 12.8 C. closer to 12.8
B. larger than 12.8 D. exactly the same as 12.8
7. If the mean of the sampling distribution of the means is 6.5, which of the
following statements best describe the population mean? A. The
population decreases by 6.5.
B. The population is greater than 6.5.
C. The population mean is also equal to 6.5.
D. The population mean and mean of the sampling distribution of the means
cannot be compared.
8. Which of the following statements is NOT true about Central Limit Theorem?
A. The population mean and the mean of the sampling distribution of the
means are equal.
B. If you take repeatedly independent random samples of size n from any
population, then when n is large, the distribution of the sample means will
approach a normal distribution.
C. The central limit theorem tells us exactly what the shape of the distribution
of the means will be when we draw repeated samples from a given
population.
D. The mean of the sampling distributions of the means, the standard deviation
of the sampling distribution of the means, and variance is the same as the
population means, variance of the population, and standard deviation.
9. Consider the population consisting of values {2, 4, 6}. List all the possible
samples of size 2 which can be drawn with replacement. (2 Points)
A. {2,4,6}
B. {(4,2),(4,4),(4,6),(6,2),(6,4),(6,6)
C. {(2,2),(2,4),(2,6) ,(4,2),(4,4),(4,6)}
D. {(2,2),(2,4),(2,6),(4,2),(4,4),(4,6),(6,2),(6,4),(6,6)}
10. If the population has a variance of 4.6, what is the variance of the sampling
distribution of its means if the sampling distribution was derived with sample
size n = 2 and all possible samples are drawn with replacements?
A. 21.16 C. 10.58
B. 19.22 D. 2.3
11. If a population has a mean of 16.7, what is the mean of the sampling
distribution?
A. less than 16.7 C. closer to 16.7
B. larger than 16.7 D. exactly the same as 16.7
12. If the mean of the sampling distribution of the means is 29.2, which of the
following statements best describes the population mean?
A. The population decreases by 29.2
237
B. The population is greater than 29.2
C. The population mean is also equal to 29.2
D. The population mean and mean of the sampling distribution of the means
cannot be compared.
13. As the sample size n increases, the mean of the sampling distribution
_____________
A. increases C. stays the same
B. decreases D. not enough information is given
14. Consider the population consisting of values (1, 3, 5). List all the possible
samples of size 2 which can be drawn without replacement.
A. {1,3,5}
B. {(3,1), (5,1), (5,3)}
C. {(1,2), (1,3), (1,5), (3,1), (3,3), (3,5)}
D. {(1,1), (1,3), (1,5), (3,1), (3,3), 3,5), (5,1), (5,3), (5,5)}
15. If the variance of a certain probability distribution is 16, find its standard
deviation.
A. 1 C. 3
B. 2 D. 4
Additional Activities
Complete the following table and solve for the unknowns.
A population of a certain ethnic group is consisting of three numbers (3, 5, 7). If
samples of size 2 will be taken with replacements, complete the following tables,
and answer the questions that follow.
(
Sample
Observation )2
Sample
s Mean
( )
1
2
3
4
5
6
7
8
9
Total
f Probability
P( )
238
1. What is the value of the following?
a. population mean
b. population variance
c. population standard deviation
d. mean of the sampling distribution of the sample mean
e. variance of the sampling distribution of the sample mean
f. standard deviation of the sampling distribution of the sample
mean
2. Illustrate the probability histogram of the sampling distribution of the
means.
Statistics and
Probability
Solving Problems Involving
Sampling Distribution of the
Sample Mean
Statistics and Probability
Solving Problems Involving Sampling Distribution of the Sample Mean First
Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
239
Development Team of the Module
Week
Writer: Maria Cristina O. Rosas
Telefax: 02-8682-5773/8684-4914/8647-7487
Your patience in solving offered problems here in the module will help you
improve your computational skills as it tackles relevant culture – based situated
problems. Your ability to interpret, reason – out and make judgement or even
decision out of statistical measures will also be practiced here. The scope of this
module permits it to be used in many different learning situations. The language
used recognizes the diverse vocabulary level of students. The lessons are arranged
to follow the standard sequence of the course but the order in which you read them
can be changed to correspond with the textbook you are now using.
What I Know
Let us see how far you will go and how much you know about the topic. Let’s get
started.
Directions: Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper.
For numbers 1-4, refer to the problem below.
240
from different private distributors. He found out that the mean pore size of all
face masks is 40 𝜇𝑚 and standard deviation of 5 𝜇𝑚. Assume that the distribution
is normally distributed, and 64 samples face masks were taken.
1. What is the mean of the sampling distribution?
A. 64 B. 40 C. 8
D. 5
2. Which of the following is the standard deviation of mean sampling
distribution?
A. 12.8
B. 1.6
C. 0.625
D. 0.078
3. The sampling distribution of 64 sample face masks falls under what kind of
distribution?
A. exponential
B. normal
C. uniform
D. straight
4. Which of the following shows the computed standard error of the mean?
A. 95
B. 8 C. 0.625
D. 0.078
5. The population of Senior High School learners in Mapayapa Integrated
School has a mean of 54.5 and a standard deviation of 8.6. If a sample of size
12 is taken at random from this population, which of the following statements
is correct?
A. The mean of the sampling distribution of the sample mean is equal to 54.5.
B. The mean of the sampling distribution of the sample mean is less than 54.5
C. The standard deviation of the sampling distribution of the sampling
mean is 8.6.
D. The standard deviation of the sampling distribution of the sampling
mean is 54.5.
6. Taken from the same population, which sample size will give a smaller
standard error of the mean?
A. 35
B. 18
C. 12
D. 9
7. Standard error is always non-negative.
A. true B. false
C. maybe
D. not sure
8. Which is NOT true about the standard error?
A. The standard error is also the sampling error.
B. The standard error is the approximate standard deviation of a statistical
sample population.
C. The more data points involved in the calculations of the mean, the smaller
the standard error tends to be.
D. The standard error can include the variation between the calculated mean
of the population and one which is considered known, or accepted as
accurate.
241
A certain condominium in Manila has 35 floors. Each floor has 2 elevators. In
every elevator there is a sign inside stating a limit of 16 persons and a weight limit
of 2500 pounds. The average weight of the condominium residents is 150 pounds
with a standard deviation of 27 pounds. Assume that the weight distribution is
approximately normal and 16 persons from the condominium are to be taken as
samples.
9. What is the expected value of the weight distribution of the sample mean?
A. 27 B.
35
C. 150
D. 2500
10. What is the standard deviation of the weight distribution of the sample
mean?
A. 16.7 B.
14.5 C. 6.75
D. 0.06
11. Which of the following average weight of 16 sample residents will reach the
limit of an elevator’s carrying capacity of 2500 pounds?
A. 156.25 B.
16.67 C. 200.05
D. 180.12
12. What is the probability that a random sample of 16 people will exceed the
weight limit capacity of an elevator?
A. 21%
B. 17%
C. 13%
D. 14%
13. If a sample came from a given population, then the mean of the sampling
distribution of the sample must be equal to _____
A. mean of population
B. sample of population
C. variance of population
D. standard deviation of population
14. Which of the following cases is the sample size considered large?
A. 𝑛 ≥ 30
B. 𝑛 ≥ 50 C. 𝑛 ≤ 30
D. 𝑛 ≤ 50
15. A population has a mean of 50 and a standard deviation of 6. What are the
mean and the standard deviation of the sampling distribution of the mean
for n = 16?
A. 𝑋̅ = 50, s= 25
B. 𝑋̅ = 50, s =1.5
C. 𝑋̅ = 25, s = 6
D. 𝑋̅ = 25, s = 1.
242
matter what the shape of the population distribution is, the sampling distribution
of the mean is closely normally distributed whenever n is large. This fact holds true
for sample sizes greater than or equal to 30.
What’s In
In the previous topics, you have learned how to use the normal
distribution to calculate the probability of a random variable X at a given value.
In this lesson, you will use the sampling distribution of the mean to get the
probability of the given sample mean taken from the population.
Sampling distribution of sample mean is a frequency distribution of the
mean computed from all possible random samples of a specific size taken from a
population. The probability distribution of the sample mean is also called the
sampling distribution of the sample mean.
a. z = , b. z =
__________________________________________________________________________________
____________________________________________________
3. How will you solve problems involving sampling distribution of the sample
mean with “less than”, “greater than” and “between” cases? Is the idea applicable
to a single observation (x)?
__________________________________________________________________________________
____________________________________________________
What’s New
Probable Time
The mean time it takes a group of senior high students to complete a
certain examination is 50.6 minutes. The standard deviation is 6 minutes.
Assume that the variable is normally distributed.
___________________________________________________________________________ 2.
If 49 randomly selected senior high school students take the
examination, what is the probability that the mean time it takes the group to
complete the test will be less than 48 minutes?
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________ 3.
If 49 randomly selected senior high school students take the
examination, what is the probability that the mean time it takes the group to
complete the test will be more than 51 minutes?
___________________________________________________________________________
___________________________________________________________________________
243
___________________________________________________________________________ 4.
If 49 randomly selected senior high students take the examination, what is
the probability that the mean time it takes the group to complete the test is
between 47.8 and 53 minutes?
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________ 5.
If you are one of the selected senior high school students to take the test, how
long would it take you to complete the examination?
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________ 6.
What other factors can you give that contribute to the length of examination?
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
Rubric for numbers 5 and 6 questions for 3-point score.
244
z = - 0.43
Find P(X< 48) by getting the area under the normal curve. P(X˂
48) = P(z ˂ -0.43)
= 0.3336
Find P(𝑋̅ ˂ 48) by getting the area under the normal curve.
P(𝑋̅ ˂ 48) = P(z ˂ -3.03)
= 0.0012
The probability that 49 randomly selected senior high school students will
complete the test in less than 48 minutes is 0.0012 or 0.12%
3. If 49 randomly selected senior high school students take the examination,
what is the probability that the mean time it takes the group to complete the
test will be more than 51 minutes?
Step 1: Identify the given information:
𝜇= 50.6
245
𝜎=6
𝑋̅= 51
𝑛= 49
Step 2: Identify what is asked.
P(𝑋̅ > 51)
= 0.47
Find P(𝑋̅ > 51) by getting the area under the normal curve.
The probability that 49 randomly selected senior high students will complete
the test in more than 51 minutes is 0.3192 or 31.92%
4. If 49 randomly selected senior high school students take the examination,
what is the probability that the mean time it takes the group to complete the
test is between 47.8 and 53 minutes?
Step 1: Identify the given information:
𝜇= 50.6
𝜎=6
𝑋̅= 47.8 and 53, 𝑛 = 49 Step
2: Identify what is asked.
P(47.8 < 𝑋̅ < 53)
Step 3: Identify the formula to be used.
The problem is dealing with data about the sample mean or n
observations, so the formula to be used to standardize 47.8 and 53
is z =
(4.a) P( 𝑋̅ <47.8)
246
= - 3.27 = 2.8
Find P(𝑋̅ > 47.8) by getting the Find P(𝑥̅ < 53) by getting the
area under the normal curve. area under the normal curve
P(𝑥̅ > 47.8) = P(z > - 3.27) = 0.0005 P(𝑥̅ < 53) = P(z < 2.8) = 0.9974
To find the probability that 49 randomly selected senior high school
students will complete the test between 47.8 and 53 minutes, subtract the
smaller area from the bigger area under the normal curve.
What’s More
Below are problems you need to ponder. Read each statement carefully and answer
the questions. Use separate sheet/s of paper for your answers.
Independent Activity 1:
My Favorite Breed!
The shih tzu’s mean length is 50 inches with standard deviation 26 inches. If
a sample of 169 shih tzu are randomly selected, what is the chance that the
sample mean is:
247
_______________________________________________________________________
_______________________________________________________________________ Rubric for
question (d) for 3-point score.
Equivalent Points Description
Coherent content with correct use of grammar. It
3 points
includes very timely and interesting ideas.
2 points Inconsistent content with correct use of grammar.
1 point Limited ideas. Grammatical error.
0 point No answer at all.
Independent Assessment 1:
Solve each of the following problems. All answers should be in two-decimal places.
IQ Level
measured and found to be normally distributed with a mean of 98 and a standard deviation
of 8.
248
*standard deviation of passing rate in licensure exam is 4.6
*standard deviation of faculty qualification with MS/MA degree/s is 5.1
*standard deviation of faculty qualification with Ph.D. degree/s is 4.8.
Independent Activity 2
Is It Worth It?
The mean annual salary of all the frontlines (nurses, medical technologists,
radiologic technologists, phlebotomists) in the Philippines is Php 42,500.
Assume that this is normally distributed with standard deviation Php 5,600.
A random sample of 25 health workers is drawn from this population, find
the probability that the mean salary of the sample is:
________________________________________________________________________
249
________________________________________________________________________
c. between Php 40,400 and Php 45,000?
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
d. As a senior high school student, Maria and her classmates made a move
on how they can show their love and concerns to the front liners. They
create a website called “ENCOURAGEMENT WALL” where netizens can
post their words for the front liners.
Find words that best describe our front liners in the grid below running in
one of TEN possible directions, horizontally, vertically, diagonally or
backward. Write the answers on the blank below the grid.
D E T T I M M O C P F
E N G L S H E R O R O
T I G H A D G E N O R
E B R A V E E W F A F
R R F T L K T A I C L
M A V E U L T N D T E
I M J D J M A S E I X
N A O S I O W N N V I
E N L U N F E D T E B
D S S S E L F L E S L
R E S P O N S I B L E
Independent Assessment 2
In this part, you need to solve the problem then shade the region under the
normal curve that corresponds to the required area.
Harvest More!
Jose who lives in Zambales harvested mangoes in her plantation for export.
The average weight of the harvested fruits is 2 kilos with a standard deviation
of 0.4 kilo. Assume that the variable is normally distributed.
a. What is the probability that a randomly selected mango will weigh less
than 1.5 kilos?
_____________________________________________________________________
_____________________________________________________________________
250
b. A potential importer intends to take a sample of 4 mangoes and will not
place an order if the sample mean is less than 1.5 kilos. What is the
Independent Assessment 3
Given the situation or problem, determine what is being asked.
a. Suppose a random sample of size n = 25 taken from a very large
population with mean 𝜇 = 90 and a standard deviation 𝜎 = 6.4. Describe
the sampling distribution of the mean for n = 25, in terms of its
expected value and the standard deviation.
_____________________________________________________________________
_____________________________________________________________________
c. Find the standard error of the sampling distribution of the mean when
a random sample of size 20 is chosen from a population of size 167
having a mean of 75 and a standard deviation of 3.9.
_____________________________________________________________________
_____________________________________________________________________
251
What I Can Do
Now let’s apply the knowledge you have gained in this module by answering the
problem below. Use separate sheet of paper for your answer.
Online Testing
Because of the immediate suspension of classes nationwide due to the COVID-19
pandemic, many schools opted to give the final examination online.
The scores on a certain exam are normally distributed with a mean of 65 and
a. less than 54
_______________________________________________________________________________
_______________________________________________________________________________
_______________________________________________________________________________
b. at least 80
_______________________________________________________________________________
_______________________________________________________________________________
_______________________________________________________________________________
c. The average age of the ABS CBN employees is known to be 32 years and the
standard deviation is 5 years. Find the probability that a random sample of 36
employees will have an average age of less than 30 years.
________________________________________________________________________________
________________________________________________________________________________
________________________________________________________________________________
d. As a response to the personal, social and psychological impact of COVID-19,
Ateneo De Manila University President Fr. Jett Villarin, S.J. declared in a
memorandum, the suspension of online classes during the Enhanced Community
Quarantine measures of the government. With this memorandum, some schools
reacted positively and some the other way around.
If you are the president of a certain university, will you do the same thing? Cite
three reasons or less for your answer.
________________________________________________________________________________
________________________________________________________________________________
________________________________________________________________________________
Rubric for question (c) for 3-point score.
Equivalent Points Description
Coherent content with correct use of grammar. It
3 points
includes very timely and interesting ideas.
2 points Inconsistent content with correct use of grammar.
1 point Limited ideas. Grammatical error.
0 point No answer at all.
Assessment
Multiple Choice: Choose the best answer in each question. Write the letter of your
choice on a separate pad paper.
252
A. 5
B. 25
C. 30
D. 40
2. What is the standard deviation of the sampling distribution?
A. 0.625
B.
2.10
C. 21.5
D. 22.5
3. What is the approximate probability that 𝑥̅ will be within 0.5 of the population
mean 𝜇?
A. 31.5%
B. 32.80%
C. 57.62%
D. 75.57%
4. What is the approximate probability that 𝑥̅ will differ from 𝜇 by more than
0.7?
A. 26.27%
B. 34.23% C. 39.3%
D. 40.7%
The GPAs of all students enrolled at a large university have an approximately normal
distribution with a mean of 3.05 and a standard deviation of 0.29. Find the probability that the
mean GPA of a random sample of 20 students selected from this university is:
253
For numbers 11 - 13, refer to the problem below.
Assume that woman’s heights are normally distributed with a mean given
11. If a woman is randomly selected, find the probability that her height is between
61.7 inches and 62.7 inches?
A. 20.75%
B. 72%
C. 21.64%
D. 86.65%
12. If a sample of 16 women is selected, what is the probability that the mean of
the sample will be larger than 63.2 inches?
A. 19.32%
B. 15.56% C. 4.75%
D. 2.28%
13. If a sample of 16 women is selected, what is the probability that the mean of
the sample will be less than 63.2 inches?
A. 97.72%
B. 41.56%
C. 28.74%
D. 16.67%
14. Which is NOT true about the standard error?
A. The standard error is also the sampling error.
B. The standard error is the approximate standard deviation of a statistical
sample population.
C. The more data points involved in the calculations of the mean, the
smaller the standard error tends to be.
D. The standard error can include the variation between the calculated
mean of the population and one which is considered known, or accepted as
accurate.
15. The number of total carbohydrates in an order of camote fries in a native
restaurant is approximately normally distributed with a mean of 240 and a
standard deviation of 20. What is the probability that a randomly selected
order has at least 265 total carbohydrates?
A. 54.13%
B. 34.67%
C. 21.56%
D. 10.56%
Additional Activities
1. The mean income of a certain government employee is Php 53,200 with a
standard deviation Php 6750. Assume that their income is normally distributed.
254
e. If a random sample of 49 employees is selected, what is the probability that
the sample mean had an income of at least Php 50,000?
1. The length of time in hours it takes an “over 40” group of men to play one
basketball game is normally distributed with a mean of 2 hours and a standard
deviation of 0.5 hours. A sample of size n = 30 is drawn randomly from the
population.
Answer the following questions:
a. 𝜇 = __________, 𝜎 = ____________ and n = ___________
d. Find the probability that the sample mean is between1.8 hours and 2.1
hours hours.
e. Shade the area under the normal curve that corresponds to the required
value.
Statistics and
Probability
255
Illustrating the t-Distribution
Statistics and Probability
Illustrating the t-Distribution
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
Telefax: 02-8682-5773/8684-4914/8647-7487
256
Your patience in solving offered problems here in the module will help you
improve your computational skills as it tackles relevant culture-based situated
problems. Your ability to interpret, reason–out, and make a judgment or even
Week
decision out of statistical measures will also be practiced here. The scope of this
module permits it to be used in many different learning situations. The language
used recognizes the diverse vocabulary level of students. The lessons are arranged 8
to follow the standard sequence of the course. But the order in which you read
them can be changed to correspond with the textbook you are now using.
After going through this module, you are expected to:
What I Know
Direction: Read and analyze each item carefully. Choose the letter of the best answer.
Write the chosen letter on a separate sheet of paper.
1. In the absence of the population variance and/or if the sample size is small,
which sampling distribution is being used?
A. chi distribution
B. z-distribution
C. t-distribution
D. p distribution
2. Who developed the t-distribution in 1908?
A. William S. Gosset
B. Wilhem G. Student
C. Ronald A. Fisher
D. a student from Gosset Academy
3. When do we say that the sample size is sufficiently large? A. When it is greater
than or equal to 10. B. When it is greater than or equal to 20. C. When it is
greater than or equal to 30.
D. When it is greater than or equal to 40.
4. In Student’s t-distribution, if the sample size is 25, what is the degree of
freedom?
A. 5
B. 24
C. 25
D. 26
5. How does t distribution differ from a normal distribution? A. The t-distribution
has a thicker tail.
B. The t-distribution has a higher peak.
C. The t-distribution is centered at 0.
D. The t-distribution is symmetric in the middle.
6. When do you use t-distribution instead of normal distribution? A. When the
sample size is less than 30.
B. If the sample standard deviation is unknown
C. If the population standard deviation is known
D. When the sample size is greater than or equal to 30
257
8. The t-distribution curve has thicker tails than the normal curve. What does it
imply?
A. The normal distribution has a greater mean than the t-distribution.
B. The t-distribution has lesser variability than the normal distribution.
C. The t-distribution has a greater chance for extreme values than the
normal distribution.
D. Estimation of the parameter using the z-distribution is more accurate
than using the t-distribution.
What’s In
Let us review your lesson on the properties of z-distribution. Write “YES” if the
statement is true about the z-distribution and “NO” if it’s not. Then check your
answer by going back to the previous module on z-distribution.
What’s New
ACTIVITY 1. “To z or not to z, that is the question”
In everything you do, your decision matters. And in every decision you make, you
must be responsible for its consequences or outcomes. In the illustration below, fill
in the missing boxes with “to z” if you think a z-table is required in the distribution
and “not to z” if not.
259
Is the sample
YES NO
_____________ ____________
YES NO
_____________ ____________
Answer the following questions.
1. How did you find the activity?
2. Was it hard for you to fill in each box?
3. Based on the activity, when do you use the z-distribution? When not to use
it? 4. If the sample size is less than 30 and the population standard deviation
is unknown, what do you use instead of the z-distribution?
What is It
According to the Central Limit Theorem, the sampling distribution of a
statistic (like a sample mean, 𝑥̅) will follow a normal distribution, as long as the
sample size (𝑛) is sufficiently large. Therefore, when we know the standard deviation
of the population, we can compute a z-score and use the normal distribution to
evaluate probabilities with the sample mean.
But sample sizes are sometimes small, and often we do not know the
standard deviation of the population. When either of these problems occurs, the
solution is to use a different distribution.
Student’s t-distribution
260
The Student’s t-distribution is a probability distribution that is used to
estimate population parameters when the sample size is small (𝑖. 𝑒. 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑖𝑧𝑒 < 30)
and/or when the population variance is unknown. It was developed by William
Sealy Gosset in 1908. He used the pseudonym or pen name “Student” when he
published his paper which describes the distribution. That is why it is called
“Student’s tdistribution”. He worked at a brewery and was interested in the
problems of small samples, for example, the chemical properties of barley. In the
problem he analyzed, the sample size might be as low as three.
Suppose you are about to draw a random sample of n observations from a
normally distributed population, you previously learned that,
where 𝑧 is the z-score, 𝑥 is the sample mean, 𝜇 is the population mean, 𝜎 is the
population standard deviation and 𝑛 is the sample size, have the standard normal
distribution. (Note that if we are standardizing a single observation, the value of n
is 1. Hence, the formula becomes . You can use this concept to construct a
confidence interval for the population mean, 𝜇. But in practice, you encounter a
problem, and that problem is that you don’t know the value of the population
standard deviation, 𝜎. The standard deviation for the entire population 𝜎 is a
parameter and you don’t typically know its value, so you can’t use that in your
formula. If that happens, you could do the next best thing, instead of using the
“population” standard deviation, 𝜎; you are going to use your 𝑥−𝜇 “sample”
standard deviation s, to estimate it. And instead of , 𝑠⁄ you are
√𝑛
going to have where s is
𝑛
your sample standard deviation.
You must take note of the change in the formula. The quantity 𝜎 is a constant but
you don’t know its value, so you used s which is a statistic and this statistic s has
a sampling distribution and its value would vary from sample to sample. And so,
the quantity would no longer have the standard normal distribution. This
quantity is labeled as t because it has a t-distribution. When you are sampling from
a normally distributed population, the quantity
has the t-distribution with n-1 degrees of freedom. Note that the number of degrees
of freedom is one less than the sample size. So, if the sample size n is 25, the
number of degrees of freedom is 24. Similarly, at t distribution having 16 degrees of
freedom, the sample size is 17.
What does the t-distribution look like? If you look at the statistic , it looks
like a z-statistic which has standard normal distribution except that you replaced
the population standard deviation, 𝜎, by the sample standard deviation s. You are
estimating a parameter with a statistic, so there is a greater variability. Hence, your
t-distribution is going to look like the normal distribution except with greater
variance.
261
You have here a plot of standard normal distribution in black and
tdistributions with 3, 5, 20, and 30 degrees of freedom in red, green, violet, and
blue respectively. You can see that both the z-distribution and t-distributions are
symmetric about 0 and bell-shaped. But the t-distributions have heavier tails (more
area in the tails) and lower peaks.
The exact shape of the t-distribution depends on the degrees of freedom. The
figure above tells you that as the degrees of freedom increase, the t-distribution
tends toward the standard normal distribution. At 30 degrees of freedom, the blue
curve might look very close to the normal curve. But if you look very closely, you
would see that the t-distribution still has slightly heavier tails and slightly lower
peak. But if you let those degrees of freedom continue to increase, the t-distribution
is going to get closer and closer to the standard normal distribution.
Properties of t-distribution
The t-distribution has the following properties:
1. The t-distribution is symmetrical about 0. That means if you draw a segment
from the peak of the curve down to the 0 mark on the horizontal axis,
the curve is divided into two equal parts or areas. The t- scores on the
horizontal axis will be divided also with half of the t-scores being positive
and half negative.
262
3. The mean, median, and mode of the t-distribution are all equal to zero.
4. The variance is always greater than 1. It is equal to where 𝑣 is the
number of degrees of freedom. As the number of degrees of freedom
increases and approaches infinity, the variance approaches 1. Using the
formula, if the number of degrees of freedom is 10, the variance is
5. As the degrees of freedom increase, the t-distribution curve looks more and
more like the normal distribution. With infinite degrees of freedom, t distribution
is the same as the normal distribution.
6. The standard deviation of the t-distribution varies with the sample size. It is
always greater than 1. Unlike the normal distribution, which has a standard
deviation of 1.
7. The total area under a t-distribution curve is 1 or 100%. One can say that the
area under the t-distribution curve represents the probability or the percentage
associated with specific sets of t-values.
What’s More
ACTIVITY 2. “Oh, Is That for Real?”
Most of us hate fake news, fake information, and even fake friends. We need
to develop our ability to distinguish what is real from what is not. Write “REAL” if
the statement is true about the t-distribution and “FAKE” if it’s not.
263
_________1. The t-distribution is used to estimate population parameters when the
sample size is small and/or the population variance is unknown.
_________2. The mean, median and mode are all equal to zero.
_________3. The variance is equal to 1.
_________4. The t-distribution curve is bell-shaped.
_________5. The standard deviation is always greater than 1.
_________6. Half of the total area under the t-distribution curve is equal to 1.
_________7. The curve is symmetrical about its zero.
_________8. The shape of the t-distribution curve depends on the sample mean.
_________9. The tails of the t-distribution curve approach the horizontal axis but
never touch it.
_________10. As the degrees of freedom increase, the t-distribution curve
looks more and more like the normal distribution.
What I Can Do
Tell whether each of the following statements describes a t-distribution, a normal
distribution, or both.
_________________1. The variance is equal to 1.
_________________2. The probability distribution curve is bell-shaped.
_________________3. The probability distribution curve symmetrical about
zero.
_________________4. The standard deviation is equal to 1.
_________________5. The shape of the distribution curve depends on the
degrees of freedom
_________________6. It has the total area under its curve equal to 1 or 100%.
_________________7. It has lower peak and heavier tails.
_________________8. Sample size is small and population
variance is unknown.
_________________9. It has tails that are asymptotic to the horizontal axis.
_________________10. Its mean, median, and mode are equal to 0.
Assessment
Direction: Read and analyze each item carefully. Choose the letter of the best
answer. Write the chosen letter on a separate sheet of paper.
1. It is the probability distribution used when the population variance is unknown
and/or if the sample size is small.
264
A. chi distribution
B. z-distribution
C. t-distribution
D. p-distribution
2. The t-distribution was developed by W.S. Gosset in 1908. Why was it called
“Student’s t-distribution”?
A. because it was first introduced to senior high school students in 1908
B. because he was still a college student when he first introduces the
t-distribution
C. because he used the pen name “Student” when he introduces his paper
about the t distribution
D. because he used the students as samples in his study about the
t-distribution
3. When do we consider that the sample size is small?
A. when it is less than 30
B. when it is between 20 to 40
C. when it is greater than 25 but less than 40
D. when it is only 1% of the total population
4. In Student’s t-distribution, what is the sample size if the degree of freedom is 25?
A. 5 B. 24
C. 25
D. 26
5. What is the difference between the normal distribution and the t-distribution? A.
The t-distribution has a lower peak.
B. The t-distribution is centered at 0.
C. The t-distribution is symmetric in the middle.
D. The t-distribution has tails that asymptotic to the horizontal axis
6. When do we use t-distribution instead of normal distribution?
A. If the sample variance is unknown
B. If the population variance is known C. When the sample size is
considered small.
D. When the sample size is sufficiently large.
7. In the estimation of a parameter using the t-distribution, is it possible to replace
the population standard deviation, by the sample standard deviation, s?
A. No, because the given sample size is small
B. No, because and s differ by a significant amount C. Yes, because is
unknown and s is a good estimator of .
D. Yes, because is equal to s when we are estimating parameters 8.
What will be the effect on the t distribution curve when we replace by s?
A. The t-distribution curve will become bell-shaped.
B. The t-distribution curve will be symmetrical to zero.
C. The t-distribution curve will be the same as the normal curve.
D. The t-distribution curve will have a lower peak and thicker tails.
For numbers 9 to 11,
The principal of Mapayapa Integrated National High School claimed that the
average salary of their teachers is ₱24, 000 per month. A random sample of 15
teachers in the school has a mean of ₱23, 220, and a standard deviation of ₱400.
9. Identify the population mean.
A. ₱24,000 B. ₱23,220 C. ₱400
D. 15
10. Identify the sample mean.
A. ₱24,000 B. ₱23,220 C. ₱400
D. 15
11. In testing the principal’s claim, can we use the t-distribution?
A. Yes, because the population standard deviation is unknown, and the
sample size is small.
265
B. Yes, because the sample standard deviation is small, and variance is
unknown.
C. No, because the population standard deviation is greater than the sample
standard deviation.
D. No, because the sample size is too small for the given sample standard
deviation.
12. Which property of t-distribution is NOT a property of normal distribution?
A. The t-distribution curve is bell-shaped.
B. The t-distribution curve is symmetrical about zero.
C. In t distribution, the variance is always greater than 1.
D. The total area under the t distribution curve equal to 1.
13. The following statements tell about the t-distribution EXCEPT _________________
A. The standard deviation of the t-distribution is equal to 1.
C. The t-distribution has thicker tails than the normal distribution.
D. The exact shape of a t-distribution depends on the degrees of freedom.
B. As the degrees of freedom increase, the t-distribution tends to normal
distribution.
14. Which of the following probability distribution curves has the lowest peak?
A. the standard normal distribution
B. a t-distribution with 5 degrees of freedom
C. a t-distribution with 7 degrees of freedom
D. a t-distribution with 9 degrees of freedom
15. Which of the following probability distribution curves has the thickest tails?
A. a t-distribution with 25 degrees of freedom
B. a t-distribution with 23 degrees of freedom
C. a t-distribution with 21 degrees of freedom D. a t-distribution with 19
degrees of freedom
Additional Activities
Make a Venn diagram showing the comparison of the t-distribution and normal
distribution. List down the characteristics or properties of the two distributions on
the spaces provided.
266
t-distribution has Normal distribution
these characteristics has these
Both distributions characteristics
have these
__________________ characteristics ___________________
__________________ ___________________ ___________________
__________________ ___________________ ___________________
__________________ ___________________ __________________
___________________ ___________________ __________________
___________________ _________________ ____________________
___________________ _____________ __________________
Statistics and
Probability
Identifying Percentiles Using
the t-Table
Statistics and Probability
Identifying Percentiles Using the t-Table
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
267
Published by the Department of Education
Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio Week
Telefax: 02-8682-5773/8684-4914/8647-7487
What I Know
Direction: Read and analyze each item carefully. Choose the letter of the best answer.
Write the chosen letter on a separate sheet of paper.
1. If the area of the shaded part of the t-distribution below is 0.15, what is the area of
the unshaded region?
268
A. 0.15 B. 0.75 C. 15% D. 0.85
2. Using the t-table, if the t-value is -1.708 and the shaded region is on the left of
the t-distribution, what is the area of the unshaded region?
A. 0.05 B. 0.10 C. 0.90 D. 0.95
3. Which t-distribution corresponds to t-value of 2.567 and whose shaded area on
the right tail of the t-distribution is 0.01? A. a t distribution with 7 degrees of
freedom
B. a t distribution with 17 degrees of freedom
C. a t distribution with 38 degrees of freedom
D. a t distribution with infinite degrees of freedom
4. Using the t table, if the t-value is 2.093 and the shaded area is on the right of
the t-distribution, what are the degrees of freedom?
A. 0.025 B. 1.9 C. 19 D. 20
5. The shaded region is on the left tail of the t-distribution. If the degree of
freedom is 25 and the area of the shaded region is 0.05, what is the value of t?
A. 2.059 B. 1.708 C.1.316 D. -1.708
6. Using the t-table, find the area to the right of 1.83 under the t- distribution
with 18 degrees of freedom?
A. between 0.01 to 0.02 C. between 0.025 to 0.05
B. between 0.02 to 0.025 D. between 0.05 to 0.10
7. It is the value on the t distribution whose left tail probability is 93% and whose
right-tail probability is 7%.
A. left t-value C. 93rd percentile
B. right t-value D. 7th percentile
8. In identifying a percentile using the t-table, what information should be
identified first?
A. degrees of freedom only
B. degrees of freedom and the given percentage
C. degrees of freedom and the area to the left tail of the t-distribution
D. degrees of freedom and the area to the right tail of the t-distribution
9. Which of the following values is the 90 th percentile of the t-distribution with 10
degrees of freedom?
A. 1.363 B. 1.372 C. 1.383 D. 1.812
th
10. What is the 95 percentile of the t-distribution if the sample size is 15?
A. 1.771 B. 1.761 C. 1.753 D. 1.345
th
11. If the degree of freedom is 24, what is the 95 percentile of the t-
distribution?
A. 1.711 B. 1.714 C 1.717 D. 2.064
th
12. If the 99 percentile of the t-distribution is 2.528, what is the degree of
freedom?
A. 17 B. 18 C. 19 D. 20
13. To what percentile of the t-distribution does 1.337 belong if the sample size is
17?
A. 90th B. 93rd C. 95th D. 99th
14. What is the 50th percentile of a t distribution with 45 degrees of freedom?
A. 1.684 B. 0.50 C. 0 D. -1.684
15. Using the t-table, find the area to the left of 2.15 under the t-distribution with
21 degrees of freedom?
A. between 0.98 to 0.99 C. between 0.95 to 0.975
B. between 0.975 to 0.98 D. between 0.90 to 0.95
269
How do you find this pre-test? Did you encounter both familiar and
unfamiliar terms? Kindly compare your answer in the Answer Key on the last part
of this module
If you obtain 100% or a perfect score, skip the module and immediately move
to the next module. But if you missed a point, please proceed with the module as it
will enrich your knowledge in t-distribution.
What’s In
BLUE RIBBON
“It’s more beautiful with a .”
Make your life more beautiful while reviewing your lesson on the properties
of t-distribution. Which of the ribbons below contain the characteristics of a
tdistribution? Color them blue.
Variance is always greater
than 1.
What’s New
“Fif-t Shades of Gray”
Given below are five t-distributions in the shades of gray. The given percentage
represents the unshaded portion of the t-distribution. Your task is to find the area
of the remaining portion/s of the t-distribution. Remember that the total area
under the t-distribution curve is equal to 1 or 100%.
1. 3.
270
2. 4.
5.
What is It
You have learned from the previous module the different properties of
tdistribution, some of which are very essential in this lesson. What is the total area
under the t-distribution curve? Yes, it should be equal to 1 or 100%. The area
under the t-distribution curve also represents the probability associated with
specific sets of t-values. That means given the t-value, you can compute for the
area or probability with the use of a table or software. A t-value or t-statistic tells us
how many standard deviations from the mean is the given score. The set of t-values
are usually written below the horizontal axis of the t-distribution curve.
Another property of t-distribution is that the exact shape of the t-distribution
depends on the degrees of freedom. Remember that the lesser the degree of
freedom, the lower is its peak and the thicker is its tails. As the degree of freedom
increases, the tails become flatter, and the peak becomes higher. That means, given
the area or the probability, the t-value depends on the number of degrees of
freedom. For example, with the given area of 0.05 on the right tail of t-distribution,
the t-value is 2.015 with 5 degrees of freedom. But with 20 degrees of freedom, the
t-value is equal to 1.725.
The t -Table
271
In finding the areas and percentiles for a t-distribution you need to
familiarize yourself with the t-table. You are going to use a table that is different
from the ztable you used in finding the area under the normal curve.
Below is an example of a t-table. It is a right-tailed t-table because the given
areas in this table are areas on the right tail of the t-distribution. Some t-tables are
slightly different in format. Look at the t-table below. In the first column in the
leftmost part, you have the degrees of freedom. It ranges from 1 down to . While
the first row in the upper part of the t-table represents the area under the right tail
of the t-distribution. Some of the given areas are from 0.25 down to 0.0005. The
rest of the entries in the body of the table are the values of the variable t (t-values).
272
By looking at the table, you can see
that the t -value for an area of 0.10 in the
right tail of the t-distribution with 10
degrees of freedom is 1.372 . This is the
intersection of the row containing the 10
degrees of freedom and the column
containing the area of 0.10.
273
Since the area under the t-distribution curve also represents the probability, the
90th percentile of the t-distribution is the t-value whose area on its left tail is 0.90
and whose area on its right tail is 0.10.
Illustrative Example 1
274
From the figure above, you can see that the value that you need is 1.943. Hence
the 95th percentile is 1.943. That means the t-value of 1.943 has 95% of the area to
the left of it, or 0.95. Also, you can say that the t-value of 1.943 has an area of
0.05 to its right. And so, using the t-table, you will find that the 95 th percentile is
1.943.
Illustrative Example 2
275
At this point, you need to recall one of the properties of the t-distribution
that it is symmetric about zero. That means the right tail of the distribution is
exactly the mirror image of its left tail. So, you can easily find the values in the left
tail by relying on this “symmetry–about–zero” property. Hence, if you are going to
find the value of t such that the area to the left of it is 0.05, recall that the area to
the right of 1.943 is also 0.05 (See Illustrative Example 1).
Therefore, you can say that since the t-distribution is symmetric about 0, the
t-value with an area to the left of 0.05 must be -1.943. So, you will find that the 5th
percentile is –1.943.
Illustrative Example 3
What is the area to the right of 2.4 under a t-distribution with 7 degrees of
freedom?
Remember that in the previous example, you found t-values using the given
areas under the t-distribution curve. But in this example, you will be doing the
opposite because in this problem you are given a t-value and you need to find the
area to the right of the t-distribution with 7 degrees of freedom.
You can illustrate the problem with the figure shown below. The t-value of
2.4 is somewhere between 2 and 3, and you are going to find the area to the right of
it.
So, looking back at the table, you need to focus on the 7 degrees of freedom
line. You will observe that the t-value of 2.4 cannot be found in this row but you do
276
find these two values 2.365 and 2.517 that surround 2.4 (The t-value 2.4 is
between 2.365 and 2.517).
The table tells you that the area to the right of 2.365 is 0.025 and the area
to the right of 2.517 is 0.02. You figure out earlier that our t-value of 2.4 falls in
between two values 2.365 and 2.517 and it tends to reason then, that the area to
the right of 2.4 must be between those two values 0.025 and 0.02.
So, using the table you found that the area to the right of 2.4 under the
tdistribution with 7 degrees of freedom lies somewhere between 0.02 and 0.025.
If you need to get the exact value, you need to use software that easily
calculates the area under the t-distribution curve with the given t-value and
number of degrees of freedom. Using such software, you could find that the area to
five decimal places is 0.02373.
What if you needed to use the t-table to find the area to the left of 2.4?
Since the area under the entire curve is 1, the area to the left of 2.4 is equal
to 1 minus the area to the right of 2.4. So, based on the table the area to the left of
2.4 under the t distribution with 7 degrees of freedom must lie somewhere between
0.98 and 0.975 (1 – 0.02 = 0.98 and 1 – 0.025 = 0.975).
But since you already knew that the area to the right of 2.4 is 0.02373, you
could find the exact area to the left of 2.4 to five decimal places as 1 minus 0.02373
or 0.97627.
What’s More
INDEPENDENT PRACTICE: “BLACKPINK in your Area”
Use the t-table to identify the area under the t-distribution given the t-value and
the number of degrees of freedom. Draw a t-distribution for each number then
choose from BLACK or PINK to shade the required area. (The first one is done for
you).
Example:
Area to the right of 1.746 under a t-distribution with 16 degrees of freedom
277
START HERE
1. Area to the right of 1.323 under a t-distribution with 21 degrees of freedom
2. Area to the right of 2.821 under a t-distribution with 9 degrees of freedom
3. Area to the left of -2.179 under a t-distribution with 12 degrees of freedom
4. Area to the left of 0.889 under a t-distribution with 8 degrees of freedom 5. Area
to the right of 1.25 under a t-distribution with 25 degrees of freedom?
What I Can Do
Answer the following problems.
1. Find the values of t for which the area on the right tail of the t-distribution is
0.05 and the number of degrees of freedom is equal to:
a. 15 b. 28 c. 100
th
2. Find the 99 percentile of the t-distribution with 18 degrees of freedom.
3. Find the 90th percentile of the t-distribution if the sample size is 25.
Assessment
Direction: Read and analyze each item carefully. Choose the letter of the best
answer. Write the chosen letter on a separate sheet of paper.
1. It is a value in the t-distribution that is less than the probability of the given
percentage.
278
A. degrees of freedom C. t-value
B. percentile D. right-tailed value
2. If the area of the unshaded part of the t-distribution is 0.85, what is the area of
the shaded region?
A. 0.15 B. 0.75 C. 0.85 D. 85%
3. If the given t-value is 2.056 and the shaded region is on the right of the
tdistribution with 26 degrees of freedom, what is the area of the unshaded
region?
A. 0.025 B. 0.15 C. 0.95 D. 0.975
4. What is the area on the left tail of the t-distribution with 14 degrees of freedom
corresponding to t-value of 3.787?
A. 0.001 B. 0.01 C. 0.975 D. 0.999
5. Which t distribution has an area of 0.10 to the right of 1.333?
A. A t-distribution with 5 degrees of freedom
B. A t-distribution with 11 degrees of freedom
C. A t-distribution with 17 degrees of freedom
D. A t-distribution with 23 degrees of freedom
6. The shaded region is on the right tail of the t-distribution. If the degree of
freedom is 22 and the area of the shaded region is 0.005, what is the value of t?
A. -2.819 B. -2.831 C. 2.819 D. 2.831
7. The shaded region is on the left tail of the t-distribution. If the degree of
freedom is 16 and the area of the shaded region is 0.01, what is the value of t?
A. 2.583 B. 2.120 C.-2. 120 D. -2. 583
8. What is the area to the right of 1.8 under the t-distribution with 5 degrees of
freedom?
A. between 0.20 and 0.15 C. between 0.15 and 0.10
B. between 0.10 and 0.05 D. between 0.05 and 0.025
9. Which of the following values is the 90th percentile of the t-distribution with 15
degrees of freedom?
A. 1.341 B. 1.345 C. 1.753 D. 1.837
10. What is the 95th percentile of the t-distribution when the sample size is 23?
A. 1.721 B. 1.717 C. 1.714 D. 1.703
11. If the degree of freedom is 29, what is the 95th percentile of the t-distribution?
A. 2.048 B. 1.701 C 1.699 D. 1.547
12. If the 99th percentile of the t-distribution is 2.650, what is the degree of
freedom?
A. 16 B. 15 C. 14 D. 13
13. To what percentile of the t-distribution does 1.397 belong if the sample size is
9?
A. 90th B. 95th C. 97th D. 99th
14. What is needed in order to identify a percentile using the t table?
A. degrees of freedom only
B. degrees of freedom and the given percentage
C. degrees of freedom and the area to the left tail of the t-distribution
D. degrees of freedom and the area to the right tail of the t-distribution 15. Why
does the right-tailed t-table display only positive values of t? A. Because the area
cannot be negative. B. Because t-values are always positive.
C. Because t-values on the left tail are positive
D. Because t-values on the right tail are positive
Additional Activities
Browse the internet and look for a site, a program or software that easily computes
for t-values and areas under the t-distribution. Answer the following questions
using the t-table then compare your answer using the software.
279
1. What is the area to the right of 1.5 under the t-distribution with 14 degrees of
freedom?
2. What is the area to the right of 2.75 under the t-distribution with 28 degrees of
freedom?
3. What is the area to the left of 2.8 under the t-distribution with 5 degrees of
freedom?
4. If the degree of freedom is 20, what is the 96th percentile of the t-distribution?
Statistics and
Probability
Identifying the Length of a
Confidence Interval
Statistics and Probability
Identifying the Length of a Confidence Interval
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
280
Development Team of the Module
Week
Writers: Maryquenn F. Reyes
Telefax: 02-8682-5773/8684-4914/8647-7487
What I Know
Before studying this module, take this test to determine what you already know
about the topic covered.
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. Which of the following values and percents illustrates the confidence level?
A. 32.8
B. 30-39
C. 90%
D. 90% - 94%
2. Which of the following symbols is used to get the mean?
A. E
B. μ or 𝑋̅ C. n
D.
3. Which of the following symbol stands for the margin of error?
A. E
281
B. Z𝛼/2
C. n
D.
4. For a 95% confidence level, the confidence coefficient is ± ____?
A. 1.645
B. 1.96
C. 2.33
D.2.58
5. What do you call the value resulting from subtracting the margin of error
from the sample mean?
A. point estimate
B. interval estimate
C. lower limit of the interval
D. upper limit of the interval
6. What do you call the value resulting from adding the margin of error from
the sample mean?
A. point estimate
B. interval estimate
C. lower limit of the interval
D. upper limit of the interval
For nos. 7 – 10
A survey on 200 HUMSS students of Bumasa Integrated High School shows that
they read on an average of 15. 7 hours per week. The margin of error is 2.2 hours
at a 95% confidence level.
282
A. 36
B. 67.18
C. 68
D. 68.82
12. Which of the following values is the margin of error?
A. 0.90
B. 0.82
C. 0.68
D. 0.05
13. What is the length of the confidence interval in the given problem?
A. from 67 to 68
B. from 671.8 to 688.2 C. from 67.18 to 68.82
D. from 6782 to 6882
14. Which of the following terms describes the probability that the interval
estimate contains parameters and what percentage of intervals from many
different samples contains the unknown population parameter?
A. margin of error
B. confidence level
C. confidence interval
D. level of significance
15. Which of the following terms pertains to a range of values that is used to
estimate a parameter which may or may not contain the true parameter
value?
A. margin of error
B. confidence level
C. confidence interval
D. level of significance
What’s In
For you to begin let us recall first the previous lessons which are very important for
you to succeed.
Rearranged the jumbled letters to get the correct answer.
What’s New
FILL ME IN FILIPINO CITIZEN!
Complete the Preamble by answering the number that corresponds to the blank.
You may choose your answer from the given box.
We, the sovereign ____(1)_____ people, imploring the aid of Almighty God, in
order to build a ___(2)____ and humane society, and establish a ____(3)____ that
shall embody our ideals and aspirations, promote the common good, conserve and
develop our ______(4)_____, and secure to ourselves and our _____(5)____, the
blessings of ____(6)____and _____(7)____ under the rule of law and a regime of truth,
283
justice, freedom, love, equality, and peace, do ordain and promulgate this
Constitution.
2. Based on the given problem, the difference of 35% and 30% is the
____________.
4. If 35% of the vote is the average vote that an independent candidate will
receive, and a limit of 30% and 40% where does the average (also known as
mean) lies? ______
7. If a problem has 95% confidence level and an average of 45 with a lower limit
of 40 and upper limit of 50 and margin of error of 5, the confidence interval
is (40,50) or 40<𝑋̅<50. Following this premise, what would be the confidence
interval of the given problem above?
Questions to ponder:
1. Are you able to complete the preamble?
_________________________________________________________________________
_________________________________________________________________________ 2.
What do you think is the symbol for the margin of error?
_________________________________________________________________________
_________________________________________________________________________
284
5. If you are one of the independent candidates, what will be your reaction to the
result of the survey?
_________________________________________________________________________
_________________________________________________________________________
What is It
What is the difference between the Confidence Level and Confidence interval?
285
77.18 = 78 – E E = 78-77.18 E = 0.82
Let’s see if we can get the same value of E if we use the formula for the
upper limit.
78.82 = 𝑿̅ + E
The Lower limit and mean are given. So, we have
78.82 = 78 + E E = 78.82-78 E = 0.82
What’s More
In the following activities, read each situation carefully to solve each problem. Write
your answer on a separate sheet of paper.
Practice Activity 1
Online selling
An online seller of yema cake, which is very popular in Tayabas Quezon surveyed
several customers. Fifty-two percent (52) of the customers were satisfied with the
services that were offered with a 3.98% margin of error. Determine the confidence
interval using this information.
The average percentage of people who are satisfied with the product is
_______________.
The lower limit of confidence interval is _________________.
The upper limit of confidence interval is _________________. The
confidence interval is _______________________.
Independent Assessment 1
Online Schooling
You are planning to buy a cellphone with a good quality in order to attend
the online class. The average price of 50 cellphones is Php13,500 with a margin of
error of Php 273.65 and a confidence level of 99%.
Independent Assessment 2
Colored Wheel
286
A survey on 200 Senior High School learners on how much time they read per
week has a confidence interval of (13.5, 17.9) in hours at 95% confidence level.
What I Can Do
Make your own story!
Things to do:
Construct your own problem but put a little story on it using the data indicated
below, then answer it.
𝑿̅ = 𝟏𝟒𝟖 𝑬
= 𝟎. 𝟗𝟏
95% Level of Confidence Rubrics
for the task:
CRITERIA
TOTAL 100%
Assessment
Multiple Choice. Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper.
1. Which of the following values and percents illustrates the confidence
interval?
A. 30.8
B. (35,39)
C. 92%
D. 92% and 94%
287
2. Which of the following symbols is used for the population standard
deviation?
A. E
B. μ or 𝑋̅ C. n
D.
3. Which of the following symbol stands for the margin of error?
A. E
B. Za/2
C. n D.
4. For a 95% confidence level, the confidence coefficient is ± ____?
A. 1.645
B. 1.96
C. 2.33
D.2.58
5. What do you call the value resulting from adding the margin of error from
the sample mean?
A. point estimate
B. interval estimate
C. lower limit of the interval
D. upper limit of the interval
6. What do you call the value resulting from subtracting the margin of error
from the sample mean?
A. point estimate
B. interval estimate
C. lower limit of the interval
D. upper limit of the interval
288
11. Which of the following values is the 𝑥̅?
A. 364.18
B. 67.18
C. 65
D. 64.82
12. Which of the following is the margin of error?
A. 0.65
B. 0.68
C. 0.82
D. 0.90
13. What is the length of the confidence interval?
A. (65, 68)
B. (641.8, 658.2) C. (64.18, 65.82)
D. (6482, 6582)
14. It is a range of values that is used to estimate a parameter. This
estimate may or may not contain the true parameter value. It is called
A. confidence level
B. margin of error
C. confidence interval
D. level of significance
15. It is the probability that the interval estimate contains a parameter, and it
describes what percentage of intervals from many different samples
contains the unknown population parameter.
A. confidence level
B. margin of error
C. confidence interval
D. level of significance
Additional Activities
Martina reads that the average age of graduate students is 33 years old. She
wanted to estimate the mean age of the graduate students at her university,
so she took a random sample of 30 graduate students. She found that their
mean age was 𝑋̅=31.8 at a 95 % confidence level with 1.54 margin of error.
Questions:
Statistics and
Probability
289
Computing for the Length of the
Confidence Interval
Statistics and Probability
Computing for the Length of the Confidence Interval
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
Telefax: 02-8682-5773/8684-4914/8647-7487
1. apply the formula in determining the margin of error; and 2. compute for the
length of the confidence interval.
What I Know
Before studying this module, take this test to determine what you already know
about the topic covered.
Direction: Read and analyze each item carefully. Shade the entire circle for each
statement that corresponds to your answer.
B. D.
291
For items 7 – 9:
A sample size of 50 has a mean of 20. The population standard deviation is 18 and
the confidence level to be used is 95%.
A random sample of 150 stores of Atimonan Mall has an average daily income of
Php 25,000. The population standard deviation is Php 12,000 and a 95%
confidence level is to be used.
292
O O O O confidence level?
A. 1.44 C. 1.75
B. 1.645 D. 1.96
What’s In
Let us have a recall!
Use a calculator to simplify the following numbers, then round off your answer:
What’s New
Business Matters
Read and analyze carefully the problem and answer the questions that
follow.
293
Businessperson Jennifer Velasco uses an analog weighing scale in her fruit
store. She earns ₧45.00 per kilo in selling lanzones. One time, she managed to
use a digital scale and found out that she is giving a 1.1 kilo instead of 1 kilo. She
also found out that the digital scale is more accurate than the analog weighing
scale.
Questions to ponder:
1. If you are Jennifer, would you rather use an analog than digital scale?
Why? ________________________________________________________________________
2. Out of 40 boxes that she has, 15 boxes weighs using analog scale and the rest
is by using the digital scale, how many boxes of lanzones she weighs correctly?
______________________________________________________________________________
3. How much kilo of lanzones did she lost in all?
___________________________________
4. How much money did she earn for the correct weight of lanzones?
________________
5. How much money did she lost for weighing lanzones incorrectly?
_________________
6. How much should be the earnings of Jennifer if she weighs all the 40 packs
correctly?
_____________________________________________________________________
What is It
Confidence level is the likelihood measure of the confidence interval that is
represented by a percentage that refers to all possible samples that can be
estimated to contain the true population parameter. The most common values of
the level of confidence are shown in the Z c table below. Confidence coefficient or
critical values expressed as zc.
Confidence interval is also called the interval estimate. It is a range of values
that is used to have an approximate boundary or parameter. However, this
approximate may or may not contain the correct or true parameter value. Level of
Confidence and Zc Table
Confidence 99% 98% 96% 95% 92% 90% 85% 80% 70%
Level 0.99 0.98 0.96 0.95 0.92 0.90 0.85 0.80 0.70
Zc 2.58 2.33 2.05 1.96 1.75 1.645 1.44 1.28 1.04
The margin of error is the range of values above and below the given
statistical number or sample in a confidence interval. To compute for the margin of
error, use the formula given below:
E= )
n = 160 = ₱75
95% confidence level where zc = 1.96
294
Step 2: Apply the formula and substitute the given data.
E=
E = (1.96)( )
E = (1.96)(5.929) (use three decimal places for partial answer)
E = 11.62 (round off the final answer in two decimal places)
COMPUTING THE LENGTH OF CONFIDENCE INTERVAL
Computing for the margin of error and confidence interval were discussed in
the previous module. We will use the formulas presented in the previous module to
compute the length of the confidence interval.
Recall that the confidence interval can be written in the form of
The lower limit is obtained by using the formula LL= 𝑿̅ − 𝑬, while the upper limit
is obtained by using the formula UL= 𝑿̅ + 𝑬, where E is the Margin of Error and 𝑿̅
is the sample mean.
Meanwhile, the margin of error is obtained using the formula:
where, n = sample size
𝑧𝛼/2= confidence coefficient
where E is the margin of error, 𝑧𝛼/2 is the confidence coefficient, 𝜎 is the population
standard deviation, n is the sample size and 𝑛 0
Example 2: The population of Sulu Hornbill (one of the endangered bird species in
the Philippines) has a standard deviation of 40. Compute for the length of the
confidence interval for a 90% confidence level having a sample size of 150 and a
sample mean of 65.
Solution:
E= )
)
E= (1.645)(3.266) (use three decimal places for partial answer)
E = 5.37(round off final answer to two decimal places)
Step 3: Compute for the upper and lower limit of the confidence interval.
UL = 𝑿̅ + 𝑬 LL = 𝑿̅ − 𝑬
= 65 + 5.37 = 65 − 5.37
295
= 𝟕𝟎. 𝟑𝟕 = 𝟓𝟗. 𝟔𝟑
Step 4: Compute for the length of the confidence interval.
L = UL – LL
= 70.37-59.63
= 10.74
Knowing that the length of the confidence interval is just twice the margin of error,
a shorter solution can be used using the formula: L = ) = 2E. Therefore, L =
2 (5.37) =10.74
Example 3: Jennifer wanted to know the average price of shoes that her customer
purchased. She sampled 160 pairs of shoes that were sold and found out that the
mean average price is ₱800 with a standard deviation of ₱75. Construct a 95%
confidence interval for the mean price of all shoes that were sold. Compute for the
length of the confidence interval.
Solution:
Step 1: Write the given data,
E= )
E = (1.96)( )
Step 3: Compute for the upper and lower limit of the confidence interval.
UL =
LL =
.
Or since
What’s More
296
In the following activities, read each situation carefully to solve each problem. Write
your answer on a separate sheet of paper.
A random sample of size 70 has a standard deviation of 15. What is the
margin of error for a 92% confidence level?
Write all the given data based on the problem statement and show the complete
solution.
n = __________ formula to be used:
________________ = __________ solution: zc =
__________
Independent Activity 2:
Compute the length of the confidence interval for estimating the
population mean using a sample size of 300 and with a standard deviation 84.
Use a 92% confidence level.
Write all the given data based on the problem statement and show the
complete solution.
n = __________ formula to be used: ________________
= __________ solution: zc = __________
Independent Assessment 2:
Compute the length of the confidence interval with a sample size of 240 and
with a standard deviation 60 having an 80% confidence level.
Independent Activity 3:
The average price of 350 cellphones is ₧13,500 with a sample standard deviation
of ₧750.
a. Find the 99% confidence interval of the true average price of the
cellphones. Interpret your answer in the context of this problem.
b. Find the length of the confidence interval.
Write all the given data based on the problem statement and show the complete
solution. n = __________ formula to be used: a. ________ b. _________ =
__________ solution:
𝑥̅=___________
zc = __________
Compare your answers with those found in the Answer Key on the last part
of this module. Did you get all items correct? Great! If you did, you learned the
lesson well. If you were not able to get the correct answer, review your
computations.
297
1. ________________ is the likelihood measure of the confidence interval that is
represented by a percentage that refers to all possible samples that can be
estimated to contain the true population parameter.
2. Confidence coefficient or ______________ is expressed as zc.
3. ___________________ is a range of values that is used to estimate a parameter.
4. To compute for the margin of error, use the formula ______________________
5. 𝑧𝛼⁄2means the ____________________ or _____________________.
Confidence 99% 98% 96% 95% 92% 90% 85% 80% 70%
Level
Zc 2.58 2.05 1.75 1.44 1.04
What I Can Do
HAVE CONFIDENCE AND MAKE A MATH VLOG!
Materials needed:
Cellphone
Short bond paper
Calculator
Pen or pencil
Things to do:
VLOG CRITERIA
Accuracy of the solution 50%
Clarity and content of the problem 25%
298
Originality and creativity 25%
TOTAL 100%
Assessment
Direction: Read and analyze each item carefully. Shade the entire circle for each
statement that corresponds to your answer.
B. ) D. L= 2( )
(for items, 7 – 8) Edward owns a meat store and he wants to find out the average
amount of his daily sale. He randomly recorded his sales for 16 days and found the
average to be ₱ 8,500. It is known from last year’s record that the standard
deviation of his sales is ₱150 and a 99% level of confidence is to be used.
299
A B C D 9. Given: n= 200, = 35, and a 96% confidence level. Which
O O O O of the following values is the length of the confidence
interval?
A. 10.15 B. 14.54 C. 15.23 D. 18.25
(For items, 11 – 12) Given a sample size of 200 with a mean of 80 and the
population standard deviation is found to be 35 having a 96% confidence level.
(For items, 13 – 14) The average amount by purchasing products of 255 customers
in JYEL store is ₧150. The population standard deviation is ₧65 with an 85%
level of confidence.
300
1. In a sample of 100 cars of a population having a standard deviation on the speed
of 25km/h, find the margin of error if the level of confidence to be used is 90%.
2. Jennifer has sampled 300 persons for her survey project. If she uses a 99%
confidence level with a population standard deviation of 400, help her to
compute for the length of the confidence interval that she needs.
3. Mr. Velasco wanted to know the average weight (in kilos) of students at LAO
University. He took a random sample of 300 students and discovered that their
Statistics and
Probability
Computing Appropriate Sample
Size Using the Length of the
Interval
Statistics and Probability
Computing Appropriate Sample Size Using the Length of the Interval First
Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
301
Development Team of the Module
Week
Writers: Dhojie N. Yanto, Jerson Jolo and Reynante M. Velasco
Telefax: 02-8682-5773/8684-4914/8647-7487
What I Know
Let us see how far you will go and how much you know about this pretest. Please
answer all questions with all your best and utmost honesty so that we can gauge your
learning in the target competency of this lesson.
If you obtain 100% or a perfect score, you may skip the module and
immediately move to the next module. While in the event you missed a point, please
proceed with the module as it will enrich your knowledge in computing for an
appropriate sample size using the length of the interval. Let’s get started!
Direction: Read and analyze each item carefully. Choose the letter of the best
answer. Write the chosen letter on a separate sheet of paper.
1. Which of the following coefficients are the 90% confidence coefficients?
302
A. ±1.645
B. ±1.96
C. ±2.40
D. ±2.58
2. What do you call a subset of a population?
A. estimation
B. interval
C. proportion
D. sample
3. Which of the following values illustrates confidence level?
A. 90 - 100
B. 65.0
C. 90%
D. 35.75
4. When alpha is 0.05, confidence level is equal to _________.
A. 10%
B. 95%
C. 90%
D. 99%
5. Confidence interval is also known as _____________________.
A. interval estimate
B. confidence coefficient
C. interval coefficient
D. confidence interval
For numbers 6-11, refer to the following values:
90% confidence level, 𝜎 = 3.5 and 𝐸 = 0.5.
6. What is the given confidence level?
A. 90%
B. 0.10
C. 0.01
D. 0.05
7. What is 𝛼?
A. 95%
B. 0.10
C. 0.01
D. 0.05
8. What are the confidence coefficients?
A. ±1.645
B. ±1.96
C. ±2.20
D. ±2.58
9. Which of the following values is the given population standard deviation?
A. 0.01
B. 3.5
C. 0.5
D. 90%
10. What is the margin of error?
A. 0.01
B. 3.5
C. 0.5
D. 90%
11. Compute for the appropriate sample size.
A. 131
B. 132
C. 133
D. 134
303
12. What is the sample size needed if the margin of error is 30 and the standard
deviation of 45 with a 98% confidence level?
C. 10
D. 11
E. 12
F. 13
13. Calculate the sample size of the given data: 𝐸 = 5, = 20, with a 95%
confidence level.
A. 59
B. 60
C. 61
D. 62
14. Which of the following variables is NOT needed in computing the minimum
sample size for the estimation of the population mean?
304
A. confidence level
B. sample mean
C. population standard deviation
D. margin of error
15. Which of the following formulas is used is used to get the sample size of the
population?
A.
B.
C.
𝑍
D.
What’s In
LET’S REVIEW
Direction: Use your calculator to evaluate the following, then round off your answer
as indicated.
How was the review? Did you answer all the items? I believe you got it all correctly.
You are now ready to learn new concepts. Good luck!
305
What’s New
Priming Activity:
Analyze the steps provided in the example below to solve for the variable x in terms
of y. Fill in the blanks with the correct mathematical expressions in the next part to
derive the formula for computing the appropriate sample size.
where
How was the activity? Where you able to get the correct expression in each
step? If yes, what helps you solve each required expression? If no, what hinders
you? What difficulty did you encounter during the activity? What mathematical
concepts have you applied to do the activity? The formula you just derived will be
used in computing for appropriate sample size using the length of the interval.
What is It
To make an accurate estimate of the parameter, it is vital to determine the
appropriate size of the sample. The factors that affect the sample size are the
maximum error of estimate, the population standard deviation, and the confidence
level.
You must also consider the table below in computing for an appropriate
sample size.
EXAMPLE 1.
306
be the minimum sample size that the student-researchers need to estimate
the population mean?
SOLUTION:
Let us identify the given in the problem:
Confidence level is 99% then
𝐸=4
𝜎=9
𝑍
Use the formula:
Therefore, to be 99% confident that the estimate differs from the true mean by
4, the minimum sample size needed is at least 34.
Note: When determining sample size, we always round off the resulting value to the
next whole number.
EXAMPLE 2.
we want to be 95% confident of finding whether the true mean differs from the
sample mean by 1.5 hour?
SOLUTION:
To answer the question, let us identify the given:
For a 95% confidence level,
𝐸 = 1.5
𝜎 = 2.6
𝑍
Use the formula:
Substitute the given:
Evaluate: 𝑛 ≈ 11.54
Round off to the nearest whole number: 𝑛 = 𝟏𝟐
Therefore, to be 95% confident that the estimate differs from the true mean by
1.5, we need a sample size of at least 12. EXAMPLE 3.
𝐸=6
𝜎 = 25
307
Use the formula:
Substitute the given:
Evaluate: 𝑛 ≈ 46.98
Round off to the nearest whole number: 𝑛 = 𝟒𝟕
Therefore, to be 90% confident that the estimate differs from the true mean by 6,
we need a sample size of at least 47.
EXAMPLE 4.
Compute for the appropriate sample size if the margin of error is 6, with a
sample standard deviation of 25 and a 92% level of confidence.
What’s More
Independent Activity 26.1
Complete Me
Complete the table below by supplying the needed information and solve for the
appropriate sample size n.
1. Confidence level 99%,
Population standard deviation = 30,
margin of error = 2.5
308
5. Confidence level 98%,
Population standard deviation , margin
of error = 2.5
No. Confidence E n
Level
1
Solve Me
Determine the minimum sample size for each of the following:
a. 95% confidence level, E = 4.8 ,
Using the formula for computing the appropriate sample size, derive the
formula for the margin of error (E) and population standard deviation .
2.
Compute for
the
appropriate sample size needed if the margin of error is 15, with a population
standard deviation of 43 and a 98% level of confidence.
__________
__________
__________
309
What I Have Learned
Let us summarize what you have learned by completing the following table. Write
the correct answer in a separate sheet of paper.
where:
________________________
________________________
________________________
________________________
What I Can Do
MAKE YOUR SURVEY NOW!
Materials needed:
Bond paper, calculator, pen and pencil
Things to do:
6. Make a problem statement anchored in any of the following issues:
a. Public Health like COVID – 19 pandemic
b. Climate change
c. Overpopulation
d. Gender Inequality
e. Bullying
f. Poverty
7. Identify a random variable related to your selected issue. For example, the
number of facemasks used for Public Health issues, volume of waste
disposal for climate change, the number of children in a family for
overpopulation, etc.
8. Survey all of your classmates.
9. Compute for the population mean, standard deviation, and the margin of
error. Use the formula: and a confidence level of 90%, 95%, and
99%.
10.Compute for the appropriate sample size using the computed values in
SURVEY CRITERIA
310
Accuracy of the solution 50%
TOTAL 100%
Assessment
Direction: Read and analyze each item carefully. Choose the letter of the best
answer. Write the chosen letter on a separate sheet of paper.
311
14. Which of the following is NOT needed in computing the minimum sample
size for the estimation of the population mean?
A. sample mean C. margin of error
B. confidence level D. population standard deviation
15. Which of the following formulas is used to get the sample size of the
population?
A. C.
B. D.
Additional Activities
To deepen your knowledge on computing for the appropriate sample size using the
length of interval you can visit the following websites,
https://youtu.be/XO9NtHvRQTQ and http://youtu.be/uMILjv6Fs94.
Statistics and
Probability
Solving Problems Involving
Sample Size Determination
Statistics and Probability
Solving Problems Involving Sample Size Determination
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines.
However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are
owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these
materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them.
312
Development Team of the Module
Week Author:
Elmer C. Potes, Rolly Q. Suaverdez and Reynante M. Velasco
Editors: Celestina M. Alba and Nestor N. Sandoval
8 Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Team:
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann
O. Briones Celestina M. Alba, Asuncion C. Ilao
Telefax: 02-8682-5773/8684-4914/8647-7487
Your patience in solving offered problems here in the module will help you
improve your computational skills as it tackles relevant culture-based situated
problems. Your ability to interpret, reason–out, and make a judgment or even decision
out of statistical measures will also be practiced here. The scope of this module permits
it to be used in many different learning situations. The language used recognizes the
diverse vocabulary level of students. The lessons are arranged to follow the standard
sequence of the course. But the order in which you read them can be changed to
correspond with the textbook you are now using.
In previous modules, you learned that sample statistics can estimate the
population parameter. You also learned how to compute for an appropriate sample size
using the length of the interval. This time, you will focus on solving problems involving
sample size determination for you to better estimate the population parameter.
What I Know
Let us see how far you will go and how much you know about this pretest.
Please answer all questions with all your best and utmost honesty so that we can
gauge your learning in the target competency of this lesson. Let’s get started!
313
NOTE: For students with online connections and want to have a paperless
assessment, please go to the link given by your teacher.
Direction: Choose the letter of the correct answer for each item. Shade/blacken the
circle that corresponds to your chosen answer.
A B C DO 1. Which of the following factors is not considered in the
O O O determination of sample size?
A. margin of error
B. level of confidence
C. confidence interval
D. variance or standard Deviation
A B C DO
O O O
2. The value of confidence coefficient ( z ) of 1.645 is used
2
A.
z (
nx 2 n) C.
𝐸
B.
n = 𝑋̅+ E D.
A B C DO 5. How does the margin of error and sample size affect one
O O O another? (Assume that the other factors are constant) A. A
smaller sample size reduces the margin of error.
B. A higher margin of error requires a larger sample.
C. A lower margin of error requires a larger sample size.
D. A small sample size results in a lower margin of error.
314
A B C DO 7. A researcher wants his estimate with a minimum margin
O O O of error. Which of the following is suggested? A. Reduce
the sample size.
B. Use a sample size of 30.
C. Increase the sample size.
D. Use a sample size of less than 25.
315
How do you find the pre-test? Did you encounter both familiar and
unfamiliar terms, symbols, and notations? Kindly compare your answer in the
Answer Key on the last part of this module.
If you obtain 100% or a perfect score, skip the module and immediately move
to the next module. While in the event you missed a point, please proceed with the
module as it will enrich your knowledge in the normal distribution, especially in
solving problems involving sample size determination
What’s In
Let’s have this first for a review.
Use the given information to compute the sample size.
1. 𝑍𝛼⁄2= 2.576, 𝜎 = 12 𝐸=3 n=?
What’s New
In each of the situations below, identify the values of the following:
A. confidence level C. standard deviation, (𝜎)
B. critical value, (𝑍𝛼⁄2) D. margin of error, (𝐸)
Situation 1
A researcher would like to estimate the mean weight loss of people one year after
using a specific weight loss program. How many people on the weight loss program
must be surveyed in order to be 95% confident that the sample mean weight loss is
within 0.75 pounds of the true population mean? Assume the population standard
deviation is known to be 10.4 pounds.
Situation 2
An investigator wants to estimate the mean birth weight of infants born fullterm to
mothers who are 19 years of age and under. The mean birth weight of infants born
full-term to mothers 20 years of age and older is 3,510 grams with a standard
deviation of 385 grams. How many women 19 years of age and under must be
enrolled in the study to ensure that a 90% confidence on interval estimate of the
mean birth weight of their infants has a margin of error not exceeding 100 grams?
Situation 3
316
A medical practitioner wants to estimate the mean systolic blood pressure in
preschool children with inborn heart disease. He plans on using a 99% confidence
interval and a margin of error of 5 units. The standard deviation of systolic blood
pressures in pre-school children with inborn heart disease is 20. How many
children should be enrolled in the study?
What is It
In the previous module, you equipped yourself with a formula in determining
the sample size. It is expected that you have learned very well the steps in using the
formula because you will still use it in this module. Also, in the previous lesson, the
values needed in the computation were already given. But since you are dealing
with worded problems in this module, you are going to identify first the given values
from the problem before you can use the formula. Hence, it is important to correctly
identify those values.
The Sample Size Determination in Estimating Population Mean
Let us imagine that you own a fast-food company. You are doing a survey to
get an idea of how long customers stay in a restaurant. In the past two years, it was
observed that the standard deviation of the time they stay in a restaurant is 12
minutes. How many people do you need to observe so that the desired margin of
error will not be more than three (3) minutes with a confidence level of 95%?
Let us identify first the given in the problem
Confidence Level = 95%
Critical Value (𝑍𝛼⁄2) = 1.96
Margin of Error (𝐸) =3
Standard Deviation (𝜎) = 12
The formula always gives you the minimum number of sample sizes needed to
ensure the specified precision. Hence, if you round down 61.47 to 61 (like what we
usually do in rounding off numbers), it will be less than the minimum. Therefore,
you should always round up whatever the decimal part is. Thus, the required
sample size is 62.
What if you wish to minimize the error to at most 2 (E ≤ 2)? Will it affect the
sample size? Let’s find out. Your given values will be
Confidence Level = 95%
Critical Value (𝑍𝛼⁄2) = 1.96
Margin of Error (𝐸) =2
Standard Deviation (𝜎) = 12
317
What does it imply? As the margin of error gets smaller, the sample size gets
larger. Or you can say that one way to minimize the error is by increasing the
sample size. The same variation will be applied if the margin of error increases.
Moreover, if you reduce the level of confidence to 90% it will surely affect the
sample size. Upon changing the level of confidence to 90%, the critical value will
change also.
Critical Value (𝑍𝛼⁄2) = 1.645
Margin of Error (𝐸) =3
Standard Deviation (𝜎) = 12
Then using the formula to compute for the sample size, you have
If we further reduce the level of confidence to 80%, the sample size will be
computed as follows:
Confidence Level = 80%
Critical Value (𝑍𝛼⁄2) = 1.28
Margin of Error (𝐸) =3
Standard Deviation (𝜎) = 12
The changes in the level of confidence in the computation above tell you
that as the sample size gets smaller as your level of confidence decreases.
Factors Affecting Sample size Determination
There are three factors that influence the sample size determination; (1) the
level of confidence, (2) the population standard deviation, and (3) the margin of error.
Among these factors, only two are under the researcher’s control.
These are:
• Margin of Error. Since no sample will be perfect, you need to decide how much
error to allow. The confidence interval determines this. The less error you’re willing
to accept, the bigger the sample size needs to be.
• Confidence Level. This determines how certain you want to be that the
parameter falls within your confidence interval. Most common confidence intervals
are 90%, 95% or 99%. The more confident you want to be, the bigger the sample
size needs to be.
Table 1:
The table below shows the confidence coefficient (critical values, 𝑍𝛼⁄2)
for a different level of confidence.
Level of
Confiden 99.73 99 98 96 95.45 95 90% 80 68.27 50%
ce % % % % % % % %
z 3.00
2.5 2.3 2.0 2.00 1.9 1.64 1.2 1.00 0.674
2
8 3 5 6 5 8 5
Illustrative Example 1
An investigator wants to estimate the caffeine consumption of senior high
school students. How many students would be required to ensure that a 95%
318
confidence interval estimate for the mean caffeine intake (measured in mg) is within
15mg of the true mean? Assume that the standard deviation in caffeine intake is
68mg
Let us identify first the given in the problem
What’s More
Apply the knowledge you obtained from the discussion by solving the given
problems below.
1. A 99% confidence interval has to be created to estimate the mean volume of
contents in the bottles of Honey-Z. Assume that the population standard
deviation is 20 ml. Calculate the sample size needed if the margin of error is no
more than:
a. 6 ml
b. 5 ml
c. 4 ml
d. 3 ml
2. How large sample size is required to determine the mean IQ of seventh graders
of Luis Palma National High School within +3 points, population standard
deviation of 15, and with the following confidence level?
319
a. 98%
b. 95%.
c. 90%.
d. 80%.
3. In the formula/equation for the sample size, n is the required sample size, z
is
2 the_____________________, represents the __________________,
and ___________ is the margin of error.
4. The factors which influence the sample size
determination are
____________________, ____________________ and ____________________.
5. As the sample size gets larger, the margin of error gets __________________.
6. The level of confidence determines how certain you want to be that the
parameter fall within your confidence interval. The more confident you want to
be, the ______________ the sample size needs to be.
What I Can Do
Read the following problems carefully. Apply your skills in solving the following
cases.
1. A researcher found that the IQ scores of the ALS students in the Division of
Quezon Province are normally distributed with a mean of 110 and a standard
deviation of 10.
a. How many ALS students are needed to test so that the estimate will not be more
than 5 from the population mean with a 99% level of confidence?
b. Due to limited contact with ALS students, the researcher tested a small number
of ALS students. Explain the effect of this small sample on the estimate of the
parameter?
2. A confidence interval estimate has a maximum error of 2.50. If the distribution of
the data where the estimate is obtained is normal with a standard deviation of 9,
a. How many samples are considered in 98% confidence interval estimate for the
mean?
b. What must be done to the sample size to reduce the margin of error of the
confident interval estimate? Support your answer.
Assessment
Direction: Choose the letter of the correct answer for each item. Shade/blacken the
circle that corresponds to your chosen answer.
320
A B C DO
O O O
2. The value of confidence coefficient ( z ) of 1.96 is used in
2
B. n = X + E D. n = df + 1
A B C DO 5. How does the margin of error and sample size affect one
O O O another? (Assume that the other factors are constant) A. A
smaller sample size reduces the margin of error.
B. A higher margin of error requires a larger sample.
C. A lower margin of error requires a larger sample size.
D. A small sample size results in a lower margin of error.
A B C DO 6. How does the standard deviation (𝜎) affect the sample size?
O O O (Assume that the other factors are constant)
A. The sample size is constant for all values of 𝜎.
B. Lower 𝜎 reduces the required sample size.
C. Lower 𝜎increases the required sample size.
D. Higher 𝜎 decreases the required sample size.
A B C DO 7. A researcher wants his/her estimate with a minimum
O O O margin of error. Which of the following is suggested? A.
reduce the sample size.
B. use a sample size of 30.
C. increase the sample size.
D. use a sample size of less than 25.
321
A B C DO 9. If a 90% level of confidence is applied by the researcher
O O O in the condition given in item # 8, what sample size shall
be obtained?
A. 8 B. 17 C. 27 D.
34
A B C DO 12. What is the desired margin of error for the estimate of the
O O O population parameter?
A. 2 B. 3 C. 4 D. 16
Additional Activities
322
How well did you perform in this lesson? Are you ready for the next lesson?
If you are, then answer the following situation.
As a researcher, you were hired by MNG Clinic to estimate the average BMI
of the adult population of Lucena City. A previous survey indicates a standard
deviation of 15.50. How many adults should be visited as samples in order to be
95% confident that the confidence interval estimate will not differ from the true
population mean by more than 3? What type of random technique will you use to
get your sample? State your procedure in conducting the survey.
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
___________________
Answer Key
Module 1
323
Module 2
324
Module 3
325
326
327
328
Module 5
Additional Activities
1.
Value of the Random Variable X
Possible Outcomes
( No. of Tails )
HHHH 0
HHHT 1
HHTH 1
HTHH 1
THHH 1
HHTT 2
HTHT 2
HTTH 2
THHT 2
THTH 2
TTHH 2
HTTT 3
THTT 3
TTHT 3
TTTH 3
TTTT 4
Probability Distribution
Value of Random Variable X 0 1 2 3 4
P(X) 1 / 164 / 166 / 164 / 161 / 16
2.
a. Sample Space
{(1,2) ,(2,1),
(1 ,3),(3,1),
(2.3),(3,2) ,
(1 ,1),(2,2),
(3,3)}
b. 1+1=2
1+2=3
2+1=3
2+2=4
1+3=4
3+1=4
2+3=5
3+2=5
3+3=6
The values of the random variable are 2,3,5 and 6
c.
Sum X 2 3 4 5 6
P(X) 1 2 3 2 1
9 9 9 9 9
Module 6
329
330
Additional Activities
1. Since 𝜇𝐴 = 6 ; 𝜇𝐵 = 10 and 𝜇𝐶 = 4
Therefore 𝜇𝐶 < 𝜇𝐴 < 𝜇𝐵
2 . 𝝈𝒂 < 𝝈𝑩 𝒂𝒏𝒅 𝝈𝒂 < 𝝈𝑪
Because 𝝈𝑩 = 𝝈𝑪
Figure C
Figure B
Figure A
What I Know What's More
1. B
2. C
3. A
4. C
5. B
6. A Assessment
7. D
8. A 1. D
9. B 2. C
10. C 3. C
11. B 4. D
12. C 5. C
13. D 6. A
14. B 7. C
15. C 8. D
9. A
10. C
11. D
12. C
13. B
14. C
15. B
Module 9
331
332
333
334
Module 11
Additional Activities
1 . A. 81.50%
B. 97.35%
2 . A. 69 to 87
b. 60 to 96
335
What I Know What's More Assessment
1 . B Activity 1 1 .D
2 . B 1. 0.0006 2 .B
2. 0.8944
3 .A 3 .C
3. 0.7123
4 .C 4. 0. 9987 4 .A
5 . 0.9750
5 .C 6. 0.0918 5 .A
7. 0.0062
6 .C 8.. 06141 6 .D
9. 0.4483
7 .A 7 .A
10. 0.0250
8 .A 8 .B
Activity 2
9 .B 9 .B
10 .C 10 .C
11 .D 1 . 11 .A
12 .B 12 .C
13 .A 13 .A
14 .B 14 .A
2 .
15 .C 15 .D
3
4.
5
336
What’s New Additional Activities
1. 0.0455
2. 0.8577
3. 0.8770
4 . 0.7190
1 . 6 .
5. 0.8413
B.
1. 0.6587
2. 0.9941
3. 0.0985 2. 7 .
4. 0.5960
5. 0.1359
3 . 8 .
4 . 9 .
5 . 10 .
Module 14
337
Module 15
338
339
340
341
342
343
What’s In (cont….)
2
B. 𝜇 = 3.89, 𝜎 = 4.38
𝑿 𝑷(𝑿) X • P(X) ( X -μ) ( μ)
X - ^ 2μ) ( X - ^2 •P(X )
1 0.13 0.134 -2.89 8.36 1.121
2 0.23 0.462 -1.89 3.58 0.827
3 0.09 0.27 -0.89 0.80 0.072
4 0.17 0.66 0.11 0.01 0.002
5 0.11 0.56 1.11 1.23 0.137
6 0.07 0.42 2.11 4.44 0.311
7 0.2 1.386 3.11 9.66 1.913
3.89 4.38
344
( What’s More Independent Activity )
19. 𝜇 = 9.4
2 a
2 b
𝑿
𝑿−µ (𝑿 − µ)𝟐
5 -4.4 19.36
6 -3.4 11.56
9 -0.4 0.16
12 2.6 6.76
15 5.6 31.36
∑ (𝑋 − 𝜇)2
69.2
=
20. 𝜎2 = 13.84
21. 10
22.
Samples Mean
5, 6 5.5
5, 9 7
5 , 12 8.5
5 , 15 10
6, 9 7.5
6 , 12 9
6 , 15 10.5
9 , 12 10.5
9 , 15 12
12 , 15 13.5
345
346
( What’s More
Independent Assessment
)
First Step:Compute the mean of the �=7.6
population � 7
: Second Step Compute the variance of the
population
a.Subtract each measurement by the computed population
mean
b.Square the results obtained in (a)Divide
then add.
the sum by the
frequency of measurements to get the value of the population
variance.
��2̅ =1 .8
�� 3 9
Third Step:Determine the number of possible samples of size n = 2,
���
using formula of combination.
���
15 -number of possible samples of size
2
Fourth Step:List all possible samples and their corresponding means.
SamplesMea
n
2, 53.50
2, 64.00
2, 95.50
2, 116.50
2, 137.50
5, 65.50
5, 97.00
5, 118.00
5, 139.00
6, 97.50
6, 118.50
6, 139.50
9, 11
10.00
9, 13
11.00
11, 13
12.00
347
348
Module 18
349
350
351
352
( What’s More Independent Activity )
Given Answer Standard
Error
Formula
a. 𝑁 = 8, 𝑛 = 3 Known 𝜎𝑥̅
b. 𝜇 = 12, 𝑠 = 3, 𝑛 = 125 Unknown 𝑠𝑥̅
c. 𝑁 = 11, 𝑛 = 5 Known 𝜎𝑥̅
d. 𝑁 = 100 Known 𝜎𝑥̅
𝜇 = 12, 𝜎 = 2
2
𝜎 = 4, 𝑛 = 10
𝑋̅ = 15, , 𝑠2 = 𝑠 = 1
e. 𝜇 = 137.16 Unknown 𝑠𝑥̅
𝑛 = 15
𝑋̅ = 139.1
𝑠 = 2.23
353
354
Module 19
Additional Activity
Answer Standard Error
Formula
1. k nown 𝜎𝑥̅
2. known 𝜎𝑥̅
3. un known 𝑠𝑥̅
4. known 𝜎𝑥̅
5. known 𝜎𝑥̅
Post Assessment
4. B 11 .D What I Can Do
5. A 12 .A
Answer Standard Error
6. C 13 .A Formula
7. B 14 .A
4. known 𝜎𝑥̅
8. B 15 .B
9. B 5. unknown 𝑠𝑥̅
10. B 𝜎𝑥̅
6. known
11. A
7. unknown 𝑠𝑥̅
12. B
10 .B 8. unknown 𝑠𝑥̅
355
356
Module 20
357
358
359
360
Assessment
1. D
2. A
3. D
4. A
5. B
6. D
7. B
8. A
9. C
10. A
11. C
12. B
13. A
14. A
15. D
What I Know
1 .B
2 .C
3 .B
4 .C
5 .A
6 .A
7 .A
8 .A
9 .C
10 .C
11 .A
12 .D
13 .A
14 .A
15 .B
361
What I have Learned What I can do
1 . small, population variance/ 1 . normal
standard deviation distribution
2 . William S. Gosset 2 . both
3 . bell, zero, 1 3 . both
4 . horizontal axis 4 . normal
5 . zero distribution
6 . degrees of freedom 5 . t-distribution
7 . lower, thicker 6 . both
8 . normal distribution 7 . t-distribution
9 . greater 8 . t-distribution
10 . v/v-2 9 . both
10 . both
Assessment What's More What I Know
1 .C 1 . REAL
1 .C
2 .C 2 . REAL
2 .A
3 .A 3 . FAKE
3 .C
4 .D 4 . REAL
4 .B
5 .A 5 . REAL
5 .A
6 .C 6 . FAKE
6 .A
7 .C 7 . REAL
7 .C
8 .D 8 . FAKE
8 .C
9 .A 9 . REAL
9 .A
10 .B 10 . REAL
10 .D
11 .A
11 .A
12 .C
12 .C
13 .A
13 .A
14 .B
14 .D
15 .D
15 .C
Module 21
362
What I Know
1 .D
What’ More 2 .D
Assessment ACTIVITY 1
3 .B
1 .B
4 .C
2 .A
5 .D
3 .D 1.
6 .C
4 .D
7 .C
5 .C
8 .D
6 .C
2 . 9 .B
7 .D
10 . C
8 .B
11 . A
9 .A
12 . D
10 . B
3 . 13 . A
11 . C
14 . C
12.D
15 . B
13 . A
14 . D 4.
What I can do
15 . D
1 .a 1.753
b. 1.701
Additional Activities
5 . c. 1.660
1 . 0.0779 or 7.79%
ACTIVITY 2 2. 2.552
2 . 0.0052 or 0.52%
1 . 1.321 3. 1.318
3 . 0.9810 or 98.10%2 . 0.700
3 . -2.150
4. 1.844
4 . -1.108
5 . -0.312 5 . -0.861
6 . 1.740
7 . 2.528
8 . 2.145
9 . -0.684
Module 22
363
What's More
What I Know
Assessment Practice Activity 1
The average percentage of people
1. C
1. B who are satisfied with the
product is 52 %
The lower limit of confidence
2. B
2. D
48.02
interval is% or 0.4802 .
The upper limit of confidence
3. A
3. A
55.98
interval is % or 0.5598 .
The confidence interval is 4. B
4. B
(48.02%,55.98%) or
(0.4802,0.5598 ). 5. C
5. D
6. D
6. C Independent Assessment 1
What is the lower limit? 7. C
7. B 13226.35
8. B
8. B What is the upper limit?
13773.65 9. A
9. B
What is the length of confidence
interval?
(13 226.35,13773.65)
If you
13 have P ,100.00, is that
amount lies within the
confidence interval?
NO.
Independent Assessment 2
What is the margin of error? 2.2
What is the average time they
read per week? 15.7 hours
Additional Activities
What is the lower limit? 30.26
What is the upper limit? 33.34
What is the confidence interval?
(30.26,33.34)
Based on the interval you get,
is it plausible that the mean
age of all graduate students at
her university is
also 33 years? Yes
Module 23
364
365
366
modules.
random
sampling discussed in the previous
, The students may from different types of
choose 103
Activities
Additional
smaller.
margin of error getslarger,
samples.
the
b. increases the numberAs ofthe sample size gets
2 a. 71
estimate.
increases It also
Quezon.the margin of error of the
size too ll estimate
IQscores of IQ students the
in the Division of
b.Asample that is smareduces the power to
1 .a. 27
do
What I can
References
Module 1
Pierce, Rod. (3 Feb 2020). "Random Variables". Math Is Fun. Retrieved 24 May
2020 from http://www.mathsisfun.com/data/random-variables.html
Malate, Jose S. Statistics and Probability for Senior High School. Vicarisg
Publications and Trading, Inc, 2017.
Lim, Yvette F., Nocon, R., Nocon, E., Ruivivar, L. Math for Engaged Learning
Statistics and Probability. Sibs Publishing House, Inc. 2016.
Module 2
Belencia, Rene R., Baccay, Elisa S., and Mateo, Efren B. Statistics and Probability.
Rex Book Store, Inc. (RBSI). 2016
367
Melosantos, Luis Allan B., Janice E. Antonio, Susan S. Robles and Ryan M.
Bruce. Math Connections in the Digital Age. Sibs Publishing House, Inc. 2016
Malate, Jose S. “Statistics and Probability”. Vicarish Publications and
Trading, Inc. (2017). ISBN 978-421-8030-17-6, 17-26.
http://www.stat.yale.edu/Courses/1997-98/101/ranvar.htm,
https://www.mathsisfun.com/data/random-variables.html
Module 3
Belencia, Rene R., Baccay, Elisa S., and Mateo, Efren B. Statistics and
Probability. Manila: Rex Book Store, Inc. (RBSI). 2016.
Melosantos, Luis Allan B., Antonio, Janice E., Robles, Susan S. and
Bruce, Ryan M. Math Connections in the Digital Age. Quezon City: Sibs
Publishing House, Inc. 2016.
http://www.stat.yale.edu/Courses/1997-98/101/ranvar.htm,
https://www.mathsisfun.com/data/random-variables.html
Module 4
Belecina, Rene R., Baccay, Elisa S. and Mateo, Efren B. “Statistics and Probability”.
Manila: Rex Book Store, Inc. (RBSI).2016. pages 9-18
Malate, Jose S. “Statistics and Probability for Senior High School.” Vicarish
Publications and Trading, Inc, 2017. pages 5-9
https://en.wikipedia.org/wiki/Timeline_of_the_COVID19_pandemic_in_the_Philipp
ines
Module 5
Belecina,Rene R., Baccay,Elisa S. and Mateo, Efren B. “Statistics And Probability”.
Manila: Rex Book Store, Inc. (RBSI), 2016.
Malate, Jose S. “Statistics and Probability for Senior High School.” Vicarish
Publications and Trading, Inc, 2017.
Module 6
Textbooks:
Gordon, Ian. Probability and statistics: Discrete Probability Distributions – A guide
for Teachers (Years 11–12). Education Services Australia, 2013.
Moore, David S. and McCabe, George P. Introduction to the Practice of Statistics -
Second Edition. W.H. Freeman and Company, 1993.
Malate, Jose S. Statistics and Probability. Vicarish Publication and Trading. Inc.,
2018.
Alferez, Merle S. and Duro, Ma. Cecilia A. MSA Statistics and Probability. MSA
Publishing House, 2012
Websites:
https://www.mathsisfun.com/data/random-variables-mean-variance.html
https://www.cabarrus.k12.nc.us/cms/lib/NC01910456/Centricity/Domain/4633
/Random%20Variables%20Worksheet%202.pdf
https://www.coconino.edu/resources/files/pdfs/academics/sabbaticalreports/kat
e-kozak/chapter_5.pdf
368
https://www.youtube.com/watch?v=NH6oiaV4iWA Comparing standard deviations
without calculation.mp4
Module 7
Gordon, Ian. Probability and statistics: Discrete Probability Distributions – A guide
for Teachers (Years 11–12). Education Services Australia, 2013.
Moore, David S. and McCabe, George P. Introduction to the Practice of
Statistics - Second Edition. W.H. Freeman and Company, 1993.
Malate, Jose S. Statistics and Probability. Vicarish Publication and Trading. Inc.,
2018.
Alferez, Merle S. and Duro, Ma. Cecilia A. MSA Statistics and Probability. MSA
Publishing House, 2012.
Websites:
https://ocw.mit.edu/courses/mathematics/18-05-introduction-to-probability-
andstatistics-spring-2014/readings/MIT18_05S14_Reading5a.pdf
https://www.mathsisfun.com/data/random-variables-mean-variance.html
https://www.cabarrus.k12.nc.us/cms/lib/NC01910456/Centricity/Domain/4633
/Random%20Variables%20Worksheet%202.pdf
https://www.coconino.edu/resources/files/pdfs/academics/sabbaticalreports/kat
e-kozak/chapter_5.pdf
Module 8
Belecina, Rene R, Baccay, Elisa S., Mateo, Efren B. Statistics and Probability. First
Edition. Manila, Philippines: Rex Book Store. 2016.
Lim, Yvette F., Nocon, Rizaldi C., Nocon, Ederlina G., Ruivivar, Leonor A.
Math for Engaged Learning Statistics and Probability. Quezon City: Sibs
Publishing House, Inc. 2016.
Malate, Jose S. Statistics, and Probability for Senior High School. Sta. Ana, Manila,
Philippines: Vicarish Publications and Trading, Inc. 2017.
Module 9
Lim, Yvette F., Nocon, Rizaldi C., Nocon, Ederlina G., Ruivivar, Leonor A. Math for
Engaged Learning Statistics and Probability. Quezon City: Sibs Publishing
House, Inc. 2016.
Malate, Jose S. Statistics and Probability for Senior High School. Sta. Ana Manila:
Vicarish Publications and Trading, Inc. 2017. Module 10
Books
Mahmot, Millard R. and Mamhot, Alice A. “Statistics and Probability”. Unlimited
Books Library Service & Publishing Inc., 2016.
Malate, Jose S. “Statistics and Probability”. Vicarish Publications and Trading, Inc.,
2017.
Ocampo, Jose M. and Marquez, Wimer G. Senior High Conceptual Math & Beyond,
Statistics and Probability. Brilliant Creations Publishing Inc., 2016. Websites
http://davidmlane.com/hyperstat/z_table.html https://www.khanacademy.org/
Module 11
A. Textbooks
369
Mahmot, Millard R. and Mamhot, Alice A. “Statistics and Probability”. Unlimited
Books Library Service & Publishing Inc., (2016). ISBN 978-971-9654-55-1,
44-50.
Malate, Jose S. “Statistics and Probability”. Vicarish Publications and Trading, Inc.,
2017. ISBN 978-421-8030-17-6, 45-54.
Ocampo, Jose M. and Marquez, Wimer G. Senior High Conceptual Math & Beyond,
Statistics and Probability. Brilliant Creations Publishing Inc., 2016. ISBN 978621-
8006-27-0, 39-42. B. Websites: http://davidmlane.com/hyperstat/z_table.html
https://www.khanacademy.org/
Module 12
Books
Mahmot, Millard R. Ph.D. and Mamhot, Alice A. M.S.. “Statistics and
Probability”. Unlimited Books Library Service & Publishing Inc. (2016), ISBN
978-971-9654-55-1, 41 -43.
Ocampo, Jose M. PhD and Marquez, Wimer G. MA.. Senior High Conceptual
Math & Beyond, Statistics and Probability. Brilliant
Creations Publishing Inc. (2016), ISBN 978-621-8006-27-0, 31-42.
Websites
http://davidmlane.com/hyperstat/z_table.html https://www.khanacademy.org/
Module 13
Books
Mahmot, Millard R. and Mamhot, Alice A. Statistics and Probability. Unlimited
Books Library Service & Publishing Inc., 2016.
Malate, Jose S. Statistics and Probability. Vicarish Publications and Trading, Inc.,
(2017).
Ocampo, Jose M. and Marquez, Wimer G. Senior High Conceptual Math & Beyond,
Statistics and Probability. Brilliant Creations Publishing Inc., (2016). Websites:
http://davidmlane.com/hyperstat/z_table.html https://www.khanacademy.org/
Module 14
Books
Belecina, Rene S., Baccay, Elisa S., and Mateo, Efren B. Statistics and Probability.
Manila: REX Book Store, 2016.
Mercado, Jesus P. and Orines, Fernando B. Next Century Mathematics Statistics &
Probability. Quezon City: Phoenix Publishing House, 2016.
370
Mercado, Jesus P. and Orines, Fernando B. Next Century Mathematics Statistics &
Probability. Quezon City: Phoenix Publishing House, 2016.
371
Books:
Belecina, R., Baccay, E., Mateo, E. Statistics and Probability. Manila: REX Book
Store, 2016.
Blay, B. Elementary Statistics. Pasig City: ANVIL Publishing, Inc., 2007.
Websites:
The Sampling Distribution of the Sample Mean.
https://saylordotorg.github.io/text_introductory-statistics/s10-02-
thesampling-distribution-of-t.html.
Libretexts. "7.2: The Central Limit Theorem for Sample Means (Averages)." Statistics
LibreTexts. November 10, 2019.
https://stats.libretexts.org/Bookshelves/Introductory_Statistics/Book:_Introd
uctory_Statistics_(OpenStax)/07:_The_Central_Limit_Theorem/7.02:_The_Centr
al_Limit_Theorem_for_Sample_Means_(Averages).
Module 21
Book
Malate, Jose S. Statistics and Probability. Vicarish Publications and Trading, Inc.,
2017. Websites:
https://www.statisticshowto.com/probability-and-statistics/t-distribution/
https://stattrek.com/probability-distributions/t-distribution.aspx
https://stepupanalytics.com/t-test-distribution-and-its-application/
https://www.youtube.com/watch?v=Uv6nGIgZMVw
Module 22
Book
Malate, Jose S. Statistics and Probability. Vicarish Publications and Trading, Inc.,
2017. Websites:
https://www.statisticshowto.com/probability-and-statistics/t-distribution/
https://stattrek.com/probability-distributions/t-distribution.aspx
https://stattrek.com/online-calculator/t-distribution.aspx
https://m.youtube.com/watch?v=qH7QZoMbB80
https://stepupanalytics.com/t-test-distribution-and-its-application/ Module
23
Baccay, Elisa S., Belecina, Rene S., Mateo, Efren B. Statistics and Probability.
Manila: Rex Book Store Inc., 2016.
Malate, Jose S. Statistics and Probability for Senior High School. Vicarish
Publications and Trading, Inc., 2017. Websites
https://www.statisticshowto.com/probability-and-
statistics/hypothesistesting/margin-of-error/
https://mathbitsnotebook.com/Algebra2/Statistics/STMarginPractice.html
https://www.khanacademy.org/math/ap-statistics/estimating-confidence-
372
ap/onesample-t-interval-mean/e/sample-size-margin-error-one-sample-t-interval-
mean
Module 24
Textbooks
Malate, Jose S. Statistics and Probability for Senior High School. Vicarish
Publications and Trading, Inc, 2017.
Websites: https://www.mathsisfun.com/data/confidence-interval-calculator.html
https://www.insightsassociation.org/article/importance-confidenceintervals
https://online.stat.psu.edu/stat414/node/198/
https://www.nku.edu/~statistics/212_Confidence_Iinterval_using_z.htm
Module 25 Books:
Mahmot, Millard R. and Mamhot, Alice A. Statistics and Probability.
Unlimited Books Library Service & Publishing Inc., 2016.
Malate, Jose S. Statistics and Probability. Vicarish Publications and Trading,
Inc., 2017.
Belecina, Rene R. et.al. Statistics and Probability, 1st Edition. Sampaloc Manila:
Manila: Rex Book Store, Inc., 2016.
Lim, Yvette, et.al. Math for Engaged Learning Statistics and Probability. Quezon
City: SIBS Publishing House, 2016.
Orines, Fernando B. Next Century Mathematics Statistics and Probability. Quezon
City: Phoenix Publishing House, 2016.
Internet Source: https://www.mathsisfun.com/data/confidence-interval-
calculator.html https://www.insightsassociation.org/article/importance-
confidence-intervals https://youtu.be/XO9NtHvRQTQ
http://youtu.be/uMILjv6Fs94
Module 26
Textbooks
Malate, Jose S. Statistics and Probability. Vicarish Publication and Trading Inc.,
2017.
Febre, Francisco A. Jr. Introduction to Statistic. Phoenix Publishing House Inc.,
2000.
Belecena, Rene R. Statistics and Probability. Manila: Rex Book Store, Inc., 2016.
Websites https://study.com/learn/sample-size-determination-questions-and-
answers html
http://sphweb.bumc.bu.edu/otlt/MPHModules/BS/BS704_Power/BS704_Power3.
html
373
For inquiries or feedback, please write or call:
374