Beruflich Dokumente
Kultur Dokumente
Objectives
At the end of this module, the students will be able to:
1. Describe statistics;
2. Appreciate the significance of the use of statistics; and
3. Apply the concepts of the different types of statistical data.
Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting numerical data to
assist in making more effective decisions. In its more common usage, refers to numerical information. It can appear
in graphic forms as well as in sentence form (Lind, Marchal, Wathen, 2006 and Kvantli, Guynes, Pavur, 1986).
Collection: This refers to the gathering of information using scientific methods (e.g.
experimentation, survey, interviews).
Organization: This is the simplification and presentation of the information which have been
collected. In organizing information, three methods can be done—tabular, graphical or
numerical.
Analysis: This is working out the information which were collected to find out values, trend,
differences, associations or predictions. In doing so, investigators apply various tools and
techniques in Statistics. There are three types of analysis depending on the type of number of
variables that are to be used—univariate, bivariate and multivariate.
Interpretation: This is seeing beyond what the information have revealed. A sound
interpretation is characterized as having scientific rigor and evidence.
Statistic is a characteristic of a sample (mean, standard deviation, variance, or any other measure based on a
sample data). A collection of more than one figure is called statistics [plural] (Lind, et al, 2006).
(1) Statistics helps in providing a better understanding and exact description of a phenomenon of nature.
(2) Statistical helps in proper and efficient planning of a statistical inquiry in any field of study.
(3) Statistical helps in collecting an appropriate quantitative data.
(4) Statistics helps in presenting complex data in a suitable tabular, diagrammatic and graphic form for an
easy and clear comprehension of the data.
(5) Statistics helps in understanding the nature and pattern of variability of a phenomenon through
quantitative observations.
(6) Statistics helps in drawing valid inference, along with a measure of their reliability about the population
parameters from the sample data.
(7) To scientifically measure conditions of any given problem and assess existing relationship(s).
(8)To show the laws underlying facts and events that cannot be determined by individual observations.
(9)To reveal cause and effect relations that otherwise may remain unknown
(10)To uncover ambiguous trends and behavior in related conditions.
A. Descriptive Statistics is the method of organizing, summarizing, and providing a description of the sample
data in an informative way. It includes presenting data in percentage, ranks, standard units, frequency distribution,
measures of location, measures of dispersion, among others.
For Example: Industrial statistics, population statistics, trade statistics etc… Such as businessman
make to use descriptive statistics in presenting their annual reports, final accounts, bank statements
Examples 1: A poll found that 49% of the people in a survey knew the name of the first book of the Bible. The
statistic 49 describes the percentage (proportion) of persons who knew the first book of the Bible.
2: According to consumer reports, Sharp washing machine owners reported 9 problems per 100
machines in 2007. The statistic 9 describes the number of problems out of every 100 reported
machines.
B. Inferential Statistics is used to infer the truth or falsity of a hypothesis. It includes making a decision, estimate,
prediction, or generalization about a population, based on a sample. The ultimate goal is to gain information
about the sample drawn from a population rather than on the population itself. Inferential statistics allow us to
make accurate inferences about the population itself on the basis of the sample data.
For Example: Suppose we want to have an idea about the percentage of illiterates in our country. We take a
sample from the population and find the proportion of illiterates in the sample. This sample proportion with the
help of probability enables us to make some inferences about the population proportion. This study belongs to
inferential statistics.
A sample is a portion, or part, of the population of interest. It is a subgroup of the target population which
the researcher plans to study for the purpose of making generalization about the entire population.
Examples 1: TV networks constantly monitor the popularity of their programs by hiring private research firms
and other organizations to sample the preferences of TV viewers.
2: The accounting department of a large firm will select a sample of the invoices to check for
accuracy for all the invoices of the company.
3: Wine tasters sip a few drops of wine to make a decision with respect to all the wine waiting to
be released for sale.
2. Parametric statistics are used when interval measurement can be assumed and the sample size is
appropriate.
Examples: Gender, religious affiliation, type of automobile owned, place of birth, hair color.
2. Quantitative variable: the variable can be reported numerically. Quantitative variables can be classified
as either discrete or continuous.
a. Discrete variables can only assume certain values and there are usually “gaps” between values.
Typically, discrete variables result from counting.
b. Continuous variables: can assume any value within a specific range. A variable which can
assume each and every value within a given range. It can occur in decimals.
For Example: Heights and Weights of students, Speed of a bus, the age of a Shopkeeper, the life time of
a T.V etc…
the time it takes to fly from Cebu to Manila.
air pressure in a tire
weight of a shipment of grains (15.01 tons, 15.45 tons, etc.)
distance between Cebu and Bohol, etc.
balance in your checking account
minutes remaining in class
Constant: A quantity which can be assuming only one value. It is usually denoted by the first letters of
alphabets .
Data are information, known facts, figures, observation, statistics, records, and reports. They can be
classified in different ways.
Classification of Data: The process of arranging data into homogenous group or classes according to some
common characteristics present in the data
For Example: The process of sorting letters in a post office, the letters are classified according to the cities
and further arranged according to streets.
Bases of Classification:
There are four important bases of classification:
(1) Qualitative Base (2) Quantitative Base (3)Geographical Base (4) Chronological or Temporal Base
(1) Qualitative Base:
When the data are classified according to some quality or attributes such as sex, religion, literacy, intelligence
etc…
(2) Quantitative Base:
When the data are classified by quantitative characteristics like heights, weights, ages, income etc…
(3) Geographical Base:
When the data are classified by geographical regions or location, like states, provinces, cities, countries etc…
Types of Classification:
(1) One -way Classification:
If we classify observed data keeping in view single characteristic, this type of classification is known as one-
way classification.
For Example: The population of world may be classified by religion as Muslim, Christians etc…
b. Grouped Data are data presented in a frequency distribution table, organized or processed data.
Example: Rating Frequency
Superior 6
Good 28
Average 25
Poor 12
Inferior 3
DATA
Discrete Continuous
(number of children) (time taken for an exam)
A. Nominal Data: The “lowest” level of the most primitive measurement. Classification has no natural order.
There is no measurement involved, only counts. There is no particular order to the categories. Data categories are
mutually exclusive and exhaustive, so an object belongs to one and only one category. Data categories have no logical
order.
Nominal scales may be further subdivided into two groups: Renaming and Categorical.
1. Nominal-Renaming occurs when each object in the set is assigned a different number (i.e. renamed
with a number). Examples of nominal-renaming are social security numbers or numbers on the back of player’s
jersey. The former is necessary because different individuals have the same name, i.e. Mary Smith, and because
computers have an easier time dealing with numbers rather than alpha-numeric characters.
2. Nominal-Categorical occurs when objects are grouped into subgroups and each object within a
subgroup is given the same number. The subgroups must be mutually exclusive, that is, an object may not belong to
more than one category or subgroup. An example of nominal-categorical measurement is grouping people into
categories based upon stated political party preference (Nacionalista, LAKAS-KAMPI, or Laban,) or upon sex
(Male or Female.) In the political party preference system Nacionalistas might be assigned the number "1", LAKAS’
"2", and Laban "3", while in the latter females might be assigned the number "1" and males "2".
In general it is meaningless to find means, standard deviations, correlation coefficients, etc., when the data
is nominal. This does not mean, however, that such systems of measurement are useless, for in combination with
other measures they can provide a great deal of information.
Properties
1. Mutually exclusive. An individual or item that, by virtue of being included in one category, must be
excluded from any other category. example: eye color.
2. Exhaustive. Each person, object, or item must be classified in at least one category.
Example: religious affiliation.
B. Ordinal Level Data: may be arranged in some order, but differences between data values cannot be
determined or are meaningless. One category is “higher” or “better” than the next one. Data categories are mutually
exclusive and exhaustive and are ranked according to the particular trait they posses.
Example: Rating of a Student Teacher. During a taste test of 4 colas, cola C was ranked number 1, cola
B was ranked number 2, cola A was ranked number 3, and cola D was ranked number 4.
Cola C B A D
Rank 1st 2nd 3rd 4th
Properties
1. Data classifications are mutually exclusive and exhaustive.
2. Data classifications are ranked or ordered according to the particular trait they posses.
C. Interval Level Data: the next highest level of measurement. It includes all the characteristics of the ordinal
level, moreover, the difference between values is a constant size. There is no natural zero point.
Example: Temperature on the Celsius scale on a certain winter time in Canberra City Australia: (12 oC,
0oC, -5oC). Note that the zero value is just a point on the scale and does not represent the absence of the
condition.
Properties
1. Data categories are mutually exclusive and exhaustive
2. Data categories are scaled according to the amount of the characteristics they possess.
3. Equal differences in the characteristic are represented by equal differences in the numbers assigned
to the categories.
D. Ratio Level Data: the “highest” level of measurement, has all the characteristics of the interval level, in
addition, the zero (0) point is meaningful and the ratio between two numbers is meaningful. examples: money, units
of production, weight, income
Properties
1. Data categories are mutually exclusive and exhaustive
2. Data categories are scaled according to the amount of the characteristics they possess.
3. Equal differences in the characteristic are represented by equal differences in the numbers assigned
to the categories.