Beruflich Dokumente
Kultur Dokumente
INTRODUCTION
TO
Statistics
1.1 WHAT IS STATISTICS?
• The word statistics derives from classical Latin roots, status which means
state.
• As potential users of statistics, we need to master both the “science” and the
“art” of using statistical methodology correctly.
Example applications
of Statistics
Specific definition:
• Statistics
Nowadays statistics is used in almostisallafields
collection of procedures
of human and principles for
effort such as:
gathering data and analyzing information to help people
make decisions when faced with uncertainty.
1. Sport
• Sports
A statistician may keeps records of the number of hits a baseball player gets in
a season.
• Financial
Financial advisor uses some statistic information to make reliable predictions
in investment.
• Public Health
An administrator would be concerned with the number of residents who
contract a new strain of flu virus during a certain year.
• Others
Any Idea?…..
2. Applied Statistics
o Involves the applications of those theorems, formulas, rules and laws to
solve real world problems.
o Applied Statistics can be divided into two main areas, depending on how
data are used. The two main areas are:
ASPECTS OF STATISTICS
Theoretical/Mathematical Applied
Statistics Statistics
Descriptive Inferential
Statistics Statistics
Exampl
Determine which of the following statements is descriptive in nature and which is
inferential.
a. Of all U.S kindergarten teachers, 32% say that “knowing the alphabet” is an
essential skill. Inferential
b. Of the 800 U.S kindergarten teachers polled, 32% say that “knowing the
alphabet” is an essential skill. descriptive
Population
Sample
Inference
Statistic
Parameter
Population Sample
Average/Mean - µ Average/Mean - s
Standard deviation - σ Standard deviation -
• Variable
A characteristic of interest about each individual element of a population or
sample.
e.g. : A student’s age at entrance into college, the color of student’s hair.
• Data value
The value of variable associated with one element of a population or
sample. This value may be a number, a word, or a symbol.
e.g. : Farah entered college at age “23”, her hair is “brown”.
• Data
The set of values collected from the variable from each of the elements that
belong to sample.
e.g. : The set of 25 heights collected from 25 students.
Exampl
A statistics student is interested in finding out something about the average ringgit
value of cars owned by the faculty members of our university. Each of the seven
terms just describe can be identified in this situation.
i) Population : the collection of all cars owned by all faculty members at our
university.
ii) Sample : any subset of that population. For example, the cars owned by
members the statistics department.
iv) Data value : one data value is the ringgit value of a particular car. Ali’s
car, for example, is value at RM 45 000.
vi) Parameter : which we are seeking information is the “average” value of all cars
in the population.
vii) Statistic : will be found is the “average” value of the cars in the sample.
e.g. Number of courses for which e.g. Weight of books and supplies
you are currently registered. you are carrying as you attend class
today.
EXERCISE 1
1. Of the adult U.S. population, 36% has an allergy. A sample of 1200 randomly selected
adults resulted in 33.2% reporting an allergy.
a. Describe the population.
b. What is sample?
c. Describe the variable.
d. Identify the statistics and give its value.
e. Identify the parameter and give its value.
2. The faculty members at Universiti Utara Malaysia were surveyed on the question
“How satisfied were you with this semester schedule?” Their responses were to be
categorized as “very satisfied,” “somewhat satisfied,” “neither satisfied nor
dissatisfied,” “somewhat dissatisfied,” or “very dissatisfied.”
a. Name the variable interest.
b. Identify the type of variable.
3. A study was conducted by Aventis Pharmaceuticals Inc. to measure the adverse side
effects of Allegra, a drug used for treatment of seasonal allergies. A sample of 679
allergy sufferers in the United States was given 60 mg of the drug twice a day. The
patients were to report whether they experienced relief from their allergies as well as
any adverse side effects (viral infection, nausea, drowsiness, etc)
a. What is the population being studied?
b. What is the sample?
c. What are the characteristics of interest about each element in the population?
d. Are the data being collected qualitative or quantitative?
Types of Data
Primary data
3. Postal questionnaire
A set of questions to obtain related information of
conducted study.
Questionnaires are posted to every respondent.
Advantages:
Wider respondent coverage.
Respondent have enough time to answer
questions.
Interviewer influences can be avoided.
Lower cost.
Chapter 1: Introduction to Statistic 9
Disadvantages:
One way interaction.
Low response rate.
QQS1013 Elementary Statistics
Any Idea?.......
Another technique to collect primary data is
observation. List the advantages and
disadvantages of this technique.
Levels of Measurement
EXERCISE 2
1) Classify each as nominal-level, ordinal-level, interval-level or ratio-level.
a. variables.
b. observations.
c. samples.
d. none of the above answers is correct.
4) The scale of measurement that is simply a label for the purpose of identify-
ing the attribute of an element is the
a. ratio scale.
b. nominal scale.
c. ordinal scale.
d. interval scale.
5) Some hotels ask their guests to rate the hotel’s services as excellent, very
good, good, and poor. This is an example of the
a. ordinal scale.
b. ratio scale.
c. nominal scale.
d. interval scale.
10) The summaries of data, which may be tabular, graphical, or numerical, are
referred to as
a. inferential statistics.
b. descriptive statistics.
c. statistical inference.
d. report generation.
EXERCISE 3
4. At Sintok Community College 150 students are randomly selected and asked the
distance of their house to campus. From this group a mean of 5.2 km is
computed.
ANSWER EXERCISE 1
2) a. satisfaction
b. ordinal
4) a. quantitative
b. qualitative
c. quantitative
d. qualitative
e. quantitative
f. quantitative
ANSWER EXERCISE 2
ANSWER EXERCISE 3
a) a. Descriptive c) a. Nominal
b. Inferential b. Ratio
c. Descriptive c. Ordinal
d. Inferential d. Interval
e. Inferential e. Ratio
TUTORIAL CHAPTER 1
1. You asked five of your classmates about their height. On the basis of this
information, you stated that the average height of all students in your
university or college is 65 inches. This is an example of:
a. descriptive statistics
b. statistical inference
c. parameter
d. population
2. A company has developed a new computer sound card, but the average
lifetime is unknown. In order to estimate this average, 200 sound cards are
randomly selected from a large production line and tested and the average
lifetime is found to be 5 years. The 200 sound cards represent the:
a. parameter
b. statistic
c. sample
d. population
5. When data are collected in a statistical study for only a portion or subset of
all elements of interest, we are using a:
a. sample
b. parameter
c. population
d. statistic
10. A company has developed a new battery, but the average lifetime is
unknown. In order to estimate this average, a sample of 500 batteries is
tested and the average lifetime of this sample is found to be 225 hours. The
225 hours is the value of a:
a. parameter
b. statistic
c. sample
d. population
11. The process of using sample statistics to draw conclusions about true
population parameters is called
a. inferential statistics
b. the scientific method
c. sampling method
d. descriptive statistics
14. The collection and summarization of the graduate degrees and research areas
of interest of the faculty in the University of Michigan of a particular
academic institution is an example of
a. inferential statistics
b. descriptive statistics
c. a parameter
d. a statistic
17. A study is under way in a national forest to determine the adult height of pine
trees. Specifically, the study is attempting to determine what factors aid a
tree in reaching heights greater than 50 feet tall. It is estimated that the forest
contains 32,000 pine trees. The study involves collecting heights from 500
randomly selected adult pine trees and analyzing the results. The sample in
the study is
a. the 500 randomly selected adult pine trees
b. the 32,000 adult pine trees in the forest
c. all the adult pine trees taller than 50 feet
d. all pine trees, of any age in the forest
20. For each of the following examples, identify the data type as nominal,
ordinal, or interval.
a. The letter grades received by students in a computer science class
________________
b. The number of students in a statistics course
________________
c. The starting salaries of newly Ph.D. graduates from a statistics program
________________
d. The size of fries (small, medium, large) ordered by a sample of Burger
King customers. _____________________
e. The college you are enrolled in (Arts and science, Business, Education,
etc.)
_________________