Beruflich Dokumente
Kultur Dokumente
INTRODUCTION TO
STATISTICS
Scope of the Course
2
Population and Sample and Processes
Engineers and Scientists are constantly exposed to the
collection of facts, or data.
4
Population and Sample and Processes
(cont...)
Usually census is impractical and infeasible: Why?
Constraints on time, money and other scarce resources
Instead, a subset of population – a sample is selected in
some prescribed manner (e.g. a randomly selected 50
students out of 500 graduates)
In order to draw inferences/ conclusions about a
population, certain characteristics of the objects of
population are investigated: (e.g. age, gender, GPA – a
categorical or numerical variable)
Variable is any characteristic whose value may change
from one object to another or from one place to another.
Uni-variate , bi-variate and Multivariate data set
5
Univariate, Bivariate, and Multivariate
Data
Depending on how many variables we are
measuring on the individuals or objects in our
sample, we will have one of the three following
types of data sets
◦ Univariate: Measurements made on only one variable
per observation.
◦ Bivariate: Measurements made on two variables per
observation.
◦ Multivariate: Measurements made on more than two
variables per observation.
6
Population and Sample
Population: The entire collection of individuals or
measurement objects about which information is desired
e.g.Average height of 5-year old children in Pakistan
7
Population and Sample (Cont…)
8
Census and Inference
Census: Complete enumeration of population units.
9
Parameter and Statistic
Parameter: Any statistical characteristic of a
population. Population mean, population median,
population standard deviation are examples of
parameters.
10
Some Differences between Population and Sample
POPULATION SAMPLE
Size Large Small
Size Notation N n
Easy to collect data? No Yes
Term used to describe A “parameter” A “statistic”
its nature
e.g., μ, σ e.g., x, s
11
Some Differences between Population and Sample
(Cont’d)
POPULATION SAMPLE
Mean (notation) μ x
Std Deviation σ s
(notation)
Mean (formula)
µ= ∑ x
x= ∑ x
N n
Variance (formula)
∑ (x − µ ) 2
=
∑ (x − x) 2
σ 2
= s2
n −1
N
12
Statistics!
What is it? What does it involve?
The art or science of making confident conclusions about the
attributes of a system or collection of systems
Involves:
-taking a small sample from a larger set (Sampling)
-analyzing data from the small sample (Data analysis)
-testing the hypotheses to ascertain if true (Hypothesis
Testing)
-making conclusions about the larger set (Statistical
Inference)
-presenting your findings to an audience (Information
Delivery)
13
Prelude to Statistics
14
Some of such questions we may be required to answer as
civil engineer :
15
-What is the strength of concrete being used in
constructing a certain structure?
(Construction/Materials Engineering)
16
-
-How many of the steel I-sections provided by a certain
supplier have a lower-than-specified strength?
(Structural Engineering)
17
Therefore:
where
in order to
18
Because we draw the sample from the population, the
sample is called a subset of the population (Recall
Set Theory)
Sample
Population
19
Ideally, we seek a sample that is a miniature copy of
the population.
20
Important Questions …
21
Every engineer involved in statistic analysis of his/her system hopes
that:
his/her sample is a good representative of the population.
POPULATION SAMPLE
Parameters: μ, σ Statistics: x, s
22
Back to “Important Questions, #1”
24
Methods of Random Sampling
There are 4 major ways by which a sample can be
carried out to ensure that it is random and yet
represents a true miniature copy of the population:
26
Systematic Random Sampling
28
Stratified Random Sampling
MAIN POPULATION
SAMPLE SAMPLE
SAMPLE SAMPLE
Any Example?
31
In Summary ...
-We can afford to take only a small sample
from a large population of systems or system
components in order to investigate the
population.
- Our sample must as much as possible reflect
the population from which it is drawn.
Sample
Population
34
Introduction to Statistics
Descriptive Inferential
Graphical Non-graphical
Central Tendency Point Estimation
Dot Plots Dispersion/ Variance Hypothesis Testing
Scatter Plots Range Confidence Interval
Box Plots Shape Statistical Regression
Stem-and-leaf Plots
Bar Charts/Histograms
35