Beruflich Dokumente
Kultur Dokumente
1-2
1-3
1-4
Descriptive
esc p ve Statistics
S s cs: Methods of organizing,
g
g,
summarizing, and presenting data in an informative way.
EXAMPLE : Election Result
Types of Statistics
1-5
Inferential Statistics:
Statistics: A decision, estimate,
prediction, or generalization about a population,
based on a sample.
A Population
i a Collection
is
of all possible
individuals,
objects, or
measurements of
i
interest
.
A Sample is a
portion,
i or part,
of the population
of interest
Types of Statistics
1-6
Sampling
Populasi
Sampel
Parameter
Statistics
Pengujian Hipotesa
Ho:; H1:
Statistik Inferensi
1-7
DATA
Qualitative or attribute
(type of car owned)
Quantitative or numerical
discrete
(number of children)
continuous
(time taken for an exam)
1-8
1-9
1-10 3- 10
1-11 3- 11
It is calculated by
summing
i the
th values
l
and dividing by the
number off values.
Average
Joe
It
1-12 3- 12
Th Median is
i the
th
The
midpoint of the values after
they have been ordered
from the smallest to the
largest.
g
The Median
1-13 3- 13
1-14 3- 14
Dispersion
p
refers to the
spread or
variability in
the data.
30
25
20
15
10
5
0
0
10
12
range
range,
mean deviation, variance, and standard
deviation.
M
Measures
off di
dispersion
i include
i l d the
th following:
f ll i
Measures of Dispersion
1-15 3- 15
2
s
(X n1
n-1
2
X)
s= s
Chapter Four
1-16
4-17
1-17
Dot Plot
Dot plots:
Report the details of each observation
Are useful for comparing two or more data sets
Dot Plot
4-18
1-18
Stem-and-leaff Displays
p y
Stem-and-leaff
display: A statistical
technique for
di l i a sett off
displaying
data. Each
numerical value is
divided into two
parts: the leading
digits become the
stem and the
trailing digits the
leaf.
Note: an advantage
g
of the stem-and-leaf
display over a
f
frequency
distribution is we do
not lose the identityy
of each observation.
Stem-and-leaf Displays
4-19
1-19
A box
b plot
l is a graphical
display, based on quartiles,
that helps to picture a set of
data.
Five pieces of data
are needed to
construct a box plot:
the Minimum Value,
the First Quartile,
the Median,
Median the
Third Quartile, and
the Maximum
V l
Value.
Box Plots
4-20
Min Q
1
12
14
16
Median
18
20
Max
Q3
22
1-20
24
26
28
30
32
4-21
1-21
Skewness is the
measurement of the
lack of symmetry of
the distribution.
The coefficient of
skewness
k
can range
from -3.00 up to 3.00
when using the following
formula:
3 X Median
sk =
s
)
Movie
4-22
Scatter
diagram: A
technique
q
used to show
the
relationship
between
variables
variables.
1-22
Example
The twelve days of stock prices and the overall market
index on each day are given as follows:
Scatter diagram
1-23
Estimation
i
i andd Confidence
C fid
Intervals
l
1-24
A point estimate is
a single value
(statistic) used to
e s t i m a t e a
population value
(parameter).
An Interval Estimate
states the range
within which a
population parameter
probably lies.
A confidence
interval
f
is a range of values
within which the
population parameter
is expected to occur.
The two confidence
i t
intervals
l that
th t are usedd
extensively are the
95% and the 99%.
99%
Point and Interval Estimates
1-25
If the population
standard deviation is
unknown, the
underlying population
is approximately
normal,, and the sample
p
size is less than 30 we
use the t distribution.
X t
s
n
1-26
s
n
95% CI for the population mean
X 1 . 96
s
n
X 2 .5 8
s
n
1-27
Ekonometrik
Statistika
Ekonomi
Matematika
1-28
Linear
i
Regression
i andd Correlation
C
l i
Draw a scatter diagram.
Understand and interpret the terms dependent variable and independent
variable.
Calculate and interpret the coefficient of correlation, the coefficient of
determination, and the standard error of estimate.
Conduct a test of hypothesis to determine if the population coefficient of
correlation is different from zero.
Calculate the least squares regression line and interpret the slope and intercept
values.
Construct and interpret a confidence interval and prediction interval for the
dependent variable.
Set up and interpret an ANOVA table.
1-29
Correlation Analysis
to
y is a ggroup
p off statistical techniques
q
measure the association between two variables.
Advertising Minutes and $ Sales
The Dependent
30
Sa
ales ($thousands)
A Scatter Diagram
is a chart that portrays
the relationship
between two variables.
25
20
15
10
5
0
70
90
110
130
150
170
190
Advertising Minutes
The Independent
1-30
The Coefficient
ff
off Correlation (r) is a measure off the
strength of the relationship between two variables.
Also called Pearsons r and
It requires
q
interval or
P
Pearsons
product
d
moment
ratio-scaled data.
correlation coefficient.
Pearson's r
It can range from
-1.00 to 1.00.
Values of -1.00
-1 00 or 1.00
1 00
indicate perfect and
-1
strong
g correlation.
0
1
Negative values indicate an
Values close to 0.0 indicate
inverse relationship and
weak correlation.
positive values indicate a
The Coefficient of Correlation, r
direct relationship.
1-31
10
9
8
7
6
5
4
3
2
1
0
10
1-32
10
9
8
7
6
5
4
3
2
1
0
10
1-33
10
9
8
7
6
5
4
3
2
1
0
10
Zero Correlation
1-34
10
9
8
7
6
5
4
3
2
1
0
10
1-35
Coefficient of Determination
1-36
Both variables
must be at least
interval scale.
1-37
1-38
Terimakasih
T i k ih