Beruflich Dokumente
Kultur Dokumente
Stats Overview
2. Inferential Statistics
• hypothesis testing and errors
3. Humanistic
- Health related quality of life (HRQOL)
- Patient satisfaction
- (Valuing these are difficult & contentious)
• more on this later…
Scales of Measurement
• Four levels of measurement
- Nominal variable→ no implied rank or order.
• Example: presence or absence of a disease
• Example: temperature
www.snr.missouri.edu/natr211/examples/sample1.png
Measure of central tendency
90%
66.5%
80%
70% 54.1%
60%
Percent of US
50%
Households
40%
30%
20%
10%
0%
1
$100,000 + 20.1%
$75,000 - $99,999 13.4%
$50,000 - $74,999 20.6%
$25,000 - $49,999 25.7%
< $25,000 20.2%
Source: URL: http://pubdb3.census.gov/macro/032005/faminc/new07_000.htm <accessed 2006 Jan 18>
A “skewed” distribution
mode
median
mean
mode
median
mean
Measures of Variability
• Range
- Different between the highest data value and the lowest data
value
- Ordinal, interval and ratio data
• Inter-quartile range
- Data values within the 25th and 75th quartiles
• Directly related to the median
• Variation:
- ex: the standard deviation
sd = N ∑ x i − ( ∑ xi )
2 2
N ( N − 1)
"Reality"
There is NO There IS a
I Decide: Difference; Difference;
Ho is true Ho is false
There is NO
Correct
difference; Error!
Decision
Do not reject the Ho
Hypothesis Testing: Inferential Statistics
"Reality"
There is NO There IS a
I Decide: Difference; Difference;
Ho is true Ho is false
2
1 Z (α / 2 )
n=
4 d
Detecting effects of different sizes
Statistical Methods: Sample Sizes
2
• Ex: 1 Z (α / 2 )
n=
You want to be 95% sure that 4 d
2
n = 384.16 or 385 Pts. 1 1.96
n=
4 0.05
Inappropriate Sample Size Risks
• Design Phase
- Specification
- Matching
• Analysis Phase
- Stratification
- Multivariate adjustment
Statistical Methods: Linear Regression
Yi = Β0 + Β1 X 1 + Β2 X 2 + ... + Βn X n + e
25
Dependent Variable: "Y"
20
15
10
0
13
1
11
15
17
19
Independent Variable(s): "X"
Statistical Methods: Linear Regression
• Key assumptions:
• the “independent” variables really are!
• the relationship is linear; not curved
• the variables are normally distributed
• Key output:
• “Beta weights” or Parameter Estimates
• Confidence intervals
• R2 value
- % of variation explained (higher is better)
2
R = 0.13