Beruflich Dokumente
Kultur Dokumente
Coombs
Objects
Unfolding
Scales
Dispersion
References
March, 2016
1 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Outline
Science as Model fitting
Data and scaling
Assigning Numbers to Observations
Coombs Theory of Data
Ordering people,
Proximity rather than order
Ordering objects
Thurstonian scaling
MDS
Unfolding
Types of scales and how to describe data
Describing data graphically
Central Tendency
Shape
Measures of dispersion
What is the fundamental scale?
2 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Fit = f ( (DataModel)
)
Data2
data. The residual left over after we remove the mean is the
variance.
3 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Rxy = xy
x
4. Reliability xx =
2
X2
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
X1
X2
Latent X
Latent Y
Error
Y1
1
Y2
2
X3
Y3
3
X4
Y4
4
X5
Y5
5
X6
Y6
6
X7
Y7
7
X8
Y8
8
X9
1
1
2
2
3
5 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
2.7182818284590450908
24
37
365.25
31,557,600
3,412.1416
299,792,458
42
3.141592653589793116
86,400
98.7
365.25636305
31,558,150
.4046856422
6.022141 * 1023
X
6 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
7 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
0.8
0.6
0.2
0.4
0.6
0.4
0.2
0.0
0.0
probability A > B
0.8
1.0
Proximity
1.0
Order
8 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Number
of Dyads
1
1
1
1
1
1
2
2
2
2
2
2
2
Comparison
Name
Order
Proximity
Order
Proximity
Order
Proximity
Order
Proximity
Order
Proximity
Order
Proximity
Proximity
Tournament rankings
Social Networks
Scaling
Similarities
Ability Measurement
Attitude Measurement
Tournament rankings
Social Networks
Scaling
Multidimensional scaling
Ability Comparisons
Preferential Choice
Individual Differences in
Multidimensional Scaling
9 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
World Cup
NCAA basketball
Is the winner really the best?
Can you predict other matches
10 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
-p %+% t( p )
4. Convert to
probabilities (using a
logit model)
> colnames (tournament) <- rownames(match) <- paste("P",1:n,sep="")
> tournament <- t(upper.tri(match) * (1-match))
+ upper.tri(match)*match
5. Convert probabilities
to outcomes
6. Show the results
11 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
> chess.df
P1
P2
P3
P4
P5
P6
P7
P8
P9
P10
P11
P12
P13
P14
P15
P16
latent observed
-1.5
0.2500
-1.3
0.1250
-1.1
0.1875
-0.9
0.2500
-0.7
0.3125
-0.5
0.3125
-0.3
0.4375
-0.1
0.5625
0.1
0.4375
0.3
0.5000
0.5
0.7500
0.7
0.5000
0.9
0.5625
1.1
0.7500
1.3
0.6875
1.5
0.8750
normed
-0.6744898
-1.1503494
-0.8871466
-0.6744898
-0.4887764
-0.4887764
-0.1573107
0.1573107
-0.1573107
0.0000000
0.6744898
0.0000000
0.1573107
0.6744898
0.4887764
1.1503494
logi
-1.0986123
-1.9459101
-1.4663371
-1.0986123
-0.7884574
-0.7884574
-0.2513144
0.2513144
-0.2513144
0.0000000
1.0986123
0.0000000
0.2513144
1.0986123
0.7884574
1.9459101
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
0.6
0.8
-2
-1
-0.5
-1.5
0.5
1.5
0.2
latent
normed
1.00
0.0
1.0
1.00 1.00
-1.0
0.2
0.6
observed
-2
-1
logi
-1.5
-0.5
0.5
1.5
-1.0
0.0
0.5
1.0
14 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
So why bother with normal or logitistic modeling, why not just use
total score?
1. How to predict wins and losses from the prior scores
What is the likelihood that player P16 will beat player 1 if they
p(A)
.
p(A) + p(B)
(1)
(2)
1
1 + e (logitB logitA )
(3)
15 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
2. Can do it by hand
Take the scale model, model the data
Find residuals
Find a goodness of fit
data)
16 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
> score
P1
P2
P3
P4
P5
P6
P7
P8
P9
P10
P11
P12
P13
P14
P15
0.2500 0.1250 0.1875 0.2500 0.3125 0.3125 0.4375 0.5625 0.4375 0.5000 0.7500 0.5000 0.5625 0.7500 0.6875 0.
> btl <- score/(score %+% t(score))
> round(btl,2)
[,1] [,2]
[1,] 0.50 0.67
[2,] 0.33 0.50
[3,] 0.43 0.60
[4,] 0.50 0.67
[5,] 0.56 0.71
[6,] 0.56 0.71
[7,] 0.64 0.78
[8,] 0.69 0.82
[9,] 0.64 0.78
[10,] 0.67 0.80
[11,] 0.75 0.86
[12,] 0.67 0.80
[13,] 0.69 0.82
[14,] 0.75 0.86
[15,] 0.73 0.85
[16,] 0.78 0.88
[,3]
0.57
0.40
0.50
0.57
0.62
0.62
0.70
0.75
0.70
0.73
0.80
0.73
0.75
0.80
0.79
0.82
[,4]
0.50
0.33
0.43
0.50
0.56
0.56
0.64
0.69
0.64
0.67
0.75
0.67
0.69
0.75
0.73
0.78
[,5]
0.44
0.29
0.38
0.44
0.50
0.50
0.58
0.64
0.58
0.62
0.71
0.62
0.64
0.71
0.69
0.74
[,6]
0.44
0.29
0.38
0.44
0.50
0.50
0.58
0.64
0.58
0.62
0.71
0.62
0.64
0.71
0.69
0.74
[,7]
0.36
0.22
0.30
0.36
0.42
0.42
0.50
0.56
0.50
0.53
0.63
0.53
0.56
0.63
0.61
0.67
[,8]
0.31
0.18
0.25
0.31
0.36
0.36
0.44
0.50
0.44
0.47
0.57
0.47
0.50
0.57
0.55
0.61
17 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
> round(resid,2)
P1
P2
P3
P1
NA 0.33 -0.57
P2 -0.33
NA 0.60
P3
0.57 -0.60
NA
P4 -0.50 -0.67 0.43
P5 -0.56 0.29 0.38
P6
0.44 0.29 0.38
P7
0.36 0.22 -0.70
P8
0.31 0.18 0.25
P9
0.36 0.22 -0.70
P10 0.33 0.20 0.27
P11 -0.75 0.14 0.20
P12 0.33 0.20 0.27
P13 0.31 0.18 0.25
P14 0.25 0.14 0.20
P15 0.27 0.15 0.21
P16 0.22 0.12 0.18
P4
0.50
0.67
-0.43
NA
-0.56
-0.56
0.36
0.31
0.36
-0.67
0.25
0.33
0.31
0.25
0.27
0.22
P5
0.56
-0.29
-0.38
0.56
NA
-0.50
-0.58
0.36
0.42
0.38
0.29
-0.62
0.36
0.29
0.31
0.26
P6
-0.44
-0.29
-0.38
0.56
0.50
NA
0.42
0.36
-0.58
0.38
0.29
-0.62
0.36
0.29
0.31
0.26
P7
-0.36
-0.22
0.70
-0.36
0.58
-0.42
NA
-0.56
-0.50
-0.53
0.37
0.47
0.44
0.37
0.39
0.33
P8
-0.31
-0.18
-0.25
-0.31
-0.36
-0.36
0.56
NA
-0.44
0.53
0.43
0.53
-0.50
-0.57
0.45
0.39
P9
-0.36
-0.22
0.70
-0.36
-0.42
0.58
0.50
0.44
NA
-0.53
0.37
-0.53
-0.56
0.37
0.39
0.33
P10
-0.33
-0.20
-0.27
0.67
-0.38
-0.38
0.53
-0.53
0.53
NA
0.40
-0.50
0.47
0.40
0.42
-0.64
P11
0.75
-0.14
-0.20
-0.25
-0.29
-0.29
-0.37
-0.43
-0.37
-0.40
NA
-0.40
-0.43
0.50
-0.48
0.46
P12
-0.33
-0.20
-0.27
-0.33
0.62
0.62
-0.47
-0.53
0.53
0.50
0.40
NA
0.47
-0.60
-0.58
0.36
P13
-0.31
-0.18
-0.25
-0.31
-0.36
-0.36
-0.44
0.50
0.56
-0.47
0.43
-0.47
NA
0.43
0.45
0.39
P14
-0.25
-0.14
-0.20
-0.25
-0.29
-0.29
-0.37
0.57
-0.37
-0.40
-0.50
0.60
-0.43
NA
-0.48
0.46
P15
-0.27
-0.15
-0.21
-0.27
-0.31
-0.31
-0.39
-0.45
-0.39
-0.42
0.48
0.58
-0.45
0.48
NA
0.44
P16
-0.22
-0.12
-0.18
-0.22
-0.26
-0.26
-0.33
-0.39
-0.33
0.64
-0.46
-0.36
-0.39
-0.46
-0.44
NA
18 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
1. Find model
2. Find Residual =
Model - Data
3. Goodness of Fit is
1 Residual 2 /Data2
19 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
20 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
21 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Friendship as proximity
1. Chess or football provides a ranking based upon an ordering
relationship (pi > pj ).
2. Alternatively, friendship groups are based upon closeness
(|pi pj | < )
2.1 Do you know person j?
2.2 Do you like person j? or as an alternative:
2.3 Please list all your friends in this class (and is j included on the
list)
2.4 Would you be interested in having a date with person j?
2.5 Would you like to have sex with person j?
2.6 Would you marry person j?
22 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Mineral
Talc
Gypsum
Calcite
Fluorite
Apaptite
Orthoclase Feldspar
Quartz
Topaz
Corundum
Diamond
Scratch hardness
.59
.61
3.44
3.05
5.2
37.2
100
121
949
85,300
23 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Wind (Knots)
Less than 1
1-3
4-6
7-10
11-16
17-21
22-27
28-33
34-40
WMO Classification
Calm
Light Air
Light Breeze
Gentle Breeze
Moderate Breeze
Fresh Breeze
Strong Breeze
Near Gale
Gale Moderately
41-47
Strong Gale
10
48-55
Storm
11
56-63
Violent Storm
12
64+
Hurricane
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
1. Assume each object (a, b, ...z) has a scale value (A, B, ...Z )
with some noise for each measurement.
2. Probability of A > B increases with difference between a and b
3. P(A > B) = f (a b)
4. Can we find a function, f, such that equal differences in the
latent variable (a, b, c) lead to equal differences in the
observed variable?
5. Several alternatives
Direct scaling on some attribute dimension (simple but flawed)
Indirect scaling by paired comparisons (more complicated but
probably better)
25 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Grant Proposals: 1 to 5
On a scale from 1 to 10 this [object] is a X?
If A is 1 and B is 10, then what is C?
College rankings based upon selectivity
College rankings based upon yield
Zagat ratings of restaurants
A - F grading of papers
26 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
27 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
28 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
29 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
30 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Intensity
= constant.
Intensity
(4)
31 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Po
Pref
(5)
32 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
33 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
34 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
35 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
0.3
0.1
0.2
probability of choice
0.3
0.2
0.0
0.0
0.1
response strength
0.4
0.5
0.4
Figure: Thurstones model of paired discrimination. Left panel: three items differ in
their mean level as well as their variance. Right panel: choice between two items with
equal variance reflects the relative strength of the two items. The shaded section
represents choosing item 2 over item 1.
36 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
package
Asp
0.81
0.72
0.56
0.50
0.44
0.41
0.32
0.40
0.27
in R
Car
0.88
0.74
0.74
0.56
0.50
0.51
0.43
0.29
0.24
Spin
0.89
0.74
0.68
0.59
0.49
0.50
0.37
0.32
0.37
S.Beans
0.90
0.81
0.84
0.68
0.57
0.63
0.50
0.47
0.36
Peas
0.89
0.84
0.80
0.60
0.71
0.68
0.53
0.50
0.37
Corn
0.93
0.86
0.82
0.73
0.76
0.63
0.64
0.63
0.50
#show the data from the veg data set from the psych package
veg
37 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Some simple R
> veg
Turn
Cab Beet
Asp
Car Spin S.Beans Peas Corn
Turn
0.500 0.818 0.770 0.811 0.878 0.892
0.899 0.892 0.926
Cab
0.182 0.500 0.601 0.723 0.743 0.736
0.811 0.845 0.858
Beet
0.230 0.399 0.500 0.561 0.736 0.676
0.845 0.797 0.818
Asp
0.189 0.277 0.439 0.500 0.561 0.588
0.676 0.601 0.730
Car
0.122 0.257 0.264 0.439 0.500 0.493
0.574 0.709 0.764
Spin
0.108 0.264 0.324 0.412 0.507 0.500
0.628 0.682 0.628
S.Beans 0.101 0.189 0.155 0.324 0.426 0.372
0.500 0.527 0.642
Peas
0.108 0.155 0.203 0.399 0.291 0.318
0.473 0.500 0.628
Corn
0.074 0.142 0.182 0.270 0.236 0.372
0.358 0.372 0.500
> colMeans(veg) #show the means (but too many decimals)
Turn
Cab
Beet
Asp
Car
Spin
S.Beans
Peas
0.1793333 0.3334444 0.3820000 0.4932222 0.5420000 0.5496667 0.6404444 0.6583333
> round(colMeans(veg))
#round off, but not enough decimals
Turn
Cab
Beet
Asp
Car
Spin S.Beans
Peas
Corn
0
0
0
0
1
1
1
1
1
> round(colMeans(veg),2) #this looks pretty good
Turn
Cab
Beet
Asp
Car
Spin S.Beans
Peas
Corn
0.18
0.33
0.38
0.49
0.54
0.55
0.64
0.66
0.72
38 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
0.7
Corn
0.6
Peas
0.5
Spin
Car
0.4
colMeans(veg)
S.Beans
Asp
0.3
Beet
Cab
0.2
Turn
0.2
0.3
0.4
0.5
0.6
0.7
Index
>
>
>
>
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
0.4
0.0
0.1
0.2
0.3
Preference
0.5
0.6
0.7
Turn
Cab
Beet
Asp
Car
Spin
S.Beans
Peas
Corn
Vegetables
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Naive scaling
1. Show the data
> round(veg,2)
Turn
Cab
Beet
Asp
Car
Spin
S.Beans
Peas
Corn
Turn
0.50
0.18
0.23
0.19
0.12
0.11
0.10
0.11
0.07
Cab
0.82
0.50
0.40
0.28
0.26
0.26
0.19
0.16
0.14
Beet
0.77
0.60
0.50
0.44
0.26
0.32
0.16
0.20
0.18
Asp
0.81
0.72
0.56
0.50
0.44
0.41
0.32
0.40
0.27
> round(colMeans(veg),2)
Turn
Cab
Beet
0.18
0.33
0.38
Car
0.88
0.74
0.74
0.56
0.50
0.51
0.43
0.29
0.24
Asp
0.49
Car
0.54
Spin S.Beans
0.55
0.64
Peas
0.66
Corn
0.72
Cab
0.15
Beet
0.20
Asp
0.31
Car
0.36
Spin S.Beans
0.37
0.46
Peas
0.48
Corn
0.54
3. Subtract the
mean for the
first column
from the
means
4. But these are
not really
useful scale
values.
41 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Turn
Cab
Beet
Asp
Car
Spin
S.Beans
Peas
Corn
Turn
0.00
-0.91
-0.74
-0.88
-1.17
-1.24
-1.28
-1.24
-1.45
Cab
0.91
0.00
-0.26
-0.59
-0.65
-0.63
-0.88
-1.02
-1.07
Beet
0.74
0.26
0.00
-0.15
-0.63
-0.46
-1.02
-0.83
-0.91
Asp
Car Spin S.Beans Peas
0.88 1.17 1.24
1.28 1.24
0.59 0.65 0.63
0.88 1.02
0.15 0.63 0.46
1.02 0.83
0.00 0.15 0.22
0.46 0.26
-0.15 0.00 -0.02
0.19 0.55
-0.22 0.02 0.00
0.33 0.47
-0.46 -0.19 -0.33
0.00 0.07
-0.26 -0.55 -0.47
-0.07 0.00
-0.61 -0.72 -0.33
-0.36 -0.33
Corn
1.45
1.07
0.91
0.61
0.72
0.33
0.36
0.33
0.00
Peas
0.46
Corn
0.64
Spin S.Beans
1.14
1.40
Peas
1.44
Corn
1.63
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Turn
Cab
Beet
Asp
Car
Spin
S.Beans
Peas
Corn
Turn
0.00
-0.52
-0.65
-0.98
-1.12
-1.14
-1.40
-1.44
-1.63
Cab
0.52
0.00
-0.13
-0.46
-0.60
-0.62
-0.88
-0.92
-1.11
Beet
0.65
0.13
0.00
-0.33
-0.46
-0.49
-0.75
-0.79
-0.98
Asp
0.98
0.46
0.33
0.00
-0.14
-0.16
-0.42
-0.46
-0.65
Car
1.12
0.60
0.46
0.14
0.00
-0.03
-0.28
-0.33
-0.51
Turn
Cab
Beet
Asp
Car
Spin
S.Beans
Peas
Corn
Turn
0.50
0.30
0.26
0.16
0.13
0.13
0.08
0.07
0.05
Cab
0.70
0.50
0.45
0.32
0.28
0.27
0.19
0.18
0.13
Beet
0.74
0.55
0.50
0.37
0.32
0.31
0.23
0.21
0.16
Asp
0.84
0.68
0.63
0.50
0.45
0.43
0.34
0.32
0.26
Car
0.87
0.72
0.68
0.55
0.50
0.49
0.39
0.37
0.30
43 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
2. Possible functions
f = A - B (simple difference)
A
A+B
Luce choice rule
Thurstonian scaling
logistic scaling
44 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Turn
Cab
Beet
Asp
Car
Spin
S.Beans
Peas
Corn
Turn
0.00
-0.12
-0.03
0.03
-0.01
-0.02
0.02
0.03
0.02
Cab Beet
Asp
Car
0.12 0.03 -0.03 0.01
0.00 0.05 0.05 0.02
-0.05 0.00 -0.07 0.06
-0.05 0.07 0.00 0.01
-0.02 -0.06 -0.01 0.00
0.00 0.01 -0.02 0.02
0.00 -0.07 -0.01 0.04
-0.02 -0.01 0.08 -0.08
0.01 0.02 0.01 -0.07
> sum(resid)
[1] 3.816392e-16
> sum(resid^2)
[1] 0.1416574
> sum(resid^2)/sum(veg^2)
[1] 0.005697482
1-sum(resid^2)/sum(veg^2)
[1] 0.9943025
45 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Constant
Equal
Squared
Reversed
Raw
Thurstone
raw thurstone
0.00
0.00
0.15
0.52
0.20
0.65
0.31
0.98
0.36
1.12
0.37
1.14
0.46
1.40
0.48
1.44
0.54
1.63
46 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
47 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Distancexy
v
u n
uX
=t
(xi yi )2 .
(6)
i=1
ATL
BOS
ORD
DCA
DEN
LAX
MIA
JFK
SEA
SFO
MSY
ATL
0
934
585
542
1209
1942
605
751
2181
2139
424
BOS
934
0
853
392
1769
2601
1252
183
2492
2700
1356
ORD
585
853
0
598
918
1748
1187
720
1736
1857
830
DCA
542
392
598
0
1493
2305
922
209
2328
2442
964
DEN
1209
1769
918
1493
0
836
1723
1636
1023
951
1079
LAX
1942
2601
1748
2305
836
0
2345
2461
957
341
1679
MIA
605
1252
1187
922
1723
2345
0
1092
2733
2594
669
JFK
751
183
720
209
1636
2461
1092
0
2412
2577
1173
SEA
2181
2492
1736
2328
1023
957
2733
2412
0
681
2101
SFO
2139
2700
1857
2442
951
341
2594
2577
681
0
1925
MSY
424
1356
830
964
1079
1679
669
1173
2101
1925
0
48 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
MSY
LAX
ATL
SFO
DEN
-200
MIA
DCA
ORD
JFK
-600
Dimension 2
BOS
-1000
SEA
-500
500
Dimension 1
1000
1500
50 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
SEA
BOS
JFK
ORD
DCA
DEN
SFO
ATL
LAX
MSY
MIA
51 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
prob(correct|, ) = f ( )
53 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
>
>
>
>
>
1. Create a matrix of 0s
2. Add 1s below the
diagonal
3. Give the rows and
columns names
4. Show it
5. score it
55 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
0.8
0.6
0.4
0.2
0.0
Probability of correct/endoresed
1.0
-3
-2
-1
latent attribute
56 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
0.3
0.2
0.1
0.0
Probability of endorsement
0.4
-3
-2
-1
latent attribute
57 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Basic operations
equality
xi = xj
Transformations
Permutations
Invariant statistic
Counts
Mode
2 and () correlation
Examples
Detection
Species classification
Taxons
Ordinal
order
xi > xj
Monotonic
(homeomorphic)
x =f(x)
f is monotonic
Median
Percentiles
Spearman correlations*
differences
Linear
(Affine)
x = a + bx
Mean ()
Standard Deviation ()
Pearson correlation (r)
Regression ()
Temperature (F, C)
Beaufort Wind (velocity)
Multiplication
(Similiarity)
x = bx
Coefficient of variation (
)
Interval
ratios
xi
xj
>
xk
xl
The Beaufort wind speed scale is interval with respect to the velocity of the wind, but only ordinal with respect to
58 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Maximum
75th percentile
Median (50th percentile)
25th percentile
Minimum
median
59 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
>
summary(sat.act)
gender
Min.
:1.000
1st Qu.:1.000
Median :2.000
Mean
:1.647
3rd Qu.:2.000
Max.
:2.000
education
Min.
:0.000
1st Qu.:3.000
Median :3.000
Mean
:3.164
3rd Qu.:4.000
Max.
:5.000
age
Min.
:13.00
1st Qu.:19.00
Median :22.00
Mean
:25.59
3rd Qu.:29.00
Max.
:65.00
ACT
Min.
: 3.00
1st Qu.:25.00
Median :29.00
Mean
:28.55
3rd Qu.:32.00
Max.
:36.00
SATV
Min.
:200.0
1st Qu.:550.0
Median :620.0
Mean
:612.2
3rd Qu.:700.0
Max.
:800.0
60 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
10
20
30
40
50
60
A Tukey Boxplot
gender
education
age
ACT
61 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
10
5
0
Observed
15
20
Density plot
epiE
epiS
epiImp
epilie
epiNeur
62 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
> describe(sat.act)
gender
education
age
ACT
SATV
SATQ
vars
1
2
3
4
5
6
n
mean
sd median trimmed
mad min max range skew kurtosis
se
700
1.65
0.48
2
1.68
0.00
1
2
1 -0.61
-1.62 0.02
700
3.16
1.43
3
3.31
1.48
0
5
5 -0.68
-0.07 0.05
700 25.59
9.50
22
23.86
5.93 13 65
52 1.64
2.42 0.36
700 28.55
4.82
29
28.84
4.45
3 36
33 -0.66
0.53 0.18
700 612.23 112.90
620 619.45 118.61 200 800
600 -0.64
0.33 4.27
687 610.22 115.64
620 617.25 118.61 200 800
600 -0.59
-0.02 4.41
63 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
64 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
1/Xi
reciprocal). harmonic.mean(x)
P
Circular Mean xcircular = tan1 P cos(x)
circular.mean(x)
sin(x)
(where x is in radians)
circadian.mean circular.mean(x) (where x is in hours)
65 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Circular statistics
Table: Hypothetical mood data from six subjects for four mood variables.
The values reflect the time of day that each scale achieves its maximum
value for each subject. Each mood variable is just the previous one
shifted by 5 hours. Note how this structure is preserved for the circular
mean but not for the arithmetic mean.
Subject
1
2
3
4
5
6
Arithmetic Mean
Circular Mean
Energetic Arousal
9
11
13
15
17
19
14
14
Positive Affect
14
16
18
20
22
24
19
19
Tense Arousal
19
21
23
1
3
5
12
24
Negative Affect
24
2
4
6
8
10
9
5
66 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Gender
Male
Female
Female
Female
Male
Male
Male
1
3
7
6
4
5
2
X
12
14
18
17
15
16
13
Y
2
6
14
12
8
10
4
Z
1
4
64
32
8
16
2
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
68 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
69 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
8.00000 18.14286
> apply(my.data[4:7],2,mean,trim=.2)
theta
4.0
X
15.0
Y
8.0
Z
12.4
> apply(my.data[4:7],2,winsor.mean,trim=.2)
theta
X
4.00000 15.00000
Y
Z
8.00000 12.91429
> apply(my.data[4:7],2,harmonic.mean)
theta
X
2.699725 14.729687
Y
5.399449
Z
3.527559
> apply(my.data[4:7],2,geometric.mean)
theta
X
3.380015 14.865151
Y
6.760030
Z
8.000000
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Condition
Pretest (Control)
Average
Posttest (Movie)
Average
Subject
1
2
1
2
Skin Conductance
2
2
2
1
4
2.5
Skin Resistance
.50
.50
.50
1.00
.25
.61
71 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
1.0
Movie S1
Mean Movie
0.5
Resistance
1.5
2.0
Mean Control
Movie S2
1
72 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Faculty
Member
A
B
C
D
E
Total
Mean
Median
Freshman/
Sophmore
20
20
20
20
200
Junior
Senior
Graduate
Mean
Median
10
10
10
100
100
10
10
10
10
400
10
10
10
10
10
12.5
12.5
12.5
35.0
177.5
10
10
10
15
150
56
20
46
10
110
10
10
10
50.0
12.5
39
10
73 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Table: Class size from the students point of view. Most students are in
large classes; the median class size is 200 with a mean of 223.
Class size
10
20
100
200
400
Number of classes
12
4
2
1
1
number of students
120
80
200
200
400
74 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Time in therapy
A psychotherapist is asked what is the average length of time that
a patient is in therapy. This seems to be an easy question, for of
the 20 patients, 19 have been in therapy for between 6 and 18
months (with a median of 12) and one has just started. Thus, the
median client is in therapy for 52 weeks with an average (in weeks)
(1 * 1 + 19 * 52)/20 or 49.4.
However, a more careful analysis examines the case load over a
year and discovers that indeed, 19 patients have a median time in
treatment of 52 weeks, but that each week the therapist is also
seeing a new client for just one session. That is, over the year, the
therapist sees 52 patients for 1 week and 19 for a median of 52
weeks. Thus, the median client is in therapy for 1 week and the
average client is in therapy of ( 52 * 1 + 19 * 52 )/(52+19) =
14.6 weeks.
75 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
76 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
School
Junior College
Non-selective university
Selective university
Pretest
1
5
27
Posttest
5
27
73
Change
4
22
45
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
School
Junior College
Non-selective university
Selective university
Pretest
27
73
95
Posttest
73
95
99
Change
45
22
4
They concluded that the teaching at the junior college was far
superior to that of the select university. What is wrong with this
conclusion?
78 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
1.0
Math
1.0
Writing
0.8
0.6
Performance
JC
0.2
0.2
TC
TC
0.4
0.6
0.4
Ivy
0.0
JC
0.0
Performance
0.8
Ivy
4
79 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Average difference x 2
80 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
0.4
0.2
0.1
0.0
probability
0.3
N(0,1)
Cauchy
logistic
N(0,2)
-4
-2
0
x
4
81 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
0.2
0.4
0.6
N(0,1)
N(1,.5)
N(0,2)
0.0
Probability density
0.8
-4
-2
0
x
4
82 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
0.2
0.1
0.0
Probability of X
0.3
Normal(0,1)
Contaminated
Normal(.3,3.3)
-10
-5
10
83 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
0.3
0.2
0.1
0.0
Probability of z
0.4
z
-3
percentile
-2
-1
0.1
IQ
16
50
84
98
99.9
55
SAT
70
85
100
115
130
145
200
stanine
300
400
500
600
700
800
Galton/Tukey
lower fence
Q1
Median
Q3
Upper Fence
84 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
85 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
-1
-3
-2
-1
difference = .5
difference=.5
3
2
Probability x1 > x
0.2
0.1
0.0
0.3
0.4
-3
probability of x
1.0
Probability x1 > x
0.3
0.2
0.1
0.0
probability of x
0.4
difference = .25
-3
-2
-1
-3
-2
-1
86 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
1.8
1.6
1.4
1.0
1.2
Probability x1 > x
0.3
0.2
0.1
0.0
probability of x
0.4
2.0
sigma = 1.1
-2
-1
-3
-2
-1
sigma = 1.2
sigma = 1.2
2.5
2.0
1.0
1.5
Probability x1 > x
0.3
0.2
0.1
0.0
probability of x
3.0
0.4
-3
-3
-2
-1
-3
-2
-1
87 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Transformation
x3
x2
x
x
-1/x
log(x)
1/x 2
1/x 3
effect
emphasize large numbers
emphasize large numbers
the basic data
emphasize smaller numbers
emphasize smaller numbers
emphasize smaller numbers
emphasize smaller numbers
emphasize smaller numbers
positive
positive
positive
positive
positive
skew
skew
skew
skew
skew
88 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
0
1
x^3
x^2
x
sqrt(x)
1/x
log(x)
1/x^2
1/x^3
transformed
0.0
0.5
1.0
original
1.5
2.0
89 / 90
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
Model Fitting
Coombs
Objects
Unfolding
Scales
Dispersion
References
90 / 90