Beruflich Dokumente
Kultur Dokumente
4 major objectives
Finding linear composites of predictor variables that enable the analyst to separate the
groups by maximizing among group variances relative to within group variance
Establish procedures for assigning new individuals, whose profiles but not group identities are
known, to one of the identified groups
Testing whether significant differences exist between mean predictor variable profiles of the
groups
Determining which variable accounts for most intergroup differences in mean profiles
Types of Discriminant Analysis?
When the dependent variable has 2 groups
Two Group Discriminant Analysis
Would Purchase
We need to estimate D such that the ratio of between group sum of squares
to within group sum of squares for the discriminant function is maximum
Steps of Discriminant Analysis
Formulate the problem
Estimate the Discriminant Function Coefficients
Determine the significance of the Discriminating Function
Interpret the results
Assess Validity of the Discriminant Analysis
Terminology: Cutting Score
The critical value of D also known as optimal cutting score is used as a
benchmark for determining in which group an object is classified
For equal group sizes
+
=
2
For unequal group sizes
+
=
+
Find which one is lower Da or Db
Calculate Di.
If Di < Dc : classify into group having lower cutting score
If Di > Dc : classify into group having higher cutting score
Terminology: Hit Ratio and Chance Criterion
Hit Ratio :
Is the ratio between correctly classified cases by total number of cases
Chance Criterion
= 2 + (1 )2
Where p proportion of individual in group I and (1-p) is the proportion of individual in group II
= 1
SPSS is really
screwed up
when it
comes to
proper
analysis
Total correct
prediction is on the
main diagonal (22+22
= 44)
Percent Correctly
Predicted = 44/50 =
88%
How do I classify new cases?
Look at this table to get an idea of the discriminant function
D = 0.930 gpa 0.876 majorgpa 4.972 grearea + 0.432 grequant + 4.964 greverbal
Multivariate Normality
Discriminant analysis does not make the strong normality assumptions
Outliers:
Can cause severe problems that discriminant analysis will not overcome