Beruflich Dokumente
Kultur Dokumente
X
=
3.2 - 3.0
.4 / 25
=
.20
.08
= 2.5
=<> :nter Table C / - !ith -; degrees of freedom.
)e find a critical value =one tailed test> of 1.D11 for
alpha S .H<.
=B> 5ince the calculated value -.<H _ critical value
=1.D11> the null hypothesis is re,ected.
The question also as4s8 2.t !hat alpha level !ould
it be significant700 6hec4ing te(t :(hibit K/- again
!e find that the critical value of -.;G- is found for
an alpha of .H1.
E. =1> The null hypothesis !ould be that there is no
difference bet!een the degree of conservatism of
professors and students and that the differences
found are due to sampling variations only.
=-> This is a case of t!o independent samples
involving proportions. The appropriate test is
parametric. Qsing the t test.
=9> The desired level of significance is again set at a
S .H<
=;> 6alculated value8
t =
p
1
- p
2
p1 - p2
=
.5 - .3
.25
20
+
.21
20
=
.200
.152
= 1.32
<> 6ritical value from Table C/- !ith 9E d.f. is 1.BG
for one/tailed test and -.H9 for t!o/tailed test.
=B> 6alculated values of 1.9- ` either critical value
so null hypotheses is not re,ected. That is the
differences found bet!een professors and students
could be due to sampling errors only.
G. =1> +ull hypothesis. "
H
8 There is no difference
bet!een average annual starting salaries of the
graduates at the t!o universities. "
.
8 Kraduates of
:astern Qniversity received higher starting
salaries than graduates of )estern Qniversity.
=-> 5tatistical test. The t test is chosen because the
data are at least interval in form and the samples are
independent in a t!o/ test situation.
=9> 5ignificance level. alpha S .H< =one/tailed testJ
the alternate hypothesis states direction>
=;> 6alculated value8
t =
(18800 - 18000) - 0
36 (1000)
2
+ 40 (1000)
2
(36 + 40) - 2
(
1
36
+
1
40
)
=
800
232.8179
= 3.436
=<> 6ritical test value S 1.BB. d.f. S D; one/tailed
test alpha S .H< .
=B> &ecision. 5ince the calculated value is larger
than the critical value =9.;; _ 1.BB> re,ect the null
hypothesis and conclude that graduates of :astern
Qniversity secured higher average annual starting
salaries.
B. Random samples from normal populations !ith the
same variance.
1H.
+umber Crequencies8 F
i
=:
i
>
#ntervie!ed 1ercent
Cavorable +eutral
Qnfavorable
=-> ( -DH =->(--H
=->(91H
Creshmen --H .-D< 1HH=D;.-<>
<H=BH.<> DH=E<.-<>
5ophomores -1H .-B-< EH=DH.EE>
BH=<D.D<>DH=E1.9E>
Iuniors 1EH .--< <H=BH.D<>
<H=;G.<> EH=BG.D<>
5eniors 1GH .-9D< ;H=B;.1->
BH=<-.-<>GH=D9.B->
EHH 1.HHH -DH --H
91H
H
i
S observed :
i
S e(pected
=1> +ull hypothesis. "
H
8 F
i
S :
i
. That is the
attitudes to!ard corporations in the population are
independent of college class. "
.
8 F
i
a :
i
. That is
the attitudes to!ard corporations in the population
do vary by class.
=-> 5tatistical test. 6hoose the k sample chi/square
test to compare the observed distribution to a
hypothesi*ed distribution. The chi/square test is
used because the responses are classified into
nominal categories and there are sufficient
observations in each cell.
=9> 5ignificance level. ?et alpha S .H<.
=;> 6alculated value8
2
=
O
i
-E
i
2
E
i
i=1
k
2
=
25.75
2
74.25
+
9.12
2
70.88
+
-10.75
2
60.75
+
-24.12
2
64.12
+ . . . +
16.38
2
73.62
2
= 33.6
=<> 6ritical test value S 1-.<G. alpha S .H< d.f. S B
=B> &ecision. The calculated value is greater than
the critical value so re,ect the null hypothesis and
conclude that there is a significant difference among
the classes as to attitude to!ard corporations.
11. =1> +ull hypothesis. "
H
8 There is no difference in
readership rates bet!een business school and liberal
arts students. "
.
8 Readership rates differ bet!een
business and liberal arts students.
=-> 5tatistical test. Qse the t test of t!o independent
samples.
=9> 5ignificance level. alpha S .H< t!o/ tailed test
=alternative hypothesis is not directional>.
=;> 6alculated value8
t =
(5.6 - 4.5) - 0
100 (2.0)
2
+ 100 (1.5)
2
(100 +100) - 2
(
1
100
+
1
100
)
=
1.1
.251
= 4.378
=<> 6ritical value S 1.GB. alpha S .H< t!o/ tailed
test d.f. S 1GE. 6onsult Table K/ -.
=B> 6alculated value of ;.; _ critical value =1.GB>.
Re,ect the null hypothesis.
1-. =1> +ull hypothesis. "
H
8 There are no durability
differences among the three products. "
.
8 There
are durability differences among the three products.
=-> 5tatistical test. )e use the ! test !ith one/!ay
analysis of variance because the data are ratio
scales. There are k independent samples and !e
accept the assumptions underlying this parameter
test.
=9> 5ignificance level. alpha S .H<.
=;> 6alculated value8 5ee :(hibit 1D/1.
=<> 6ritical test value for ! S 9.EG alpha S .H<
d.f. S =-1->
=B> &ecision. 5ince the calculated value is greater
than the critical value =<.;G _ 9.EG> !e re,ect the
null hypothesis.
#n the continuation of this e(hibit a post hoc
comparison =multiple comparison test> of the three
paints using the 5cheffe procedure sho!s that the
significant differences are bet!een Fne/Ooat and
6ompetitor B =groups 1 and 9>.
19. =1> +ull hypothesis. "
H
8 There are no
differences among the three sales strategies =as
represented by store types>.
=-> 5tatistical test. )e use the ! test !ith one/!ay
analysis of variance because the data are ratio
scales. There are k independent samples and !e
accept the assumptions underlying this parameter
test.
=9> 5ignificance level. alpha S .H<.
=;> 6alculated value8 5ee :(hibit 1D/-.
=<> 6ritical test value for ! S 9.EG alphaS .H<
d.f. S = - 1- >
=B> &ecision. 5ince the calculated value is greater
than the critical value =<.;9 _ 9.EG> !e re,ect the
null hypothesis. . post hoc comparison of the three
store types using the 5cheffe procedure sho!s that
the significant differences are bet!een the
electronics store and the department store =groups 1
and ->.
1;. . test for related samples should be used since the
data are collected on the same firms and are paired
for the one/year interval.
=1> +ull hypothesis. "
H
8 There is no difference
bet!een the profits in of the utility companies. "
.
8
The profits are higher in the second year.
=-> 5tatistical test. The related samples t test is
chosen because the data are ratio and the samples
are dependent.
=9> 5ignificance level. alpha S .H<
=;> 6alculated value8 5ee :(hibit 1D/9.
=<> 6ritical test value S 1.E1. d.f. S1H one/tailed
test. alpha S .H< .
=B> &ecision. 5ince the calculated value is smaller
than the critical value
=1.<9 ` 1.E1 !e fail to re,ect the null hypothesis and
conclude that there are no differences bet!een
profits of the t!o years.
1<. . 5tudents might use the Cortune categories
from the article8 1 S soft!areAservices - S
entertainmentAsports 9 S
hard!areAsemiconductors ; S
communications equipment < S
biotechnology B S food D S venture capital.
Ra
nk
+ame
Compan+
1ro
+et
)o
rth
7< in
mi''ions
9
- 1ierre
Fmidyar
eBay 1 ;9GH
9 Ieff 54oll eBay 1 -B9H
< Ieff Be*os .ma*on.6om #nc. 1 1-9H
B Rinny 5mith 'uest 5oft!are 1 DEH
D &avid Cilo Mahoo 1 D9H
E Ierry Mang Mahoo 1 D-1
G Rob Klaser Real+et!or4s 1 B9<
11 Kreg Reyes Brocade
6ommunications
5ystems
1 <1E
1- Ien/"sun
"uang
+vidia 1 <HD
1; Ioe
?iemandt
Trilogy 5oft!are 1 9GH
1E 5an,ay
Oumar
6omputer .ssociates
#nternational #nc.
1 -DH
-1 Iames T.
&emetriades
5eeBeyond
Technology
1 -9G
-9 Ierry
Kreenberg
5apient 1 --<
-; I. 5tuart
Moore
5apient 1 --;
-D &avid "it* +et!or4 .ppliance 1 -H-
-E Iohn ?.
MacCarlane
Fpen!ave 5ystems
#nc.
1 1GE
9H Raul
Cernande*
&imension &ata
+orth .merica
1 1EE
9- 6hris Olaus #nternet 5ecurity
5ystems
1 1ED
9< Michael
5aylor
Microstrategy 1 1EH
9E Marc
.ndreessen
?oudcloud 1 1BB
1H &an 5nyder )ashington Reds4ins - BH;
19 Michael
Iordan
)ashington )i*ards - 9GE
1G Tom 6ruise 6ruiseA)agner
1roductions
- -<1
-H 1ercy Miller
=Master 1>
+o ?imit - -;G
-- 5ean 6ombs
=1. &iddy>
Bad Boy
:ntertainment
- -91
9B Iim 6arrey 1it Bull 1roductions - 1D1
;H Tiger )oods :T) - 1BH
1 Michael &ell &ell 6omputer 9 1B9HH
; Ted )aitt Kate!ay #nc. 9 1EDH
1B 1antas
5utard,a
Marvell Technology 9 9B9
9G +av 5ooch 5ilicon ?aboratories 9 1B-
1< Ieanette
5ymons
Nhone Technologies ; 9D;
-< 5udha4ar
Ravi
5onic).?? ; -1G
-B 5ree4anth
Ravi
5onic).?? ; -1G
-G Ieffrey
6itron
Ronage ; 1G;
99 .nousheh
.nsari
5onus ; 1EH
9D Ionathan M.
Rothberg
6uraKen < 1BE
1D Iohn
5chnatter
1apa Iohn0s
#nternational
B -G9
91 :ric
Kreenberg
#nnovation
#nvestments
D 1ED
9; "alsey
Minor
1- :ntrepreneuring D 1EH
B. )ith one/!ay analysis of variance !e test the
hypothesis that there are no differences in the net
!orths among the groups. 6ategory values < B and D
are omitted because there is insufficient sample to run
.+FR.. The Fne/)ay ! test is significant at .H9;.
The null hypothesis of no differences is re,ected. )e
conclude from the post hoc test that the
hard!areAsemiconductor net !orth is significantly
different from the other three groups.
=et ;orth
#!ke+ B
+5ubset
for
alpha
S .H<
Kroup 1 -
;.HH <-9D.-H
HH
-.HH D-G;.E<
D1
1.HH -HD9H.<H
HH
9.HH ; ;BD9.D
<HH
Means for groups in homogeneous subsets are
displayed.
a Qses "armonic Mean 5ample 5i*e S B.---.
b The group si*es are unequal. The harmonic
mean of the group si*es is used. Type # error
levels are not guaranteed.
1B. =1> +ull hypothesis. "
H
8p
1
S p
-
the proportion
removed by .nti/R is equal to the proportion
removed by '/6ure. "
.
8 p
1
a p
-
=-> 5tatistical test. The Mc+emar test is chosen
because the groups have been matched according to
the control characteristic// standard number of
viruses that each anti/virus product is presumed to
remove. .lthough nominal measurements !ere
used the chi/square test of independence is not
appropriate.
=9> 5ignificance level. alpha S .H< =t!o/tailed
test>
=;> 6alculated value8
Z =
33 - 58
33 + 58
=
-25
91
= - 2.62
=<> 6ritical test value S
Z
1.GB alpha S .H< .
=B> &ecision. 5ince the =absolute value> calculated
value is larger than the critical value =b/-.B-b _ b
Z
1.GBb> re,ect the null hypothesis and conclude that
the proportion removed by .nti/R is not equal to
the proportion removed by '/6ure.
+ote8 the other approach to Mc+emar a modified
chi/square that !as illustrated in the te(t produced
a chi/square value of B.EB. )ith 1 d.f. at alpha S .
H< the critical value is 9.E;. )e re,ect the null.
=The square root of B.E; is the 8 result of -.B-.>
1D. .. Test of hypothesis8
=1> +ull hypothesis. "
H
8 F
i
S :
i
. That is the styling
preferences are independent of buyer behavior. "
.
8
F
i
a :
i
.
=-> 5tatistical test. 6hoose the t!o independent
sample chi/square test to compare the observed
distribution to a hypothesi*ed distribution. The chi/
square test is used because the responses are
classified into nominal categories and there are
sufficient observations in each cell.
=9> 5ignificance level. ?et alpha S .H<.
=;> 6alculated value8 5ee :(hibit 1D/;.
=<> 6ritical test value S 9.E;. alpha S .H< d.f. S 1
=B> &ecision. The calculated value =-H.G;> is
greater than the critical value =9.E;> so re,ect the
null hypothesis and conclude that styling preference
is not independent of buyer characteristic.
B. Many analysts !ould apply the correction for
continuity since the sample si*e is larger than ;H
and a - ( - table is used. #n this case the calculated
value drops only slightly to 1G.11 and !e also re,ect
the null hypothesis.
From Concept to Practice
1E. =1> +ull hypothesis. "
H
8 There are no differences
bet!een the means of the t!o groups relative to
flight service ratings.
=-> This is a case of t!o independent samples. The
appropriate test is parametric because the data are
interval. Qse the t test.
=9> The desired level of significance is again set at
alpha S .H<
=;> 6alculated value8 5ee :(hibit 1D/<.
=<> 6ritical value from te(t :(hibit K/- !ith 9E d.f.
is -.H9 for a t!o/tailed test. The ! test on the left of
the e(hibit indicates that !e cannot re,ect the
hypothesis of equality of meansJ thus !e interpret
the pooled variance estimate section.
=B> 6alculated values =absolute values> of b/-.1Gb _
critical value= b-.H9b>. The null hypothesis is
re,ected. That is there is a significant difference
found bet!een the t!o groups !ith respect to the
service rating.
1. .. . regression coefficient is the value !hich
e(presses ho! much of a dependent variable0s value
is estimated to change !ith the change of one unit
in the independent variable. . correlation
coefficient r is a measure of the association bet!een
one or more independent variables and a dependent
variable. The correlation coefficient can range from
/1.H to Z1.H.
B. Both r S H and p S H state that there is no
relationship bet!een the L and M variables. The
inde( r represents the sample data !hereas p
represents the population correlation.
6. The test of the slope = b
1
S H> is a very important test
in bivariate linear regression. #f the true slope is
found to be *ero there is no linear relationship
bet!een the L and M variables. The test of the
intercept is only to determine if the regression line
goes through the origin. The test of r
-
S H is
similar to the test of the slope !ith a slightly
different interpretation. .s a goodness of fit test r
-
tells us ho! !ell the regression line fits the data. By
partitioning the sum of squares in the dependent
variable !e discover the proportion of variation in
the dependent variable e(plained by the model.
&. The coefficient of correlation r describes the
relationship bet!een the t!o measured variables.
The coefficient of determination is r
-.
#t sho!s the
degree to !hich the variables in question share
common variance. #f r is found to be .GH r
-
is equal
to .E1J that is E1P of the variance in L is e(plained
by M and vice versa.
:. . slope of *ero is represented by a hori*ontal line in
bivariate linear regression and a slope of infinity is
represented by a vertical line. The meaning of a
*ero slope may be based on several conditions8 =1>
M is completely unrelated to L and no systematic
pattern is evident =-> there are constant values of M
for every value of L or =9> the data are related but
nonlinear.
C. #n bivariate regression a test of goodness of fit uses
t. #n multiple regression the C test is used to chec4
the entire model and individual t tests evaluate each
independent variable separately. The C test !hen
applied to bivariate regression produces the same
result as t
-
.
From Concept to Practice
-. The interpretation of the four plots is as follo!s8
=.> +o relationship =B> 1ositive relationship =6>
+onlinear relationship =&> +egative relationship.
Making Research Decisions
9. .. Kamma is found to be .EEB / as per computer
calculation in :(hibit 1E/1.
B. Oendall0s Tau
b
and Tau
c
are .DHD and .DH
respectively.
6. Kamma is based on a comparison of concordant and
discordant pairs of scores. #t treats variables
symmetrically in calculation8 no independent or
dependent variable is designated. Both versions of
tau are refinements of gamma that considers that not
all pairs of variables are concordant or discordantJ
some observations may be tied on variable L M or
both. ?i4e gamma the range for tau is /1.H to Z1.H
and larger coefficients imply stronger relationships
bet!een the variables. Technically tau
b
can only
obtain this range for a square table if no marginal
frequency is equal to H. Tau
c
better approaches the
outer limits of the range for tables of any ro! by
column configuration.
&. . large value for gamma often occurs !hen a more
appropriate measure of the variables !ould produce
a smaller coefficient. #n this sense gamma is often
inflated. 5ince all the coefficients !ould be
appro(imately the same if there !ere no tied pairs
it appears prudent to consider this effect and the
greater stability of tau across different situations.
Cor this data either tau !ould be suitable although
tau
c
!ould be the more conservative choice.
;. 5ee :(hibit 1E/1. 5omers0 d symmetric is .DHD and
asymmetric !ith opinion dependent is .DH. The
symmetric coefficient results from averaging the
t!o asymmetric coefficients. 5omers0 is e(tension
of gamma that considers the number of pairs not
tied on the independent variable. The magnitude of
the coefficients is quite similar to the tau results. )e
!ould assume that education influences opinion on
the ta( and that the asymmetric coefficient !ith
opinion dependent is the most appropriate one to
interpret. )e conclude from this problem that there
is a relatively strong positive relationship bet!een
education and opinion about a ta( on stoc4 and
bond transactions.
<. )e have computed four measures of association in
:(hibit 1E/-. Ff the four phi is the most
appropriate because the table in question is - ( -.
.s noted else!here 6ramer0s R simplifies to phi
for - ( - tables. The 1earson contingency
coefficient 6 is problematic because of its upper
limit and comparability to other measures. .ll of
these three are based on the chi/square. ?ambda on
the other hand offers a 1R: interpretation. Kiven
the information in the problem it !ould be difficult
to identify a predictor variable for lambda in that it
could be argued that one0s position on ta(es in
general is ,ust as good a predictor of party
affiliation as party affiliation !ould be for
predicting favorableness on a specific position.
B. .. 5ee :(hibit 1E/9
B. M S ;.E Z 1.HBD L
6. 5ee :(hibit 1E/9
&. #f L S 1H M S 1<.<J L S 1D M S --.G
D. .. (+mmetric Meas!res
Ralu
e
.ppro
(. 5ig.
1hi.E-D .HHH
6ramer0s
R
.E-D .HHH
6ontinge
ncy
6oefficie
nt
.B9D .HHH
+ of BE<
Ralid
6ases
1hi is appropriate for this - ( - table. #t is identical to
6ramers R in this case. The 6ontingency 6oefficients
upper range restriction ma4es it unsuitable for
comparison. The relationship bet!een the t!o variables
is relatively strong. #t is also statistically significant at .
HHH<. The 6hi/square !as significant and since the three
measures are based on 6hi/square !e !ould e(pect
them to have the same confidence level.
B. The value for lambda is .BE !ith the clinical condition
as the designated dependent variable. This means that
BEP of the error in predicting the clinical condition
=pregnancy> is eliminated by 4no!ledge of the test
result.
E. 5ee :(hibit 1E/;.
.. )ith d.f. =1E> the critical value of ! is <.9-. #n this
table the calculated value is G<.D< =found from the
mean squares once the student fills in the d.f. and
sum of squares by subtraction>. )e re,ect the null
hypothesis b S H.
B. The t value =G.DG> is the square root of !, #t is the
primary test of the slope for bivariate regression.
G.
1H. The largest 1earson coefficient is .GDH; the
relationship bet!een cash flo! and mar4et value.
The 5pearman ran4 order correlation is found to be .
GD<E. +ormally !e !ould e(pect the 5pearman to
display a lo!er coefficient since the ratio data is
lost !hen each data point is converted to a ran4. #n
this case the order preserved the relationship !ith
precision.
11. The variables selected for illustration !ere net
profits and mar4et value. The regression appears in
:(hibit 1E/B. The R
-
is .G- and a t value for the
slope is G.DG. The null hypothesis !as re,ected at
the .HHHH< level.
The follo!ing assumptions !ere e(amined8
linearity equality of variance independence of
error and normality. #n addition residuals !ere
e(amined for outliers.
Linearity8 The scatterplot graphs the relationship of
the L and M variables. There is little doubt that the
function is linear and that a straight line fits the
data.
Euality of Nariance8 The assumption of equality of
variance may be chec4ed !ith the plot sho!n in
:(hibit 1E/D. There is some spread of the residuals
!ith increases in predicted values. :ven !ith this
small sample si*e !e should be cautious about this
assumption.
&ndependence of Error8 The collection and
recording of data !as a computer generated random
sample. There is no reason to suspect sequential
values. .s sho!n at the bottom of :(hibit 1E/B no
case!ise plot !as produced for this e(amination
because no outliers !ere found.
Residuals8 The summary of residual statistics in
:(hibits 1E/E and 1E/G does not seem to indicate
that the ranges =in the /1.B to 1.D range for
standardi*ed values> !ould not be a cause for
concern.
0ormality8 Reinforcing !hat !e 4no! from prior
chec4s the histogram of studenti*ed residuals in
:(hibit 1E/1H does not reveal any residuals beyond
Z
9.1B. Cor a sample of 1H the :(pected + conforms
reasonably !ell to the observed +. .lthough there
are some of departures from a strictly normal
histogram !e are dealing !ith residuals and
nonconstant variance and therefore e(pect that
normality !ill sho! some effects. The normal
probability plot of standardi*ed residuals in :(hibit
1E/11 also provides some indication of departures
from normality but not in a patterned !ay.
Recommendations8 Kenerally the diagnostics
suggest that this small random sample from Corbes
<HH data conforms rather !ell to regression
assumptions. :quality of variance could be
improved !ith transformation and that !ould be our
ne(t step in this problem. Fther!ise the departures
are relatively minor.
1-. .. The relationship bet!een L and M is sho!n as / .E;
in :(hibit 1E/1-.
B. The sign of the correlation is negativeJ they vary in
opposition.
6. The square of the correlation r
-
is equal to .D1J
that is D1P of L is e(plained by M and vice
versa.
&. The equation is8 M S 91.B; / 1.H; L. The line is
plotted as sho!n8
:. Refer to :(hibit 1E/1-. The test of the slope =b S H>
!as re,ected belo! our conventional a S .H< !ith a
t value of / ;.1B and significance of .HH;-. The test
for the significance of the correlation coefficient
!as determined by a t!o/tailed t2 test as sho!n in
the top of the e(hibit. This null !as re,ected at .H1.
Cinally the ! test of the regression model !as equal
to t
-
=1D.9>.
1. .. )ultidimensional 1caling uses a spatial map to
describe perceptions. Qsually participants rate or
ran4 ob,ects =or people places constructs etc.> !ith
the smallest value applying to ob,ects that are
perceived to be most similar and the largest applying
to ob,ects that are perceived to be least similar.
Fften the participant is instructed on the criteria to
use in ,udging similarity. The researcher decides on
the number of a(es used to display the spatial map.
T!o to three dimensions are most useful. The map
sho!s the most similar pair as closest to each other
and the least similar pair as furthest a!ay from each
other.
B. #n contrast to M&5 !hich spatially sho!s
similarities and dissimilarities cluster analysis
groups ob,ects by similarities. )hereas M&5 is
used mainly for perceptions cluster analysis is used
for any set of attributes. #ts algorithm is defined to
ma(imi*e membership in mutually e(clusive
clusters that are homogeneous. Both techniques use
a graphical presentationJ cluster analysis puts the
ob,ects into distinct groups and M&5 does not.
6. Cactor analysis li4e cluster analysis is a grouping
technique. "o!ever !hereas cluster analysis
requires members to be in one group only variables
can load on more than one factor in varying
amounts. )hereas cluster analysis groups similar
members together into a cluster factor analysis
attempts to e(plain =or describe> latent constructs by
creating ne! factors to replace the original
variables.
-. &ependency and interdependency identify the t!o types
of interrelationships among variables. #n the
dependency situation the concern is !ith the 2effect2
that independent variables have on a dependent
variable. #n the interdependency situation there is no
designated dependent variable. Rather the
interrelations among variables are studied.
. dependency technique !ould be chosen !hen one or
more variables can be classified as dependent.
Making Research Decisions
9. #f a M.+FR. problem has t!o factor levels and
several dependent variables the position of the
variables could be reversed and treated !ith
discriminant analysis. The t!o levels of the factor
already a technically nominal classification become the
dependent variable in the discriminant equation !ith
the several dependent variables =from the M.+FR.>
becoming the predictors. #n this !ay the degree to
!hich each dependent contributed to a linear equation
predicting the correct classification could be assessed.
;. There are a variety of possibilities here about !hich
your students !ill quite possibly have some e(pertise.
5ome of the factors and levels that could be considered
for are8
Cactor ?evels
Brands 9 =required>
"ardness of shoc4s 9 =softAmediumAhard>
Kearing - =lo!Ahigh>
)eight - =lightAheavy>
Tire si*e 9 =carAtruc4Aother>
Tread type 9 =high!ayAoff/roadAother>
+umber of seats 9 =1A-A;>
#f !e use the first < factors of the list !e have a 9 ( 9 (
- ( - ( - design or a D-/option full concept design.
<. .. 1robably a discriminant analysis. The problem
involves a dependency relationship in !hich the
dependent =or criterion> variable is nonmetric =i.e..
applicants either choose to attend or not attend>.
)e are not sure about the measurement scales of the
independent =predictor> variables but it is assumed
that they are metric. .nother possibility is multiple
classification analysis !ith a H/1 dependent
variable.
B. This situation also involves the dependency
relationships and presents a metric dependent
variable and at least some independent variables
that are metric. The appropriate multivariate
technique is multiple regression perhaps !ith some
dummy =H/1> independent variables.
6. . multiple regression equation for the reasons of
dependency plus metric predictor and criterion
variables.
&. This appears to call for a statistical technique !hich
relates the various test results and e(tracts a fe!er
number of latent variables or dimensions !hich
2e(plain2 sales success. This !ould suggest factor
analysis.
B. ?et us adopt the follo!ing symbols for the variables8
5ales8 5 =U>
5alesperson0s level of education8 : =Mears of
education>.
Kender8 )e use a single dummy variable for t!o
categories. #f the observation is male then the
variable MS1 if female then H.
6onsumer income8 # =U>
)ealth8 ) =U>
:thnicity8 ?et us consider three categories
6aucasian "ispanic and .sian. Cor the three
categories !e need t!o dummy variables.
.sian /. !hich ta4es the value 1 if it is an .sian
and H other!ise.
"ispanic /" !hich ta4es the value 1 if it is a
"ispanic and H other!ise.
bH is a constantJ b1 b- b9 etc. are coefficients e is
the residual error term.
The general model !ill be8
5 S bH Z b1: Z b-M Z b9# Z b;) Z b<. Z bB" Z e
The instructor may observe and e(plain ho!A !hen the
values of the dummy variables M . and " are each
*ero then the predicted value of 5 is for a Cemale
salesperson !ith the consumer being 6aucasian.
D. .. This particular e(ercise illustrates the sort of
problems one encounters almost routinely in real
life situations and ho! the form and scale in !hich
data is collected restricts the analysis and
determines the techniques that can be used.
The follo!ing points are !orth observing at the
outset8
1. The data for all the variables i.e. ,ob
satisfaction promotions and departments are
non/metric =non/metric refers to nominal or
ordinal>.
-. +o dependent variables are specified.
:(hibit 1G/1 =5electing the Most 6ommon
Multivariate Techniques> in the te(tboo4 indicates
that if there are no dependent variables and the
variables are non/metric the possibly usable
techniques are non metric factor analysis latent
structure analysis non metric cluster analysis and
non metric multidimensional scaling.
Ff these cluster analysis groups similar ob,ects or
people. "ere cluster analysis !ould e(amine
2!hether and !here is there a clustering2 of people
in terms of ,ob satisfaction promotions and
departments. &o people high on ,ob satisfaction
seem to be those !ho also have many promotions7
&o these seem to be from one department alone7
These are the nature of insights that cluster analysis
may provide.
The instructor can point out that the data on
promotions could easily have been collected as
2number of promotions2 for each participant =ratio
scaled> and ,ob satisfaction data could be collected
on a semantic differential seven point scale etc.
!hich !ould allo! us to use methods appropriate
for metric data. #f this !ere done then only
2&epartments2 !ould be nominally scaled.
Ta4ing the data as it is an e(ploratory e(ercise to
test the hypothesis that 2there is no relation bet!een
departments promotions and ,ob satisfaction2 can
be done using the 6hi 5quare Test =refer to 6hapter
1B>. Curther if hypotheses !ere formulated and a
dependent variable specified then depending on the
manner in !hich the data is collected =metric or non
metric> discriminant analysis =non metric dependent
variable> or multiple regression =metric dependent
variable> could have been used.
B. Brand choice =9 brands> is a nominally scaled non/
metric variable income levels and e(tent of
advertising =say advertising budgets or the number
of advertisements> are ratio scaled metric variables.
)ith the dependent variable for study being brand
choice the appropriate methodology !ould be
discriminant analysis.
6. 6luster analysis is recommended here. #f there
are substantive cluster formations for dar4 fabric
buyers in particular incomeAethnic groups and
particular temperature conte(ts then those cluster
groupings indicate the temperature income
ethnicity 2combinations2 !hich should be focused
on. 6luster analysis also reveals the 2bases2 that are
not important to the formation of clusters =for
instance !hile the ma,ority of people !ho are .sian
may buy dar4 fabric income may sho! no relation
to the clustering>. The cluster identification and
corresponding mar4et segmentation principle
minimi*es !ithin cluster distance and ma(imi*es
bet!een cluster distance. The cluster characteristics
of dar4 fabric buyers can then be compared to the
demographic data available for different areas and
those areas !here the chosen cluster characteristics
are salient may be chosen as mar4ets for dar4
fabric.
.nother !ay of approaching the issue !ould be to
treat fabric color choice as the dependent variable in
a discriminant analysis !ith income temperature
and ethnicity being the e(planatory or independent
variables. :thnicity is a nominal variable and to use
it in a discriminant analysis procedure !ould
require the use of dummy variables in a manner
similar to that used in regression analysis. #n this
case the discriminant function can be used to
e(trapolate and 2predict2 the number of dar4 color
fabric users !hen the data on income temperature
and ethnicity is 4no!n.
From Concept to Practice
E. The R- value indicates that the four independent
variables statistically account for G- percent of the
variation in the annual sales. The standard error of
estimate is a measure of the precision of the M
estimates. This value of 11.G million dollars indicates
that t!o out of three times the equation estimates of
company sales !ere !ithin a plus or minus 11.G million
of the true sales figure.
The regression coefficients indicate that8
1. M S ;G.E< million dollars !hen all the #Rs are
*ero.
-. M is lo!er by UBEHHH for each increase of a
million annual marriages. This is illogical and
suggests that the estimating equation might be
improved by substituting another variable. .ll
other regression coefficients sho! the effects on M
of a change in one unit of the independent
variable.
G. Fne !ay to ,udge the predictive value of a discriminant
analysis is to determine the percent of correctly
classified dependent variable cases. #n this problem
-1H of -EH or D< percent of the cases !ere correctly
classified. Crom the detailed data !e see that the best
record !as in predicting !hich persons !ould ta4e
alternative ..
1H. The data are found on the !ebsite
=!!!.mhhe.comAbusinessAcooperE> in :(cel
spreadsheet form. &ata source8 "air Ioseph C. Rolph
:. .nderson Ronald ?. Tatham and )illiam 6. Blac4.
Multivariate &ata .nalysis !ith Readings. <
th
ed. Qpper
5addle River +I8 1rentice/"all 1GGE pp. D-</-B. L1S
delivery speed L- S price level L9 S price fle(ibility
L;S manufacturers image L< S overall service LB S
salesforce image. LD S product quality.
Qsing the procedure Cactor in 5155J principal
components varima( rotation. :ighty/three percent of
the variance is e(plained by three factors. The rotated
solution is sho!n belo!.
Rotate) Component Matri.
6omponent
1 - 9
L9 /.EHB
LD .DBH
L- .D<; .<B1
L1 /.D<- .<BH
LB .G;<
L; .G-1
L< .GEH
:(traction Method8 1rincipal 6omponent
.nalysis.
Rotation Method8 Rarima( !ith Oaiser
+ormali*ation.
a Rotation converged in < iterations.
#erms in Re:ie5
1. .. . speaker2centered presentation is one !here the
focus =generally resulting from the spea4ers sense
of inadequacy> relies on memori*ation of the
manuscript. The style of delivery reflects a
preoccupation !ith the memori*ed message to the
detriment of establishing rapport !ith and adapting
to the needs of the audience. #t is considered self or
spea4er/ centered because it is strictly one/!ay
communication. #n contrast an extemporaneous
presentation replaces a script !ith an organi*ed set
of ideas that may be presented from notes or an
outline. This approach ta4es into consideration the
need for adaptability to the occasion fle(ible
response to audience feedbac4 and a conversational
delivery of the message.
B. /echnical reports include both a full presentation of
the analysis as !ell as sufficient procedural
information to permit another researcher to replicate
the study. The report structure follo!s the steps of
the research study itself8 prefatory items
introduction methodology findings conclusions
appendi(es and bibliography. The appendi(es
contain detailed information such as
instrumentation data analysis methods and
instructions for the field personnel.
)anagement reports are !ritten for the
nontechnical client. They minimi*e methodological
details. 6onclusions are presented before the
findings that support them. Kraphics are used to
enhance comprehension. The appendi( is short
compared to the technical report and the
bibliographic references are often omitted.
6. The topic outline is a format in !hich only a 4ey
!ord or t!o are used for each item. 1entence
outlines e(press the essential thoughts using brief
sentences associated !ith the specific topic.
Making Research Decisions
-. .. #t is important to include a balanced and thoughtful
but fair e(pression of the conceptual and
implementation limits that the reader should 4eep in
mind !hen dra!ing inferences from a report0s
findings. #t is a mar4 of the true professional. #n
student reports one sometimes sees an e(tensive
listing of faults !hich may be ,ustifiably true but
detract from the report.
B. Tables appearing in the body of the report or
presentation should be 4ept as simple as possible in
order to allo! the readerAlistener to grasp one or
t!o specific points. #n place of tables use charts
and graphs !henever possible. #f comple( tables are
necessary in the report an appropriate place is in
the appendi(.
6. The physical presentation of the report is critical to
its being read. #t is especially important to design
the report to fit its audience and to deal adequately
!ith the 4no!ledge gap that may e(ist bet!een the
!riter and reader. The particular form of the report
should be appropriate to the audience occasion and
importance of the sub,ect.
&. 21ace2 concerns the problem of ho! quic4ly
concepts are developed and ho! deeply they are
e(plained. The pace tends to be slo! and the depth
limited if the topic is comple( and the audience
unsophisticated. #f the topic is comple( but the
audience is sophisticated the !riter can assume a
certain depth of 4no!ledge and move quic4ly and
deeply into the sub,ect.
9. .. . short report informational in nature most li4ely a
memo. 6osts reimbursement amounts and
employee satisfaction !ith the health plan are
comparative aspects that may be included in
graphical form.
B. . technical report !hich follo!s the suggested
format of the ,ournal in question.
6. . short report8 memorandum or letter style
informational in nature.
&. . long report. . technical report !ould be
mandatory a management report may also be
presented to supplement the technical report.
;. 5everal different graphic forms are acceptable for each
of the cases mentioned.
.. #f the intent is to sho! yearly data for the decade it
is probably !ise to use a line diagram on a semilog
chart because semilog scales sho! rates of relative
change much better than do arithmetic scales. #f
one is sho!ing only the percentage change bet!een
annual income in 1GGH and -HHH then an arithmetic
line scale diagram may be used if 1GGH for each
country is set at 1HH and -HHH is e(pressed as a ratio
to 1GGH. . better choice !ould be to use a multiple
variable bar chart !ith both countries listed above
the years 1GGH and -HHH. The ra! data and the
percentage change can thus be sho!n on the same
chart using numbers to augment the graphic.
B. &ata are delivered several !ays depending on the
message being delivered. #f there !ere a desire to
only compare percentages of income spent on items
it !ould probably be best to use a hori*ontal bar
chart of the 1HHP component type. This is the best
!ay to sho! comparable allocations among
subparts. . vertical version of the same type of
chart is often found and is also acceptable. #f on
the other hand the absolute dollar comparisons
!ere desired a vertical stac4ed bar chart !ould be a
good choice. Fther possibilities include a set of pie
charts =for percentages only> a 9/& column chart if
the data has been collected for more than one year
or a stratum chart.
6. This could be the t!o/!ay percent change
hori*ontal bar chart. #t is an ideal !ay for
comparing percent changes bet!een t!o points for
a limited number of entities especially !hen one or
more may be negative percent changes.
.lternatively a multiple variable vertical bar chart
could be used to graphically sho! the changes from
year to year. This !ould be best done !ith the
years 1GGB and 1GGD as the variables and the si(
firms as the heading for the groups of bars. . third
possibility is to use a line chart !ith each firms0
price sho!n as a different line.
From Concept to Practice
<. This can be a good e(ercise to have students
prepare in advance and then have several sho!n on
chal4board or via transparencies. Fne suggested
outline for the first assignment is given belo!.
.ns!ers to parts B and 6 depend on the
circumstances at the time.
#. Pre5riting Consi)erations
.. 1urpose of report
B. &efine audience
6. 6ircumstances and limitations
&. "o! !ill it be used7
##. Report Choice
.. #nformational or research7
B. ?ong or short7
6. Management or technical7
&. Cormat7
###. Draft &o)+ of report
.. .nalysis of data
B. 5elect important findings
6. 5tatistical tests
&. &raft conclusions
:. &raft recommendations
C. Revie! and revise
1. by self
-. by others
#R. ;riting report
.. Futline body of report
B. "o! to present data
1. draft tables
-. set up charts
9. develop graphics
6. #ntroduction
&. .ppendi(
:. 5ynopsis
C. Cirst draft / total report
K. Revie! and revise
1. by self
-. by others
". Cinal printing and finishing touches
R. (!&mit report
6. There are hundreds of #nternet presentations most in
1o!er1oint .dobe .crobat or real/time that may be
found using a search engine li4e Koogle. )e selected a
topic on secondary data !ith a 1o!er1oint slide sho!
as the delivery medium
=http8AA!!!.dtic.milAcendiAactivitiesA1-X11XH1XoaisXpr
ogram.html> $5etting the Cederal 5tage8 &eveloping a
+ational &igital #nformation #nfrastructure for
1reservation.% This presentation !as made in &ecember
-HH1 at the conference on Managing and 1reserving
:lectronic Resources at the ?ibrary of 6ongress.
1resenter8 Martha .nderson Technical 6oordinator
Fffice of 5trategic #nitiatives ?ibrary of 6ongress.
D. 5ome li4ely requirements for statistical materials are8
.. 5tatistical data should usually be presented in small
doses.
B. .ll tables should be labeled adequately to ensure
that the reader could understand their contents.
6. )riters should seldom put more than several
statistical numbers in the body of te(t. )hen more
need to be presented the statistics should go into
tables or semi/tabular presentations.
&. )henever possible salient points should be
emphasi*ed by means of a po!erful graphic. Cor
time series data bar charts and stratumAarea charts
are good choices. Cor percentage data a pie chart
can be used. Cor distributions consider the
histogram or bo( plot.
:. )riters should recogni*e that charts and graphs
represent visual comparisons that have more impact
than the labels on the data. Cor e(ample the
omission of a *ero base line may give an incorrect
visual impression even though the scales may be
labeled correctly.
C. 5tatistical data from !hich findings are dra!n
should be ad,acent to those findings.