Beruflich Dokumente
Kultur Dokumente
1) INTRODUCTIONa) Before we go into operation first we will have three type of data
set1) Master data set which contains all the sets combined
2) E-grocery subset which contains basic details of subjects
belonging to e- grocery and contains questions 20 till 32 only
3) Non E-grocery subset which contains basic details of subjects and
contains questions 33 till 42
b) If we try to get data on the subjects
2) OBJECTIVEwe need to do analyses in different perspectives but two main
objectives area. To analyse people who are under e- grocery their satisfaction
level, is there any need for development in the service by these
e-groceries firm?
b. To analyse people who are under non e- grocery -compare with
people who are in e-grocery, reason(factors) affecting the people
for not going into e- grocery thereby detecting the areas to be
developed and people understanding on e-grocery.
3) THINGS TO BE KNOWNA. We will denote from question from 1 till 19 as X independent
variable (here within these question we can make analyses but
for a broader perspective will denote it as like this
B. From question 20 till 32 as dependent variable Y1 with respect
to first sub data set and
from questions 33 till 42 as dependent variable Y2 with respect
to second sub data set
4) OPERATION At initial level we shall start by three approach a) we can try factor analysis using R language just to identify
what are all the variables in data turned out to be quite
significant and narrow down our analysis (but this would be
informal approach)
b) Either from former step or we can start dependency test like Chisquare test X2, G2 test and linear trend analysis variables on
ordinal variables, for this dependency test we should take Y1 and
Y2 as dependent factors and X as independent factors thus there
will n number of dependency test which will give us insight that
for what are all the factors which influence decision made by
subjects on particular question (note these test can be
conducted only for two categorical variable)
c) Similarly we can try with odds and odds ratio (here dependent
variable should have only two options).
Odds ,odds ratio1) At basic level we can find odds for a person having specific
quality shop at e-grocery or not eg- take e- grocery as
success consider gender i.e. we can find odds for male to
shop online and odds for female to shop online.
Odds ratio- here we can find odds for female to shop online
compared to a male to shop online.
2) Marginal odds ratio, conditional odds ratio
Above example for odds ratio is marginal odds ratio. For the
example we add another factor(x) say sector i.e. we are
conditioning on factor sector
Therefore this would be a 3 dimensional table- here we can
find odds for a woman from IT sector choosing e-grocery
compare to a man from IT sector choosing e-grocery,
similarly for other sector.
This helps us on who are all your potential customer and
what are customer remaining to be covered. Thus this will
narrow down on marketing strategy also.
Similarly we can use different type factors affecting people
to choose e-grocery.
Binary logistic regression model1) We can take y as people choosing e- grocery as success and
consider independent variables like Q1-12, Q14-18 and
build binary logistic regression model. This model will help
us interpreting the factors X affecting probability of
success i.e. here people choosing e- grocery for e.g. if slope
coefficient for age is negative means as age increases the
odds for people choosing e-grocery reduces
Consider if slope coefficient for income is positive means as
income increases the odds for people choosing e-grocery
increases- high income prefer e-grocery.
Multinomial logistic regression model1) Here consider y which has more than 2 categories e.g.consider Q31 medium of online shopping as Y and consider
suitable X and build multinomial logistic regression model
here there three categories-pc, tablet ,mobile thus masking
one category we will get coefficients for other two i.e.
probabilities for person to choose tablet or mobile can be
find using coefficients and interpretations are quite similar
to above
Note- the data sets should be altered suitable to the tools that is being
used for analysis.