Beruflich Dokumente
Kultur Dokumente
HTTPS://SITES.GOOGLE.COM/SITE/JOURNALOFCOMPUTING/
WWW.JOURNALOFCOMPUTING.ORG 12
Abstract—Markov chains are stochastic processes the knowledge of the present state uniquely determines its future stochastic
behaviour, and this behaviour does not depend on the past of the process. Markov Chains are widely used in areas such as
finance, education, production, marketing and brand addiction. A Hidden Markov Model is a stochastic process which is formed
by adding some properties to a Markov chain. Applications are developed by using the solution algorithms of the three main
problems of Hidden Markov Model. This study is an application about product preferences and the reasons for the preferences
by using Hidden Markov Model based on Markov Chains. The aim of this study is to develop an estimation method about
product preferences. The toolbox inside Matlab software is used for the numerical solutions of the model which is handled in the
application.
—————————— ——————————
1 INTRODUCTION
a
j 1
ij 1 , i (4) ditions below.
b j ( k ) 0 , 1≤ j ≤ N , 1≤ k ≤ M (9)
since they obey standard stochastic constraints. M
t (i) (i , j) (27) M
b (k) 1 ,
t
j 1 j 1≤ j ≤ N (30c)
If we sum t (i ) over the time index t ,we get a quanti- k 1
ty which can be interpreted as the expected (over time)
number of times that state Si is visited, or equivalently,
4 APPLICATION
the expected number of transitions made from state Si (if
we exclude the time slot t T from the summation). Si- In this study, the goal is to estimate which brand the
milarly, summation of t (i , j ) over t (from t 1 to notebook computer users in Turkey will prefer and the
t T 1 ) can be interpreted as the expected number of reason behind the preference of any brand, thus develop-
transitions from state Si to state S j . That is ing a new application area for HMM. The data that will
T 1 be used in determining the research parameters are ga-
(i) expected number of transitions from S
t 1
t i (28a) thered with the survey technique. In the making of the
survey which is used as the research tool, the appropriate
T 1 questions are selected from the brand addiction and
(i , j) expected
t 1
t
S .
number of transitions from S to
(28b)
i
brand preference surveys that are used frequently in the
literature.
j
In the preparation of the questions in the survey, the
Using the above formulas (and the concept of counting information that is gathered from the literature scan and
event occurrences) we can give a method for reestimation the opinions and suggestions of the authorities in the
of the parameters of an HMM. A set of reasonable reesti- leading companies of the industry were taken into con-
mation formulas for , A and B are sideration. There are four category questions in the sur-
vey form. The first question is, choosing the currently
i expected frequency (number of times) in state Si used notebook computer brand, and the second question
at time ( t 1 ) = 1 (i ) (29a) is, choosing the reason behind the preference of the used
brand. The third question is, choosing the notebook com-
T 1 puter brand that will be used next and the fourth question
(i , j) t is, choosing the reason behind the preference of the brand
aij t 1 (29b) that will be used the next.
T 1
(i)
t 1
t Estimation of the Model Parameters
In a given time { t }, the reason behind preferring any
T notebook computer brand is taken as the “state” of the
t 1
t ( j) HMM { q t D i }. When any notebook computer brand is
preferred, the brand of the computer { V } is only known
Ot vk
bj(k) (29c) by the person who uses a notebook computer or the buy-
T
t 1
t ( j)
er but the reason behind preferring that brand is not
known by the others. Therefore, the reasons of preference
are “hidden” since they are not known by other people
If we define the current model as A ,B , and use than the buyer. Estimation of unknown reasons of prefe-
that to compute the right-hand sides of (40a)-(40c), and rences is very critical for the firms that produce notebook
we define the reestimated model as A ,B, . computers or the leading companies in the notebook
Based on the above procedure, if we iteratively use computer market.
in place of and repeat the reestimation calculation, we The transition probabilities matrix is formed by using
then can improve the probability of O being observed the reasons behind the current and the next notebook
from the model until some limiting point is reached. The computer brand preferences since, in the Markov Chain
JOURNAL OF COMPUTING, VOLUME 2, ISSUE 10, OCTOBER 2010, ISSN 2151-9617
HTTPS://SITES.GOOGLE.COM/SITE/JOURNALOFCOMPUTING/
WWW.JOURNALOFCOMPUTING.ORG 16
theory, the state of the system { q t 1 } in the next step In the light of these data, the transition probabilities and
{ t 1} is only dependent on the current state { qt } and is observation probabilities matrices are formed. Since none
independent from the past. The transition probabilities of the preference reasons of notebook computers were of
matrix includes the transition probabilities of the hidden higher priority than any other, initially, their initial state
states whose rows and columns are represented as probabilities were taken equally as { i Pq1 Di ,
S {S1 , S 2 , , S6 } , so to say the transition probabilities 1≤ i ≤6}.
{ A a ij } from the reason behind the preference of the By using the First Problem of the HMM, P(O ) refers
currently used notebook computer to the reason behind to the probability of the notebook computer brand that
the preference of the notebook computer that will be used will be used next from the currently used and the note-
next. book computer brand that will be preferred next for the
TABLE 1 model { i , aij , b j (k ) }. The observations v1 , v 2 , , v16
REASONS OF PREFERENCES THAT ARE THE HIDDEN STATES FOR were used respectively for the O value here. In other
HMM words, the O value is used to show every notebook com-
puter brand individually { O v1 (Acer), O v 2 (Apple)
etc.}. The forward algorithm given in (13), (14) and (15) is
used in calculating the P(O ) probability.
Finding out what will be the reason behind the prefe-
rence of the notebook computer brand that will be used
next will be done by using the Second Problem of HMM.
The reason that lies beneath the preference actually refers
to the hidden state in the model . Finding out the hid-
den states for the observations v1 , v 2 , , v16 is done by
The answers given to the questions of currently used using the Viterbi algorithm. The steps of this algorithm
notebook computer preference { v k } and the preference of are given between (19) and (24).
the notebook computer that will be used next { v k 1 } are The parameters of the model may be rearranged in or-
taken as the “observations” of the HMM. The brands that der to maximize the preference probability of any obser-
are obtained from the answers given to the survey ques- vation (brand). The model parameters to be rearranged
tions are given in Table 2. include the transition probabilities matrix, the observa-
TABLE 2 tion probabilities matrix and the initial probabilities.
OBSERVATIONS REPRESENTING BRANDS (29a), (29b) and (29c) are given in order to calculate these
values.
5 CONCLUSION
For the model, the P(O ) probabilities, that were
found by using the First Problem of the HMM,
TABLE 1
BRAND PREFERENCE PROBABILITIES AND HIDDEN STATES