Beruflich Dokumente
Kultur Dokumente
The author would like to thank Debasis Mishra for helpful suggestions.
He would also like to thank Pradeep Dubey for numerous conversations
regarding John Nash and his contemporaries.
Arunava Sen (asen@isid.ac.in) teaches at the Indian Statistical Institute,
New Delhi.
Economic & Political Weekly
EPW
vol l no 48
SPECIAL ARTICLE
the matrix above. In every cell, the first and second numbers in
the pair of numbers denote the pay-offs to the shooter and
goalkeeper, respectively.
In this setting, it is meaningless to require a player to maximise his pay-off. A players pay-off maximising strategy depends
on the strategy chosen by the other player. If the goalkeeper
chooses L, the shooters optimal strategy is R. On the other
hand, if the goalkeeper chooses R, his optimal strategy is L.
Similarly, the goalkeepers optimal strategy is L if the shooter
chooses L and R if the shooter chooses R.
Since a player does not know what his opponent will play
(since each is required to play simultaneously), it is reasonable
to assume that each player has beliefs about the choice of the
other player. A belief of a player is a probability distribution
over the strategies of the other player. For instance, the shooter may have seen videos of earlier penalty shootouts involving
this particular goalkeeper and observed that he goes L, 60% of
the time and R, 40% of the time. Similarly, the goal-keeper believes that the shooter goes L with probability p and R with probability 1 p for some p between zero and one.
Every player has a set of strategies that are optimal given a
belief. These are the strategies that maximise the players
expected pay-off given the beliefs.8 They are known as bestresponses to the belief.9
Let us calculate the best-responses in the matching pennies
example. Suppose the row player (the shooter) believes that
the goalkeeper will play L and R with probabilities p and 1 p,
respectively. Then his expected pay-off from playing L is 1.p
+ 1.(1 p) = 1 2p. By a similar argument, the expected pay-off
from playing R is 1.p + 1.(1 p) = 2p 1. Clearly the bestresponse for the row player is to play R if p > 21 and L if p < 21 .
If p = 21 , playing L and R both yield a pay-off of zero. Therefore,
any randomisation by the row player is optimal for the belief
p = 21 . By an analogous argument the goalkeepers bestresponses are as follows: play L if he believes the shooter will
play L with probability q > 21 ; play R if q < 21 and randomise
arbitrarily if q = 21 .
It seems natural to require players to play best-responses in
any equilibrium. But best-responses to what beliefs? Nash
equilibrium specifies these beliefs for a playerthey are precisely those beliefs that are generated from the strategies actually played by the other players in equilibrium. Suppose that
such an equilibrium involves the goalkeeper playing L with
probability 0.7, that is, p = 0.7. Then the best-response of the
shooter is to play R with probability 1. In a Nash equilibrium,
these must, in fact, be the beliefs of the goalkeeper. His bestresponse to these beliefs is to play L with probability one.
However this is inconsistent with our assumption that p = 0.7.
Consequently, p = 0.7 cannot be an equilibrium.
A little reflection should convince the reader that the only
equilibrium in this game, is p* = q* = 21 . If the shooter believes that the goalkeeper will play L and R with equal probability, then playing L and R both give him zero and playing L
and R with equal probability is a best-response. The shooter
playing L and R with equal probability will generate beliefs
q = 21 for the goalkeeper. Playing L and R with equal probability
NOVEMBER 28, 2015
vol l no 48
EPW
SPECIAL ARTICLE
for the goalkeeper is a best response to q = 21 , in turn justifying the beliefs p = 21 held by the shooter.
Let us summarise the discussion above. A strategy for a
player in the game above is a probability distribution over the
set {L, R}. A strategy for player j can be interpreted as a belief
for player i. A strategy profile (p*, q*) is a Nash equilibrium if
p* is a best response to q* and q* is a best-response to p*.
Equivalently, (p*, q*) is a best-response to itself.
This idea can be generalised. Consider an arbitrary game
where N = {1, 2,, n} is the set of players, Si and i : S1
Sn R, are the strategy set and pay-off functions, respectively, for each player i N. We shall refer to the elements of Si as
the set of pure strategies for player i.10 A player is allowed to
choose randomly from his set of pure strategies.11 A randomised strategy over Si is called a mixed strategy. The set of
mixed strategies for player i will be denoted by Mi this is just
the set of probability distributions over Si. The pay-off function
for player i, i, can be extended to a pay-off function i where
i: M1 Mn R in a straightforward way, by taking expectations. The set of players N, the set of mixed strategies Mi
and the new pay-off functions i for each player i N, is
referred to as the mixed extension of the original game.
A Nash equilibrium of the mixed extension game is a collection of strategies m* = (m1*, m2*, , mn*) such that
i(m*) i(m1*, , m*i1, mi, m*i+1, , m*n) for all
mi Mi and for all i N.
Let m* be a Nash equilibrium and let i be a player. The players
beliefs are the strategies of other players specified by the equilibrium, that is, (m1*, , m*i1, m*i+1, m*n). The Nash equilibrium
inequality ensures that m1* is a best-response to these beliefs.
There are several equivalent ways to express the fundamental
idea behind Nash equilibrium. As we noted earlier, a Nash equilibrium is a best response to itselfin Nashs words, it is a counter
to itself. The equilibrium is also self-enforcing in the following
sense: if players made an informal agreement to play it, then no
player has a unilateral incentive to break the agreement.
A Nash equilibrium can also be defined for the game where
players can only play pure strategies, that is, the game where
the strategy set for player i is Si and the pay-off function for i is
i. A Nash equilibrium in this game (or a pure strategy Nash
equilibrium) is a collection of pure strategies s* = (s1*, s2*, ,
s*n) such that
i(s*) i(s*1, , s*i1, si, s*i+1, ,s*n) for all si Si and for all
i N.
It is easy to see that a pure strategy Nash equilibrium is a Nash
equilibrium of the mixed extension game. However, a pure strategy Nash equilibrium may not exist. Consider the matching pennies. Observe that at every pure strategy pair (L, L), (L, R), (R, L)
and (R, R) one of the players is getting minus one. Moreover, the
player can unilaterally deviate and obtain a pay-off of one.
Nashs main contribution to equilibrium theory is not merely
the formulation of a plausible equilibrium concept. In his
1953 paper [9], he showed that the mixed extension of every
game with a finite set of pure strategies, has at least one (Nash)
equilibrium. The existence property can be extended to all
well behaved games.12 It is the existence property of Nash
Economic & Political Weekly
EPW
vol l no 48
3 Nash Bargaining
SPECIAL ARTICLE
vol l no 48
EPW
SPECIAL ARTICLE
for (S, d) must be z as well. The case for the axiom can be
argued along the following lines. Suppose the outcome for
(S, d) is zl which is distinct from z. Since S T , zl was under
consideration in the problem (T, d) but was rejected in favour
of z. Therefore, the choice of zl over z in (S, d) is due to absence
of the alternatives in T\S. But these alternatives ought to be
irrelevant for the problem (S, d). Axioms such as IIA appear
frequently in economic theory.19 The axiom is a natural simplifying assumption. The analysis would be virtually intractable
if bargaining outcomes depended on alternatives that are not
on the table.
The Nash bargaining solution for the bargaining problem
(S, d) is the vector (x1*, x2*) S with the following property:
(x1* d1)(x2* d2) (x1 d1)(x2d2) for all (x1 , x2) S and
x1 d 1 , x 2 d 2 .
The function (x1d1)(x2d2) is known as the Nash product.
The NBS is obtained by maximising the Nash product (a continuous function) over a compact set {x S|x d}. A standard
result in mathematics guarantees the existence of a solution.
Moreover, the solution is unique since the objective function is
strictly quasi-concave and the constraint set is convex. The
maximisation is illustrated in Figure 1.
Figure 1: Nash Bargaining Solution
X2
(X1d1).(X2d2)
NBS
X1
(d1,d2)
EPW
vol l no 48
SPECIAL ARTICLE
properly appreciated only by researchers with advanced training in specific branches of mathematics. In contrast, his papers
in game theory are accessible to a wide audience. They are
written in an informal, almost conversational style. There is
no obsession with technical details. The 1950 paper about the
existence of equilibrium [9], is less than a page long and barely has any mathematical symbols.21 The 1950 axiomatic bargaining paper [10] has examples about the purchase of
Cadillacs and Buicks and individuals Bill and Jack who have to
share objects such as a book, a toy, a pen and so on. The paper
rests on an elegant mathematical argument but it can be fully
explained to someone with no more than a high school background in mathematics. The other two papers use a more
sophisticated mathematical result, the Kakutani Fixed-Point
Theorem. However Nash provides the following assurance:
Notes
1 In a handwritten letter to the National Security Agency (NSA) of the US in 1955, Nash outlined the construction of an enciphering-deciphering machine. Although his idea was not
pursued by the NSA, the letter anticipates developments in computational complexity and
modern cryptography by several decades
see [12].
2 Sadly, he and his wife, Alicia were killed in a
road accident while they were on their way
back from Norway where he had received the
Abel Prize.
3 It was the subject of the Oscar-winning 2001
Hollywood film A Beautiful Mind based on the
book by Sylvia Nassar with the same name.
4 For an informal and wide-ranging assessment
of Nashs work and legacy, see [4].
5 Formally, i is a map from the Cartesian product of the strategy sets to the real line.
6 This description is known as the normal form
representation of the game. An alternative and
equivalent representation is the extensive form.
The latter makes use of game trees.
7 In other words, all players know it, all players
know that all players know it, all players know
that all players know that all players know it,
and so on ad infinitum.
8 An important question is why a player should
choose to maximise his expected pay-off. For
instance, why should it not be reasonable to assume that a player chooses strategies in order
to maximise his worst-case pay-off? The foundations of choice under uncertainty were laid
down by von-Neumann and Morgenstern in
their book. Their fundamental contribution is
the so-called Expected Utility Theorem. Loosely
speaking, it states that if preferences over lotteries over outcomes satisfy certain reasonable
axioms, utility numbers can be assigned to the
outcomes in a way such that choosing the optimal lottery according to these preferences is
equivalent to choosing the lottery that yields
the highest expected utility. The pay-offs to the
players in a game can be thought of as the utilities from the appropriate outcomes. Choosing
an optimal strategy (given a belief) is then
equivalent to choosing the strategy that maximises the expected pay-off from the strategy
where the lottery is specified by the belief.
9 Nash referred to them as counters.
10 In the matching pennies example the set of
pure strategies for each player is the set {L, R}.
11 The randomisation is independent in the statistical sense of the randomisation of other
players.
12 For instance, any game where the strategy sets
80
13
14
15
16
17
18
19
20
21
References
[1] Abel Prize (2015): http://www.abelprize.no/
c63466/seksjon/vis.html?tid=63467.
[2] Bertrand, Joseph (1883): Review of Thorie
mathmatique de la richesse sociale by Leon
Walras and Recherches sur les principles
mathmatiques de la thorie de les richesses
by Augustin Cournot, Journal des Savants
499-508. Translated by Margaret Chevaillier
as Review by Joseph Bertrand of two books,
History of Political Economy, 24 (1992), 64653.
[3] Cournot, Antoine A (1838): Recherches sur
les principles mathmatiques de la thorie de
les richesses, Paris, Hachette. Translated by
Nathaniel T Bacon as Researches into the
mathematical principles of the theory of
wealth, New York, Macmillan, 1897.
[4] Ghosh, Parikshit (2015): John Nash and
Modern Economic Theory, http://www.
ideasforindia.in/article.aspx?article id=1462.
[5] Harsanyi, John C (1967, 1968): Games with
Subscription Numbers
Subscribers are requested to note their
S ubscription Numbers and quote these
numbers when corresponding with the
circulation department.
vol l no 48
EPW