Beruflich Dokumente
Kultur Dokumente
S Forehand Backhand
R
Forehand 90 20
Backhand 30 60
60
55
48
45
30
20 RMB
40 50
Zero-sum game: the tennis case
The mix 40:60, where S aims 40 percent of times to
forehand is the one that minimizes the percentage of
successful returns when R plays its best response (along
the green line): it wants to minmax (minimize the
maximum payoff of R); it is then 48%
Thus 40:60 is Ss best mix
Now let us look at the game from the Receivers point of
view by plotting the percentage of successful returns
against percentage of times R moves to Forehand.
By correctly exploiting Rs move, S can
keep return percentage as low as possible,
along the red line; hence the best choice of
R is to move to its forehand 30% of times:
this keeps its return percentage to the
highest, i.e., 48% SAF 90
60
48
30
SAB
20
30
Zero-sum game: the tennis case
The mix 30:70, where R anticipates 30 percent of
times, and gets prepared to, a service on its
forehand is the one that maximizes the percentage
of successful returns when S plays its best
response (along the red line): it wants to maxmin
(maximize its minimum payoff); it is then 48%
From both point of view, the best mix leads to a
percentage of successful returns of 48%
This is a general property of zero-sum games,
called the min-max theorem
Zero-sum game: the tennis case
When, in zero-sum games, one player attempts to
minimize his opponents maximum payoff, while
his opponent attempts to maximize his opponents
minimum payoff, the surprising conclusion is that
the minimum of the maximum payoffs equals the
maximum of the minimum payoffs
Neither player can improve his position, and so
these strategies form an equilibrium of the game
If R moves 50% on F, then his percentage of
return, as a function of the probability p S aims
at F is: 0.5(90p+20(1-p))+0.5(30p+60(1-p)) =
20p+40 is minimum for p=0
V A B
R
A 150 , 50 250 , 0
B 300 , 0 100 , 100
p[150q+250(1-q)]+(1-p)[300q+100(1-q)] =
(100+200q)+(150-300q)p
30
SAB
20
30 33.3
Payoff changes
Paradoxically, a higher percentage of successful returns is
achieved by using less often the improved backhand
The reason lies in the interaction of the two players
strategies: when the receiver is becoming better with his
backhand, the server aims at the forehand more often (43%
instead of 40%), in response the receiver moves too his
forehand more often too.
A better backhand unlocks the power of your forehand
Similarly for a basket-ball player: suppose her right-hand
shooting is better than her left-hand; an improvement in
her left-hand shooting changes the way she is defended
and allows her to shoot right-hand more often
Equilibrium mix
If one player is not pursuing his equilibrium mix, his
opponent can exploit this to his advantage
So, even though at your opponents equilibrium mix you
are indifferent between your strategies: play your best
mix! otherwise your opponent is going to exploit it
In the tennis example, each serve has to be unpredictable
(if your best mix is to play 40% on forehand, do it
whenever the second hand of your (analogue) watch is
between 1 and 24)
The best way to surprise the others is to surprise
yourself and toss a coin if your best mix is 50/50
Examples
Airlines and discount/stand-by tickets: if last minute ticket
availability were more predictable, there would be a
greater possibility to exploit the system
To motivate compliance at a lower monitoring cost: tax
audits, drug testing, parking meters, etc Explains why
the punishment should not necessarily fit the crime
If parking meter costs 1 euro an hour, then a fine of 20
euros will keep people honest if they believe the
probability of a fine is 5% or higher (risk-neutral);
No enforcement induces misuse of scarce parking places
100% enforcement would be too expensive
But enforcement mechanism should not be any: the expected fine
should be high enough to induce compliance
But then: random walks of officers!