Beruflich Dokumente
Kultur Dokumente
Introduction
Economists have been interested in sport for quite a long time. A first reason
Canada are working in a sports-related activity1 - and leisure - the 2002 Football
World cup for example was watched by over a billion spectators (Hoffman et al,
data. Given the popular interest in sports, a lot of newspapers and fans collect
all kinds of detailed sports data. These detailed data have been used by
example investigates the need for competitive balance. That is, they try to see
whether it is important that teams are not too different in terms of quality so
that there’s some uncertainty about the outcome of a match. Other economists
use sports data to test theories that are also of importance outside the sports
context. Ehrenberg and Bognanno (1990) for example use golf data to test the
1
http://www.parl.gc.ca/InfocomDoc/36/1/cher/Studies/Reports/sinsrp05/04-rep-e.htm
implications of tournament theory – i.e. that differences in pay-off motivate
be less available for European sports – salary data for European sportsmen for
example are very rare compared to salary data for American sportsmen. Still,
focuses on estimating the demand for football and theoretical work on player
mobility, the latter spurred by the 199X Bosman Case2 (see Matheson, 2003 for
to take a look at a whole range of issues that have been covered in the
attendance etc) and allows us to test a number of theories that are of interest
2
The Bosman case made the free transfer of players possible.
Data Description.
1) Champfoot
selected the data from the French Competition. The reason for this is that the
data on the French competition are the most comprehensive data available
for us, data on player performance are available for the French Championship.
• list of names of all players of all teams in the first (D1) and second (D2)
o team
3
The information for the 1996/97 season is not complete.
o Minutes played by match (this allows us to infer whether a player
Note that players that were hired by a team but never played for
o number of spectators
o for first division matches, the quality of the match – the France
L’ Equipe is the biggest French sports daily. Like France Foot, its reporters also
obtained from one of its journalists a file with player ratings for the first
3) Football data.co.uk
Football data.co.uk collects odds data for French football matches in both
division 1 and 2 since the 2000/01 season. Odds data are often used to measure
From Champfoot we can get an idea about the careers of football players
about careers of those players who remained in the first and second division of
competition after 1996/97 or for players who started their career before
the Internet Player History Database and L’Equipe’s player history database.
The player history database is a internet database with over 30000 histories of
football players.
information on:
• nationality
• second nationality
• country of birth
• date of birth
• height
• weight
• retired or not
extracted, for each football player in the Champfoot database, information on:
• nationality
• birthdate
• height
• weight
Combining these 3 databases gives us a much better idea about the careers of
football players4.
Table 1 gives the distribution of France Foot’s player ratings over the 6
different categories for the French first division over the period 1996-2003.
The ratings do show signs of centrality bias and or leniency bias – that is,
extreme ratings are very little used and bad grades are rarely given. Such
biases are typically explained by the fact that supervisors do not like to give
bad grades to people with whom they interact closely (see Murphy,1991 and
agency problem between the supervisor/manager and the owners of the firm:
take time away from better-rewarded activities, and because managers face
evaluations‘. Note that despite the fact that in our setting there’s very little
direct contact between players and reporters, we still notice the same
tendencies.
however, grades in division 2 are higher on average – the main difference with
division 1 being that the low grade 2 is less often used! This finding provides
strong evidence for a leniency bias – despite the fact that second division
division football players are selected for the national team and second division
teams tend to lose from first division teams in the direct confrontations of the
cup matches), subjective performance evaluation does not show such difference.
Besides the ratings by the journalists of France Football, we also have data on
the ratings by journalists of the sports journal, L’Equipe for division one games
in three seasons. The grades of L’Equipe cover a larger range (1 to 10). Still,
Computing the correlation between the France Football ratings and the Equipe
ratings, show a strong positive correlation: 0.59 for 1997 and 0.57 5 for both
5
Rank correlations are of similar magnitudes.
1998 and 2002. Though strongly correlated they are far from perfectly
Also for matches we find leniency and centrality biases – here again explanations
based on personal interaction that leads to a disutility of giving bad grades fail –
Commentators often argue that the quality of the French competition has
transfer market. For example, Raymond Domenech trainer of the French youth
national team claims (Le Monde, 30/03/1998, p. 20) that the departure of the
best French young players has weakened the French clubs6. Looking back at
table 1-4 one can notice that there’s little change over time. Both the
distribution over grades and the average player grade are stable between 1996
and 2003. And the average match score first increased somewhat, then
decreased, which confirms the results based on player ratings of little evidence
Some have argued that football has become more defensive. Mention (2000) for
example shows that during the sixties the goals per game dropped substantially
to remain at a low level ever since. Since we have for each player, the position on
the field – that is whether they are goalkeeper, whether they play in the attack,
the middle or the defense – we can look whether different positions got used to
a different extent over time, i.e. whether French football has, for example,
become more defensive over the period 1996/7-2003/4. The answer can be
found in table 5.
6
Raymond Domenech déplore la baisse de niveau du football français. Dans un entretien au "Monde",
l'entraîneur des Espoirs analyse, avant le choc Metz-Lens, le parcours des deux équipes actuellement en
tête du championnat de première division. Il affirme que le départ de leurs meilleurs jeunes à l'étranger a
affaibli les clubs nationaux.
No clear tendency over time can be observed in the use of different positions
over time, thus rejecting the idea that ‘strategy’ has changed over time. Mention
While at first sight this might be surprising since physical shape and career
effects. Note that also the variance (both individual variance within a season as
effort’. Lazear (1989) for example shows that increasing the prize in a relative
performance contest will not only increase effort but also increase the level of
‘sabotage’ (destructive effort). The intuition behind this is that you can beat
your competitor by working hard but also by preventing him from getting a good
result. Football games are clearly relative performance contests – there are two
competitors and only the best get points. If some games are more important
than others, we would expect that in those matches the players put more
creative effort (i.e. play better) but also do more to prevent the other team
from scoring (for example, by playing less fair). An indicator of creative effort
can be the rating, an indicator for destructive effort, whether one gets a yellow
card or not. Players with yellow card have on average a lower rating (3.2067)
then those without (3.2446) which goes against Lazear’s idea. However, an
alternative explanation for this finding is that the experts evaluate the cards as
something negative and reflect this in their ratings – indeed a red card
for example mention that in England home teams win twice as many matches as
away teams. The same is true in our French data: 49,9% of first division
matches are won by the home team, against 23% by the away team. In the
second division, 46.1% of all games are won by the home team, against 21% by
the away team – in both cases, home teams are about 2.2 times more likely to
win.
One might wonder why home teams are more likely to win. One explanation is
that they are playing better at home. Descriptive statistics seem to indicate
that this is indeed the case. The average player rating of the home team is
Note that an alternative explanation for this finding is that if reporters choose
which matches to visit, they are more likely to visit the teams they support and
hence, they might give these too high points. According to France Football, a
fraction of the home team advantage could come from the fact that the journal
more lenient towards the home team than the FF experts 7 – we checked this by
looking at the difference between home and away matches in division one and
division 2 (period 1997/98 – 2003/04). Hence, the local reporters do NOT seem
Playing well indeed does pays off. That is, teams that play well are more likely to
win. The statistics point in this direction: if the home team is having a higher
player rating than the away team, the home teams wins in 75.9% of all matches,
it draws in 21.67% and loses only 2.43% (using data from L’Equipe these
When the away team is better than the home team, it wins 59,24% of the
matches, draws in 34,33% and loses in 6,43% ( for L’Equipe, 55.08%, 37.43%,
7.49%). This difference between the relation of match score and win percentage
for home and away teams might indicate that the reporters prefer the home
team to win. Indeed, if the average match rating of home team wins is 12.1894,
home and away team have strategies that are indeed somewhat different: home
teams begin a match on average with more offensive players (2.25 vs 2.1) and
7
FF also explains that the higher average rating for D2 players by this higher proportion of local
reporters in Division 2.
Strategy indeed is related to the chance of winning: If the home team has more
offensive players (than the away team), the home team wins in 47.1% of the
cases, draws in 20.5% and loses in 32.3% of the cases. If the away team has
more offensive players (than the home team), the away team wins in 20.7% of
the cases, draws in 45.1% and loses in 34.1% of the cases. This seems to suggest
that for a home team a more offensive tactic pays off while for an away team
There are several championship related incentives for teams in the football
championship – a team can win the championship, it can win a place in the
championship league, it can qualify for the UEFA cup and it can qualify for the
which qualify for the UEFA cup). In addition, there are the incentives coming
from the fear of relegation. Because at the end of the season, the winners and
losers of a championship are already known, many observers expect lower quality
middle of the season, then increases slowly to end up at the same level as
To investigate the reasons for the absence of such effect, we computed the
percentage of matches where one of the teams had nothing to loose or win
+ no Champion
Sure UEFA + no 4.0000 0.17
Champions League
Sure intertoto + no 5.0000 0.22
UEFA
Sure no relegation + no 36.0000 1.57
intertoto
Sure Relegation 15.0000 0.66
It turns out that there only in 2.93% of all matches there is at least one team
that has nothing to win or lose anymore – 4 matches included a team that was
already sure to win the championship, 3 matches had a team that was sure to win
a champions league place but was also sure not to become champion anymore. 36
matches had a team that couldn’t win any prize but also couldn’t be relegated
anymore. And in 15 cases a team was involved that already was sure to be
relegated. Of the 2290 first division matches only 14 matches where between
two teams that had nothing to win or lose anymore (not in the table) - So, as far
as the French first division is concerned, the prize structure is such that
Next, we look at the second division – second division has a less extensive
there only 7.15% of the games had at least one team that had nothing to win or
lose anymore, and only 2.66% of the games (not in table) were between two
Number % of al
of games
games
Sure Promotion 40 0.0146
Sure no relegation + no promotion 131 0.0478
Sure Relegation 25 0.0091
Next we check whether such matches with less (or no) champion-related team
efforts are of lower quality – first division matches where two teams have such
incentives have an average rating of 11.85, against 12 for matches where at least
one such teams has no such incentives. If the home team has no incentives
anymore, the average player rating of the home team is 3.21 (3.32 for the
second division), against 3.25 if there are incentives (3.34 for the second
division) – for the away team, the average player rating is 3.13 (3.23) both when
These statistics suggest that championship related team incentives are not
crucial for the quality of a game or the quality of the players’ play. A potential
explanation for this finding is the presence of other incentives – players can
decide to play well because of match premia that are related to winning a match
(the typical contract will indeed have explicit rewards that are a function of the
We can also check the effect of this on spectators to see the effect of
competitive balance!
team is only considered not to have incentives coming from the champions’ title
championship. Of course, a team that is 20 points behind the leader will probably
not believe it can become champion anymore even if there are still 30 points to
We take the championships from after the second world war and look at the
points evolution of the champion. That is, we look at each match Y of the
competition and ask the following question: suppose that before match Y, we are
X points behind the leader – what is the chance that we still become champion.
To answer this question we run, for each stage of the championship, a logit
regression where the dependent variable is a dummy, that takes one for the
team that eventually becomes champion and zero for the other teams, and the
explicative variable is the difference in points between the leader and the other
teams, for each match, using data from postwar Division championships. Using
the logit results, we can compute predicted probabilities for each point
points behind the leader at different stages of the championship using the 33
post WW2 championships which had 20 teams (18 others had 18 teams).
If after match 1 you have the same level of points as the leader, your ceteris
paribus chances to win the title are about 10% (against 0.05% before the start
of the championship). As one can see, being at the same level as the leader
increases your probability of winning the title as the season continues. After the
37th match, if you still have the same points as the leader, the chance of winning
the championship is 94%. Being one point behind the leader is more detrimental
to the chances to win the cup than at the end – if you are 1 point behind the
leader after the first match of the season, your chance to win the championship
are less than 5%. But being one point behind the leader after the 32nd match
give you more than 30% chance to win the championship. However, if after the
37th match, you are still one point behind the leader, the chance of still winning
the championship is only 9%. While these results are plausible, there are some
shortcomings – if there are two teams sharing the leadership position after the
37th match it’s unlikely both will think of themselves as having 94% chances to
win. Similar, after the 37th the team at the top position has 94% chance while
the team at one point behind the leader has 9%, these probabilities sum up to
E. Competitive Balance
The literature on competitive balance claims that fans will lose interest in the
game when there is no match uncertainty – that is, if it’s unlikely that your team
will win or if you’re sure your team will lose, you will not be interested to watch
the game. The empirical literature only offers limited support for this
(Szymanski, 2004?).
To take a first look at this issue, we look at the correlation between goal
difference and the quality of the match – if experts like unbalanced matches, we
like unbalancedness. Splitting up the sample in home wins and away wins, we find
that the correlation is stronger for home wins (0.31) than for away wins (0.19).
bookmaker) between the home and away team) and our measure of ex post
unbalancedness (the absolute value of the difference between the average home
player performance and the average away player performance) was about zero
(using 2743 matches). Similar, the correlation between the quality of a match
2. Regressions
Before starting with the regressions, let’s look at the factors that will influence
a football game.
The Coach
First we have incentives for the teams before the match. These incentives will
determine the decisions of the coach who will choose the players that are best
for the tactics he sees appropriate for the game. A first decision a coach has
can observe this choice by counting how many offensive players, how many
A second decision is on team heterogeneity - the coach can choose players that
have about the same age or can prefer a mix of young and old. He can prefer to
make it an international team. He can choose players that all speak the same
language or can assume that language doesn’t matter. Several studies have
between two national teams knows for example that there are important
differences in the style and tactic which different national teams use. The
Economists (May 30, 2002) quotes Jorge Valdano (Argentina, 1986 World Cup)
as saying “The Brazilians play like they dance; the Germans play like they make
cars, with lots of technical efficiency and not much left to the imagination; the
English run hard all the time, maybe because of the weather; the Spanish are a
mosaic of regional styles, which has yet to find a national pattern. And the
Italians,” he pauses and frowns, “they are a paradox. In every other area they
export style and flair to the world; but in football they've allowed the ideal of
Empirical work of Yamanaka, Hughes and Lott (1995) confirms this: ‘Because of
the different ways in which soccer has developed throughout the world, the
range of climates in which it is played and the varying temperaments associated
with individual nations, there would appear to be styles of play unique to areas
of the globe’ and conclude their empirical research as following:’ It is clear that
the overall patterns of the British Isles teams differ from the other groups
examined in the way they build up their attack from defense, using the goal kick
and long forward passes. They also showed a dominance in the air. European
teams tended to build the play using short passes, runs, and dribbles, reducing
the risk of losing possession. The South American teams had a high ratio of
shots to crosses with a higher percentage of crosses coming from the final
sixth of the field. It was also concluded that Cameroun exhibited similarities to
the game patterns played by the teams of South America, more so than the
other two groups, from the British Isles and Europe. The main differences were
Mixing players with widely varying styles can be advantageous but is not without
the previous Champion of the Dutch competition, explains the current poor
performance of the team as follows: ‘The problem is too many players left the
team on a short time. Moreover, in most cases, it were Dutch internationals that
left and were replaced by foreigners. This caused, certainly during the first few
other. Obviously, this makes good results very difficult to achieve.’ (HUMO,
16/3/99). Similarly, just before the start of the new competition, a journalist
asked a player of the Belgian first division team Club Brugge the following
8
This evidence was based on the team-performance during the 1990 Worldcup so it isn’t possible to decide
whether these differences are really due to differences in nationalities or differences in coaching-style. Still,
national coaches do have generally the nationality of the team they are coaching.
question: ‘Club Brugge did buy Belgian to replace Van Der Elst [ a famous Belgian
player]. The supporters love it. And the players?’ The player of Brugge
answered:’ It’s nicer that more Dutch is spoken during the training. English and
French are now spoken less. Which obliges the foreigners to learn our language.
It isn’t that foreigners have worse characters than Belgians. The club did well
changing its policy. One language is better than six languages.’ Trainers also
because the team has 5 Dutch players and 3 Belgian players, Dutch is spoken in
the dressing room rather than German. The player’s answer again illustrates the
importance of language : ‘No, Huub Stevens, the trainer, insists that we speak
German. There are also two Tsjech ,one Russian and one Norwegian player, so it
would be quite a mess when everybody spoke his own language. So everybody
players that are part of other national teams can not count on this
synchronization. Implying that if, for example, a French team bought all the
best Dutch players, it would be possible that, every time, the Dutch national
team plays, the French team would have to field their ‘second’-team. That this is
say that teams have to release their players when the latter are called on to
absences of foreign key-players that are selected for their own national teams
done)
• heterogeneity in club tenure: how many players on the team where on the
The players
In addition to picking the players the coach can give recommendations to players
on how to play. Given the incentives of the match, the players will then decide
about their use of effort – they can either do creative efforts which are match
improving, some of which are measured like number of goals or decisive passes
or they can choose for destructive efforts which are match-deteriorating, some
9
If this idea is confirmed, the fact that France became World Champion can be explained by its multi-ethnic but
uni-language composition, while the poor performance of the Belgian team isn’t anything more than the
consequence of combining two languages in one team without the advantages of more ethnic variety. Moreover,
it could give a rationale for naturalizing foreigners to increase the ethnic variety of the teams.
of which leads to measured stats like a yellow card. Of course, some effort is
not captured by objective variables, for example somebody making a foul that is
not sanctioned by the referee or somebody nearly scoring a goal from far
distance. Both the measured and the non-measured actions will be captured by
The Match
The effort of the players and the strategy chosen by the coach will determine
the outcome of the game: whether the team wins, loses or draws but also other
outcomes like how unbalanced the game was. Both these results will influence
Match regressions
To explain the match ratings, we will take 3 different strategies. First, we can
take only objective match performance indicators like goals, the number of
passes etc. The problem with this approach is that we only have a limited number
measured objective indicators– the problem with the latter strategy is that the
information – a goal scored will both influence the goal indicator and the rating
10
An important assumption we make is that a player’s performance rating is independent of the other players’
performance. One could easily imagine that this assumption is violated – giving a good decisive pass to a good
player who then scores might be valuated more than the same good pass to a bad player who then misses.
indicator – one alternative strategy would be the following – we run a players
rating regression and then use the errors of the regression (average for a given
match and team) as additional explicative variable together with the objective
indicators for the match. Those errors then reflect the quality element that is
The numbers of defensive and offensive players are indicators for the
Idea for a “game theory” paper: What determines the probability of getting
We further have
interested in the total number of goals but also in the goal difference.
cards and the number of offensive and defensive players. To allow for a
difference.
• A dummy for the number of the match, with higher numbers meaning a
season effect. One could argue that at the end of the season a lot of
teams have nothing too lose or win anymore so they have less incentives to
win. (A nice thing would be to create dummy that takes one when a team
has nothing to lose anymore and a dummy that takes one when a team has
nothing to win anymore – and check the effect of that on win percentage,
quality of player notes, yellow cards,team strategy etc – title the effect
of carrots versus sticks and also because here incentive rule is exogenous
En effet !
• players rating: the average note of the players on the pitch – note that we
assume that the quality of the match does not influence the notes a
player gets.
• balance between the two teams: the absolute value of the difference
between the average notes of the players of the two teams. We include
including this variable we can check whether more balanced games are
better games or not. If we find that more balanced games are better,
• The average variance of effort – what do experts prefer, that the whole
players’ notes
We use three different estimation methods. Column 1 gives the OLS estimates.
Since we have multiple observations for each specific combination of two teams
(i.e. Bordeaux meets Lyon 2 times in every season) we also run a fixed effect
preferred over OLS a Hausman test chooses fixed effects over random effects.
Since our notes are ordered categories we in fact should use ordered probit
different marginal effects for each category. The significance of the variables
however is not affected by the estimation method. We therefore will focus for
[Probably better to ADD DIFFERENCE IN RED CARDS and then not include
The results are as follows – as far as strategy is concerned, experts do not like
defensive teams – each extra defensive player on the field decreases the match
attackers is insignificant. More goals and more passes lead to higher notes. Each
extra goal is worth 0.3 points, each extra decisive pass 0.1. The number of
(first) yellow cards has a positive effect albeit only marginally significant. Red
cards in contrast have a negative effect on the match note. The variables
indicating unbalancedness give interesting results. Experts like matches that are
unbalanced in terms of scores – a 2-0 victory is getting a half a point score more
than a draw 1-1 draw, ceteris paribus - but do not like matches where teams are
match where one team is very offensive and the other is very defensive is
generally not leading to a high match note. So experts seem to like balancedness
in tactics (ex ante) while unbalancedness in outcome (ex post). Very nice
The (not reported) dummies for the effect of the number of the match show
that matches later in the season do not receive higher notes than the matches
at the beginning. This is a surprising finding since one would expect that, at the
end of the season, several teams have nothing more to lose or to win. Perhaps
The (not reported) coefficients of the dummies for the years are not significant
for the OLS and the ordered probit specification, but the fixed effects
goals, passes and cards, matches in 1997-1999 received higher notes than those
Finally, a Ramsey RESET test using powers of the independent variables rejects
the null hypothesis that the model has no omitted variables at a 5% level. This
outcome may indicate that additional variables are needed to correctly specify
Subjective Variables
performance evaluations.
performance of a team by one point extra (on a scale of six) increases the match
a team with on average 3 points and team with on average 5 points is better than
a match between 2 teams that have about the same point average. Note that
moreover the squared term is positive, stressing even more the importance of
significant indicating that experts like to see matches during which some players
the season and the year we get little significant effects, all in all confirming
that matches later in the season are not worse and that the competition did not
The adjusted R2 is about 0.2 for the objective variables and 0.3 for the
So far we assumed that a goal scored (or a yellow card received) by the home
team is the same as a goal scored (or a yellow card received) by an away team.
However, there are a number of reasons why home and away indicators could be
valued differently. First, it is a stylised fact that home teams are more likely to
win than away team. This might be the consequence of more incentives of the
11
This result is robust to including the strategy variables like the number of defenders and attackers on
the field or the difference in these variables.
home teams to win (because of the spectators) – more incentives lead to better
play, hence this results in more goals for the home team and hence a higher
likelihood to win. However, this better play also might mean that on average the
goals of the home team might be of better quality than the goals of the away
team. Hence, the coefficient of the home team goals should be bigger than the
coefficient of the away goals variable. Another reason would be that are
experts are biased towards the home teams – that too would imply that home
goals are valued more. One way to distinguish between these two explanations is
by looking at the ‘valuation’ of red cards. Assume that the referees are
unbiased, hence a referee will give a red card when the gravity of a foul exceeds
a specific threshold that is the same for both teams. In this case, biased
(towards the home team) experts will punish home red cards less. However,
higher incentives will not only lead to more creative effort but also to more
destructive efforts. Hence a home red card should on average be for a more
Here again the Ramsey RESET test rejects clearly (at the 1% level) the null
hypothesis that the model has no omitted variables, indicating that the right
objective).
By the way, check whether strategy at home and away are different – criticism
Chossat, V. and Gergaud, O. (2003) "Expert Opinion and Gastronomy: The Recipe
for Success", (with V. CHOSSAT), Journal of Cultural Economics, 27, pp. 127-
141.
Ginsburgh, V. and van Ours, J.C. (2002) "Expert Opinion and Compensation:
Evidence from a Musical Competition", The American Economic Review, 93(1), pp.
89-296, March.