Beruflich Dokumente
Kultur Dokumente
Chess Rating
Goal Paired comparison scaling to make a ranking of all active players and to serve as a basis for selection of players for training and tournaments Description Rating and scoring of two-player game Paired-comparison scaling Extrapolation of player relative strength from small number of games Geographically distributed players Generalize diverse set of backgrounds, styles and strategies into one number
Interesting Facts
Kasparov does not hold either of two World Titles in Chess but is universally regarded the best chess player in the world by virtue of his number 1 ranking (2813 rating in Oct 2004)
Rating Scale
ELO/FIDE is most widely accepted ranking system (FIDE - Fdration Internationale des checs)
Source: http://www.ex.ac.uk/~dregis/DR/bcftable.html
B
1609
C
1477
1613
D
1388
1613
E
1586
1613
B
1720
A is penalized for having a worse performance than should be expected given his opponents. If he had scored two wins, one loss, and two draws, for a total score of three points, that would have been slightly better than expected, and his new rating would have been (1613 + 32*(3 - 2.867)) = 1617.
1613
Score: 0+0.5+1+1+0 = 2.5 E_Score: 0.506+0.686+0.785+0.539+0.351 = 2.867 New Rating: 1613+32*(2.5-2.867) = 1601
Source: http://www.wordiq.c om/definition/ELO_rating_system
Conversion of rating difference into likelihood of winning for each player H - higher ranked player L - lower ranked player
No History Rating - Ru
Rating Player with no Rating history: Tournament Play
Scores > 50%
Scores 50%
Ru = Rc
Rc = R a for Unrated Players Ra = Tournament Average Rar = Avg Rating of all Rated Players dpa = determine p for rated players, then determine dp for each, then avg Ra = R ar - d pa * n/(n+1)
Source: http://www.fide.com/official/handbook.asp?level=B0210
History Rating - Rn
Rating Player with Rating history: Tournament Play
Determine Rc For Tournament D = R - Ra Determine PD W = Achieved Score We = Expected Score
We = PD * n for Swiss or individual play
Difference in rating above 350 is treated as 350 for rating purposes Rc = (Ra (n+1) - R) / n R = Rating of Player n = Number of Players Ra = avg rating of all tournament players Calculating new R n : Rn = R 0 + SDR R0 = Old Rating SDR = Sum DR for each new event Source: http://www.fide.com/official/handbook.asp?level=B0210
K = Development Coefficient Players not ranked with R n < 1801 K = 25 until player has completed 30 games K = 15 while Player Rating < 2400 K = 10 once published Player Rating > 2400 (remains permanently at 10 thereafter)
Normal distribution fits these properties and makes sense if we assume that the fluctuations in performance are purely random and are not caused by changing properties of I and j Scaling factor of c = 200 2
1 P( D) = 800 !
Source: http://www.schaakbond.nl/rating/elopaper.html
D c "#
1 2 " t 2
Challenges
Ranking of computer players Retrospective rankings not accurate How to model draws (likely in chess) Performance is not typically normally distributed (kurtosis and skew) Performance variance varies between players but is generally modeled as a constant Players entering and leaving the rating pool (turnover) leads to loss of points from the system not a strict zero-sum system and system average rating is not stable Subpools of players: deflation and inflation
Other Topics
Current research focused influence of covariate information like age, playing routine, style and proficiency level on the reliability of the rating system (can incorporate these into the Kfactor) Rating Databases - Quick Chess (QC), Regular Tournament, Correspondence Chess, Online Algorithm has motivated and moved into other areas Video Games - Xbox Online Splinter Cell Sports - College Football, Soccer, Volleyball, Tennis, Golf British Chess Federation (BCF) scale ELO=(BCF*8)+600: 1 BCF point = 8 ELO point USCF Conversion
USCF initially aimed for an average club player to have a rating of 1500.
http://math.bu.edu/people/mg/ratings/approx/approx.html WorldChessRating.com