AccuRating is a library for accurate player ranking based on match results.

These details have not been verified by PyPI

Project links

Project description

AccuRating

Library computing accurate ratings based on match results.

Is AccuRating accurate?

Yes. With the same data, AccuRating will return much more accurate ratings than chess ELO or EGD. For instance if a player Leon manages to win against player Caesar, Leon will "get" some X points for that victory. If later Caesar will win against strong players Alusia and EmCee, his rating will increase, but that would not affect Leon's rating in chess or EGD. If we want to use data efficienty Leon's rating should be adjusted because Caesar clearly demonstrated his power and it was not long ago that poor Leon lucked out a victory against Caesar.

EGD, Chess ELO go over the data once, so they have no way of taking that into account. AccuRating shows that we need about 1500 - 2000 passes over the data to converge to accurate ratings.

AccuRating is a variant of Whole-History-Rating, which is also used at https://www.goratings.org/.

Interesting discussion on why WHR is good.

What AccuRating ratings numbers mean exactly?

Only differences matter, i.e. adding say 100 to all the ratings would yield equally valid points.

100 AccuRating point difference is 1:2 win odds (33% vs 66%)
200 AccuRating point difference is 1:5 win odds (20% vs 80%)
300 AccuRating point difference is 1:9 win odds (11% vs 89%)

The exact formula is $P(win) = 1 / (1 + 2^{d / 100})$. Optimization algorithm find ratings that maximize probability of the data.

Compared to Chess ELO

Chess ELO is similar, but the points are rescaled by 1.20412:

120.412 chess ELO difference for 1:2 win odds (33% vs 66%)
240.824 chess ELO difference for 1:5 win odds (20% vs 80%)
361.236 chess ELO difference for 1:9 win odds (11% vs 89%)

The Chess ELO formula is $P(win) = 1 / (1 + 10^{d / 400})$

Compared to EGD

In EGD, winning odds for 100 points of rating is not fixed. This is beacuse: 1 dan/kyu difference = 100 EGD = 1 handicap. The nature of Go is that 1 handicap (i.e. 100 EGD) means more on a dan level than on a kyu level.

On the dan level:

90 EGD point difference is approximately 1:2 win odds (33% vs 66%)
180 EGD point difference is approximately 1:5 win odds (20% vs 80%)
270 EGD point difference is approximately 1:9 win odds (11% vs 89%)

On the kyu level:

300 EGD point difference is approximately 1:2 win odds (33% vs 66%)
600 EGD point difference is approximately 1:5 win odds (20% vs 80%)
900 EGD point difference is approximately 1:9 win odds (11% vs 89%)

Based on these tables.

If AccuRating is so accurate why other systems are not using it?

Typical ELO systems (like EGD) use every game result only once and update rating based on it. This model can do 1500 passes over the data until it converged (the first pass returns numbers more or less like standard ELO system, equations are almost the same). However so many passes is too expensive when the number of games is as big as EGD.

What it AccuRating bad for?

This system does not have any gamification incentives so it is bad to player motivation. It uses data efficiently and nicely approximates true strength. It can be compared to hidden MMR used for match-making in games like Starcraft 2, not to the player-visible motivating points with various "bonus points".

Model details

The model finds a single number (ELO strength) for each player. Given ELO of two players, the model predicts probability of win in a game: If we assume that $P$ and $Q$ are rankings of two players, the model assumes:

$$P(\text{P vs Q win}) = \frac{2^P}{2^P + 2^Q} = \frac{1}{1 + 2^{Q-P}} $$

This means that if both rankings are equal to $a$, then: $P(win) = \frac{2^a}{2^a+2^a} = 0.5$. If a ranking difference is one point, we have $P(win) = \frac{2^{a+1}}{2^{a+1}+2^{a}} = \frac{2}{2+1}$ Two point adventage yields $P(win) = \frac{1}{5}$ $n$ point adventage yields $P(win) = \frac{1}{1+2^n}$

For readability reasons we rescale the points by 100. This is exactly equivalent to using this equation:

$$ \frac{1}{1 + 2^{\frac{Q-P}{100}}} $$

Comparison to single-pass systems.

In single pass systems, if you play a game, it will not affect the model estimation of your rating yesterday. In multi-pass system we can estimate every player's rating for every season (or even every day). Then iteratively pass over the data again and again until we find rating curves that best fit the data.

There is a parameter that controlls how fast the rating can change. WHR model assumes that player rating is a gaussian process in time, and this parameter is variance of this gaussian process.

The consequence is that data flows both time directions: if you play game in season 20, it will also affect your ratings for season 19 (a bit) and season 18 (very small bit) etc. The data also flows over the whole graph of games and players on each iteration.

Can I convert these ratings to EGD?

Because EGD is using different exponent base, it is not that easy to convert directly. These is a monotonic conversion function but it is non-linear, and it would take some work to derive the formula.

It would be interesting to plot EGD ratings against AccuRating ratings.

What's implemented

This repo implements:

Implement Bradley-Terry (BT) model for player ranking (better variant of ELO).
Generalization BT model to a variant of WHR model with time as seasons.

ToDo and ideas

distribution fit to account for heavy tail
Chess units
EGD conversion
Fit player variance (high variance player can more easily win against stonger players and more easily lose against weaker players)
Follow this: https://www.marwandebbiche.com/posts/python-package-tooling/

Development

git config --global core.hooksPath .githooks/

git clone https://github.com/lukaszlew/accurating
virtualenv /tmp/venv
source /tmp/venv/bin/activate
pip install poetry
poetry install
poetry run pytest
poetry run mypy .

Testing with iglo use-case:

poetry run python iglo/iglo.py download iglo/matches.json
poetry run python iglo/iglo.py ielo iglo/matches.json iglo/cfg.json iglo/ratings.json iglo/igor.txt

Publish package

# edit stuff; increase version
poetry build
poetry config pypi-token.pypi [token]
poetry publish

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.7.0

Mar 11, 2025

0.6.1

Jan 20, 2025

0.6.0

May 12, 2024

0.5.0

May 16, 2023

0.4.9

May 15, 2023

0.4.8

May 14, 2023

0.4.7

Apr 26, 2023

0.4.6

Apr 26, 2023

0.4.5

Apr 23, 2023

0.4.4

Apr 23, 2023

0.4.3

Apr 17, 2023

0.4.2

Apr 17, 2023

0.4.1

Apr 17, 2023

0.4.0

Apr 17, 2023

0.3.1

Apr 16, 2023

0.3

Apr 16, 2023

0.2

Apr 16, 2023

0.1.1

Apr 16, 2023

0.1.0

Apr 16, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

accurating-0.7.0.tar.gz (10.1 kB view details)

Uploaded Mar 11, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

accurating-0.7.0-py3-none-any.whl (11.3 kB view details)

Uploaded Mar 11, 2025 Python 3

File details

Details for the file accurating-0.7.0.tar.gz.

File metadata

Download URL: accurating-0.7.0.tar.gz
Upload date: Mar 11, 2025
Size: 10.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.0.1 CPython/3.13.1 Linux/6.13.0-0.rc1.20241204gitfeffde684ac2.17.fc42.x86_64

File hashes

Hashes for accurating-0.7.0.tar.gz
Algorithm	Hash digest
SHA256	`a28f56beceaec8d7b32322e222d27c2b9f2ae095d073d59fa941ccebceb48619`
MD5	`3fd83cabc7af1771c6ba5d5ccb53d799`
BLAKE2b-256	`e1ba0615d25c705da2bc7e89064f08dbea52c94e0e657f796172f66dbc19352a`

See more details on using hashes here.

File details

Details for the file accurating-0.7.0-py3-none-any.whl.

File metadata

Download URL: accurating-0.7.0-py3-none-any.whl
Upload date: Mar 11, 2025
Size: 11.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.0.1 CPython/3.13.1 Linux/6.13.0-0.rc1.20241204gitfeffde684ac2.17.fc42.x86_64

File hashes

Hashes for accurating-0.7.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0161f6caff6cb436960f3dcac4b0e4c3aa2811c83c0ad03a89173416fb37a8fd`
MD5	`9136807e6efade6bff5a6e9bd3c635c9`
BLAKE2b-256	`21be9e8b0dba62a51d28fec94fce7a3d77bf24825b853ef0c7dddd71c4f4ce8e`

See more details on using hashes here.

AccuRating 0.7.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

AccuRating

Is AccuRating accurate?

What AccuRating ratings numbers mean exactly?

Compared to Chess ELO

Compared to EGD

If AccuRating is so accurate why other systems are not using it?

What it AccuRating bad for?

Model details

Comparison to single-pass systems.

Can I convert these ratings to EGD?

What's implemented

ToDo and ideas

Development

Publish package

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes