Match Predictions based on Player Ratings

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

player-performance-ratings

Framework designed to predict outcomes in sports games using player-based ratings. Ratings can be used to predict game-winner, but also other outcomes such as total points scored, total yards gained, etc.

Installation

pip install player-performance-ratings

Example Useage

Ensure you have a dataset where each row is a unique combination of game_ids and player_ids. Even if the concept of a player doesn't exist in the dataset, you can use team_id instead of player_id.

Utilizing a rating model can be as simple as:

import pandas as pd
from player_performance_ratings import PredictColumnNames

from player_performance_ratings.pipeline import Pipeline
from player_performance_ratings.predictor import GameTeamPredictor

from player_performance_ratings.ratings import UpdateRatingGenerator

from player_performance_ratings.data_structures import ColumnNames

df = pd.read_pickle("data/game_player_subsample.pickle")

# Defines the column names as they appear in the dataframe
column_names = ColumnNames(
    team_id='team_id',
    match_id='game_id',
    start_date="start_date",
    player_id="player_name",
)
# Sorts the dataframe. The dataframe must always be sorted as below
df = df.sort_values(by=[column_names.start_date, column_names.match_id, column_names.team_id, column_names.player_id])

# Defines the target column we inted to predict
df[PredictColumnNames.TARGET] = df['won']

# Drops games with less or more than 2 teams
df = (
    df.assign(team_count=df.groupby(column_names.match_id)[column_names.team_id].transform('nunique'))
    .loc[lambda x: x.team_count == 2]
    .drop(columns=['team_count'])
)

# Pretends the last 10 games are future games. The most will be trained on everything before that.
most_recent_10_games = df[column_names.match_id].unique()[-10:]
historical_df = df[~df[column_names.match_id].isin(most_recent_10_games)]
future_df = df[df[column_names.match_id].isin(most_recent_10_games)].drop(columns=[PredictColumnNames.TARGET, 'won'])

# Defining a simple rating-generator. It will use the "won" column to update the ratings.
# In contrast to a typical Elo, ratings will follow players.
rating_generator = UpdateRatingGenerator(performance_column='won')

# Defines the predictor. A machine-learning model will be used to predict game winner on a game-team-level.
# Mean team-ratings will be calculated (from player-level) and rating-difference between the 2 teams calculated.
# It will also use the location of the game as a feature.
predictor = GameTeamPredictor(
    game_id_colum=column_names.match_id,
    team_id_column=column_names.team_id,
    estimator_features=['location']
)

# Pipeline is whether we define all the steps. Other transformations can take place as well.
# However, in our simple example we only have a simple rating-generator and a predictor.
pipeline = Pipeline(
    rating_generators=rating_generator,
    predictor=predictor,
    column_names=column_names,
)

# Trains the model and returns historical predictions
historical_predictions = pipeline.train_predict(df=historical_df)

# Future predictions on future results
future_predictions = pipeline.future_predict(df=future_df)

#Grouping predictions from game-player level to game-level.
team_grouped_predictions = future_predictions.groupby(column_names.match_id).first()[
    [column_names.start_date, column_names.team_id, 'team_id_opponent', predictor.pred_column]]

print(team_grouped_predictions)

For more advanced usecases, check the examples directory.

Description

The flexibility of the rating model grants the potential for significantly higher accuracy than other models, such as Elo,Glicko and Trueskill which are based on team performance. Both team and player outcomes can be predicted. The user has freedom to combine the ratings with other features, such as home/away, weather, etc. The user can also use some of the already created machine-learning models or create any custom model that they believe will work better.

The framework consists of the following components:

Preprocessing

If the intention is a simple elo-model or equivalent, no preprocessing is required. However, typically a lot of value can be gained through intelligent preprocessing before the ratings are calculated. The rating-model will take a performance_column as input and update ratings on that. A well designed performance that is a good indicator of future success is crucial for the model to work well. For instance, if the user suspects that a players true shooting percentage is a better indicator of future points scored by the player than actual points scored, the user can use that. Or, user can also use a combination of statistics, such as true shooting percentage and points scored to calculate the "match-performance".

The user can configure classes inside the preprocessing folder to create the performance_column. The user can also create custom classes with more specific functionality.

Rating Calculation

PostProcessing

Model Predictions

Scoring

Hyperparameter Tuning

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

5.6.13

Apr 21, 2024

5.6.11

Apr 19, 2024

5.6.10

Apr 19, 2024

5.6.9

Mar 26, 2024

5.6.8

Mar 26, 2024

5.6.7

Mar 26, 2024

5.6.6

Mar 26, 2024

5.6.5

Mar 25, 2024

5.6.4

Mar 25, 2024

5.6.3

Mar 24, 2024

5.6.2

Mar 23, 2024

5.6.1

Mar 23, 2024

5.6

Mar 23, 2024

5.5

Mar 18, 2024

5.4.1

Mar 17, 2024

5.4

Mar 17, 2024

5.3.5

Mar 16, 2024

5.3.4

Mar 16, 2024

5.3.3

Mar 16, 2024

5.3.2

Mar 16, 2024

5.3.1

Mar 16, 2024

5.3

Mar 16, 2024

5.2.8

Mar 12, 2024

5.2.7

Mar 10, 2024

5.2.6

Mar 10, 2024

5.2.5

Mar 9, 2024

5.2.4

Mar 9, 2024

5.2.3

Mar 9, 2024

5.2.2

Mar 9, 2024

5.2.1

Mar 9, 2024

5.2

Mar 9, 2024

5.1.12

Mar 7, 2024

5.1.11

Mar 7, 2024

5.1.10

Mar 6, 2024

5.1.9.1

Mar 6, 2024

5.1.9

Mar 5, 2024

5.1.8

Mar 5, 2024

5.1.7.2

Mar 4, 2024

5.1.7.1

Mar 4, 2024

5.1.7

Mar 4, 2024

5.1.6.1

Mar 3, 2024

5.1.6

Mar 3, 2024

5.1.5.1

Mar 3, 2024

5.1.5

Mar 3, 2024

5.1.4.3

Mar 3, 2024

5.1.4.2

Mar 3, 2024

5.1.4.1

Mar 3, 2024

5.1.4

Mar 3, 2024

5.1.3.2

Mar 3, 2024

5.1.3.1

Mar 3, 2024

5.1.3

Mar 3, 2024

5.1.2

Mar 3, 2024

5.1.1

Mar 2, 2024

5.1.0

Mar 2, 2024

5.0.2

Mar 2, 2024

5.0.1

Mar 2, 2024

5.0.0

Mar 2, 2024

4.8.7.2

Feb 29, 2024

4.8.7.1

Feb 29, 2024

4.8.7

Feb 29, 2024

4.8.6.1

Feb 29, 2024

4.8.6

Feb 29, 2024

4.8.5.5

Feb 29, 2024

4.8.5.4

Feb 29, 2024

4.8.5.3

Feb 29, 2024

4.8.5.2

Feb 28, 2024

4.8.5.1

Feb 28, 2024

4.8.4

Feb 28, 2024

4.8.3

Feb 28, 2024

4.8.2

Feb 28, 2024

4.8.1

Feb 27, 2024

4.8.0

Feb 27, 2024

4.7.11

Feb 27, 2024

4.7.10

Feb 27, 2024

4.7.9

Feb 26, 2024

4.7.8

Feb 26, 2024

4.7.7

Feb 26, 2024

4.7.6

Feb 26, 2024

4.7.5

Feb 24, 2024

4.7.4

Feb 24, 2024

4.7.3

Feb 23, 2024

4.7.2

Feb 23, 2024

4.7.1

Feb 23, 2024

4.7.0

Feb 22, 2024

4.6.11

Feb 22, 2024

4.6.10

Feb 22, 2024

4.6.9

Feb 22, 2024

4.6.8

Feb 22, 2024

4.6.7

Feb 22, 2024

4.6.6

Feb 22, 2024

4.6.5

Feb 21, 2024

4.6.4

Feb 20, 2024

4.6.3

Feb 20, 2024

4.6.2

Feb 19, 2024

4.6.1

Feb 19, 2024

4.6

Feb 19, 2024

4.5.18

Feb 18, 2024

4.5.17

Feb 18, 2024

4.5.16

Feb 18, 2024

4.5.15

Feb 18, 2024

4.5.14

Feb 18, 2024

4.5.13

Feb 18, 2024

4.5.12

Feb 18, 2024

4.5.11

Feb 18, 2024

4.5.10

Feb 18, 2024

4.5.9

Feb 18, 2024

4.5.8

Feb 18, 2024

4.5.7

Feb 18, 2024

4.5.6

Feb 18, 2024

4.5.3.2

Feb 17, 2024

4.5.3.1

Feb 17, 2024

4.5.3

Feb 17, 2024

4.5.2

Feb 17, 2024

4.5.1

Feb 17, 2024

4.5

Feb 17, 2024

4.4

Feb 17, 2024

4.3

Feb 13, 2024

4.2

Feb 13, 2024

4.1

Feb 17, 2024

4.0.8

Feb 13, 2024

4.0.7

Feb 13, 2024

4.0.6

Feb 13, 2024

4.0.5

Feb 13, 2024

4.0.1

Feb 13, 2024

3.26

Jan 6, 2024

3.25

Jan 6, 2024

3.24

Jan 6, 2024

3.23

Jan 6, 2024

3.22

Jan 6, 2024

3.21

Jan 5, 2024

3.2

Jan 5, 2024

3.1.4

Jan 5, 2024

3.1.3

Jan 5, 2024

3.1.2

Jan 5, 2024

3.1.1

Jan 5, 2024

3.1.0

Jan 5, 2024

3.0.0

Jan 2, 2024

2.4.0

Jan 1, 2024

2.2.0

Dec 30, 2023

2.1.0

Dec 30, 2023

2.0

Dec 29, 2023

1.33

Dec 23, 2023

1.32

Dec 23, 2023

1.31

Dec 23, 2023

1.30

Dec 23, 2023

1.29

Dec 23, 2023

1.28

Dec 23, 2023

1.27

Dec 23, 2023

1.26

Dec 23, 2023

1.25

Dec 23, 2023

1.24

Dec 23, 2023

1.23

Dec 23, 2023

1.22

Dec 23, 2023

1.21

Dec 23, 2023

1.6

Dec 27, 2023

1.5

Dec 26, 2023

1.4

Dec 25, 2023

1.2

Dec 23, 2023

1.1

Dec 23, 2023

1.0

Dec 21, 2023

0.12

Oct 23, 2023

0.11

Oct 23, 2023

0.10

Oct 23, 2023

0.9

Oct 23, 2023

0.8

Oct 23, 2023

0.7

Oct 22, 2023

0.6

Oct 22, 2023

0.5

Oct 22, 2023

0.4

Oct 21, 2023

0.3

Oct 21, 2023

0.2

Oct 21, 2023

0.1

Oct 21, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

player_performance_ratings-5.6.13.tar.gz (61.2 kB view hashes)

Uploaded Apr 21, 2024 Source

Built Distribution

player_performance_ratings-5.6.13-py3-none-any.whl (78.6 kB view hashes)

Uploaded Apr 21, 2024 Python 3

Hashes for player_performance_ratings-5.6.13.tar.gz

Hashes for player_performance_ratings-5.6.13.tar.gz
Algorithm	Hash digest
SHA256	`9c8d490a2abd91d343b3aab1bfb3d2d3cfeb039f2192dc1ea6359e9018145c90`
MD5	`0d07dde64a16a1e12d73c753b8719eb4`
BLAKE2b-256	`ec5a1fe79ed0d3c2d6002f521f369090e4143e0309fd29a39eea6b431e69db1a`

Hashes for player_performance_ratings-5.6.13-py3-none-any.whl

Hashes for player_performance_ratings-5.6.13-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1cbc46f2bdf9cf28d6af0f11a0b84db06265c765e8474c88cc6326d7c59f4e94`
MD5	`6466e6e594f856bf7440f551b8d099f2`
BLAKE2b-256	`40db06cde9949ec73b94929a4cb2660e4ded734d2495a94d120174f559632a66`