Match Predictions based on Player Ratings

Project description

player-performance-ratings

Framework designed to predict outcomes in sports games using player-based ratings or other forms of engineered features such as rolling means. Ratings can be used to predict game-winner, but also other outcomes such as total points scored, total yards gained, etc.

Installation

pip install player-performance-ratings

Examples

Ensure you have a dataset where each row is a unique combination of game_ids and player_ids. There are multiple different use-cases for the framework, such as:

Creating ratings for players/teams.
Predicting the outcome.
Creating features or other types of data-transformations

Training a Rating Model

If you only desire to generate ratings this is simple:

from player_performance_ratings.ratings import UpdateRatingGenerator
from examples import get_sub_sample_nba_data
from player_performance_ratings.data_structures import ColumnNames

df = get_sub_sample_nba_data(as_pandas=True)

# Defines the column names as they appear in the dataframe
column_names = ColumnNames(
    team_id='team_id',
    match_id='game_id',
    start_date="start_date",
    player_id="player_name",
)
# Sorts the dataframe. The dataframe must always be sorted as below
df = df.sort_values(by=[column_names.start_date, column_names.match_id, column_names.team_id, column_names.player_id])


# Drops games with less or more than 2 teams
df = (
    df.assign(team_count=df.groupby(column_names.match_id)[column_names.team_id].transform('nunique'))
    .loc[lambda x: x.team_count == 2]
    .drop(columns=['team_count'])
)

# Pretends the last 10 games are future games. The most will be trained on everything before that.
most_recent_10_games = df[column_names.match_id].unique()[-10:]
historical_df = df[~df[column_names.match_id].isin(most_recent_10_games)]
future_df = df[df[column_names.match_id].isin(most_recent_10_games)]

# Defining a simple rating-generator. It will use the "won" column to update the ratings.
# In contrast to a typical Elo, ratings will follow players.
rating_generator = UpdateRatingGenerator(performance_column='won')

# Calculate Ratings on Historical data
historical_df_with_ratings = rating_generator.generate_historical(historical_df, column_names=column_names)

# Printing out the 10 highest rated teams and the ratings of the players for the team
team_ratings = rating_generator.team_ratings
print(team_ratings[:10])

#Calculating Ratings for Future Matches
future_df_with_ratings = rating_generator.generate_future(future_df)

Predicting Game-Winner

Ensure you have a dataset where each row is a unique combination of game_ids and player_ids. Even if the concept of a player doesn't exist in the dataset, you can use team_id instead of player_id.

Utilizing a rating model can be as simple as:

from examples import get_sub_sample_nba_data
from player_performance_ratings.pipeline import Pipeline
from player_performance_ratings.predictor import GameTeamPredictor

from player_performance_ratings.ratings import UpdateRatingGenerator

from player_performance_ratings.data_structures import ColumnNames

df = get_sub_sample_nba_data(as_pandas=True)

# Defines the column names as they appear in the dataframe
column_names = ColumnNames(
    team_id='team_id',
    match_id='game_id',
    start_date="start_date",
    player_id="player_name",
)
# Sorts the dataframe. The dataframe must always be sorted as below
df = df.sort_values(by=[column_names.start_date, column_names.match_id, column_names.team_id, column_names.player_id])



# Drops games with less or more than 2 teams
df = (
    df.assign(team_count=df.groupby(column_names.match_id)[column_names.team_id].transform('nunique'))
    .loc[lambda x: x.team_count == 2]
    .drop(columns=['team_count'])
)

# Pretends the last 10 games are future games. The most will be trained on everything before that.
most_recent_10_games = df[column_names.match_id].unique()[-10:]
historical_df = df[~df[column_names.match_id].isin(most_recent_10_games)]
future_df = df[df[column_names.match_id].isin(most_recent_10_games)].drop(columns=['won'])

# Defining a simple rating-generator. It will use the "won" column to update the ratings.
# In contrast to a typical Elo, ratings will follow players.
rating_generator = UpdateRatingGenerator(performance_column='won')

# Defines the predictor. A machine-learning model will be used to predict game winner on a game-team-level.
# Mean team-ratings will be calculated (from player-level) and rating-difference between the 2 teams calculated.
# It will also use the location of the game as a feature.
predictor = GameTeamPredictor(
    game_id_colum=column_names.match_id,
    team_id_column=column_names.team_id,
    estimator_features=['location'],
    target='won',
    one_hot_encode_cat_features=True
)

# Pipeline is whether we define all the steps. Other transformations can take place as well.
# However, in our simple example we only have a simple rating-generator and a predictor.
pipeline = Pipeline(
    rating_generators=rating_generator,
    predictor=predictor,
    column_names=column_names,
)

# Trains the model and returns historical predictions
historical_predictions = pipeline.train(df=historical_df)

# Future predictions on future results
future_predictions = pipeline.predict(df=future_df)

#Grouping predictions from game-player level to game-level.
team_grouped_predictions = future_predictions.groupby(column_names.match_id).first()[
    [column_names.start_date, column_names.team_id, 'team_id_opponent', predictor.pred_column]]

print(team_grouped_predictions)

Calculating Rolling Means, Lags and Ratings in the same Pipeline

If the user simply wants to calculate features without directly feeding into a prediction-model, this can be done using PipelineTransformer. The example below calculates rolling-means and lags for kills, deaths, the result and calculates a rating based on the result. It then outputs the dataframe with the new features.

from examples import get_sub_sample_lol_data
from player_performance_ratings import ColumnNames
from player_performance_ratings.pipeline_transformer import PipelineTransformer
from player_performance_ratings.ratings import UpdateRatingGenerator

from player_performance_ratings.transformers import LagTransformer
from player_performance_ratings.transformers.lag_generators import RollingMeanTransformer

column_names = ColumnNames(
    team_id='teamname',
    match_id='gameid',
    start_date="date",
    player_id="playername",
    league='league',
    position='position',
)
df = get_sub_sample_lol_data(as_pandas=True)
df = (
    df.loc[lambda x: x.position != 'team']
    .assign(team_count=df.groupby('gameid')['teamname'].transform('nunique'))
    .loc[lambda x: x.team_count == 2]
    .assign(player_count=df.groupby(['gameid', 'teamname'])['playername'].transform('nunique'))
    .loc[lambda x: x.player_count == 5]
)
df = (df
.assign(team_count=df.groupby('gameid')['teamname'].transform('nunique'))
.loc[lambda x: x.team_count == 2]
)


# Pretends the last 10 games are future games. The most will be trained on everything before that.
most_recent_10_games = df[column_names.match_id].unique()[-10:]
historical_df = df[~df[column_names.match_id].isin(most_recent_10_games)]
future_df = df[df[column_names.match_id].isin(most_recent_10_games)].drop(columns=['result'])

rating_generator = UpdateRatingGenerator(
    performance_column='result'
)

lag_generators = [
    LagTransformer(
        features=["kills", "deaths", "result"],
        lag_length=3,
        granularity=['playername']
    ),
    RollingMeanTransformer(
        features=["kills", "deaths", "result"],
        window=20,
        min_periods=1,
        granularity=['playername']
    )
]


transformer = PipelineTransformer(
    column_names=column_names,
    rating_generators=rating_generator,
    lag_generators=lag_generators
)

historical_df = transformer.fit_transform(historical_df)

future_df = transformer.transform(future_df)
print(future_df.head())

Hyperparameter tuning

TODO


## Advanced usecases

The listed examples above are quite simple. 
However, the framework is designed to be flexible and can easily be extended in order to create better models:
Examples:

* Create a better margin of victory bombine multiple columns to create a performance_column which ratings will be calculated based on.
* Combine rolling-means, lags with ratings to create a more complex model.
* Add other features such as weather, home/away, etc.
* Predict other outcomes than game-winner, such as total points scored, total yards gained, etc.
* Create custom transformations utilizing domain knowledge of the sport.

Project details

Release history Release notifications | RSS feed

This version

5.27.0

Mar 16, 2025

5.26.0

Mar 12, 2025

5.25.4

Feb 19, 2025

5.25.3

Feb 18, 2025

5.25.2

Feb 17, 2025

5.25.1

Feb 16, 2025

5.25.0

Feb 16, 2025

5.24.4

Feb 12, 2025

5.24.3

Feb 12, 2025

5.24.2

Feb 12, 2025

5.24.1

Feb 12, 2025

5.24.0

Feb 12, 2025

5.23.1

Feb 12, 2025

5.23.0

Feb 12, 2025

5.22.2

Jan 12, 2025

5.22.1

Jan 7, 2025

5.22.0

Jan 5, 2025

5.21.14

Jan 4, 2025

5.21.13

Jan 3, 2025

5.21.12

Jan 3, 2025

5.21.11

Jan 3, 2025

5.21.10

Jan 2, 2025

5.21.9

Jan 2, 2025

5.21.8

Jan 2, 2025

5.21.7

Jan 2, 2025

5.21.6

Jan 2, 2025

5.21.5

Jan 2, 2025

5.21.4

Jan 2, 2025

5.21.3

Jan 2, 2025

5.21.1

Jan 1, 2025

5.21.0

Jan 1, 2025

5.20.5

Dec 31, 2024

5.20.3

Dec 31, 2024

5.20.2

Dec 31, 2024

5.20.1

Dec 31, 2024

5.20.0

Dec 27, 2024

5.19.6

Dec 22, 2024

5.19.5

Dec 22, 2024

5.19.4

Dec 22, 2024

5.19.3

Dec 22, 2024

5.19.2

Dec 22, 2024

5.19.0

Dec 22, 2024

5.18.5

Dec 19, 2024

5.18.4

Dec 15, 2024

5.18.3

Dec 15, 2024

5.18.2

Dec 15, 2024

5.18.0

Dec 13, 2024

5.17.3

Dec 10, 2024

5.17.2

Dec 10, 2024

5.17.1

Dec 10, 2024

5.17.0

Dec 6, 2024

5.16.6

Nov 18, 2024

5.16.3

Nov 18, 2024

5.16.2

Oct 26, 2024

5.16.1

Oct 26, 2024

5.16

Oct 26, 2024

5.15.1

Oct 23, 2024

5.15

Oct 22, 2024

5.14.2

Oct 21, 2024

5.14.1

Oct 13, 2024

5.13.5

Oct 13, 2024

5.13.4

Sep 25, 2024

5.13.3

Sep 24, 2024

5.13.2

Sep 10, 2024

5.13.0

Sep 1, 2024

5.12.4

Aug 14, 2024

5.12.3

Aug 14, 2024

5.12.2

Aug 14, 2024

5.12.1

Aug 5, 2024

5.12

Aug 5, 2024

5.11.1

Jul 25, 2024

5.11

Jul 25, 2024

5.10

Jul 25, 2024

5.9.7

Jul 23, 2024

5.9.6

Jul 15, 2024

5.9.4

Jul 15, 2024

5.9.3

Jul 15, 2024

5.9.2

Jul 15, 2024

5.9.1

Jul 15, 2024

5.9

Jul 15, 2024

5.8.1

Jul 15, 2024

5.8

Jul 15, 2024

5.7.7

Jul 11, 2024

5.7.6

Jul 8, 2024

5.7.5

Jul 7, 2024

5.7.4

Jun 23, 2024

5.7.2

Jun 23, 2024

5.7.1

Jun 18, 2024

5.7

Jun 15, 2024

5.6.13

Apr 21, 2024

5.6.11

Apr 19, 2024

5.6.10

Apr 19, 2024

5.6.9

Mar 26, 2024

5.6.8

Mar 26, 2024

5.6.7

Mar 26, 2024

5.6.6

Mar 26, 2024

5.6.5

Mar 25, 2024

5.6.4

Mar 25, 2024

5.6.3

Mar 24, 2024

5.6.2

Mar 23, 2024

5.6.1

Mar 23, 2024

5.6

Mar 23, 2024

5.5

Mar 18, 2024

5.4.1

Mar 17, 2024

5.4

Mar 17, 2024

5.3.5

Mar 16, 2024

5.3.4

Mar 16, 2024

5.3.3

Mar 16, 2024

5.3.2

Mar 16, 2024

5.3.1

Mar 16, 2024

5.3

Mar 16, 2024

5.2.8

Mar 12, 2024

5.2.7

Mar 10, 2024

5.2.6

Mar 10, 2024

5.2.5

Mar 9, 2024

5.2.4

Mar 9, 2024

5.2.3

Mar 9, 2024

5.2.2

Mar 9, 2024

5.2.1

Mar 9, 2024

5.2

Mar 9, 2024

5.1.12

Mar 7, 2024

5.1.11

Mar 7, 2024

5.1.10

Mar 6, 2024

5.1.9.1

Mar 6, 2024

5.1.9

Mar 5, 2024

5.1.8

Mar 5, 2024

5.1.7.2

Mar 4, 2024

5.1.7.1

Mar 4, 2024

5.1.7

Mar 4, 2024

5.1.6.1

Mar 3, 2024

5.1.6

Mar 3, 2024

5.1.5.1

Mar 3, 2024

5.1.5

Mar 3, 2024

5.1.4.3

Mar 3, 2024

5.1.4.2

Mar 3, 2024

5.1.4.1

Mar 3, 2024

5.1.4

Mar 3, 2024

5.1.3.2

Mar 3, 2024

5.1.3.1

Mar 3, 2024

5.1.3

Mar 3, 2024

5.1.2

Mar 3, 2024

5.1.1

Mar 2, 2024

5.1.0

Mar 2, 2024

5.0.2

Mar 2, 2024

5.0.1

Mar 2, 2024

5.0.0

Mar 2, 2024

4.8.7.2

Feb 29, 2024

4.8.7.1

Feb 29, 2024

4.8.7

Feb 29, 2024

4.8.6.1

Feb 29, 2024

4.8.6

Feb 29, 2024

4.8.5.5

Feb 29, 2024

4.8.5.4

Feb 29, 2024

4.8.5.3

Feb 29, 2024

4.8.5.2

Feb 28, 2024

4.8.5.1

Feb 28, 2024

4.8.4

Feb 28, 2024

4.8.3

Feb 28, 2024

4.8.2

Feb 28, 2024

4.8.1

Feb 27, 2024

4.8.0

Feb 27, 2024

4.7.11

Feb 27, 2024

4.7.10

Feb 27, 2024

4.7.9

Feb 26, 2024

4.7.8

Feb 26, 2024

4.7.7

Feb 26, 2024

4.7.6

Feb 26, 2024

4.7.5

Feb 24, 2024

4.7.4

Feb 24, 2024

4.7.3

Feb 23, 2024

4.7.2

Feb 23, 2024

4.7.1

Feb 23, 2024

4.7.0

Feb 22, 2024

4.6.11

Feb 22, 2024

4.6.10

Feb 22, 2024

4.6.9

Feb 22, 2024

4.6.8

Feb 22, 2024

4.6.7

Feb 22, 2024

4.6.6

Feb 22, 2024

4.6.5

Feb 21, 2024

4.6.4

Feb 20, 2024

4.6.3

Feb 20, 2024

4.6.2

Feb 19, 2024

4.6.1

Feb 19, 2024

4.6

Feb 19, 2024

4.5.18

Feb 18, 2024

4.5.17

Feb 18, 2024

4.5.16

Feb 18, 2024

4.5.15

Feb 18, 2024

4.5.14

Feb 18, 2024

4.5.13

Feb 18, 2024

4.5.12

Feb 18, 2024

4.5.11

Feb 18, 2024

4.5.10

Feb 18, 2024

4.5.9

Feb 18, 2024

4.5.8

Feb 18, 2024

4.5.7

Feb 18, 2024

4.5.6

Feb 18, 2024

4.5.3.2

Feb 17, 2024

4.5.3.1

Feb 17, 2024

4.5.3

Feb 17, 2024

4.5.2

Feb 17, 2024

4.5.1

Feb 17, 2024

4.5

Feb 17, 2024

4.4

Feb 17, 2024

4.3

Feb 13, 2024

4.2

Feb 13, 2024

4.1

Feb 17, 2024

4.0.8

Feb 13, 2024

4.0.7

Feb 13, 2024

4.0.6

Feb 13, 2024

4.0.5

Feb 13, 2024

4.0.1

Feb 13, 2024

3.26

Jan 6, 2024

3.25

Jan 6, 2024

3.24

Jan 6, 2024

3.23

Jan 6, 2024

3.22

Jan 6, 2024

3.21

Jan 5, 2024

3.2

Jan 5, 2024

3.1.4

Jan 5, 2024

3.1.3

Jan 5, 2024

3.1.2

Jan 5, 2024

3.1.1

Jan 5, 2024

3.1.0

Jan 5, 2024

3.0.0

Jan 2, 2024

2.4.0

Jan 1, 2024

2.2.0

Dec 30, 2023

2.1.0

Dec 30, 2023

2.0

Dec 29, 2023

1.33

Dec 23, 2023

1.32

Dec 23, 2023

1.31

Dec 23, 2023

1.30

Dec 23, 2023

1.29

Dec 23, 2023

1.28

Dec 23, 2023

1.27

Dec 23, 2023

1.26

Dec 23, 2023

1.25

Dec 23, 2023

1.24

Dec 23, 2023

1.23

Dec 23, 2023

1.22

Dec 23, 2023

1.21

Dec 23, 2023

1.6

Dec 27, 2023

1.5

Dec 26, 2023

1.4

Dec 25, 2023

1.2

Dec 23, 2023

1.1

Dec 23, 2023

1.0

Dec 21, 2023

0.12

Oct 23, 2023

0.11

Oct 23, 2023

0.10

Oct 23, 2023

0.9

Oct 23, 2023

0.8

Oct 23, 2023

0.7

Oct 22, 2023

0.6

Oct 22, 2023

0.5.29

Mar 18, 2025

0.5.28

Mar 18, 2025

0.5

Oct 22, 2023

0.4

Oct 21, 2023

0.3

Oct 21, 2023

0.2

Oct 21, 2023

0.1

Oct 21, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

player_performance_ratings-5.27.0.tar.gz (483.7 kB view details)

Uploaded Mar 16, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

player_performance_ratings-5.27.0-py3-none-any.whl (498.7 kB view details)

Uploaded Mar 16, 2025 Python 3

File details

Details for the file player_performance_ratings-5.27.0.tar.gz.

File metadata

Download URL: player_performance_ratings-5.27.0.tar.gz
Upload date: Mar 16, 2025
Size: 483.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.21

File hashes

Hashes for player_performance_ratings-5.27.0.tar.gz
Algorithm	Hash digest
SHA256	`fe4606b0b445102481b0dd7a7b387d0cd73242b8277b01240c3912311d12289f`
MD5	`eef0e44fe6aeb2724d98411fcb833125`
BLAKE2b-256	`9c0a944ebfbc1afdfa5d1c675964aaac42cdce20dfe80ba34ac5b44b881fdefa`

See more details on using hashes here.

File details

Details for the file player_performance_ratings-5.27.0-py3-none-any.whl.

File metadata

Download URL: player_performance_ratings-5.27.0-py3-none-any.whl
Upload date: Mar 16, 2025
Size: 498.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.21

File hashes

Hashes for player_performance_ratings-5.27.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`476380751695fab1780c2a47e69e933d05c9782d5498e30bb53e1540503a20b1`
MD5	`dfd23c07fc4d4e1668ed280c2f7f42fd`
BLAKE2b-256	`97ed51d408fd3f75ead2de08d39b0edd159ef32f431b6f2b5238d37c3d233506`

See more details on using hashes here.

player-performance-ratings 5.27.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

player-performance-ratings

Installation

Examples

Training a Rating Model

Predicting Game-Winner

Calculating Rolling Means, Lags and Ratings in the same Pipeline

Hyperparameter tuning

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes